Benchmark - DEV Community

Skip to content

DEV Community

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Odilon HUGONNOT

Jul 25

39,000 Torrents: The Bug My Green Benchmark Never Caught

#matching #algorithms #benchmark #selfhosting

8 min read

smakosh

Jul 22

OpenRouter vs Vercel vs LLMGateway Performance

#ai #performance #benchmark #llm

6 min read

Wei Dou

Jul 22

MCPMark v2: InsForge on Sonnet 4.6

#ai #mcp #benchmark #database

3 min read

Pneumetron

Jul 17

SDABench: A New Benchmark for Evaluating LLMs in Scientific Discovery

#llm #scientificdiscovery #benchmark #airesearch

4 min read

Rob

Jul 15

Model Showdown Round 9: Qwen 3.6 27B vs Qwen 3.6 35B-A3B vs Qwythos-9B vs GLM-4.7-Flash vs Nemotron-3-Nano

#modelshowdown #benchmark #ai #llm

14 min read

Jul 15

DeepSeek vs GLM vs Qwen: Which Free LLM API is Best for Your Project?

#ai #comparison #llm #benchmark

4 min read

Pneumetron

Jul 14

AdvancedMathBench: A New Benchmark for LLM Advanced Mathematical Reasoning

#llm #mathematics #benchmark #proofgeneration

3 min read

Rob

Jul 13

TurboQuant, Four Months Later: Chasing Google's 6x VRAM Claim Into the Wild

#homelab #ai #llm #benchmark

6 min read

Adeline

Jul 13

Your agent's memory remembers what you chose. Does it remember what you rejected?

#ai #memory #opensource #benchmark

5 min read

Jul 12

Which LLM should I actually code with? I built a small benchmark to find out

#ai #llm #benchmark #programming

2 min read

Jul 10

I Benchmarked 42 Compression Formats Spanning Four Decades. Here's What to Actually Use.

#compression #zip #benchmark #cli

5 min read

Rob

Jul 7

ComfyUI, Lemonade, and LocalAI: Scouting the Next Wave of Homelab AI Tools

#homelab #ai #llm #benchmark

7 min read

Jul 7

AI Coding Tools Benchmark 2026: Cursor vs Copilot vs Windsurf vs Claude Code

#coding #benchmark #cursor #githubcopilot

5 min read

Cleiton Augusto Correa Bezerra

Jul 4

I built a neutral benchmarking layer for quantum simulators in Rust — and it revealed a silent disagreement between two backends

#rust #quantumcomputing #opensource #benchmark

1 min read

xbill for Google Developer Experts

Jun 30

Debugging Deployments with Gemma 12B, TPU v6e-4, MCP, and Antigravity CLI

#mcps #gemma #tpu #benchmark

16 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.