DEV Community

# benchmarking

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
My AI memory benchmark said 98.3%. The number was true — and worthless.

My AI memory benchmark said 98.3%. The number was true — and worthless.

Comments
4 min read
Is AI-Generated Code Buggier? The 2025-26 Data

Is AI-Generated Code Buggier? The 2025-26 Data

Comments 1
3 min read
Four Performance Bugs AI Coders Introduce Every Day

Four Performance Bugs AI Coders Introduce Every Day

Comments
4 min read
prima.cpp local llm benchmark: 15% Faster Than llama.cpp

prima.cpp local llm benchmark: 15% Faster Than llama.cpp

Comments
8 min read
Building an Official Performance Baseline for Vix.cpp Core v2.6.3

Building an Official Performance Baseline for Vix.cpp Core v2.6.3

Comments
3 min read
I measure how fast 42 LLMs actually answer. Here's the honest method.

I measure how fast 42 LLMs actually answer. Here's the honest method.

1
Comments 1
2 min read
Comparing Node.js Postgres Client Libraries: brianc/node-postgres vs. porsager/postgres for Efficiency and Use Cases

Comparing Node.js Postgres Client Libraries: brianc/node-postgres vs. porsager/postgres for Efficiency and Use Cases

1
Comments
10 min read
I needed up-to-date .NET mapper benchmarks. They didn't exist. So I built them.

I needed up-to-date .NET mapper benchmarks. They didn't exist. So I built them.

1
Comments
3 min read
How do you benchmark a product you built yourself?

How do you benchmark a product you built yourself?

1
Comments
2 min read
Benchmarking API reliability under load: when zero downtime migration becomes critical

Benchmarking API reliability under load: when zero downtime migration becomes critical

Comments
3 min read
How I Built a 95K-Line Cognitive AI Pipeline That Takes an 8B Model to GPT-4 Territory

How I Built a 95K-Line Cognitive AI Pipeline That Takes an 8B Model to GPT-4 Territory

Comments
4 min read
Building a Rust Benchmarking Agent

Building a Rust Benchmarking Agent

1
Comments
21 min read
We Benchmarked SupportSage Against Traditional Supports: Here's the Data

We Benchmarked SupportSage Against Traditional Supports: Here's the Data

Comments
3 min read
Why I spun my benchmark into its own repo (and why every dev tool with a benchmark should)

Why I spun my benchmark into its own repo (and why every dev tool with a benchmark should)

Comments
4 min read
KVQuant / BitForge: same model, smarter context, better answer

KVQuant / BitForge: same model, smarter context, better answer

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.