Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
benchmark
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
We benchmarked 10 LLMs on 10 real agent coding tasks — here are the results
Vilius
Vilius
Vilius
Follow
May 9
We benchmarked 10 LLMs on 10 real agent coding tasks — here are the results
#
ai
#
llm
#
benchmark
#
agents
Comments
Add Comment
2 min read
How we almost wrote off 3 models as broken — the thinking-mode tax
Vilius
Vilius
Vilius
Follow
May 9
How we almost wrote off 3 models as broken — the thinking-mode tax
#
ai
#
llm
#
benchmark
#
postmortem
1
 reaction
Comments
Add Comment
2 min read
Model Showdown Round 2: Adding Gemma, Kimi, and 579 GB of Stubborn Optimism
Rob
Rob
Rob
Follow
May 8
Model Showdown Round 2: Adding Gemma, Kimi, and 579 GB of Stubborn Optimism
#
ai
#
llm
#
benchmark
#
homelab
Comments
Add Comment
11 min read
The Agentic Gap: Claude Oneshots, Gemma Fails
Rob
Rob
Rob
Follow
May 8
The Agentic Gap: Claude Oneshots, Gemma Fails
#
ai
#
llm
#
benchmark
#
homelab
Comments
Add Comment
9 min read
Slaying the Gemma Beast: How We Fixed Local AI and Shipped Search
Rob
Rob
Rob
Follow
May 8
Slaying the Gemma Beast: How We Fixed Local AI and Shipped Search
#
ai
#
llm
#
benchmark
#
homelab
Comments
Add Comment
13 min read
Model Showdown: Benchmarking Local vs Cloud LLMs on a Real Coding Task
Rob
Rob
Rob
Follow
May 8
Model Showdown: Benchmarking Local vs Cloud LLMs on a Real Coding Task
#
ai
#
llm
#
benchmark
#
homelab
Comments
Add Comment
14 min read
I Ran 5 LLMs Through 10 Real Agent Coding Tasks. The Free One Won.
Vilius
Vilius
Vilius
Follow
May 9
I Ran 5 LLMs Through 10 Real Agent Coding Tasks. The Free One Won.
#
ai
#
agents
#
benchmark
#
llm
2
 reactions
Comments
1
 comment
2 min read
Optimize benchmark in Next.js 15 vs Astro 4: What You Need to Know
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
Follow
May 7
Optimize benchmark in Next.js 15 vs Astro 4: What You Need to Know
#
optimize
#
benchmark
#
nextjs
#
astro
Comments
Add Comment
3 min read
CPU Inference on AMD EPYC 9334: Real Numbers for LLM and TTS Workloads
RubberDuckOps
RubberDuckOps
RubberDuckOps
Follow
for
Leaseweb
May 6
CPU Inference on AMD EPYC 9334: Real Numbers for LLM and TTS Workloads
#
machinelearning
#
llm
#
benchmark
#
infrastructure
Comments
Add Comment
4 min read
Benchmark: Claude 3.5 vs. GPT-4o for Cloud Cost Anomaly Detection in AWS and GCP
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
Follow
May 6
Benchmark: Claude 3.5 vs. GPT-4o for Cloud Cost Anomaly Detection in AWS and GCP
#
benchmark
#
claude
#
gpt4o
#
cloud
Comments
Add Comment
19 min read
Benchmark: Discord 20 Loads 30% Faster Than Microsoft Teams 5 on Chrome 130
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
Follow
May 4
Benchmark: Discord 20 Loads 30% Faster Than Microsoft Teams 5 on Chrome 130
#
benchmark
#
discord
#
loads
#
faster
Comments
Add Comment
2 min read
Benchmark: JetBrains DataGrip 2026 vs. DBeaver 24.0: Query Execution Speed for PostgreSQL 17
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
Follow
May 4
Benchmark: JetBrains DataGrip 2026 vs. DBeaver 24.0: Query Execution Speed for PostgreSQL 17
#
benchmark
#
jetbrains
#
datagrip
#
2026
Comments
Add Comment
3 min read
Vector Search Benchmark: FAISS 1.9 vs. Chroma 0.6 vs. Pinecone 1.6 for 100M Embedding Datasets
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
Follow
May 3
Vector Search Benchmark: FAISS 1.9 vs. Chroma 0.6 vs. Pinecone 1.6 for 100M Embedding Datasets
#
vector
#
search
#
benchmark
#
faiss
Comments
Add Comment
15 min read
Benchmark: Gitea 1.24 vs. GitLab 17.0 for Git Repository Performance
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
Follow
May 3
Benchmark: Gitea 1.24 vs. GitLab 17.0 for Git Repository Performance
#
benchmark
#
gitea
#
gitlab
#
repository
Comments
Add Comment
14 min read
Benchmark CI/CD in Docker 25 vs Cilium: What You Need to Know
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
ANKUSH CHOUDHARY JOHAL
Follow
May 3
Benchmark CI/CD in Docker 25 vs Cilium: What You Need to Know
#
benchmark
#
cicd
#
docker
#
cilium
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account