DEV Community

# agents

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
We benchmarked 10 LLMs on 10 real agent coding tasks — here are the results

We benchmarked 10 LLMs on 10 real agent coding tasks — here are the results

Comments
2 min read
Have you ever built a dashboard nobody opens?

Have you ever built a dashboard nobody opens?

Comments
4 min read
What 16 Parallel Claude Agents Built Around Themselves: Deconstructing Anthropic's C Compiler Experiment

What 16 Parallel Claude Agents Built Around Themselves: Deconstructing Anthropic's C Compiler Experiment

Comments
12 min read
The Overlooked Gem in Microsoft Entra That Gives Your AI Agents Super-Powers

The Overlooked Gem in Microsoft Entra That Gives Your AI Agents Super-Powers

Comments
4 min read
Scheduled agent runs are now more reliable

Scheduled agent runs are now more reliable

Comments
3 min read
I Built a Local AI Coding Agent on M5 Max 128GB — It Failed 164 Times Before Passing 35 Tests

I Built a Local AI Coding Agent on M5 Max 128GB — It Failed 164 Times Before Passing 35 Tests

Comments
7 min read
I built an AI agent that turns Gmail receipts into a spreadsheet — automatically

I built an AI agent that turns Gmail receipts into a spreadsheet — automatically

Comments
3 min read
Have you ever told an AI 'never do this' and watched it do it anyway?

Have you ever told an AI 'never do this' and watched it do it anyway?

Comments
4 min read
Fast, Efficient, and Confidently Delivered — But Wrong

Fast, Efficient, and Confidently Delivered — But Wrong

Comments
4 min read
My Bookkeeper AI Agent Does a Much Better Job Than Me

My Bookkeeper AI Agent Does a Much Better Job Than Me

Comments
5 min read
Nine Seconds, No Backups: An Agent’s “Confession”

Nine Seconds, No Backups: An Agent’s “Confession”

5
Comments
10 min read
Read by Something Without a Body

Read by Something Without a Body

Comments
1 min read
36 Days of Claude Code Logs: Silent Model Switching, 11.5x Efficiency Gap

36 Days of Claude Code Logs: Silent Model Switching, 11.5x Efficiency Gap

Comments
8 min read
How Stripe, Shopify, and Airbnb Build AI Harnesses

How Stripe, Shopify, and Airbnb Build AI Harnesses

Comments
3 min read
I audited 18 A2A agent cards. 17 graded F. Mine was the 18th.

I audited 18 A2A agent cards. 17 graded F. Mine was the 18th.

1
Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.