DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
LLMs + Tool Calls: Clever But Cursed

LLMs + Tool Calls: Clever But Cursed

1
Comments
2 min read
Base LLMs vs Instruction-Tuned LLMs: Understanding the Architecture Behind ChatGPT and Claude

Base LLMs vs Instruction-Tuned LLMs: Understanding the Architecture Behind ChatGPT and Claude

Comments
3 min read
Reranking and Two-Stage Retrieval: Precision When It Matters Most

Reranking and Two-Stage Retrieval: Precision When It Matters Most

Comments
2 min read
🚀 How I Created an AI-Powered Secret Santa Using Cognee as the Memory Layer

🚀 How I Created an AI-Powered Secret Santa Using Cognee as the Memory Layer

8
Comments 4
5 min read
Why We Replaced Our Orchestrator with a 'Regex' Switch

Why We Replaced Our Orchestrator with a 'Regex' Switch

Comments
4 min read
Bifrost: The Fastest Open Source LLM Gateway

Bifrost: The Fastest Open Source LLM Gateway

Comments
4 min read
Adaptive Load Balancing: Why Your LLM Gateway Needs It

Adaptive Load Balancing: Why Your LLM Gateway Needs It

Comments
3 min read
TOON: Token-Oriented Object Notation – A Complete Guide for LLM Data Efficiency

TOON: Token-Oriented Object Notation – A Complete Guide for LLM Data Efficiency

Comments
3 min read
Creating Personal AI Agents in Multiplayer Games with LoRA Adapters: An Efficient and Memory-Saving Solution

Creating Personal AI Agents in Multiplayer Games with LoRA Adapters: An Efficient and Memory-Saving Solution

1
Comments
4 min read
Dense vs Sparse Retrieval: Mastering FAISS, BM25, and Hybrid Search

Dense vs Sparse Retrieval: Mastering FAISS, BM25, and Hybrid Search

Comments
15 min read
AWS Bedrock with LangChain

AWS Bedrock with LangChain

1
Comments
3 min read
Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration

Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration

3
Comments 2
2 min read
Prompt Length vs. Context Window: The Real Limits Behind LLM Performance

Prompt Length vs. Context Window: The Real Limits Behind LLM Performance

Comments
4 min read
Prompt‑Powered User Personas: From Messy Logs to Living Profiles

Prompt‑Powered User Personas: From Messy Logs to Living Profiles

Comments
14 min read
Beyond the Black Box: Neuro‑Symbolic AI, Metacognition, and the Next Leap in Machine Intelligence

Beyond the Black Box: Neuro‑Symbolic AI, Metacognition, and the Next Leap in Machine Intelligence

Comments
12 min read
The Prompting Trick That Fixed My AI Image Generation

The Prompting Trick That Fixed My AI Image Generation

2
Comments
7 min read
How an LLM Gateway Can Help You Build Better AI Applications

How an LLM Gateway Can Help You Build Better AI Applications

Comments
11 min read
Finally Got My Dify Agent Working in Discord, Telegram and Slack

Finally Got My Dify Agent Working in Discord, Telegram and Slack

Comments
3 min read
The Wipe & Inject Pattern: Full Context for Implementation After Long Planning Sessions

The Wipe & Inject Pattern: Full Context for Implementation After Long Planning Sessions

Comments
3 min read
You’re Talking to Your AI Wrong. Here’s How to Fix It.

You’re Talking to Your AI Wrong. Here’s How to Fix It.

Comments
3 min read
Fine-Tuning Llama 3 with PEFT

Fine-Tuning Llama 3 with PEFT

1
Comments
2 min read
Taming Opus 4.5's Efficiency: Using TodoWrite to Keep Claude Code on Track

Taming Opus 4.5's Efficiency: Using TodoWrite to Keep Claude Code on Track

Comments
2 min read
How to Build an AI Agent Evaluation Framework from Scratch

How to Build an AI Agent Evaluation Framework from Scratch

1
Comments
5 min read
I Built a Distributed AI Search Engine That Lets Websites 'Talk' Directly to LLMs (No Indexing Required)

I Built a Distributed AI Search Engine That Lets Websites 'Talk' Directly to LLMs (No Indexing Required)

4
Comments
8 min read
Why your AI assistant lies to you (and how to fix it)

Why your AI assistant lies to you (and how to fix it)

Comments
4 min read
loading...