DEV Community

Nitin Srivastava profile picture

Nitin Srivastava

404 bio not found

Joined Joined on 
Production Reranker Layer for RAG in Python: Cross-Encoder, Cohere Fallback, and Reciprocal Rank Fusion (Runnable Code)

Production Reranker Layer for RAG in Python: Cross-Encoder, Cohere Fallback, and Reciprocal Rank Fusion (Runnable Code)

Comments
10 min read
Bulletproofing LLM Structured Output in Python: Healing Retries, Cost Caps, and Drift Detection (Runnable Code)

Bulletproofing LLM Structured Output in Python: Healing Retries, Cost Caps, and Drift Detection (Runnable Code)

Comments
10 min read
Building a Production LLM Evaluation Harness in Pytest: Cost-Bounded, Flake-Aware, CI-Gated (Runnable Python)

Building a Production LLM Evaluation Harness in Pytest: Cost-Bounded, Flake-Aware, CI-Gated (Runnable Python)

Comments
9 min read
How We Cut API Response Time from 2.3s to 180ms Using Redis + Smart Caching

How We Cut API Response Time from 2.3s to 180ms Using Redis + Smart Caching

Comments
8 min read
Semantic Chunking with Overlap and Section-Awareness: The RAG Tutorial Nobody Wrote

Semantic Chunking with Overlap and Section-Awareness: The RAG Tutorial Nobody Wrote

Comments
8 min read
loading...