DEV Community

# mlops

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Speculative decoding shifted our output distribution and evals missed it

Speculative decoding shifted our output distribution and evals missed it

1
Comments
4 min read
LLMOps in 2026: AI Demo to Production Guide

LLMOps in 2026: AI Demo to Production Guide

Comments
8 min read
Streamlining MLOps: Model Deployment with MLflow

Streamlining MLOps: Model Deployment with MLflow

Comments
2 min read
The latency tax of an LLM gateway: I measured Bifrost's overhead

The latency tax of an LLM gateway: I measured Bifrost's overhead

Comments
4 min read
AI Workloads Are Reshaping Kubernetes in 2026: GPU Scheduling, MLOps, and the Platform Engineering Reckoning

AI Workloads Are Reshaping Kubernetes in 2026: GPU Scheduling, MLOps, and the Platform Engineering Reckoning

Comments
4 min read
Winograd convolutions cost us 2 mAP and we didn't notice for a month

Winograd convolutions cost us 2 mAP and we didn't notice for a month

Comments
4 min read
What DevOps Taught Me About AI Governance

What DevOps Taught Me About AI Governance

Comments
4 min read
From ML Tooling to Analytical Governance: Recent Updates to KMDS

From ML Tooling to Analytical Governance: Recent Updates to KMDS

1
Comments
3 min read
RLAIF Is Eating RLHF — Here Are the Four Places Human Feedback Still Wins

RLAIF Is Eating RLHF — Here Are the Four Places Human Feedback Still Wins

Comments
6 min read
A 9-point eval gain vanished when we deduped train against test

A 9-point eval gain vanished when we deduped train against test

Comments
4 min read
OpenAI Already Told Us the Kubernetes Scaling Story, Most People Just Did Not Read It Closely

OpenAI Already Told Us the Kubernetes Scaling Story, Most People Just Did Not Read It Closely

Comments
10 min read
I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

Comments
3 min read
Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

Comments
7 min read
I Built a Production RAG System on My M1 Mac for $0

I Built a Production RAG System on My M1 Mac for $0

Comments
3 min read
I built a feature store in pure Python to finally understand the point-in-time join

I built a feature store in pure Python to finally understand the point-in-time join

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.