DEV Community

# pytorch

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Tracing torch.cuda.empty_cache() on an RTX 4090 - Where Do the 53 MB Go?

Tracing torch.cuda.empty_cache() on an RTX 4090 - Where Do the 53 MB Go?

Comments
5 min read
QAT vs PTQ on our edge vision model: 6 months of A/B data

QAT vs PTQ on our edge vision model: 6 months of A/B data

Comments
4 min read
LLM-as-judge variance broke our DPO training signal for 3 weeks

LLM-as-judge variance broke our DPO training signal for 3 weeks

Comments
4 min read
The bf16 grad accumulator that killed our SDXL LoRA training

The bf16 grad accumulator that killed our SDXL LoRA training

Comments
4 min read
I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail

I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail

Comments
2 min read
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)

Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)

Comments
5 min read
Prefix caching in vLLM under multi-tenant agent traffic

Prefix caching in vLLM under multi-tenant agent traffic

Comments 1
4 min read
Why your diffusion model is slow at batch size 1 (and what actually helps)

Why your diffusion model is slow at batch size 1 (and what actually helps)

Comments
4 min read
Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It

Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It

Comments
3 min read
Distilling SAM 2 into a 6MB student for industrial inspection

Distilling SAM 2 into a 6MB student for industrial inspection

Comments
4 min read
My high-res image-to-video kept OOMing — turns out I was decoding outside no_grad

My high-res image-to-video kept OOMing — turns out I was decoding outside no_grad

Comments
4 min read
Building Smaller Graph Neural Networks for Edge Healthcare Systems

Building Smaller Graph Neural Networks for Edge Healthcare Systems

Comments
4 min read
How to Serve a PyTorch Model with FastAPI: A Complete Guide

How to Serve a PyTorch Model with FastAPI: A Complete Guide

Comments
4 min read
CUDA Out of Memory at 60% Utilization: Tracing PyTorch GPU Memory Fragmentation

CUDA Out of Memory at 60% Utilization: Tracing PyTorch GPU Memory Fragmentation

Comments
4 min read
What Google's New Chips Mean If You Train Your Own Models

Google Cloud NEXT '26 Challenge Submission

What Google's New Chips Mean If You Train Your Own Models

2
Comments
9 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.