Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
gpu
Follow
Hide
Posts
Left menu
đ
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Qwen3.6 GGUF, RTX 4080 Cooling & Pragmata GPU Benchmarks Drive Performance
soy
soy
soy
Follow
Apr 17
Qwen3.6 GGUF, RTX 4080 Cooling & Pragmata GPU Benchmarks Drive Performance
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
What Happens When an AI Agent Gets Kernel-Level GPU Traces
Ingero Team
Ingero Team
Ingero Team
Follow
Apr 16
What Happens When an AI Agent Gets Kernel-Level GPU Traces
#
gpu
#
ebpf
#
observability
#
gpuobservability
Comments
Add Comment
5 min read
NVIDIA DLSS 4 & RTX VSR Updates, CUDA Shared Memory Optimization Challenges
soy
soy
soy
Follow
Apr 16
NVIDIA DLSS 4 & RTX VSR Updates, CUDA Shared Memory Optimization Challenges
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
NVIDIA 50-Series GDDR7 Rumors, Mesa 26.1 AMD APU Drivers, WebGPU 1-bit LLMs
soy
soy
soy
Follow
Apr 15
NVIDIA 50-Series GDDR7 Rumors, Mesa 26.1 AMD APU Drivers, WebGPU 1-bit LLMs
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
4 min read
LLM Auto-Tunes llama.cpp, SASS Latency Analysis, DLSS Frame Gen for RTX 40
soy
soy
soy
Follow
Apr 14
LLM Auto-Tunes llama.cpp, SASS Latency Analysis, DLSS Frame Gen for RTX 40
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
VRAMăćąăăă°è§Łæ±șăăăăŻç©ççă«ééăŁăŠăă â HBMă»CXLă»Unified MemoryăćăăȘăăŁăăăź
plasmon
plasmon
plasmon
Follow
Apr 14
VRAMăćąăăă°è§Łæ±șăăăăŻç©ççă«ééăŁăŠăă â HBMă»CXLă»Unified MemoryăćăăȘăăŁăăăź
#
llm
#
gpu
#
vram
Comments
Add Comment
4 min read
llama.cppăźèšćźă§8GBăźæ§èœă5ćć€ăă â äž»èŠăȘăă·ă§ăłăźæé©ć€ăćșăă
plasmon
plasmon
plasmon
Follow
Apr 14
llama.cppăźèšćźă§8GBăźæ§èœă5ćć€ăă â äž»èŠăȘăă·ă§ăłăźæé©ć€ăćșăă
#
llm
#
llamacpp
#
gpu
Comments
Add Comment
4 min read
One Query, Four GPUs: Tracing a Distributed Training Stall Across Nodes
Ingero Team
Ingero Team
Ingero Team
Follow
Apr 13
One Query, Four GPUs: Tracing a Distributed Training Stall Across Nodes
#
gpu
#
ebpf
#
distributedcomputing
Comments
Add Comment
7 min read
Task Manager is lying about your GPU temps. Here is how to read the real data in Python
Yaroslav Pristupa
Yaroslav Pristupa
Yaroslav Pristupa
Follow
Apr 13
Task Manager is lying about your GPU temps. Here is how to read the real data in Python
#
ai
#
hardware
#
softwaredevelopment
#
gpu
Comments
Add Comment
4 min read
Tutorial: Build an AI-Powered GPU Fleet Optimizer
DigitalOcean
DigitalOcean
DigitalOcean
Follow
for
DigitalOcean
Apr 17
Tutorial: Build an AI-Powered GPU Fleet Optimizer
#
gpu
#
nvidia
#
ai
#
tutorial
3
 reactions
Comments
Add Comment
12 min read
AMD ML Complete Stack
compilersutra
compilersutra
compilersutra
Follow
Apr 12
AMD ML Complete Stack
#
gpu
#
cpu
#
ai
#
llm
Comments
Add Comment
1 min read
RTX 5090 cuBLAS Bug, Neural Texture Compression, Multi-GPU vLLM Inference
soy
soy
soy
Follow
Apr 11
RTX 5090 cuBLAS Bug, Neural Texture Compression, Multi-GPU vLLM Inference
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
CUDA SGEMM Bug on RTX 5090, Kernel-Fusing for SGEMV, & Radeon RX 9070 XT Price Surge
soy
soy
soy
Follow
Apr 10
CUDA SGEMM Bug on RTX 5090, Kernel-Fusing for SGEMV, & Radeon RX 9070 XT Price Surge
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
4 min read
TGI - Text Generation Inference - Install, Config, Troubleshoot
Rost
Rost
Rost
Follow
Apr 10
TGI - Text Generation Inference - Install, Config, Troubleshoot
#
docker
#
gpu
#
observability
#
selfhosting
Comments
Add Comment
9 min read
Memory Coalescing: Same computation, 6x Performance Difference
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Apr 9
Memory Coalescing: Same computation, 6x Performance Difference
#
cuda
#
gpu
#
aiops
#
cpp
Comments
Add Comment
6 min read
đ
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account