Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
inference
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
The Inference Inversion
David Aronchick
David Aronchick
David Aronchick
Follow
May 5
The Inference Inversion
#
distributedcomputing
#
edgecomputing
#
nvidia
#
inference
Comments
Add Comment
7 min read
Muse Spark beats Llama 4 with 10x less compute. Here's how.
Gabriel Anhaia
Gabriel Anhaia
Gabriel Anhaia
Follow
Apr 26
Muse Spark beats Llama 4 with 10x less compute. Here's how.
#
ai
#
llm
#
architecture
#
inference
Comments
Add Comment
7 min read
First Words: LLM Inference on RISC-V
Bruno Verachten
Bruno Verachten
Bruno Verachten
Follow
Apr 22
First Words: LLM Inference on RISC-V
#
bananapi
#
benchmark
#
inference
#
llamacpp
Comments
Add Comment
9 min read
Gaussian Process Regression: The Bayesian Approach to Curve Fitting
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 13
Gaussian Process Regression: The Bayesian Approach to Curve Fitting
#
bayesian
#
supervisedlearning
#
probabilistic
#
inference
Comments
Add Comment
13 min read
Google Dropped TurboQuant Two Weeks Ago. The Community Already Made It Usable.
Alan West
Alan West
Alan West
Follow
Apr 7
Google Dropped TurboQuant Two Weeks Ago. The Community Already Made It Usable.
#
turboquant
#
locallm
#
inference
#
opensource
1
 reaction
Comments
Add Comment
6 min read
Hierarchical Bayesian Regression with PyMC: When Groups Share Strength
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 26
Hierarchical Bayesian Regression with PyMC: When Groups Share Strength
#
bayesian
#
probabilistic
#
inference
#
pymc
1
 reaction
Comments
Add Comment
13 min read
From MLE to Bayesian Inference: Why Your Estimate Needs a Prior
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Mar 29
From MLE to Bayesian Inference: Why Your Estimate Needs a Prior
#
bayesian
#
inference
#
statistics
#
probabilistic
Comments
Add Comment
15 min read
The EM Algorithm: An Intuitive Guide with the Coin Toss Example
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Mar 27
The EM Algorithm: An Intuitive Guide with the Coin Toss Example
#
unsupervisedlearning
#
inference
#
optimisation
#
probabilistic
Comments
Add Comment
10 min read
Maximum Likelihood Estimation from Scratch: From Coin Flips to Gaussians
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Mar 26
Maximum Likelihood Estimation from Scratch: From Coin Flips to Gaussians
#
statistics
#
inference
#
optimisation
#
probabilistic
Comments
Add Comment
13 min read
DGX Spark Inference Performance: Local LLM vs Cloud Benchmarks (2026)
MrJHSN
MrJHSN
MrJHSN
Follow
Mar 19
DGX Spark Inference Performance: Local LLM vs Cloud Benchmarks (2026)
#
dgx
#
llm
#
inference
#
benchmark
Comments
Add Comment
5 min read
Estimating Operational Costs for CLIP-Based Image Search on 1 Million Images: Infrastructure Expenses Focused
Artyom Kornilov
Artyom Kornilov
Artyom Kornilov
Follow
Mar 10
Estimating Operational Costs for CLIP-Based Image Search on 1 Million Images: Infrastructure Expenses Focused
#
clip
#
gpu
#
inference
#
cost
Comments
Add Comment
12 min read
I built an Ollama alternative with TurboQuant, model groups, and multi-GPU support
deharoalexandre-cyber
deharoalexandre-cyber
deharoalexandre-cyber
Follow
Apr 8
I built an Ollama alternative with TurboQuant, model groups, and multi-GPU support
#
ai
#
llm
#
cpp
#
inference
Comments
1
 comment
4 min read
How to Optimize AI Agent Costs — Inference, API Calls, and Infrastructure
Custodia-Admin
Custodia-Admin
Custodia-Admin
Follow
Mar 13
How to Optimize AI Agent Costs — Inference, API Calls, and Infrastructure
#
agents
#
costs
#
optimization
#
inference
Comments
1
 comment
3 min read
Why Inference Compression Compounds for Modular Agents
Rotifer Protocol
Rotifer Protocol
Rotifer Protocol
Follow
Mar 31
Why Inference Compression Compounds for Modular Agents
#
inference
#
compression
#
agents
#
gene
1
 reaction
Comments
Add Comment
4 min read
Model Serving Infrastructure: Building Scalable Inference
Matt Frank
Matt Frank
Matt Frank
Follow
Feb 23
Model Serving Infrastructure: Building Scalable Inference
#
modelserving
#
inference
#
mlops
Comments
Add Comment
7 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account