DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Transformers — The Architecture That Changed AI (Part 1 of 3)

Transformers — The Architecture That Changed AI (Part 1 of 3)

Comments
13 min read
Vision Transformers — How Transformers Learned to See (Part 2 of 3)

Vision Transformers — How Transformers Learned to See (Part 2 of 3)

Comments
12 min read
GNN vs. Trees: High-Speed Hybrid Architecture for XLA Runtime Prediction

GNN vs. Trees: High-Speed Hybrid Architecture for XLA Runtime Prediction

Comments
2 min read
Gradient Descent: The Engine That Made Deep Learning Possible : How one simple idea changed the way machines learn

Gradient Descent: The Engine That Made Deep Learning Possible : How one simple idea changed the way machines learn

Comments
5 min read
The Big Bang of Deep Learning: How 2012 Changed Everything

The Big Bang of Deep Learning: How 2012 Changed Everything

Comments
4 min read
The GPU Utilization Number That's Quietly Wrecking AI Team Budgets

The GPU Utilization Number That's Quietly Wrecking AI Team Budgets

Comments
5 min read
翁荔Scaling Law博文解读

翁荔Scaling Law博文解读

Comments
2 min read
How I Cut My AI API Costs by 61% with a Unified Gateway

How I Cut My AI API Costs by 61% with a Unified Gateway

Comments
5 min read
One "+x" That Made 100-Layer Networks Trainable: ResNet Skip Connections

One "+x" That Made 100-Layer Networks Trainable: ResNet Skip Connections

Comments
2 min read
Activation Functions: Why Non-Linearity Is Everything

Activation Functions: Why Non-Linearity Is Everything

Comments
3 min read
AI Deep Learning: Explained Simply

AI Deep Learning: Explained Simply

2
Comments 1
3 min read
Your gradient dies on the way to layer 1 (and how to save it)

Your gradient dies on the way to layer 1 (and how to save it)

1
Comments
4 min read
Understanding Backpropagation: Calculating Gradients for Hidden Layer Weights and Biases

Understanding Backpropagation: Calculating Gradients for Hidden Layer Weights and Biases

6
Comments
3 min read
Dropout: Switch Off Neurons to Stop Overfitting

Dropout: Switch Off Neurons to Stop Overfitting

Comments
1 min read
Free from-scratch deep learning notes: tensors, attention, and a tiny GPT

Free from-scratch deep learning notes: tensors, attention, and a tiny GPT

Comments
1 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.