DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why Softmax is Used Instead of Argmax in Neural Network Training

Why Softmax is Used Instead of Argmax in Neural Network Training

Comments
4 min read
How Search Engines Actually Answer Your Questions

How Search Engines Actually Answer Your Questions

Comments
11 min read
Giving AI Eyes: Multi-Modal LLMs

Giving AI Eyes: Multi-Modal LLMs

Comments
9 min read
Tokenization in NLP: The Foundational Step That Turns Language Into Data

Tokenization in NLP: The Foundational Step That Turns Language Into Data

Comments
3 min read
Linear Algebra for AI

Linear Algebra for AI

1
Comments
2 min read
Cross-Modal Embeddings: Bridging AI Modalities

Cross-Modal Embeddings: Bridging AI Modalities

Comments
11 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
Stock Price Prediction by ML Models

Stock Price Prediction by ML Models

Comments
1 min read
Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Comments
2 min read
How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

Comments
2 min read
Attention Mechanism in Transformers: The Core Idea Behind Modern AI

Attention Mechanism in Transformers: The Core Idea Behind Modern AI

5
Comments
2 min read
Transformers and Attention: How LLMs Actually Process Text

Transformers and Attention: How LLMs Actually Process Text

4
Comments
18 min read
Star Multi-Class Classification Neural Network With Pytorch

Star Multi-Class Classification Neural Network With Pytorch

Comments
12 min read
A cleaner, safer, plug-and-play NanoGPT

A cleaner, safer, plug-and-play NanoGPT

Comments
1 min read
LANGUAGE MODELS USING MLP (Part 1)

LANGUAGE MODELS USING MLP (Part 1)

Comments
15 min read
Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

5
Comments
5 min read
Decoder-Only Transformers: The Architecture Behind GPT Models

Decoder-Only Transformers: The Architecture Behind GPT Models

Comments
5 min read
Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Comments
4 min read
🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

5
Comments
4 min read
Setting Up NVIDIA Parakeet TDT 0.6B v3 for Speech Recognition on AWS EC2 Ubuntu

Setting Up NVIDIA Parakeet TDT 0.6B v3 for Speech Recognition on AWS EC2 Ubuntu

Comments
8 min read
Stock Price Prediction

Stock Price Prediction

Comments
1 min read
Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

4
Comments
7 min read
The Evolution of Sequential Learning Models: RNN LSTM Transformers

The Evolution of Sequential Learning Models: RNN LSTM Transformers

Comments
2 min read
Devtool for running and benchmarking local AI

Devtool for running and benchmarking local AI

2
Comments
3 min read
BIGRAM LANGUAGE MODELS USING A NEURAL NET

BIGRAM LANGUAGE MODELS USING A NEURAL NET

Comments
14 min read
loading...