DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
DragonMemory: Neural Sequence Compression for Production RAG

DragonMemory: Neural Sequence Compression for Production RAG

1
Comments
8 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
Stock Price Prediction by ML Models

Stock Price Prediction by ML Models

Comments
1 min read
Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Comments
2 min read
How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

Comments
2 min read
Attention Mechanism in Transformers: The Core Idea Behind Modern AI

Attention Mechanism in Transformers: The Core Idea Behind Modern AI

5
Comments
2 min read
Star Multi-Class Classification Neural Network With Pytorch

Star Multi-Class Classification Neural Network With Pytorch

Comments
12 min read
A cleaner, safer, plug-and-play NanoGPT

A cleaner, safer, plug-and-play NanoGPT

Comments
1 min read
LANGUAGE MODELS USING MLP (Part 1)

LANGUAGE MODELS USING MLP (Part 1)

Comments
15 min read
Decoder-Only Transformers: The Architecture Behind GPT Models

Decoder-Only Transformers: The Architecture Behind GPT Models

Comments
5 min read
Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Comments
4 min read
🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

5
Comments
4 min read
Setting Up NVIDIA Parakeet TDT 0.6B v3 for Speech Recognition on AWS EC2 Ubuntu

Setting Up NVIDIA Parakeet TDT 0.6B v3 for Speech Recognition on AWS EC2 Ubuntu

Comments
8 min read
Stock Price Prediction

Stock Price Prediction

Comments
1 min read
Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

4
Comments
7 min read
The Evolution of Sequential Learning Models: RNN LSTM Transformers

The Evolution of Sequential Learning Models: RNN LSTM Transformers

Comments
2 min read
Devtool for running and benchmarking local AI

Devtool for running and benchmarking local AI

2
Comments
3 min read
Speculative Decoding: Making LLMs Faster Without Sacrificing Quality

Speculative Decoding: Making LLMs Faster Without Sacrificing Quality

1
Comments
14 min read
BIGRAM LANGUAGE MODELS USING A NEURAL NET

BIGRAM LANGUAGE MODELS USING A NEURAL NET

Comments
14 min read
Seeing Shapes: Unveiling Neural Network Vision with Fourier Geometry by Arvind Sundararajan

Seeing Shapes: Unveiling Neural Network Vision with Fourier Geometry by Arvind Sundararajan

Comments
2 min read
LLMs Speaking in Tongues: Unlocking Direct Semantic Exchange

LLMs Speaking in Tongues: Unlocking Direct Semantic Exchange

Comments
2 min read
LLM Concepts (Explained Without Making Your Brain Hurt): What Every Developer Should Know

LLM Concepts (Explained Without Making Your Brain Hurt): What Every Developer Should Know

1
Comments
4 min read
Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model

Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model

2
Comments
7 min read
AI Art Turbocharged: Differentiable Diffusion for Hyper-Realistic Results

AI Art Turbocharged: Differentiable Diffusion for Hyper-Realistic Results

Comments
2 min read
Unlocking AI Vision with the Wisdom of Cats: Building Generalizable Models

Unlocking AI Vision with the Wisdom of Cats: Building Generalizable Models

Comments
2 min read
loading...