DEV Community

# reinforcementlearning

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Embodied AI Systems: Extending Intelligence Through Learning in the Environment

Embodied AI Systems: Extending Intelligence Through Learning in the Environment

Comments
2 min read
Policy Gradients: REINFORCE from Scratch with NumPy

Policy Gradients: REINFORCE from Scratch with NumPy

Comments
16 min read
Deep Q-Networks: Experience Replay and Target Networks

Deep Q-Networks: Experience Replay and Target Networks

Comments
18 min read
Q-Learning from Scratch: Navigating the Frozen Lake

Q-Learning from Scratch: Navigating the Frozen Lake

Comments
11 min read
Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

1
Comments
4 min read
Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Comments
11 min read
A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

5
Comments
4 min read
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

1
Comments
52 min read
How I Built a Readable AlphaZero From Scratch — A Deep Dive Into the Code

How I Built a Readable AlphaZero From Scratch — A Deep Dive Into the Code

1
Comments
10 min read
From Pixels to Physicality ☃️: Engineering Olaf with Reinforcement ✨ Learning, Control Systems, and Illusion Design 🤖

From Pixels to Physicality ☃️: Engineering Olaf with Reinforcement ✨ Learning, Control Systems, and Illusion Design 🤖

2
Comments
8 min read
I Built an AI Arena and Trained AlphaZero to Play Gomoku: Here’s How

I Built an AI Arena and Trained AlphaZero to Play Gomoku: Here’s How

1
Comments
4 min read
[Meta-RL] We told an AI agent 'you can fail 3 times.' Accuracy went up 19%.

[Meta-RL] We told an AI agent 'you can fail 3 times.' Accuracy went up 19%.

4
Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.