DEV Community

# reinforcementlearning

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Value Iteration vs Q-Learning: Dynamic Programming Meets RL

Value Iteration vs Q-Learning: Dynamic Programming Meets RL

Comments
12 min read
Evolution Is Back: A New Way to Fine‑Tune LLMs

Evolution Is Back: A New Way to Fine‑Tune LLMs

1
Comments
7 min read
Solving CartPole Without Gradients: Simulated Annealing

Solving CartPole Without Gradients: Simulated Annealing

Comments
13 min read
The Cross-Entropy Method: Solving RL Without Gradients

The Cross-Entropy Method: Solving RL Without Gradients

1
Comments
12 min read
Self-Learning AI Agents; Architectures and Challenges

Self-Learning AI Agents; Architectures and Challenges

1
Comments 1
3 min read
Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet"

Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet"

5
Comments
2 min read
Top 15 Reinforcement Learning Questions That Will Appear in Exams

Top 15 Reinforcement Learning Questions That Will Appear in Exams

6
Comments
2 min read
Embodied AI Systems: Extending Intelligence Through Learning in the Environment

Embodied AI Systems: Extending Intelligence Through Learning in the Environment

Comments
2 min read
Policy Gradients: REINFORCE from Scratch with NumPy

Policy Gradients: REINFORCE from Scratch with NumPy

Comments
16 min read
Deep Q-Networks: Experience Replay and Target Networks

Deep Q-Networks: Experience Replay and Target Networks

Comments
18 min read
Q-Learning from Scratch: Navigating the Frozen Lake

Q-Learning from Scratch: Navigating the Frozen Lake

Comments
11 min read
Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

1
Comments
4 min read
Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Comments
11 min read
A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

5
Comments
4 min read
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

1
Comments
52 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.