DEV Community

# optimisation

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Value Iteration vs Q-Learning: Dynamic Programming Meets RL

Value Iteration vs Q-Learning: Dynamic Programming Meets RL

Comments
12 min read
Solving CartPole Without Gradients: Simulated Annealing

Solving CartPole Without Gradients: Simulated Annealing

Comments
13 min read
The Cross-Entropy Method: Solving RL Without Gradients

The Cross-Entropy Method: Solving RL Without Gradients

1
Comments
12 min read
AI Experts Are Dead. Long Live the AI Experts.

AI Experts Are Dead. Long Live the AI Experts.

Comments
13 min read
Hyperparameter Optimization: Grid vs Random vs Bayesian

Hyperparameter Optimization: Grid vs Random vs Bayesian

Comments
16 min read
Policy Gradients: REINFORCE from Scratch with NumPy

Policy Gradients: REINFORCE from Scratch with NumPy

Comments
16 min read
Deep Q-Networks: Experience Replay and Target Networks

Deep Q-Networks: Experience Replay and Target Networks

Comments
18 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.