⌘Ctrlk

Introduction
Deep RL Course
Related Papers
- Meta RL
Resources
- Resources
- Spinning Up by OpenAI
RL in practice
- Policy gradients

Powered by GitBook

On this page

Deep RL Course

Deep RL with Q-Function

Correlated samples and unstable target The accuracy of Q-function Continuous actions

PreviousLearning theory NextCorrelated samples and unstable target

Last updated 6 years ago