In this brief tutorial you're going to learn the fundamentals of deep reinforcement learning, and the basic concepts behind actor critic methods. We'll cover the Markov decision process, the agent's policy, reward discounting and why it's necessary, and the actor critic algorithm. We'll implement an actor critic algorithm using Tensorflow 2 to handle the cart pole environment from the Open AI Gym.
Actor critic methods form the basis for more advanced algorithms such as deep deterministic policy gradients, soft actor critic, and twin delayed deep deterministic policy gradients, among others.
You can find the code for this video here:
[ Ссылка ]
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to sales@neuralnet.ai
[ Ссылка ]
Or, pickup my Udemy courses here:
Deep Q Learning:
[ Ссылка ]
Actor Critic Methods:
[ Ссылка ]
Curiosity Driven Deep Reinforcement Learning
[ Ссылка ]
Natural Language Processing from First Principles:
[ Ссылка ]
Reinforcement Learning Fundamentals
[ Ссылка ]
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: [ Ссылка ]
Grokking Deep Learning: [ Ссылка ]
Grokking Deep Reinforcement Learning: [ Ссылка ]
Come hang out on Discord here:
[ Ссылка ]
Need personalized tutoring? Help on a programming project? Shoot me an email! phil@neuralnet.ai
Website: [ Ссылка ]
Github: [ Ссылка ]
Twitter: [ Ссылка ]
![](https://i.ytimg.com/vi/LawaN3BdI00/maxresdefault.jpg)