Hosted by
Sean Welleck
Assistant Professor CMU
Support the podcast:
Buy me a coffee
Become a Patron!
Episode 7 | John Schulman
Optimizing Expectations: From Deep RL to Stochastic Computation Graphs
Papers and Links:
John Schulman's Homepage
Thesis
Proximal Policy Optimization Algorithms
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Open AI Five
Listen:
The Thesis Review
ยท
[42] Charles Sutton - Efficient Training Methods for Conditional Random Fields