Music |
Video |
Movies |
Chart |
Show |
Policy Gradients with Human Advice for Safe Reinforcement Learning! (Papers and Chill) View | |
From Policy Gradient with baseline to Actor-Critic (RLVS 2021 version) (Olivier Sigaud) View | |
Policy Gradient Intro (CIS 522 - Deep Learning) View | |
Better Reinforcement Learning for Human in the Loop Systems | Emma Brunskill | WiDS 2019 (ICMEStudio) View | |
CoRL 2020, Spotlight Talk 171: Safe Policy Learning for Continuous Control (Conference on Robot Learning) View | |
Reinforcement Learning Like a PRO (Responsible AI) View | |
Adaptive Continuous Control of Spacecraft Attitude Using Deep Reinforcement Learning - AAS 2020 (Jake Elkins) View | |
Reinforcement Learning From Human Feedback, RLHF. Overview of the Process. Strengths and Weaknesses. (AemonAlgiz) View | |
Multi-agent Reinforcement Learning (Dylan Klein) View | |
Uncertainty Aware Action Advising for Deep Reinforcement Learning Agents - Paper Explained! (Papers and Chill) View |