RL) ETH Zurich & UC Berkeley Method Automates Deep Reward-Learning by Simulating the Past

2021. 4. 20. 20:32관심있는 주제/RL

728x90

리워드 관련 논문...

 

읽을게 너무 많다.

 

 

 

 

 

medium.com/syncedreview/eth-zurich-uc-berkeley-method-automates-deep-reward-learning-by-simulating-the-past-f4aa7281b23f

 

ETH Zurich & UC Berkeley Method Automates Deep Reward-Learning by Simulating the Past

In the field of reinforcement learning (RL), task specifications are typically designed by experts. Learning from demonstrations and…

medium.com

arxiv.org/pdf/2104.03946.pdf

 

728x90