논문 리뷰) [TODO] Online Decision Transformer

2022. 5. 25. 21:22관심있는 주제/RL

728x90

 

시간이 나면 보는 걸로...

 

 

 

 

https://arxiv.org/abs/2202.05607

 

Online Decision Transformer

Recent work has shown that offline reinforcement learning (RL) can be formulated as a sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via approaches similar to large-scale language modeling. However, any practical instantiatio

arxiv.org

 

728x90