강화학습 기초 자료 모음집

2020. 7. 18. 17:56관심있는 주제/RL

728x90

강화학습 기초부터 DQN까지 (Reinforcement Learning from Basics to DQN)

 

 

https://greentec.github.io/reinforcement-learning-second/

 

강화학습 알아보기(2) - DQN · greentec's blog

강화학습 알아보기(2) - DQN 01 Apr 2019 • 0 Comments 가치 함수, 감가율 지난 글에서는 강화학습에 대해 처음으로 알아보는 시간과 함께 감가율(discount rate), 가치 함수(value function) 등의 개념을 소개했�

greentec.github.io

https://naifmehanna.com/2018-10-18-implementing-sarsa-in-python/

 

Implementing SARSA(λ) in Python

This post show how to implement the SARSA algorithm, using eligibility traces in Python. It is part of a serie of articles about reinforcement learning that I will be writing. Please note that I will go in further details as soon as I can. This is the firs

naifmehanna.com

medium.com/zero-equals-false/n-step-td-method-157d3875b9cb

 

N-step TD Method

The unification of SARSA and Monte Carlo Simulation

medium.com

https://www.slideshare.net/SuHyunCho2/sac-overview

 

728x90