Bellman

딥러닝(20)

[3줄 AGI] 사실 AGI는 우리 옆에 있었다.
https://www.sciencedirect.com/science/article/pii/S0004370221000862?fbclid=IwAR00HAZ1VgULd647jwVdXSCG58RlcWsC9GpPUimy0JvEgGNLYeKNI-_UWWc Reward is enough In this article we hypothesise that intelligence, and its associated abilities, can be understood as subserving the maximisation of reward. Accordingl… www.sciencedirect.com 1. 근본적인 의문: 어떻게 자연에서의 에이전트(동물), 사람은 똑똑하게 행동하는가? 에 대답하기 위한 답변으로 “모든것이 g..
2021.07.04
[3줄 RL] 자가지도학습과 강화학습의 샘플 효율성
https://openreview.net/pdf?id=uCQfPZwRaUu 1.self-supervised representation learning은 data efficiency에 있어 큰 발전을 보였는데, 강화학습에서 sample efficiency는 매우 중요한 이슈이므로 이 두가지의 연결점을 찾고자 하였다. 2.state와 next state와 그 augmentation들에 대해 representation learning을 수행하여 self-predictive 한 형태로 학습한다. 3. 실험 결과는 실제로 다소 작은 step(100k)을 진행한 atari26 환경에서 기존 data-efficient RL 방법론을 아웃퍼폼했는데, 이것은 representation learning과 data-effi..
2021.07.03

1 2 3 4

티스토리툴바