[논문 리뷰] FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control 작성자: 김재훈 | 2025, Jun 26
[논문 리뷰] Preference Transformer: Modeling Human Preferences Using Transformers for RL 작성자: 김민경 | 2025, Jun 12
[논문 리뷰] Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions 작성자: 김동민 | 2025, May 22
[논문 리뷰] Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone 작성자: 민예린 | 2025, Mar 27