[논문 리뷰] Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions 작성자: 김동민 | 2025, May 22
[논문 리뷰] Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone 작성자: 민예린 | 2025, Mar 27
[논문 리뷰] SURF: semi-supervised reward learning with data augmentation for feedback-efficient preference-based reinforcement learning 작성자: 김민경 | 2025, Mar 20