장영수

Reinforcement Learning (RL)
Large Language Models (LLMs)
Reasoning and Planning
Reinforcement Learning from Human Feedback (RLHF)