Minjae Oh

I am a Ph.D. student in Data Science at Seoul National University, advised by Prof. Yohan Jo in the Human-Oriented Language Intelligence (HOLI) Lab. My research centers on reinforcement learning and reasoning with large language models. Recently, I have been looking into physical AI and world models with reasoning.

Education

Seoul National University
Ph.D. in Data Science (Advisor: Prof. Yohan Jo) 2026 Mar. – Present
Seoul National University
M.S. in Data Science (Advisor: Prof. Yohan Jo) 2024 Mar. – 2026 Feb.
Korea University
B.S. in Industrial Management Engineering 2018 Mar. – 2024 Feb.

Publications

^* Equal contribution

KL for a KL: On-Policy Distillation with Control Variate Baseline
Minjae Oh, Sangjun Song, Gyubin Choi, Yunho Choi, Yohan Jo
AI4MATH @ International Conference on Machine Learning (ICML), 2026 paper
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
Yunho Choi^*, Jongwon Lim^*, Woojin Ahn, Minjae Oh, Jeonghoon Shim, Yohan Jo
Under Review, 2026 paper
SHAPE of Chain-of-Thought in Math Reasoning
Jonghyun Song, Sangjun Song, Minjae Oh, Haesung Pyun, Sungsik Lee, Yohan Jo
AI4MATH @ International Conference on Machine Learning (ICML), 2026
Future Policy Approximation for Offline Reinforcement Learning Improves Mathematical Reasoning
Minjae Oh, Yunho Choi, Dongmin Choi, Yohan Jo
Under Review, 2026 paper
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding
Sangjun Song^*, Minjae Oh^*, Seungkyu Lee, Sungmin Jo, Yohan Jo
Findings of the Association for Computational Linguistics (ACL Findings), 2026 paper
Medication Recommendation for Parkinson's Disease Based on Dynamics of Symptom Progression
Minjae Oh, Hongbum Kim, Hyo Kyung Lee
Scientific Reports, Vol. 14, Article 25051, 2024 paper