Minjae Oh

I am a Ph.D. student in Data Science at Seoul National University, advised by Prof. Yohan Jo in the Human-Oriented Language Intelligence (HOLI) Lab. My research centers on reinforcement learning and reasoning with large language models. Recently, I have been looking into physical AI and world models with reasoning.

Education

Publications

* Equal contribution
  1. KL for a KL: On-Policy Distillation with Control Variate Baseline
    Minjae Oh, Sangjun Song, Gyubin Choi, Yunho Choi, Yohan Jo
    AI4MATH @ International Conference on Machine Learning (ICML), 2026 paper
  2. Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
    Yunho Choi*, Jongwon Lim*, Woojin Ahn, Minjae Oh, Jeonghoon Shim, Yohan Jo
    Under Review, 2026 paper
  3. SHAPE of Chain-of-Thought in Math Reasoning
    Jonghyun Song, Sangjun Song, Minjae Oh, Haesung Pyun, Sungsik Lee, Yohan Jo
    AI4MATH @ International Conference on Machine Learning (ICML), 2026
  4. Future Policy Approximation for Offline Reinforcement Learning Improves Mathematical Reasoning
    Minjae Oh, Yunho Choi, Dongmin Choi, Yohan Jo
    Under Review, 2026 paper
  5. ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding
    Sangjun Song*, Minjae Oh*, Seungkyu Lee, Sungmin Jo, Yohan Jo
    Findings of the Association for Computational Linguistics (ACL Findings), 2026 paper
  6. Medication Recommendation for Parkinson's Disease Based on Dynamics of Symptom Progression
    Minjae Oh, Hongbum Kim, Hyo Kyung Lee
    Scientific Reports, Vol. 14, Article 25051, 2024 paper