I am PhD student at KAIST advised by professor Jinwoo Shin ,
and a research intern at NVIDIA GEAR hosted by Jim Fan and Yuke Zhu .
I was a student researcher at Google DeepMind hosted by Yinxiao Li .
I closely worked with Kihyuk Sohn via Google University Relations program.
My research interests lie in probabilistic machine learning, generative modeling, and representation learning, with a focus on their applications to visual understanding and generation. Recently, I have been particularly interested in generative modeling for visual synthesis — including image, video, and 3D generation — as a step toward world modeling. Also, I am exploring applications in robotics, especially utilizing video world-models for robotics policy. My previous works have focused on the fundamentals of training diffusion and flow models, as well as their adaptations to tasks such as 3D generation, personalization, and preference-based fine-tuning.
I plan to graduate in 2026 and am seeking industry research positions. Please feel free to reach out if you’re interested!
Email: kyungmnlee (at) kaist (dot) ac (dot) kr | Curriculum Vitae
📝 Publications
Selected
All
World Action Models are Zero-Shot Policies
Seonghyeon Ye ,
Yunhao Ge ,
Kaiyuan Zheng ,
Shenyuan Gao ,
Sihyun Yu ,
George Kurian ,
Suneel Indupuru ,
You Liang Tan ,
Chuning Zhu ,
Jiannan Xiang ,
Ayaan Malik ,
Kyungmin Lee ,
William Liang ,
Nadun Ranawaka ,
Jiasheng Gu ,
Yinzhen Xu ,
Guanzhi Wang ,
Fengyuan Hu ,
Avnish Narayan ,
Johan Bjorck ,
Jing Wang ,
Gwanghyun Kim ,
Dantong Niu ,
Ruijie Zheng ,
Yuqi Xie ,
Jimmy Wu ,
Qi Wang ,
Ryan Julian ,
Danfei Xu ,
Yilun Du ,
Yevgen Chebotar ,
Scott Reed ,
Jan Kautz ,
Yuke Zhu ,
Jim Fan ,
Joel Jang
Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model
John Won ,
Kyungmin Lee ,
Huiwon Jang ,
Dongyoung Kim ,
Jinwoo Shin
Decoupled MeanFlow: Turning Flow Models into Flow Maps for Accelerated Sampling
Kyungmin Lee ,
Sihyun Yu ,
Jinwoo Shin
Calibrated Multi-Preference Optimization for Aligning Diffusion Models
Kyungmin Lee ,
Xiaohang Li ,
Qifei Wang ,
Junfeng He ,
Junjie Ke ,
Ming-Hsuan Yang ,
Irfan Essa ,
Jinwoo Shin ,
Feng Yang ,
Yinxiao Li
Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion Models
Kyungmin Lee ,
Sangkyung Kwak ,
Kihyuk Sohn ,
Jinwoo Shin
DreamFlow: High-quality Text-to-3D generation by Approximating Probability Flow
Kyungmin Lee ,
Kihyuk Sohn ,
Jinwoo Shin
Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Yisol Choi ,
Sangkyung Kwak ,
Kyungmin Lee ,
Hyungwon Choi,
Jinwoo Shin
World Action Models are Zero-Shot Policies
Seonghyeon Ye ,
Yunhao Ge ,
Kaiyuan Zheng ,
Shenyuan Gao ,
Sihyun Yu ,
George Kurian ,
Suneel Indupuru ,
You Liang Tan ,
Chuning Zhu ,
Jiannan Xiang ,
Ayaan Malik ,
Kyungmin Lee ,
William Liang ,
Nadun Ranawaka ,
Jiasheng Gu ,
Yinzhen Xu ,
Guanzhi Wang ,
Fengyuan Hu ,
Avnish Narayan ,
Johan Bjorck ,
Jing Wang ,
Gwanghyun Kim ,
Dantong Niu ,
Ruijie Zheng ,
Yuqi Xie ,
Jimmy Wu ,
Qi Wang ,
Ryan Julian ,
Danfei Xu ,
Yilun Du ,
Yevgen Chebotar ,
Scott Reed ,
Jan Kautz ,
Yuke Zhu ,
Jim Fan ,
Joel Jang
Contrastive Representation Regularization for Vision-Language-Action Models
Taeyoung Kim ,
Jimin Lee ,
Myungkoo Koo ,
Dongyoung Kim ,
Kyungmin Lee ,
Changyeon Kim ,
Younggyo Seo ,
Jinwoo Shin
Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model
John Won ,
Kyungmin Lee ,
Huiwon Jang ,
Dongyoung Kim ,
Jinwoo Shin
Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance
Junesuk Choi ,
Kyungmin Lee ,
Sihyun Yu ,
Yisol Choi ,
Jinwoo Shin ,
Kimin Lee
HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy
Myungkoo Koo ,
Daewon Choi ,
Taeyoung Kim ,
Kyungmin Lee ,
Changyeon Kim ,
Younggyo Seo ,
Jinwoo Shin
Decoupled MeanFlow: Turning Flow Models into Flow Maps for Accelerated Sampling
Kyungmin Lee ,
Sihyun Yu ,
Jinwoo Shin
StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity Alignment
Younghyun Kim*, Jongheon Jeong*,
Sangkyung Kwak ,
Kyungmin Lee ,
Juho Lee,
Jinwoo Shin
Calibrated Multi-Preference Optimization for Aligning Diffusion Models
Kyungmin Lee ,
Xiaohang Li ,
Qifei Wang ,
Junfeng He ,
Junjie Ke ,
Ming-Hsuan Yang ,
Irfan Essa ,
Jinwoo Shin ,
Feng Yang ,
Yinxiao Li
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing
Junesuk Choi ,
Kyungmin Lee ,
Jongheon Jeong ,
Saining Xie ,
Jinwoo Shin ,
Kimin Lee
Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion Models
Kyungmin Lee ,
Sangkyung Kwak ,
Kihyuk Sohn ,
Jinwoo Shin
Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Yisol Choi ,
Sangkyung Kwak ,
Kyungmin Lee ,
Hyungwon Choi,
Jinwoo Shin
Discovering and Mitigating Visual Biases through Keyword Explanation
Younghyun Kim, Sangwoo Mo, Minkyu Kim,
Kyungmin Lee ,
Jaeho Lee,
Jinwoo Shin
DreamFlow: High-quality Text-to-3D generation by Approximating Probability Flow
Kyungmin Lee ,
Kihyuk Sohn ,
Jinwoo Shin
Fine-tuning Protein Language Models by ranking protein fitness
Minji Lee
Kyungmin Lee ,
Jinwoo Shin
Collaborative Score Distillation for Consistent Visual Editing
Subin Kim* ,
Kyungmin Lee\* ,
Junesuk Choi ,
Jongheon Jeong ,
Kihyuk Sohn ,
Jinwoo Shin
S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions
Sangwoo Mo, Minkyu Kim,
Kyungmin Lee ,
Jinwoo Shin
Slimmed Asymmetrical Contrastive Learning and Cross Distillation for Lightweight Model Training
Jian Meng, Li Yang,
Kyungmin Lee ,
Jinwoo Shin ,
Deliang Fan and Jae-sun Seo
STUNT: Few-shot Tabular Learning with Self-generated Tasks from Unlabeled Tables
Jaehyun Nam ,
Jihoon Tack ,
Kyungmin Lee ,
Hankook Lee ,
Jinwoo Shin
RényiCL: Contrastive Representation Learning with Skew Rényi divergence
Kyungmin Lee and Jinwoo Shin
GCISG: Guided Causal Invariant Learning for Improved Syn-to-Real Generalization
Gilhyun Nam, Gyeongjae Choi, Kyungmin Lee
Representation Distillation by Prototypical Contrastive Predictive Coding
Kyungmin Lee
Pseudo-spherical Knowledge Distillation
Kyungmin Lee and Hyeongkeun Lee
Provable Defense by Denoised Smoothing with Learned Score function
Kyungmin Lee
💻 Work Experience
2025.12 - 2026.03 , Research Intern @ NVIDIA GEAR, Remote.
2024.07 - 2024.12 , Student Researcher @ Google DeepMind, Mountain View, CA, US.
2023.02 - 2024.03 , University Relation Program @ Google Research, Remote.
🤝 Academic Services
Conference reviewer: NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, BMCV, WACV, AISTATS
Journal reviewer: TMLR, TPAMI
📖 Educations
2022.09 - 2026.06 , KAIST, Ph.D. in Artificial Intelligence (expected).
2015.03 - 2019.02 , KAIST, B.S. in Mathematics, Electrical and Computer Engineering (double major).