
I am a Staff AI Scientist at the IFM MBZUAI Silicon Valley Lab, where I lead data mixing for LLM pre-training.
Before that, I was a research manager at the MIT‑IBM Watson AI Lab leading the Statistical Large Language Modeling group.
I completed my PhD in Statistics at the University of Michigan, where I worked with Prof. Long Nguyen.
I am interested in a variety of LLM‑related problems—pre‑ and post‑training, data quality, reasoning, evaluation, routing, and efficient inference—and enjoy exploring statistical modeling approaches to solve them. I have also worked on OOD generalization, algorithmic fairness, optimal transport, federated learning, and Bayesian nonparametrics.

Felipe Maia Polo, Ronald Xu, Lucas Weber, Mírian Silva, Onkar Bhardwaj, Leshem Choshen, Allysson Flavio Melo de Oliveira, Yuekai Sun, Mikhail Yurochkin
ICML Workshop on Efficient Systems for Foundation Models, 2024
Neural Information Processing Systems (NeurIPS), 2024
arXiv / Code / Data / Twitter

Igor Melnyk, Youssef Mroueh, Brian Belgodere, Mattia Rigotti, Apoorva Nitsure, Mikhail Yurochkin, Kristjan Greenewald, Jiri Navratil, Jerret Ross
ICML Workshop on Models of Human Feedback for AI Alignment, 2024
Neural Information Processing Systems (NeurIPS), 2024
arXiv / Available in TRL

Felipe Maia Polo, Subha Maity, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun
Neural Information Processing Systems (NeurIPS), 2024
arXiv / Code

Lilian Ngweta, Mayank Agarwal, Subha Maity, Alex Gittens, Yuekai Sun, Mikhail Yurochkin
Tiny Papers at the International Conference on Learning Representations (ICLR), 2024 (Notable)
Findings of the Association for Computational Linguistics: EMNLP, 2024
arXiv / Code / Data / Model

Tal Shnitzer, Anthony Ou, Mírian Silva, Kate Soule, Yuekai Sun, Justin Solomon, Neil Thompson, Mikhail Yurochkin
NeurIPS Workshop on Distribution Shifts (DistShift), 2023 (Oral)
Conference on Language Modeling (COLM), 2024
arXiv / Code (see supplementary material)

Michael Feffer, Ronald Xu, Yuekai Sun, Mikhail Yurochkin
Conference on Language Modeling (COLM), 2024
arXiv

Felipe Maia Polo, Lucas Weber, Leshem Choshen, Yuekai Sun, Gongjun Xu, Mikhail Yurochkin
ICLR Workshop on Mathematical and Empirical Understanding of Foundation Models, 2024
International Conference on Machine Learning (ICML), 2024
arXiv / Code / Data / Twitter

Jiacheng Zhu, Kristjan Greenewald, Kimia Nadjahi, Haitz Sáez de Ocáriz Borde, Rickard Brüel Gabrielsson, Leshem Choshen, Marzyeh Ghassemi, Mikhail Yurochkin, Justin Solomon
ICLR Workshop on Mathematical and Empirical Understanding of Foundation Models, 2024
International Conference on Machine Learning (ICML), 2024
arXiv / Code

Apoorva Nitsure, Youssef Mroueh, Mattia Rigotti, Kristjan Greenewald, Brian Belgodere, Mikhail Yurochkin, Jiri Navratil, Igor Melnyk, Jerret Ross
NeurIPS Workshop on Socially Responsible Language Modelling Research (SoLaR), 2023
International Conference on Machine Learning (ICML), 2024
arXiv / Code (see supplementary material)

Hongyi Wang, Felipe Maia Polo, Yuekai Sun, Souvik Kundu, Eric Xing, Mikhail Yurochkin
NeurIPS Workshop on Distribution Shifts (DistShift), 2023
International Conference on Learning Representations (ICLR), 2024
arXiv

Subha Maity, Mayank Agarwal, Mikhail Yurochkin, Yuekai Sun
International Conference on Learning Representations (ICLR), 2024
arXiv / Code

Felix Petersen, Aashwin Ananda Mishra, Hilde Kuehne, Christian Borgelt, Oliver Deussen, Mikhail Yurochkin
International Conference on Learning Representations (ICLR), 2024
arXiv / Code

Yuchen Zeng, Kristjan Greenewald, Luann Jung, Kangwook Lee, Justin Solomon, Mikhail Yurochkin
NeurIPS Workshop on Distribution Shifts (DistShift), 2023
arXiv / Code

Rickard Gabrielsson, Mikhail Yurochkin, Justin Solomon
Transactions on Machine Learning Research (TMLR), 2023
arXiv

Lilian Ngweta, Subha Maity, Alex Gittens, Yuekai Sun, Mikhail Yurochkin
International Conference on Machine Learning (ICML), 2023
arXiv / Code

Kristjan Greenewald, Anming Gu, Mikhail Yurochkin, Justin Solomon, Edward Chien
Transactions on Machine Learning Research (TMLR), 2023
arXiv / Code

Subha Maity, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun
International Conference on Learning Representations (ICLR), 2023
arXiv / Code

Lingxiao Li, Noam Aigerman, Vladimir Kim, Jiajin Li, Kristjan Greenewald, Mikhail Yurochkin, Justin Solomon
International Conference on Learning Representations (ICLR), 2023
arXiv / Code

Lingxiao Li, Qiang Liu, Anna Korba, Mikhail Yurochkin, Justin Solomon
International Conference on Learning Representations (ICLR), 2023
arXiv / Code

Zahra Ashktorab, Benjamin Hoover, Mayank Agarwal, Casey Dugan, Werner Geyer, Hao Bang Yang, Mikhail Yurochkin
CHI Conference on Human Factors in Computing Systems, 2023
arXiv

Songkai Xue, Yuekai Sun, Mikhail Yurochkin
Neural Information Processing Systems (NeurIPS), 2022 (Oral)
arXiv

Debarghya Mukherjee, Felix Petersen, Mikhail Yurochkin, Yuekai Sun
Neural Information Processing Systems (NeurIPS), 2022
arXiv

Mikhail Yurochkin and Yuekai Sun
Chapter 7 of Federated Learning: A Comprehensive Overview of Methods and Applications (edited by Heiko Ludwig and Nathalie Baracaldo), 2022
PDF

Mayank Agarwal, Mikhail Yurochkin, Yuekai Sun
Chapter 4 of Federated Learning: A Comprehensive Overview of Methods and Applications (edited by Heiko Ludwig and Nathalie Baracaldo), 2022
PDF

Tal Shnitzer, Mikhail Yurochkin, Kristjan Greenewald, Justin Solomon
International Conference on Machine Learning (ICML), 2022
arXiv / Code

Ioana Baldini, Dennis Wei, Karthikeyan Natesan Ramamurthy, Mikhail Yurochkin, Moninder Singh
Findings of ACL, 2022
arXiv

William Stephenson, Soumya Ghosh, Tin Nguyen, Mikhail Yurochkin, Sameer Deshpande, Tamara Broderick
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
arXiv / Code

Mayank Agarwal, Mikhail Yurochkin, Yuekai Sun
Neural Information Processing Systems (NeurIPS), 2021
arXiv

Felix Petersen, Debarghya Mukherjee, Yuekai Sun, Mikhail Yurochkin
Neural Information Processing Systems (NeurIPS), 2021
arXiv / Code / Video

Subha Maity, Debarghya Mukherjee, Mikhail Yurochkin, Yuekai Sun
Neural Information Processing Systems (NeurIPS), 2021
arXiv

Viet Huynh, Nhat Ho, Nhan Dam, XuanLong Nguyen, Mikhail Yurochkin, Hung Bui, Dinh Phung
Journal of Machine Learning Research (JMLR), 2021
PDF / Code

Debarghya Mukherjee, Aritra Guha, Justin Solomon, Yuekai Sun, Mikhail Yurochkin
International Conference on Machine Learning (ICML), 2021
arXiv / Code

Mikhail Yurochkin and Yuekai Sun
International Conference on Learning Representations (ICLR), 2021 (Oral)
arXiv / Code (see supplementary material) / Video / Blog

Amanda Bower, Hamid Eftekhari, Mikhail Yurochkin, Yuekai Sun
International Conference on Learning Representations (ICLR), 2021
arXiv / Code (see supplementary material) / Video

Subha Maity, Songkai Xue, Mikhail Yurochkin, Yuekai Sun
International Conference on Learning Representations (ICLR), 2021
arXiv / Code / Video

Alexander Vargo, Fan Zhang, Mikhail Yurochkin, Yuekai Sun
International Conference on Learning Representations (ICLR), 2021 (Spotlight)
arXiv / Code (see supplementary material) / Video

Lingxiao Li, Aude Genevay, Mikhail Yurochkin, Justin Solomon
Neural Information Processing Systems (NeurIPS), 2020
arXiv / Code

Mark Weber, Mikhail Yurochkin, Sherif Botros, Vanio Markov
NeurIPS Fair AI in Finance Workshop, 2020 (Spotlight Talk)
arXiv / Blog

Sebastian Claici, Mikhail Yurochkin, Soumya Ghosh, Justin Solomon
International Conference on Machine Learning (ICML), 2020
arXiv / Code

Debarghya Mukherjee, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun
International Conference on Machine Learning (ICML), 2020
arXiv / Code

Songkai Xue, Mikhail Yurochkin, Yuekai Sun
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
arXiv

Hongyi Wang, Mikhail Yurochkin, Yuekai Sun, Dimitris Papailiopoulos, Yasaman Khazaeni
International Conference on Learning Representations (ICLR), 2020 (Oral)
arXiv / Code / Video / Blog

Mikhail Yurochkin, Amanda Bower, Yuekai Sun
International Conference on Learning Representations (ICLR), 2020 (Spotlight)
arXiv / Code / Video / Blog

Mikhail Yurochkin, Sebastian Claici, Edward Chien, Farzaneh Mirzazadeh, Justin Solomon
Neural Information Processing Systems (NeurIPS), 2019
arXiv / Code / Blog / MIT News

Pierre Monteiller, Sebastian Claici, Edward Chien, Farzaneh Mirzazadeh, Justin Solomon, Mikhail Yurochkin
Neural Information Processing Systems (NeurIPS), 2019
arXiv / Code / Blog

Mikhail Yurochkin, Mayank Agarwal, Soumya Ghosh, Kristjan Greenewald, Trong Nghia Hoang
Neural Information Processing Systems (NeurIPS), 2019
arXiv / Code / Blog

Mikhail Yurochkin, Zhiwei Fan, Aritra Guha, Paraschos Koutris, XuanLong Nguyen
Neural Information Processing Systems (NeurIPS), 2019
arXiv / Code / Blog

Mikhail Yurochkin, Aritra Guha, Yuekai Sun, XuanLong Nguyen
International Conference on Machine Learning (ICML), 2019 (Long Talk)
arXiv / Code / Video

Mikhail Yurochkin, Mayank Agarwal, Soumya Ghosh, Kristjan Greenewald, Trong Nghia Hoang, Yasaman Khazaeni
International Conference on Machine Learning (ICML), 2019
arXiv / Code / Video

Mikhail Yurochkin, Sohini Upadhyay, Djallel Bouneffouf, Mayank Agarwal, Yasaman Khazaeni
ICLR Limited Labeled Data (LLD) Workshop, 2019
PDF

Mikhail Yurochkin
PhD Thesis, University of Michigan, 2018
PDF / Slides

Mikhail Yurochkin, Aritra Guha, XuanLong Nguyen
Neural Information Processing Systems (NeurIPS), 2017
arXiv / Code

Mikhail Yurochkin, XuanLong Nguyen, Nikolaos Vasiloglou
Neural Information Processing Systems (NeurIPS), 2017
arXiv / Code

Nhat Ho, XuanLong Nguyen, Mikhail Yurochkin, Hung Hai Bui, Viet Huynh, Dinh Phung
International Conference on Machine Learning (ICML), 2017
arXiv / Code

Mikhail Yurochkin and XuanLong Nguyen
Neural Information Processing Systems (NeurIPS), 2016
arXiv / Code