Yigit Korkmaz

I am a third-year PhD student in Computer Science at University of Southern California. I am lucky to be advised by Prof. Erdem Bıyık.

My Research interests lie in Robotics, Imitation Learning, Human-in-the-loop Learning, Human-Robot Interaction and Reinforcement Learning. I specifically focus on developing algorithms that enable AI agents to model the behaviors and goals of humans and other agents by leveraging different forms of information, including explicit forms such as human demonstrations and comparisons, and more implicit forms such as human gaze and gestures. My aim is to equip AI agents and robots with the capability to understand and align with humans' goals and preferences.

Previously, I worked with Prof. Xiaolong Wang on Imitiation Learning for dexterous manipulation. I started Machine Learning research with Prof. Burak Acar at Bogazici University.

Fun Fact: My name is pronounced as Yeet.

                   

profile photo
Education

PhD. in Computer Science, University of Southern California 2023-present

Msc. in Electrical and Computer Engineering, UC San Diego 2021-2023

Bsc. in Electrical and Electronics Engineering (w/ Mechanical Engineering Minor), Bogazici University 2016-2021

Updates

Jan 2026 - ReCouPLe is accepted to ICLR 2026!
Jan 2026 - LfCD-GRIP is accepted to ICLR 2026!
Sep 2025 - Q3C is accepted to NeurIPS 2025!
June 2025 - Organized Human-in-the-Loop Robot Learning Workshop at RSS 2025!
Jan 2025 - MILE is accepted to ICRA 2025!
Feb 2024 - CyberDemo is accepted to CVPR 2024!
Aug 2023 - Started my PhD in Computer Science at University of Southern California!

Publications
Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons
Anthony Liang*, Yigit Korkmaz*, Jiahui Zhang, Minyoung Hwang, Abrar Anwar, Sidhant Kaushik, Aditya Shah, Alex S. Huang, Luke Zettlemoyer, Dieter Fox, Yu Xiang, Anqi Li, Andreea Bobu, Abhishek Gupta, Stephen Tu†, Erdem Bıyık†, Jesse Zhang†
In submission
[ Website, Paper, Code ]

Causally Robust Reward Learning from Reason-Augmented Preference Feedback
Minjune Hwang, Yigit Korkmaz, Daniel Seita†, Erdem Bıyık†
International Conference on Learning Representations (ICLR) 2026
Human-in-the-Loop Robot Learning Workshop @ RSS 2025
[ Paper, Code ]

When a Robot is More Capable than a Human: Learning from Constrained Demonstrators
Xinhu Li, Ayush Jain, Zhaojing Yang, Yigit Korkmaz, Erdem Bıyık
International Conference on Learning Representations (ICLR) 2026
[ Website, Paper ]

Actor-Free Continuous Control via Structurally Maximizable Q-Functions
Yigit Korkmaz*, Urvi Bhuwania*, Ayush Jain†, Erdem Bıyık†
Conference on Neural Information Processing Systems (NeurIPS) 2025
[ Paper, Code ]

MILE: Model-based Intervention Learning
Yigit Korkmaz, Erdem Bıyık
International Conference on Robotics and Automation (ICRA) 2025
[ Website, Paper, Code ]

CyberDemo: Augmenting Simulated Human Demonstration for Real-World Dexterous Manipulation
Jun Wang*, Yuzhe Qin*, Kaiming Kuang, Yigit Korkmaz, Akhilan Gurumoorthy, Hao Su, Xiaolong Wang
Conference on Computer Vision and Pattern Recognition (CVPR) 2024
[ Website, Paper, Code ]

Selected Projects
Using Longformers for Argument Classification
Yigit Korkmaz
code

Transformer networks are highly utilized in Natural Language Processing(NLP) tasks. In this project, the use of Longformer, which is a transformer based network with advanced attention mechanism, in classifying argumentative elements of students’ writing will be discussed.

Motion Planning with Artificial Potential Fields
Yigit Korkmaz
code

In this project, APF(artificial potential fields), which a method used to formulate obstacle and goal interactions in a robot’s path, is implemented. With the assumption of a known map, obstacles act as repulsive forces where targets act as attractive ones. The code is written for ROS environment, and experimented on real robots.


Template is from Jon Barron's awesome website.