Ege C. Kaya

PhD Candidate in Electrical and Computer Engineering, Purdue University

I work on reinforcement learning, optimization, and stochastic decision-making, with an emphasis on mathematically grounded algorithms for distributional RL, multi-objective and risk-sensitive decision-making, stochastic approximation, and robust discrete optimization.

kayae [at] purdue [dot] edu Google Scholar LinkedIn GitHub

Research

My current research develops theoretical foundations and algorithms for reinforcement learning and stochastic decision-making. I am especially interested in settings where the object of interest is richer than a scalar expected return, including distributional RL, multi-objective RL, risk-sensitive decision-making, and coupled-dynamics environments.

A second line of work studies stochastic approximation and optimization methods that arise in reinforcement learning, including categorical distributional temporal-difference learning, average-reward distributional RL, non-expansive fixed-point problems, and non-convex stochastic optimization under weak noise assumptions.

I am also currently learning and working on practical aspects of modern large language models, including decoder-only transformers, causal language modeling, supervised fine-tuning, post-training, evaluation, and the role of reinforcement learning and reward modeling in preference optimization.

Earlier in my PhD, I worked on robust and submodular optimization, with applications to sensor selection, multi-task subset selection, federated learning, and distributed online optimization.

Selected Papers and Preprints

Joint MDPs and Reinforcement Learning in Coupled-Dynamics Environments UAI 2026 Oral Ege C. Kaya, Mahsa Ghasemi, and Abolfazl Hashemi Uncertainty in Artificial Intelligence, 2026. Oral presentation. Top 2.2% of 1,087 submissions. arXiv
Stochastic Dominance Driven First-Order Policy Optimization for Multi-Objective Reinforcement Learning Ege C. Kaya, Kadierdan Kaheman, Jason M. Cloud, and Abolfazl Hashemi Uncertainty in Artificial Intelligence, 2026.
A Finite-Iteration Theory for Asynchronous Categorical Distributional Temporal-Difference Learning Ege C. Kaya and Abolfazl Hashemi Preprint, 2026. arXiv
Quotient-Categorical Representations for Bellman-Compatible Average-Reward Distributional Reinforcement Learning Ege C. Kaya, Aliasghar Pourghani, Vijay Gupta, and Abolfazl Hashemi Preprint, 2026. arXiv
Lower Bounds and Proximally Anchored SGD for Non-Convex Minimization Under Unbounded Variance Arda Fazla, Ege C. Kaya, Antesh Upadhyay, and Abolfazl Hashemi Preprint, 2026. arXiv
Randomized Greedy Methods for Weak Submodular Sensor Selection with Robustness Considerations Ege C. Kaya, Michael Hibbard, Takashi Tanaka, Ufuk Topcu, and Abolfazl Hashemi Automatica, 2025.
Localized Distributional Robustness in Submodular Multi-Task Subset Selection Ege C. Kaya and Abolfazl Hashemi IEEE Transactions on Signal Processing, 2024.

Earlier Work

Equitable Client Selection in Federated Learning via Truncated Submodular Maximization. IEEE CDC, 2024.
Relative Entropy Regularization for Robust Submodular Multi-Task Subset Selection. Allerton, 2023.
Communication-Constrained Exchange of Zeroth-Order Information with Application to Collaborative Target Tracking. ICASSP, 2023.
High Probability Guarantees for Submodular Maximization via Boosted Stochastic Greedy. Asilomar, 2023.
Communication-Efficient Zeroth-Order Distributed Online Optimization: Algorithm, Theory, and Applications. IEEE Access, 2023.