Research

  1. Towards Optimal Offline Reinforcement Learning
    Mengmeng Li, Daniel Kuhn, and Tobias Sutter
    Submitted, 2025
    Second Place, 2025 Dupačová-Prékopa Best Student Paper in Stochastic Programming
  2. Optimism in the Face of Ambiguity Principle for Multi-Armed Bandits
    Mengmeng Li, Daniel Kuhn, and Bahar Taşkesen
    Major revision in Operations Research, 2025
    Extended abstract appeared in WINE 2024
  3. A Large Deviations Perspective on Policy Gradient Algorithms
    (α-β) Wouter Jongeneel, Daniel Kuhn, and Mengmeng Li
    In Learning for Dynamics and Control Conference (L4DC), 2024
  4. Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets
    Mengmeng Li, Daniel Kuhn, and Tobias Sutter
    Major revision in SIAM Journal on Optimization, 2023
  5. Distributionally Robust Optimization with Markovian Data
    Mengmeng Li, Tobias Sutter, and Daniel Kuhn
    In International Conference on Machine Learning (ICML), 2021