Publications & Preprints

(2024). Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation. NeurIPS 2024.

PDF

(2024). Nearly Minimax Optimal Regret for Multinomial Logistic Bandit. NeurIPS 2024.

PDF

(2024). Demystifying Linear MDPs and Novel Dynamics Aggregation Framework. ICLR 2024.

PDF

(2024). Learning Uncertainty-Aware Temporally-Extended Actions. AAAI 2024.

PDF