Publications & Preprints

(2025). Improved Online Confidence Bounds for Multinomial Logistic Bandits. ICML 2025.

PDF

(2025). Combinatorial Reinforcement Learning with Preference Feedback. ICML 2025.

PDF

(2024). Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation. NeurIPS 2024.

PDF

(2024). Demystifying Linear MDPs and Novel Dynamics Aggregation Framework. ICLR 2024.

PDF

(2024). Learning Uncertainty-Aware Temporally-Extended Actions. AAAI 2024.

PDF