Selected Publications

No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes, NeurIPS 2025
Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds, ICML 2025
Near-Optimal Sample Complexity in Reward-Free Reinforcement Learning, AISTATS 2025
Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm, NeurIPS 2024
Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning, COLT 2024
Reward-Free Kernel-Based Reinforcement Learning, ICML 2024
Random Exploration in Bayesian Optimization: Order-Optimal Regret and Computational Efficiency, ICML 2024
Optimal Regret Bounds for Collaborative Learning in Bandits, ALT 2024
Adversarial Contextual Bandits Go Kernelized, ALT 2024
Kernelized Reinforcement Learning with Order Optimal Regret Bounds, NeurIPS 2023
Collaborative Learning in Kernel-based Bandits for Distributed Users, IEEE Trans. Signal Processing 2023
Delayed Feedback in Kernel Bandits, ICML 2023
Provably and Practically Efficient Neural Contextual Bandits, ICML 2023
Image Generation with Shortest Path Diffusion, ICML 2023
Sample Complexity of Kernel-Based Q-Learning, AISTATS 2023
Fisher-Legendre (FishLeg) Optimization of Deep Neural Networks, ICLR 2023
Uniform Generalization Bounds for Overparameterized Neural Networks, ISIT 2023
Generative Diffusion Models for Radio Wireless Channel Modelling and Sampling, GLOBECOM 2023
Near-Optimal Collaborative Learning in Bandits, NeurIPS 2022 (Oral)
Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning, ICML 2022 (Spotlight)
On Information Gain and Regret Bounds in Gaussian Process Bandits, AISTATS 2021
Optimal Order Simple Regret for Gaussian Process Bandits, NeurIPS 2021
Scalable Thompson Sampling using Sparse Gaussian Process Models, NeurIPS 2021
Open Problem: Tight Online Confidence Intervals for RKHS Elements, COLT 2021
A Domain-Shrinking based Bayesian Optimization Algorithm with Order-Optimal Regret Performance, NeurIPS 2021
Stochastic Coordinate Minimization with Progressive Precision for Stochastic Convex Optimization, ICML 2020
Amortized Variance Reduction for Doubly Stochastic Objective, UAI 2020
Adaptive Sensor Placement for Continuous Spaces, ICML 2019
Multi-armed Bandits on Partially Revealed Unit Interval Graphs, IEEE Trans. Network Science and Engineering 2019
Risk-Averse Multi-Armed Bandit Problems Under Mean-Variance Measure, IEEE J. Selected Topics in Signal Processing 2016
Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems, IEEE J. Selected Topics in Signal Processing 2013