Global Optimality Guarantees For Policy Gradient Methods [pdf]
Jalaj Bhandari and Daniel Russo
OptRL workshop, NeurIPS 2019; RLDM 2019
Journal version in preparation
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation [pdf]
Jalaj Bhandari, Daniel Russo and Raghav Singal
Conference on Learning Theory (COLT), 2018; OptRL workshop, NeurIPS 2019; RLDM 2019
Journal version accepted to Operations Research with minor revisions.
Annular Augmentation Sampling [pdf]
Francois Fagan, Jalaj Bhandari and John P. Cunningham
AISTATS, 2017 as full oral presentation (top 5%).
Elliptical Slice Sampling with Expectation Propagation [pdf]
Francois Fagan, Jalaj Bhandari and John P. Cunningham
UAI, 2016.
On the tightness of an LP relaxation for Rational Optimization
[pdf]
Vashist Avadhanula, Jalaj Bhandari, Vineet Goyal and Assaf Zeevi
Operations Research Letters, 2016
User Scheduling in Cognitive Radio Networks
[pdf]
Jalaj Bhandari and Nomesh Bolia
Journal of Computations & Modelling, 2013