Shipra Agrawal is an Associate Professor of the Department of Industrial Engineering and Operations Research at Columbia University. She is also affiliated with the Department of Computer Science and the Data Science Institute. She received her PhD in Computer Science from Stanford University in June 2011 under the guidance of Prof. Yinyu Ye,
and was a researcher at Microsoft Research India from July 2011 to August 2015. Her research spans several areas of optimization and machine learning,
including online optimization under uncertainty, multi-armed bandits, online learning, reinforcement learning, and learning in games and auctions.
Shipra serves as an associate editor for Management Science,
INFORMS Journal on Optimization, and Journal of Machine Learning Research (JMLR).
Her research is supported by a
Google Faculty research award (2017), Amazon research award (2017),
and an NSF CAREER Award.
For further information, please see CV .
Yunhao Tang (Graduated 2021, research scientist at Google Deepmind London)
Xingyu Zhang (Graduated 2021, research scientist at Amazon)
Randy Jia (Graduated 2020, research scientist at Amazon)
Mohammad Shadravan (Graduated 2020)
Vashist Avadhanula (Graduated 2018, research scientist at Facebook)
Postdoc
Wei Tang (Sep 2022-Aug 2024, now an Assistant professor at CUHK)
Elaheh Fata (Sep-Aug 2022)
Selected Publications
(A full list of journal and conference papers is here )
Optimistic Q-learning for average reward and episodic reinforcement learning, Priyank Agrawal, Shipra Agrawal. Manuscript, July 2024. [ arXiv ]
Dynamic pricing and learning with long-term reference effects, S. Agrawal, W. Tang. EC 2024 [ arXiv]
Dynamic pricing and learning with Bayesian persuasion, S. Agrawal, Y. Feng, W. Tang, NeurIPS 2023 [ arXiv ]
Dynamic pricing and learning under Bass model, S. Agrawal, S. Yin, A. Zeevi, EC 2021. [arXiv]
Robust Repeated Auctions under Heterogeneous Buyer Behavior. S. Agrawal, C. Daskalakis, V. Mirrokni, B. Sivan, EC 2018. [ arXiv ]
Optimistic posterior sampling for reinforcement learning: worst-case regret bounds. S. Agrawal and R. Jia, NIPS 2017 (spotlight) [ arXiv ]
Thompson Sampling for MNL-bandit. S. Agrawal, V. Avandhanula, V. Goyal, A. Zeevi. COLT 2017. [ arXiv ]
Fast algorithms for online stochastic convex programming. S. Agrawal, N. R. Devanur. SODA 2015. [ EE ] [ arXiv ]
Bandits with concave rewards and convex knapsacks. S. Agrawal, N. R. Devanur. EC 2014. [ EE ] [ arXiv ]
Near-Optimal Regret Bounds for Thompson Sampling JACM 64 Issue 5, October 2017 [ EE ][ pdf ].
(The pdf version here includes a correction made to the published version: in Lemma 2.13, page 15:16. Modified text is in blue)
Thompson Sampling for contextual bandits with linear payoffs. S. Agrawal, N. Goyal. ICML 2013. [ EE ]
[Newer version with improved results is here: arXiv ]
Analysis of Thompson Sampling for the multi-armed bandit problem. S. Agrawal, N. Goyal. COLT 2012. [ pdf ] [ arXiv ]
Associate Editor
Management Science (Optimization area), INFORMS Journal on Optimization and Journal of Machine Learning Research (JMLR).
Program co-chair AISTATS 2025
Co-chair, Conference on Learning Theory (COLT) 2024
COLT Steering committe member (Elected July 2022).
Co-chair of the 34th international conference on algorithmic learning theory (ALT), Feb 20-23, 2023
Co-organizer: Fall 2022 semester on Data-driven decision processes at Simons Institute, Berkeley.
Tutorials Co-chair ACML (Asian Conference on Machine Learning) 2022.
Program Committees: COLT 2023 (SPC), ALT 2022 (SPC), INFORMS Nicholson prize committee 2022.
Contact Information
Mudd Building, Office 423
500 West 120th Street
New York, NY 10027
Videos
Title: A tutorial on multi-armed bandits at Simons Institute, Berkeley, CA. Aug 22, 2022.
Title: Dynamic Pricing and Learning under the Bass model, July 19, 2021.
Title: "Recent advances in multiarmed bandits for sequential decision making"
Tutorial in OR at at INFORMS annual meeting 2019
Title: Learning MDPs with convex cost functions: improved regret bounds for inventory management
at NeurIPS 2019 Workshop on Optimization foundations of Reinforcement Learning
( my talk begins at 37:33 )
News and Events
2024
Sep 12 Lightning Talk at Columbia Sports AI Symposium
Oct 8 Keynote at RL workshop, Amazon Machine Learning Conference.
July 17/18 Invited talk at Theory of RL workshop at ICML 2020.
July 17/18 Invited talk on "Learning to manage inventory" at Real-World Experiment Design & Active Learning workshop at ICML 2020. Video is here (at 6:01:00))