Publications
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness
Haotian Ye*, Xiaoyu Chen*, Liwei Wang, Simon S Du
ICML 2023Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
Xiaoyu Chen*, Han Zhong*, Zhuoran Yang, Zhaoran Wang, Liwei Wang
ICML 2022Understanding Domain Randomization for Sim-to-real Transfer
Xiaoyu Chen*, Jiachen Hu*, Chi Jin, Lihong Li, Liwei Wang
ICLR 2022 spotlightNear-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver
Xiaoyu Chen, Jiachen Hu, Lin F. Yang, Liwei Wang
ICLR 2022 spotlightNear-optimal Representation Learning for Linear Bandits and Linear RL
Jiachen Hu*, Xiaoyu Chen*, Chi Jin, Lihong Li, Liwei Wang ICML 2021Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL
Xiaoyu Chen, Jiachen Hu, Lihong Li, Liwei Wang
ICLR 2021(Locally) Differentially Private Combinatorial Semi-bandits
Xiaoyu Chen*, Kai Zheng*, Zixin Zhou, Yunchang Yang, Wei Chen, Liwei Wang
ICML 2021Distributed Bandit Learning: Near-optimal Regret with Efficient Communication
Yuanhao Wang, Jiachen Hu, Xiaoyu Chen, Liwei Wang
ICLR 2021Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP
Kefan Dong, Yuanhao Wang, Xiaoyu Chen, Liwei Wang
ICLR 2021