Follow
Chenjia Bai
Chenjia Bai
Shanghai AI Laboratory
Verified email at pjlab.org.cn - Homepage
Title
Cited by
Cited by
Year
Exploration in Deep Reinforcement Learning: A Comprehensive Survey
T Yang, H Tang, C Bai, J Liu, J Hao, Z Meng, P Liu
arXiv preprint arXiv:2109.06668, 2021
282021
Survey on Sparse Reward in Deep Reinforcement Learning
W Yang, C Bai, C Cai, Y Zhao, P Liu
计算机科学 47 (3), 182-191, 2020
20*2020
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
C Bai, L Wang, Z Yang, Z Deng, A Garg, P Liu, Z Wang
International Conference on Learning representations (ICLR), 2022
182022
Principled Exploration via Optimistic Bootstrapping and Backward Induction
C Bai, L Wang, L Han, J Hao, A Garg, P Liu, Z Wang
International Conference on Machine Learning (ICML), 2021
142021
Guided Goal Generation for Hindsight Multi-Goal Reinforcement Learning
C Bai, P Liu, W Zhao, X Tang
Neurocomputing 359, 353-367, 2019
122019
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
C Bai, P Liu, K Liu, L Wang, Y Zhao, L Han, Z Wang
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
72021
Active Sampling for Deep Q-learning Based on TD-error Adaptive Correction
C Bai, P Liu, W Zhao, X Tang
计算机研究与发展 56 (2), 262-280, 2019
7*2019
Addressing Hindsight Bias in Multi-Goal Reinforcement Learning
C Bai, L Wang, Y Wang, Z Wang, R Zhao, C Bai, P Liu
IEEE Transactions on Cybernetics, 2021
62021
Dynamic Bottleneck for Robust Self-Supervised Exploration
C Bai, L Wang, L Han, A Garg, J Hao, P Liu, Z Wang
Neural Information Processing Systems (NeurIPS), 2021
62021
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
R Yang, C Bai, X Ma, Z Wang, C Zhang, L Han
Neural Information Processing Systems (NeurIPS), 2022
52022
Generating Attentive Goals for Prioritized Hindsight Reinforcement Learning
P Liu, C Bai, Y Zhao, C Bai, W Zhao, X Tang
Knowledge-Based Systems 203, 106140, 2020
52020
Research on Autonomous Driving Methods Based on Computer Vision and Deep Learning
C Bai
Harbin Institute of Technology, 2017
42017
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
S Qiu, L Wang, C Bai, Z Yang, Z Wang
International Conference on Machine Learning (ICML), 18168-18210, 2022
32022
SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Z Deng, Z Fu, L Wang, Z Yang, C Bai, Z Wang, J Jiang
arXiv preprint arXiv:2110.12468, 2021
32021
Obtaining Accurate Estimated Action Values in Categorical Distributional Reinforcement Learning
Y Zhao, P Liu, C Bai, W Zhao, X Tang
Knowledge-Based Systems 194, 105511, 2020
22020
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning
C Bai, T Xiao, Z Zhu, L Wang, F Zhou, A Garg, B He, P Liu, Z Wang
IEEE Transactions on Neural Networks and Learning Systems, 2022
2022
OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning
J Liu, W Zhi, Y Zheng, J Hao, J Ye, C Bai, P Li
NeurIPS 2021 Deep Reinforcement Learning Workshop, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–17