Offline reinforcement learning with value-based episodic memory X Ma, Y Yang, H Hu, Q Liu, J Yang, C Zhang, Q Zhao, B Liang arXiv preprint arXiv:2110.09796, 2021 | 34 | 2021 |
Light aircraft game: A lightweight, scalable, gym-wrapped aircraft competitive environment with baseline reinforcement learning algorithms Q Liu, Y Jiang, X Ma | 7 | 2022 |
Safe opponent-exploitation subgame refinement M Liu, C Wu, Q Liu, Y Jing, J Yang, P Tang, C Zhang Advances in Neural Information Processing Systems 35, 27610-27622, 2022 | 5 | 2022 |
Learning Diverse Risk Preferences in Population-Based Self-Play Y Jiang, Q Liu, X Ma, C Li, Y Yang, J Yang, B Liang, Q Zhao Proceedings of the AAAI Conference on Artificial Intelligence 38 (11), 12910 …, 2024 | 1 | 2024 |
CVaR-Constrained Policy Optimization for Safe Reinforcement Learning Q Zhang, S Leng, X Ma, Q Liu, X Wang, B Liang, Y Liu, J Yang IEEE Transactions on Neural Networks and Learning Systems, 2024 | | 2024 |
Efficient Multi-agent Reinforcement Learning by Planning Q Liu, J Ye, X Ma, J Yang, B Liang, C Zhang The Twelfth International Conference on Learning Representations, 2023 | | 2023 |