Multi-Agent Game Abstraction via Graph Attention Neural Network Y Liu*, W Wang*, Y Hu, J Hao, X Chen, Y Gao AAAI 2020, 2020 | 133 | 2020 |
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping Y Hu, W Wang, H Jia, Y Wang, Y Chen, J Hao, F Wu, C Fan Advances in Neural Information Processing Systems 33, 2020 | 71 | 2020 |
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning W Wang, T Yang, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao AAAI 2020, 2020 | 70 | 2020 |
Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas W Wang, J Hao, Y Wang, M Taylor Proceedings of the First International Conference on Distributed Artificial …, 2019 | 46* | 2019 |
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems W Wang, T Yang, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao ICLR 2020, 2020 | 26 | 2020 |
The 37 implementation details of proximal policy optimization S Huang, RFJ Dossa, A Raffin, A Kanervisto, W Wang ICLR Blog Track, 2022 | 22 | 2022 |
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, W Liu, ... IJCAI 2020, 2020 | 21 | 2020 |
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge P Zhang, J Hao, W Wang, H Tang, Y Ma, Y Duan, Y Zheng IJCAI2020, 2020 | 20 | 2020 |
Background-free upconversion-encoded microspheres for mycotoxin detection based on a rapid visualization method M Yang, M Cui, W Wang, Y Yang, J Chang, J Hao, H Wang Analytical and bioanalytical chemistry 412, 81-91, 2020 | 19 | 2020 |
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning T Yang*, W Wang*, H Tang*, HAO Jianye, Z Meng, H Mao, D Li, W Liu, ... Thirty-Fifth Conference on Neural Information Processing Systems, 2021 | 18* | 2021 |
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems X Hao*, W Wang*, J Hao, Y Yang Proceedings of the 18th International Conference on Autonomous Agents and …, 2019 | 13 | 2019 |
Learning Adaptive Display Exposure for Real-Time Advertising W Wang, J Jin, J Hao, C Chen, C Yu, W Zhang, J Wang, X Hao, Y Wang, ... CIKM 2019, 2019 | 13* | 2019 |
Efficient Deep Reinforcement Learning through Policy Transfer. T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, Z Wang, ... AAMAS, 2053-2055, 2020 | 8 | 2020 |
Individual Reward Assisted Multi-Agent Reinforcement Learning L Wang, Y Zhang, Y Hu, W Wang, C Zhang, Y Gao, J Hao, T Lv, C Fan International Conference on Machine Learning, 23417-23432, 2022 | 7 | 2022 |
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment T Zhou, F Zhang, K Shao, K Li, W Huang, J Luo, W Wang, Y Yang, H Mao, ... arXiv preprint arXiv:2106.00517, 2021 | 7 | 2021 |
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang The Eleventh International Conference on Learning Representations, 2023 | 3* | 2023 |
A2C is a special case of PPO S Huang, A Kanervisto, A Raffin, W Wang, S Ontañón, RFJ Dossa arXiv preprint arXiv:2205.09123, 2022 | 3 | 2022 |
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing Y Yang, G Chen, W Wang, X Hao, HAO Jianye, PA Heng Advances in Neural Information Processing Systems, 2022 | 3 | 2022 |
MARLlib: Extending RLlib for Multi-agent Reinforcement Learning S Hu, Y Zhong, M Gao, W Wang, H Dong, Z Li, X Liang, X Chang, Y Yang arXiv preprint arXiv:2210.13708, 2022 | 2 | 2022 |
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization J Zhao, Y Zhang, X Hu, W Wang, W Zhou, J Hao, J Zhu, H Li arXiv preprint arXiv:2202.04427, 2022 | 2 | 2022 |