Follow
Yuqing Du
Title
Cited by
Cited by
Year
Aligning text-to-image models using human feedback
K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ...
arXiv preprint arXiv:2302.12192, 2023
1052023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Y Du*, O Watkins*, Z Wang, C Colas, T Darrell, P Abbeel, A Gupta, ...
International Conference on Machine Learning (ICML) 2023, 2023
852023
Robust Reinforcement Learning using Adversarial Populations
E Vinitsky*, Y Du*, K Parvate*, K Jang, P Abbeel, A Bayen
arXiv preprint arXiv:2008.01825, 2020
832020
Auto-tuned sim-to-real transfer
Y Du*, O Watkins*, T Darrell, P Abbeel, D Pathak
2021 IEEE International Conference on Robotics and Automation (ICRA), 1290-1296, 2021
682021
Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Y Fan, O Watkins, Y Du, H Liu, M Ryu, C Boutilier, P Abbeel, ...
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS …, 2023
532023
Ave: Assistance via empowerment
Y Du, S Tiomkin, E Kiciman, D Polani, P Abbeel, A Dragan
Advances in Neural Information Processing Systems 33, 4560-4571, 2020
352020
Group surfing: A pedestrian-based approach to sidewalk robot navigation
Y Du, NJ Hetherington, CL Oon, WP Chan, CP Quintero, E Croft, ...
2019 international conference on robotics and automation (ICRA), 6518-6524, 2019
332019
Vision-Language Models as Success Detectors
Y Du, K Konyushkova, M Denil, A Raju, J Landon, F Hill, N de Freitas, ...
Conference on Lifelong Learning Agents (CoLLAs) 2023, 2023
312023
Learning to model the world with language
J Lin, Y Du, O Watkins, D Hafner, P Abbeel, D Klein, A Dragan
arXiv preprint arXiv:2308.01399, 2023
202023
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation
Y Du, P Abbeel, A Grover
International Conference on Learning Representations (ICLR) 2022, 2022
152022
Practical Visual Deep Imitation Learning via Task-Level Domain Consistency
M Khansari, D Ho, Y Du, A Fuentes, M Bennice, N Sievers, S Kirmani, ...
2023 IEEE International Conference on Robotics and Automation (ICRA), 1837-1844, 2023
6*2023
Bayesian Imitation Learning for End-to-End Mobile Manipulation
Y Du, D Ho, AA Alemi, E Jang, M Khansari
International Conference on Machine Learning (ICML) 2022, 2022
52022
Sidewalk Delivery Robot Navigation: A Pedestrian-Based Approach
Y Du, NJ Hetherington, CL Oon, WP Chan, CP Quintero, E Croft, ...
Human-Aiding Robotics: Open Issues and Future Direction 2018, 2018
22018
Teaching Large Language Models to Reason with Reinforcement Learning
A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ...
arXiv preprint arXiv:2403.04642, 2024
12024
A Study on Improving Reasoning in Language Models
Y Du, A Havrilla, S Sukhbaatar, P Abbeel, R Raileanu
I Can’t Believe It’s Not Better! (ICBINB) Workshop @ NeurIPS 2023, 2023
1*2023
What can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration
Y Du, E Kosoy, A Dayan, M Rufova, P Abbeel, A Gopnik
NeurIPS 2023 workshop: Information-Theoretic Principles in Cognitive Systems, 2023
12023
Using embeddings, generated using robot action models, in controlling robot to perform robotic task
D Ho, E Jang, M Khansari, YQ Du, AA Alemi
US Patent App. 18/102,053, 2024
2024
Mitigating reality gap through feature-level domain adaptation in training of vision-based robot action model
M Khansari, D Ho, E Jang, YQ Du
US Patent App. 17/986,428, 2023
2023
Human-Centric Reward Design
YQ Du
UC Berkeley, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–19