A generalist agent S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... arXiv preprint arXiv:2205.06175, 2022 | 693 | 2022 |
Magnetic control of tokamak plasmas through deep reinforcement learning J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ... Nature 602 (7897), 414-419, 2022 | 639 | 2022 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 568 | 2023 |
Robust reinforcement learning for continuous control with model misspecification DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ... arXiv preprint arXiv:1906.07516, 2019 | 113 | 2019 |
Fairness for unobserved characteristics: Insights from technological impacts on queer communities N Tomasev, KR McKee, J Kay, S Mohamed Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, 254-265, 2021 | 78 | 2021 |
Self-supervised sim-to-real adaptation for visual robotic manipulation R Jeong, Y Aytar, D Khosid, Y Zhou, J Kay, T Lampe, K Bousmalis, F Nori 2020 IEEE international conference on robotics and automation (ICRA), 2718-2724, 2020 | 59 | 2020 |
Sociotechnical safety evaluation of generative ai systems L Weidinger, M Rauh, N Marchal, A Manzini, LA Hendricks, ... arXiv preprint arXiv:2310.11986, 2023 | 39 | 2023 |
Learning dexterous manipulation from suboptimal experts R Jeong, JT Springenberg, J Kay, D Zheng, Y Zhou, A Galashov, N Heess, ... arXiv preprint arXiv:2010.08587, 2020 | 35 | 2020 |
Modelling generalized forces with reinforcement learning for sim-to-real transfer R Jeong, J Kay, F Romano, T Lampe, T Rothorl, A Abdolmaleki, T Erez, ... arXiv preprint arXiv:1910.09471, 2019 | 21 | 2019 |
Real-time control in ROS and ROS 2.0 J Kay, AR Tsouroukdissian ROSCon15, 2015 | 21 | 2015 |
Subverting machines, fluctuating identities: Re-learning human categorization C Lu, J Kay, K McKee Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022 | 18 | 2022 |
A generalist agent, 2022 S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... URL https://arxiv. org/abs/2205.06175 3, 2022 | 18 | 2022 |
Local search for policy iteration in continuous control JT Springenberg, N Heess, D Mankowitz, J Merel, A Byravan, ... arXiv preprint arXiv:2010.05545, 2020 | 17 | 2020 |
A generalist Agent. arXiv S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... arXiv preprint arXiv:2205.06175, 2022 | 15 | 2022 |
A generalist agent. arXiv 2022 S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... arXiv preprint arXiv:2205.06175, 1-40, 0 | 13 | |
Queer in AI: a case study in community-led participatory AI OO Queerinai, A Ovalle, A Subramonian, A Singh, C Voelcker, ... Proceedings of the 2023 ACM Conference on Fairness, Accountability, and …, 2023 | 12 | 2023 |
Finetuning from offline reinforcement learning: Challenges, trade-offs and practical solutions Y Luo, J Kay, E Grefenstette, MP Deisenroth arXiv preprint arXiv:2303.17396, 2023 | 11 | 2023 |
Proposal for Implementation of Real-time Systems in ROS 2 J Kay ROS. org, 2016 | 11 | 2016 |
Urdf I Sucan, J Kay nd ROS. org, from http://wiki. ros. org/urdf, 2017 | 6 | 2017 |
Few-shot keypoint detection as task adaptation via latent embeddings M Vecerik, J Kay, R Hadsell, L Agapito, J Scholz 2022 International Conference on Robotics and Automation (ICRA), 1251-1257, 2022 | 1 | 2022 |