Robert Kirk

Cited by

	All	Since 2019
Citations	541	541
h-index	6	6
i10-index	6	6

220

110

165

20212022202320245 120 193 218

Co-authors

Edward GrefenstetteDirector of Research, Google DeepMind | Honorary Professor, UCLVerified email at google.com
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindVerified email at cs.ucl.ac.uk
Eric HambroAnthropicVerified email at anthropic.com
David Scott KruegerUniversity Assistant Professor, University of CambridgeVerified email at cam.ac.uk
Amy ZhangAssistant Professor of Electrical and Computer Engineering at University of Texas at AustinVerified email at austin.utexas.edu
Minqi JiangResearch Scientist at Google DeepMindVerified email at ucl.ac.uk
Roberta RaileanuResearch Scientist, MetaVerified email at fb.com
Usman AnwarUniversity of CambridgeVerified email at cam.ac.uk
Vitaly KurinResearch Scientist at Isomorphic LabsVerified email at isomorphiclabs.com
Mikayel SamvelyanMeta AI, UCLVerified email at meta.com
Fabio PetroniSamaya AIVerified email at samaya.ai
Heinrich KüttlerxAIVerified email at math.lmu.de
Jack Parker-HolderGoogle DeepMind, UCLVerified email at google.com
Hidenori TanakaGroup Leader, CBS-NTT Program in "Physics of Intelligence", Harvard UniversityVerified email at fas.harvard.edu
Robert DickUniversity of Michigan, StrydVerified email at rpdmail.dyndns.org
Ekdeep Singh LubanaUniversity of MichiganVerified email at umich.edu
Samyak JainUndergrad at Indian Institute of Technology(BHU),VaranasiVerified email at itbhu.ac.in
Thomas CosteNoah's Ark Lab & University of CambridgeVerified email at cam.ac.uk
Christoforos NalmpantisPostdoctoral Researcher, Fundamental AI Research at MetaVerified email at fb.com
Jelena LuketinaOxford UniversityVerified email at cs.ox.ac.uk

Robert Kirk

PhD Student, University College London

Verified email at ucl.ac.uk - Homepage

AI Alignment AI Safety Language Models Fine-tuning Generalisation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey of zero-shot generalisation in deep reinforcement learning R Kirk, A Zhang, E Grefenstette, T Rocktäschel Journal of Artificial Intelligence Research 76, 201-264, 2023	322	2023
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... arXiv preprint arXiv:2109.13202, 2021	77	2021
Reward model ensembles help mitigate overoptimization T Coste, U Anwar, R Kirk, D Krueger arXiv preprint arXiv:2310.02743, 2023	42	2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... ICLR 2024, 2023	40	2023
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks S Jain, R Kirk, ES Lubana, RP Dick, H Tanaka, E Grefenstette, ... arXiv preprint arXiv:2311.12786, 2023	26	2023
Insights from the neurips 2021 nethack challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022	18	2022
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu arXiv preprint arXiv:2312.03801, 2023	5	2023
A study of off-policy learning in environments with procedural content generation A Ehrenberg, R Kirk, M Jiang, E Grefenstette, T Rocktäschel ICLR Workshop on Agent Learning in Open-Endedness, 2022	5	2022
Graph backup: Data efficient backup exploiting markovian transitions Z Jiang, T Zhang, R Kirk, T Rocktäschel, E Grefenstette arXiv preprint arXiv:2205.15824, 2022	4*	2022
Leading the Pack: N-player Opponent Shaping A Souly, T Willi, A Khan, R Kirk, C Lu, E Grefenstette, T Rocktäschel arXiv preprint arXiv:2312.12564, 2023	1	2023
Domain Generalization for Robust Model-Based Offline Reinforcement Learning A Clark, SA Siddiqui, R Kirk, U Anwar, S Chung, D Krueger arXiv preprint arXiv:2211.14827, 2022	1	2022
Analyzing the Generalization and Reliability of Steering Vectors--ICML 2024 D Tan, D Chanin, A Lynch, D Kanoulas, B Paige, A Garriga-Alonso, R Kirk arXiv preprint arXiv:2407.12404, 2024		2024
Analyzing the Generalization and Reliability of Steering Vectors DCH Tan, D Chanin, A Lynch, A Garriga-Alonso, D Kanoulas, B Paige, ... ICML 2024 Workshop on Mechanistic Interpretability, 2024		2024
What Mechanisms Does Knowledge Distillation Distill? C Wu, ES Lubana, BK Mlodozeniec, R Kirk, D Krueger Proceedings of UniReps: the First Workshop on Unifying Representations in …, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–14

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors