Katherine Lee

Cited by

	All	Since 2019
Citations	22264	22245
h-index	19	19
i10-index	20	20

11000

5500

2750

8250

20202021202220232024735 2058 4425 10299 4625

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Adam RobertsGoogle BrainVerified email at google.com
Colin RaffelUniversity of Toronto, Vector Institute and Hugging FaceVerified email at cs.toronto.edu
Nicholas CarliniGoogle DeepMindVerified email at google.com
Florian TramèrAssistant Professor of Computer Science, ETH ZurichVerified email at inf.ethz.ch
Sharan NarangResearch Engineer, Meta AIVerified email at meta.com
Matthew JagielskiGoogle DeepMindVerified email at google.com
Yanqi ZhouGoogleVerified email at google.com
Noam ShazeerCharacter.aiVerified email at character.ai
Daphne IppolitoGoogle BrainVerified email at google.com
Chiyuan ZhangGoogle ResearchVerified email at google.com
Eric WallaceUC BerkeleyVerified email at berkeley.edu
Milad NasrGoogle DeepMindVerified email at srxzr.com
Christopher A. Choquette-ChooGoogle DeepMindVerified email at google.com
A. Feder CooperCo-founder, The GenLaw CenterVerified email at cornell.edu
Orhan FiratGoogle AIVerified email at google.com
David SussilloMeta Reality Labs and Adjunct Professor @ StanfordVerified email at stanford.edu

Katherine Lee

Researcher, Google Brain Research

Verified email at google.com

natural language processing translation machine learning computational neuroscience attention


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Exploring the limits of transfer learning with a unified text-to-text transformer C Raffel, N Shazeer, A Roberts, K Lee, S Narang, M Matena, Y Zhou, W Li, ... The Journal of Machine Learning Research 21 (1), 5485-5551, 2020	14787	2020
Palm: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... Journal of Machine Learning Research 24 (240), 1-113, 2023	3409	2023
Extracting Training Data from Large Language Models. N Carlini, F Tramer, E Wallace, M Jagielski, A Herbert-Voss, K Lee, ... USENIX Security Symposium 6, 2021	1198	2021
PaLM 2 Technical Report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	775	2023
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	443	2023
Quantifying Memorization Across Neural Language Models N Carlini, D Ippolito, M Jagielski, K Lee, F Tramer, C Zhang arXiv preprint arXiv:2202.07646, 2022	369	2022
Deduplicating training data makes language models better K Lee, D Ippolito, A Nystrom, C Zhang, D Eck, C Callison-Burch, N Carlini arXiv preprint arXiv:2107.06499, 2021	338	2021
WT5?! Training Text-to-Text Models to Explain their Predictions S Narang, C Raffel, K Lee, A Roberts, N Fiedel, K Malkan arXiv preprint arXiv:2004.14546, 2020	167	2020
What Does it Mean for a Language Model to Preserve Privacy? H Brown, K Lee, F Mireshghallah, R Shokri, F Tramèr 2022 ACM Conference on Fairness, Accountability, and Transparency, 2280-2292, 2022	114	2022
Hallucinations in neural machine translation K Lee, O Firat, A Agarwal, C Fannjiang, D Sussillo	112	2018
Are aligned neural networks adversarially aligned? N Carlini, M Nasr, CA Choquette-Choo, M Jagielski, I Gao, PWW Koh, ... Advances in Neural Information Processing Systems 36, 2024	96	2024
Counterfactual memorization in neural language models C Zhang, D Ippolito, K Lee, M Jagielski, F Tramèr, N Carlini Advances in Neural Information Processing Systems 36, 2024	70	2024
Propagation of information along the cortical hierarchy as a function of attention while reading and listening to stories M Regev, E Simony, K Lee, KM Tan, J Chen, U Hasson Cerebral Cortex 29 (10), 4017-4034, 2019	70	2019
Scalable Extraction of Training Data from (Production) Language Models M Nasr, N Carlini, J Hayase, M Jagielski, AF Cooper, D Ippolito, ... arXiv preprint arXiv:2311.17035, 2023	67	2023
Preventing Verbatim Memorization in Language Models Gives a False Sense of Privacy D Ippolito, F Tramèr, M Nasr, C Zhang, M Jagielski, K Lee, ... arXiv preprint arXiv:2210.17546, 2022	59*	2022
Measuring Forgetting of Memorized Training Examples M Jagielski, O Thakkar, F Tramèr, D Ippolito, K Lee, N Carlini, E Wallace, ... arXiv preprint arXiv:2207.00099, 2022	57	2022
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity S Longpre, G Yauney, E Reif, K Lee, A Roberts, B Zoph, D Zhou, J Wei, ... arXiv preprint arXiv:2305.13169, 2023	42	2023
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	28	2024
Madlad-400: A multilingual and document-level large audited dataset S Kudugunta, I Caswell, B Zhang, X Garcia, D Xin, A Kusupati, R Stella, ... Advances in Neural Information Processing Systems 36, 2024	24	2024
Talkin’‘Bout AI Generation: Copyright and the Generative AI Supply Chain K Lee, AF Cooper, J Grimmelmann Available at SSRN 4523551, 2023	18	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors