Kartik Audhkhasi

Cited by

	All	Since 2019
Citations	3651	2588
h-index	32	27
i10-index	64	49

540

270

135

405

20102011201220132014201520162017201820192020202120222023202410 26 34 50 83 92 127 213 386 416 463 533 510 445 220

Public access

View all

9 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Bhuvana RamabhadranManager, GoogleVerified email at google.com
Shrikanth (Shri) NarayananUniversity Professor and Niki & Max Nikias Chair in Engineering, University of Southern CaliforniaVerified email at sipi.usc.edu
Brian KingsburyDistinguished Research Staff Member and Manager, IBM T. J. Watson Research Center, Yorktown HeightsVerified email at us.ibm.com
Samuel ThomasIBM Research AIVerified email at us.ibm.com
George SaonDistinguished Research Staff Member, IBM Research AIVerified email at us.ibm.com
Abhinav SethyAmazon AlexaVerified email at amazon.com
Zoltán TüskeAppTekVerified email at apptek.com
Andrew RosenbergGoogleVerified email at google.com
Osonde OsobaLinkedIn, RAND Corporation, University of Southern CaliforniaVerified email at rand.org
Bart KoskoProfessor of Electrical Engineering, University of Southern CaliforniaVerified email at usc.edu
Panayiotis (Panos) GeorgiouApple/University of Southern CaliforniaVerified email at apple.com
Xiaodong CuiPrincipal Research Scientist, IBM T. J. Watson Research CenterVerified email at us.ibm.com
Rahul GuptaAmazon AlexaVerified email at amazon.com
Tom SercuEvolutionaryScaleVerified email at evolutionaryscale.ai
Jia CuiTencentVerified email at tencent.com
Daniel BoneAmazon, AlexaVerified email at amazon.com
Om DeshmukhVerified email at xerox.com
Dimitrios DimitriadisApplied Scientist - AmazonVerified email at amazon.com
Markus Nussbaum-ThomRWTH Aachen UniversityVerified email at i6.informatik.rwth-aachen.de
Sungbok LeeResearch Professor of Electrical Engineering, University of Southern CaliforniaVerified email at usc.edu

Kartik Audhkhasi

Google

Verified email at google.com - Homepage

Speech recognition Machine learning Neural networks


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
English conversational telephone speech recognition by humans and machines G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ... arXiv preprint arXiv:1703.02136, 2017	472	2017
Applying machine learning to facilitate autism diagnostics: pitfalls and promises D Bone, MS Goodwin, MP Black, CC Lee, K Audhkhasi, S Narayanan Journal of autism and developmental disorders 45, 1121-1136, 2015	287	2015
Direct acoustics-to-word models for english conversational speech recognition K Audhkhasi, B Ramabhadran, G Saon, M Picheny, D Nahamoo arXiv preprint arXiv:1703.07754, 2017	166	2017
Avlnet: Learning audio-visual language representations from instructional videos A Rouditchenko, A Boggust, D Harwath, B Chen, D Joshi, S Thomas, ... arXiv preprint arXiv:2006.09199, 2020	142	2020
Building competitive direct acoustics-to-word models for english conversational speech recognition K Audhkhasi, B Kingsbury, B Ramabhadran, G Saon, M Picheny 2018 IEEE international conference on acoustics, speech and signal …, 2018	140	2018
End-to-End ASR-free Keyword Search from Speech K Audhkhasi, A Rosenberg, A Sethy, B Ramabhadran, B Kingsbury arXiv preprint arXiv:1701.04313, 2017	133	2017
Noise-enhanced convolutional neural networks K Audhkhasi, O Osoba, B Kosko Neural Networks 78, 15-23, 2016	122	2016
Multilingual representations for low resource speech recognition and keyword search J Cui, B Kingsbury, B Ramabhadran, A Sethy, K Audhkhasi, Z Tüske, ... Proc. ASRU, 2015	114	2015
Joint modeling of accents and acoustics for multi-accent speech recognition X Yang, K Audhkhasi, A Rosenberg, S Thomas, B Ramabhadran, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	86	2018
Invariant representations for noisy speech recognition D Serdyuk, K Audhkhasi, P Brakel, B Ramabhadran, S Thomas, Y Bengio arXiv preprint arXiv:1612.01928, 2016	81	2016
Formant-based technique for automatic filled-pause detection in spontaneous spoken English K Audhkhasi, K Kandhway, OD Deshmukh, A Verma 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009	80	2009
Single headed attention based sequence-to-sequence model for state-of-the-art results on switchboard Z Tüske, G Saon, K Audhkhasi, B Kingsbury arXiv preprint arXiv:2001.07263, 2020	79	2020
Which ASR should I choose for my dialogue system? F Morbini, K Audhkhasi, K Sagae, R Artstein, D Can, P Georgiou, ... SIGDIAL 2013, 2013	75	2013
Knowledge distillation across ensembles of multilingual models for low-resource languages J Cui, B Kingsbury, B Ramabhadran, G Saon, T Sercu, K Audhkhasi, ... 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017	74	2017
End-to-end speech recognition and keyword search on low-resource languages A Rosenberg, K Audhkhasi, A Sethy, B Ramabhadran, M Picheny 2017 ieee international conference on acoustics, speech and signal …, 2017	71	2017
Leveraging unpaired text data for training end-to-end speech-to-intent systems Y Huang, HK Kuo, S Thomas, Z Kons, K Audhkhasi, B Kingsbury, R Hoory, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	68	2020
Noise-enhanced convolutional neural networks K Audhkhasi, B Kosko, O Osoba US Patent 11,256,982, 2022	58	2022
Alignment-length synchronous decoding for RNN transducer G Saon, Z Tüske, K Audhkhasi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	57	2020
Guiding CTC posterior spike timings for improved posterior fusion and knowledge distillation G Kurata, K Audhkhasi arXiv preprint arXiv:1904.08311, 2019	55	2019
External word embedding neural network language models K Audhkhasi, B Ramabhadran, A Sethy US Patent 10,019,438, 2018	55	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors