Andrew Rouditchenko

Cited by

	All	Since 2019
Citations	1283	1265
h-index	10	10
i10-index	10	10

380

190

285

201820192020202120222023202413 65 136 186 244 368 266

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

James GlassMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
Hilde KuehneUniversity of Bonn , MIT-IBM Watson LabVerified email at uni-bonn.de
Rogerio FerisResearch Manager, MIT-IBM Watson AI LabVerified email at us.ibm.com
Samuel ThomasIBM Research AIVerified email at us.ibm.com
David HarwathThe University of Texas at AustinVerified email at utexas.edu
Antonio TorralbaProfessor of Computer Science, MITVerified email at csail.mit.edu
Brian KingsburyDistinguished Research Staff Member and Manager, IBM T. J. Watson Research Center, Yorktown HeightsVerified email at us.ibm.com
Hang ZhaoAssistant Professor, Tsinghua UniversityVerified email at csail.mit.edu
Chuang GanUMass Amherst | MIT-IBM Watson AI LabVerified email at csail.mit.edu
Alexander H. LiuMassachusetts Institute of TechnologyVerified email at mit.edu
Michael Alan PichenyNYU - Courant CS and CDSVerified email at nyu.edu
Angie BoggustMassachusetts Institute of TechnologyVerified email at mit.edu
Rameswar PandaResearch Scientist, MIT-IBM Watson AI LabVerified email at ibm.com
Carl VondrickAssociate Professor, Columbia UniversityVerified email at columbia.edu
Brian ChenColumbia UniversityVerified email at columbia.edu
Shih-Fu ChangProfessor of Electrical Engineering and Computer Science, Columbia UniversityVerified email at columbia.edu
Dhiraj JoshiIBM T. J. Watson ResearchVerified email at us.ibm.com
Kartik AudhkhasiGoogleVerified email at google.com
Kevin DuarteUniversity of Central FloridaVerified email at knights.ucf.edu

Andrew Rouditchenko

PhD Student at MIT CSAIL

Verified email at mit.edu - Homepage

Speech and Language Processing Multimodal Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The sound of pixels H Zhao, C Gan, A Rouditchenko, C Vondrick, J McDermott, A Torralba Proceedings of the European conference on computer vision (ECCV), 570-586, 2018	574	2018
Avlnet: Learning audio-visual language representations from instructional videos A Rouditchenko, A Boggust, D Harwath, B Chen, D Joshi, S Thomas, ... Proc. Interspeech 2021, 1584-1588, 2021	142	2021
Everything at once-multi-modal fusion transformer for video retrieval N Shvetsova, B Chen, A Rouditchenko, S Thomas, B Kingsbury, RS Feris, ... Proceedings of the ieee/cvf conference on computer vision and pattern …, 2022	136	2022
Self-supervised audio-visual co-segmentation A Rouditchenko, H Zhao, C Gan, J McDermott, A Torralba ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	129	2019
Contrastive audio-visual masked autoencoder Y Gong, A Rouditchenko, AH Liu, D Harwath, L Karlinsky, H Kuehne, ... arXiv preprint arXiv:2210.07839, 2022	98	2022
Multimodal clustering networks for self-supervised learning from unlabeled videos B Chen, A Rouditchenko, K Duarte, H Kuehne, S Thomas, A Boggust, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	78	2021
Cross-modal discrete representation learning AH Liu, SY Jin, CIJ Lai, A Rouditchenko, A Oliva, J Glass arXiv preprint arXiv:2106.05438, 2021	41	2021
Cmkd: Cnn/transformer-based cross-model knowledge distillation for audio classification Y Gong, S Khurana, A Rouditchenko, J Glass arXiv preprint arXiv:2203.06760, 2022	31	2022
Uavm: Towards unifying audio and visual models Y Gong, AH Liu, A Rouditchenko, J Glass IEEE Signal Processing Letters 29, 2437-2441, 2022	17*	2022
Comparison of multilingual self-supervised and weakly-supervised speech pre-training for adaptation to unseen languages A Rouditchenko, S Khurana, S Thomas, R Feris, L Karlinsky, H Kuehne, ... arXiv preprint arXiv:2305.12606, 2023	11	2023
Cascaded Multilingual Audio-Visual Learning from Videos A Rouditchenko, A Boggust, D Harwath, S Thomas, H Kuehne, B Chen, ... Proc. Interspeech 2021, 3006-3010, 2021	6	2021
Label-efficient audio classification through multitask learning and self-supervision T Lee, T Gong, S Padhy, A Rouditchenko, A Ndirango arXiv preprint arXiv:1910.12587, 2019	6	2019
C2kd: Cross-lingual cross-modal knowledge distillation for multilingual text-video retrieval A Rouditchenko, YS Chuang, N Shvetsova, S Thomas, R Feris, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	5	2023
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset I Palmer, A Rouditchenko, A Barbu, B Katz, J Glass Proc. Interspeech 2021, 3650-3654, 2021	5	2021
What When and Where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions B Chen, N Shvetsova, A Rouditchenko, D Kondermann, S Thomas, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	3	2024
Av-cpl: Continuous pseudo-labeling for audio-visual speech recognition A Rouditchenko, R Collobert, T Likhomanenko arXiv preprint arXiv:2309.17395, 2023	1	2023
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation A Rouditchenko, Y Gong, S Thomas, L Karlinsky, H Kuehne, R Feris, ... arXiv preprint arXiv:2406.10082, 2024		2024
Learning Audio-Video Language Representations A Rouditchenko Massachusetts Institute of Technology, 2021		2021

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors