Follow
Vedanuj Goswami
Vedanuj Goswami
Research Engineer, Meta AI
Verified email at meta.com
Title
Cited by
Cited by
Year
Llama 2: Open foundation and fine-tuned chat models
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
arXiv preprint arXiv:2307.09288, 2023
36202023
12-in-1: Multi-task vision and language representation learning
J Lu*, V Goswami*, M Rohrbach, D Parikh, S Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
4902020
The hateful memes challenge: Detecting hate speech in multimodal memes
D Kiela, H Firooz, A Mohan, V Goswami, A Singh, P Ringshia, ...
Advances in neural information processing systems 33, 2611-2624, 2020
4272020
Flava: A foundational language and vision alignment model
A Singh*, R Hu*, V Goswami*, G Couairon, W Galuba, M Rohrbach, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
4222022
No language left behind: Scaling human-centered machine translation
MR Costa-jussà, J Cross, O Çelebi, M Elbayad, K Heafield, K Heffernan, ...
arXiv preprint arXiv:2207.04672, 2022
3972022
MMF: A multimodal framework for vision and language research
A Singh, V Goswami, V Natarajan, Y Jiang, X Chen, M Shah, M Rohrbach, ...
URL: https://github. com/facebookresearch/mmf, 0
342*
Only time can tell: Discovering temporal data for temporal modeling
L Sevilla-Lara, S Zha, Z Yan, V Goswami, M Feiszli, L Torresani
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021
752021
Creative sketch generation
S Ge, V Goswami, CL Zitnick, D Parikh
arXiv preprint arXiv:2011.10039, 2020
622020
The hateful memes challenge: Competition report
D Kiela, H Firooz, A Mohan, V Goswami, A Singh, CA Fitzpatrick, P Bull, ...
NeurIPS 2020 Competition and Demonstration Track, 344-360, 2021
532021
Human-adversarial visual question answering
S Sheng, A Singh, V Goswami, J Magana, T Thrush, W Galuba, D Parikh, ...
Advances in Neural Information Processing Systems 34, 20346-20359, 2021
512021
Are we pretraining it right? digging deeper into visio-linguistic pretraining
A Singh, V Goswami, D Parikh
arXiv preprint arXiv:2004.08744, 2020
452020
Movie: Revisiting modulated convolutions for visual counting and beyond
DK Nguyen, V Goswami, X Chen
arXiv preprint arXiv:2004.11883, 2020
312020
Tricks for training sparse translation models
D Dua, S Bhosale, V Goswami, J Cross, M Lewis, A Fan
arXiv preprint arXiv:2110.08246, 2021
202021
Speechmatrix: A large-scale mined corpus of multilingual speech-to-speech translations
PA Duquenne, H Gong, N Dong, J Du, A Lee, V Goswani, C Wang, J Pino, ...
arXiv preprint arXiv:2211.04508, 2022
142022
Causes and cures for interference in multilingual translation
U Shaham, M Elbayad, V Goswami, O Levy, S Bhosale
arXiv preprint arXiv:2212.07530, 2022
132022
Knowledge extraction and annotation for cross-domain textual case-based reasoning in biologically inspired design
S Rugaber, S Bhati, V Goswami, E Spiliopoulou, S Azad, S Koushik, ...
Case-Based Reasoning Research and Development: 24th International Conference …, 2016
132016
Muavic: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation
M Anwar, B Shi, V Goswami, WN Hsu, J Pino, C Wang
arXiv preprint arXiv:2303.00628, 2023
122023
Unsupervised image-to-video clothing transfer
A Pumarola, V Goswami, F Vicente, F De la Torre, F Moreno-Noguer
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
112019
Revisiting machine translation for cross-lingual classification
M Artetxe, V Goswami, S Bhosale, A Fan, L Zettlemoyer
arXiv preprint arXiv:2305.14240, 2023
82023
Building recommender systems with PyTorch
D Mudigere, M Naumov, J Spisak, G Chauhan, N Kokhlikyan, A Singh, ...
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge …, 2020
82020
The system can't perform the operation now. Try again later.
Articles 1–20