Brian Chen
Cited by
Cited by
Avlnet: Learning audio-visual language representations from instructional videos
JG Andrew Rouditchenko, Angie Boggust, David Harwath, Brian Chen, Dhiraj ...
Interspeech, 2021, 2021
Everything at once-multi-modal fusion transformer for video retrieval
N Shvetsova, B Chen, A Rouditchenko, S Thomas, B Kingsbury, RS Feris, ...
Proceedings of the ieee/cvf conference on computer vision and pattern …, 2022
Multi-level multimodal common semantic space for image-phrase grounding
H Akbari, S Karaman, S Bhargava, B Chen, C Vondrick, SF Chang
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
Gaia: A fine-grained multimedia knowledge extraction system
M Li, A Zareian, Y Lin, X Pan, S Whitehead, B Chen, B Wu, H Ji, ...
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
B Chen, A Rouditchenko, K Duarte, H Kuehne, S Thomas, A Boggust, ...
Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2021
Resin: A dockerized schema-guided cross-document cross-lingual cross-media information extraction and event tracking system
H Wen, Y Lin, T Lai, X Pan, S Li, X Lin, B Zhou, M Li, H Wang, H Zhang, ...
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
Joint multimedia event extraction from video and article
B Chen, X Lin, C Thomas, M Li, S Yoshida, L Chum, H Ji, SF Chang
Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
GAIA-A Multi-media Multi-lingual Knowledge Extraction and Hypothesis Generation System.
T Zhang, A Subburathinam, G Shi, L Huang, D Lu, X Pan, M Li, B Zhang, ...
TAC, 2018
General Partial Label Learning via Dual Bipartite Graph Autoencoder
B Chen, B Wu, A Zareian, H Zhang, SF Chang
Proceedings of the AAAI Conference on Artificial Intelligence, 10502-10509., 2020
Weakly-supervised temporal article grounding
L Chen, Y Niu, B Chen, X Lin, G Han, C Thomas, H Ayyubi, H Ji, ...
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022
GAIA at SMKBP 2020-a dockerlized multi-media multi-lingual knowledge extraction, clustering, temporal tracking and hypothesis generation system
M Li, Y Lin, TM Lai, X Pan, H Wen, S Li, Z Wang, P Yu, L Huang, D Lu, ...
Proceedings of Thirteenth Text Analysis Conference (TAC 2020), 2020
Cascaded multilingual audio-visual learning from videos
A Rouditchenko, A Boggust, D Harwath, S Thomas, H Kuehne, B Chen, ...
Presented at Interspeech 2021., 2021
GAIA at SM-KBP 2019-A Multi-media Multi-lingual Knowledge Extraction and Hypothesis Generation System.
M Li, Y Lin, A Subburathinam, S Whitehead, X Pan, D Lu, Q Wang, ...
TAC, 2019
Egotv: Egocentric task verification from natural language task descriptions
R Hazra, B Chen, A Rai, N Kamra, R Desai
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
Routing with self-attention for multimodal capsule networks
K Duarte, B Chen, N Shvetsova, A Rouditchenko, S Thomas, A Liu, ...
arXiv preprint arXiv:2112.00775, 2021
What, when, and where?--Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
B Chen, N Shvetsova, A Rouditchenko, D Kondermann, S Thomas, ...
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2024
Previts: contrastive pretraining with video tracking supervision
B Chen, RR Selvaraju, SF Chang, JC Niebles, N Naik
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2023
The system can't perform the operation now. Try again later.
Articles 1–17