Follow
Xiyang Dai
Xiyang Dai
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Cvt: Introducing convolutions to vision transformers
H Wu, B Xiao, N Codella, M Liu, X Dai, L Yuan, L Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision, 22-31, 2021
4712021
Dynamic convolution: Attention over convolution kernels.
Y Chen, X Dai, M Liu, D Chen, L Yuan, Z Liu
CVF Conference on Computer Vision and Pattern Recognition, CVPR, 13-19, 2020
2782020
Temporal context network for activity localization in videos
X Dai, B Singh, G Zhang, LS Davis, Y Qiu Chen
Proceedings of the IEEE International Conference on Computer Vision, 5793-5802, 2017
2502017
Man: Moment alignment network for natural language moment retrieval via iterative graph adjustment
D Zhang, X Dai, X Wang, YF Wang, LS Davis
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1932019
Focal Self-attention for Local-Global Interactions in Vision Transformers
J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao
Advances in Neural Information Processing Systems, 2021, 2021
1232021
Multi-scale vision longformer: A new vision transformer for high-resolution image encoding
P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
1092021
Dynamic head: Unifying object detection heads with attentions
X Dai, Y Chen, B Xiao, D Chen, M Liu, L Yuan, L Zhang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
952021
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
802021
Dynamic ReLU
Y Chen, X Dai, M Liu, D Chen, L Yuan, Z Liu
European Conference on Computer Vision, 351-367, 2020
672020
Fason: First and second order information fusion network for texture recognition
X Dai, J Yue-Hei Ng, LS Davis
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
662017
Mobile-former: Bridging mobilenet and transformer
Y Chen, X Dai, D Chen, M Liu, X Dong, L Yuan, Z Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
642022
Efficient self-supervised vision transformers for representation learning
C Li, J Yang, P Zhang, M Gao, B Xiao, X Dai, L Yuan, J Gao
arXiv preprint arXiv:2106.09785, 2021
572021
Dynamic detr: End-to-end object detection with dynamic attention
X Dai, Y Chen, J Yang, P Zhang, L Yuan, L Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
452021
S3D: Single Shot multi-Span Detector via Fully 3D Convolutional Network
D Zhang, X Dai, X Wang, YF Wang
British Machine Vision Conference (BMVC), 2018, 2018
432018
Bevt: Bert pretraining of video transformers
R Wang, D Chen, Z Wu, Y Chen, X Dai, M Liu, YG Jiang, L Zhou, L Yuan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
342022
Dynamic Temporal Pyramid Network: A Closer Look at Multi-Scale Modeling for Activity Detection
D Zhang, X Dai, YF Wang
Asian Conference on Computer Vision (ACCV), 2018, 2018
292018
Revisiting dynamic convolution via matrix decomposition
Y Li, Y Chen, X Dai, M Liu, D Chen, Y Yu, L Yuan, Z Liu, M Chen, ...
arXiv preprint arXiv:2103.08756, 2021
182021
TAN: Temporal Aggregation Network for Dense Multi-label Action Recognition
X Dai, B Signh, JYH Ng, LS Davis
IEEE Winter Conf. on Applications of Computer Vision (WACV), 2019, 2018
132018
UFO: A unified transformer for vision-language representation learning
J Wang, X Hu, Z Gan, Z Yang, X Dai, Z Liu, Y Lu, L Wang
arXiv preprint arXiv:2111.10023, 2021
122021
Da-nas: Data adapted pruning for efficient neural architecture search
X Dai, D Chen, M Liu, Y Chen, L Yuan
European Conference on Computer Vision, 584-600, 2020
122020
The system can't perform the operation now. Try again later.
Articles 1–20