Follow
Zhuofan Xia
Zhuofan Xia
Other names夏 卓凡
PhD candidate, Department of Automation, Tsinghua University
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Vision Transformer with Deformable Attention
Z Xia, X Pan, S Song, LE Li, G Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022
5902022
3D Object Detection with Pointformer
X Pan, Z Xia, S Song, LE Li, G Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021), 2021
4292021
Adaptive Rotated Convolution for Rotated Object Detection
Y Pu, Y Wang, Z Xia, Y Han, Y Wang, W Gan, Z Wang, S Song, G Huang
IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023
832023
Slide-transformer: Hierarchical vision transformer with local self-attention
X Pan, T Ye, Z Xia, S Song, G Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023), 2023
592023
Agent attention: On the integration of softmax and linear attention
D Han, T Ye, Y Han, Z Xia, S Song, G Huang
European Conference on Computer Vision (ECCV 2024), 2024
522024
Demystify Mamba in Vision: A Linear Attention Perspective
D Han, Z Wang, Z Xia, Y Han, Y Pu, C Ge, J Song, S Song, B Zheng, ...
arXiv preprint arXiv:2405.16605, 2024
252024
GSVA: Generalized Segmentation via Multimodal Large Language Models
Z Xia, D Han, Y Han, X Pan, S Song, G Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), 2024
252024
Dat++: Spatially dynamic vision transformer with deformable attention
Z Xia, X Pan, S Song, LE Li, G Huang
arXiv preprint arXiv:2309.01430, 2023
152023
Budgeted Training for Vision Transformer
Z Xia, X Pan, X Jin, Y He, H Xue, S Song, G Huang
International Conference on Learning Representations (ICLR 2023), 2023
9*2023
Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Y Pu, Z Xia, J Guo, D Han, Q Li, D Li, Y Yuan, J Li, Y Han, S Song, ...
European Conference on Computer Vision (ECCV 2024), 0
5*
Bridging the divide: Reconsidering softmax and linear attention
D Han, Y Pu, Z Xia, Y Han, X Pan, X Li, J Lu, S Song, G Huang
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
22024
Generalized Activation via Multivariate Projection
J Li, Y Cheng, Y Lu, Z Xia, Y Mo, G Huang
arXiv preprint arXiv:2309.17194, 2023
12023
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
R Huang, H Zheng, Y Wang, Z Xia, M Pavone, G Huang
arXiv preprint arXiv:2411.15657, 2024
2024
Training an Open-Vocabulary Monocular 3D Detection Model without 3D Data
R Huang, H Zheng, Y Wang, Z Xia, M Pavone, G Huang
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 0
The system can't perform the operation now. Try again later.
Articles 1–14