Layoutlmv2: Multi-modal pre-training for visually-rich document understanding Y Xu, Y Xu, T Lv, L Cui, F Wei, G Wang, Y Lu, D Florencio, C Zhang, ... arXiv preprint arXiv:2012.14740, 2020 | 488 | 2020 |
RGB-T object tracking: Benchmark and baseline C Li, X Liang, Y Lu, N Zhao, J Tang Pattern Recognition 96, 106977, 2019 | 393 | 2019 |
Feature selection using principal feature analysis Y Lu, I Cohen, XS Zhou, Q Tian Proceedings of the 15th ACM international conference on Multimedia, 301-304, 2007 | 362 | 2007 |
Trocr: Transformer-based optical character recognition with pre-trained models M Li, T Lv, J Chen, L Cui, Y Lu, D Florencio, C Zhang, Z Li, F Wei Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13094 …, 2023 | 345 | 2023 |
Spatial coding for large scale partial-duplicate web image search W Zhou, Y Lu, H Li, Y Song, Q Tian Proceedings of the 18th ACM international conference on Multimedia, 511-520, 2010 | 300 | 2010 |
Principal visual word discovery for automatic license plate detection W Zhou, H Li, Y Lu, Q Tian IEEE transactions on image processing 21 (9), 4269-4279, 2012 | 269 | 2012 |
Weighted sparse representation regularized graph learning for RGB-T object tracking C Li, N Zhao, Y Lu, C Zhu, J Tang Proceedings of the 25th ACM international conference on Multimedia, 1856-1864, 2017 | 197 | 2017 |
Tap: Text-aware pre-training for text-vqa and text-caption Z Yang, Y Lu, J Wang, X Yin, D Florencio, L Wang, C Zhang, L Zhang, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 164 | 2021 |
A comparison of methods for sketch-based 3D shape retrieval B Li, Y Lu, A Godil, T Schreck, B Bustos, A Ferreira, T Furuya, MJ Fonseca, ... Computer Vision and Image Understanding 119, 57-80, 2014 | 164 | 2014 |
Shape retrieval of non-rigid 3d human models D Pickup, X Sun, PL Rosin, RR Martin, Z Cheng, Z Lian, M Aono, ... International Journal of Computer Vision 120, 169-193, 2016 | 162 | 2016 |
A comparison of 3D shape retrieval methods based on a large-scale benchmark supporting multimodal queries B Li, Y Lu, C Li, A Godil, T Schreck, M Aono, M Burtscher, Q Chen, ... Computer Vision and Image Understanding 131, 1-27, 2015 | 159 | 2015 |
Shrec’14 track: extended large scale sketch-based 3D shape retrieval B Li, Y Lu, C Li, A Godil, T Schreck, M Aono, M Burtscher, H Fu, T Furuya, ... Eurographics Workshop on 3D Object Retrieval, 121-130, 2014 | 151 | 2014 |
SHREC'13 track: large scale sketch-based 3D shape retrieval B Li, Y Lu, A Godil, T Schreck, M Aono, H Johan, JM Saavedra, S Tashiro Proceedings of the Sixth Eurographics Workshop on 3D Object Retrieval, 89-96, 2013 | 133 | 2013 |
Layoutxlm: Multimodal pre-training for multilingual visually-rich document understanding Y Xu, T Lv, L Cui, G Wang, Y Lu, D Florencio, C Zhang, F Wei arXiv preprint arXiv:2104.08836, 2021 | 119 | 2021 |
Scalar quantization for large scale image search W Zhou, Y Lu, H Li, Q Tian Proceedings of the 20th ACM international conference on Multimedia, 169-178, 2012 | 109 | 2012 |
SIFT match verification by geometric coding for large-scale partial-duplicate web image search W Zhou, H Li, Y Lu, Q Tian ACM Transactions on Multimedia Computing, Communications, and Applications …, 2013 | 104 | 2013 |
M3S-NIR: Multi-modal multi-scale noise-insensitive ranking for RGB-T saliency detection Z Tu, T Xia, C Li, Y Lu, J Tang 2019 IEEE Conference on Multimedia Information Processing and Retrieval …, 2019 | 94 | 2019 |
BSIFT: Toward data-independent codebook for large scale image search W Zhou, H Li, R Hong, Y Lu, Q Tian IEEE Transactions on Image Processing 24 (3), 967-979, 2015 | 75 | 2015 |
Shrec’17 track large-scale 3d shape retrieval from shapenet core55 M Savva, F Yu, H Su, A Kanezaki, T Furuya, R Ohbuchi, Z Zhou, R Yu, ... Proceedings of the Eurographics Workshop on 3D Object Retrieval 10, 2017 | 69 | 2017 |
Large scale image search with geometric coding W Zhou, H Li, Y Lu, Q Tian Proceedings of the 19th ACM international conference on Multimedia, 1349-1352, 2011 | 68 | 2011 |