Xiaotian Han

Cited by

	All	Since 2019
Citations	87	85
h-index	4	4
i10-index	3	3

201720182019202020212022202320241 1 5 8 6 25 23 18

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Quanzeng YouByteDanceVerified email at microsoft.com
Houdong HuMicrosoft，Principal Engineering ManagerVerified email at microsoft.com
Jianghao Xiong 熊江浩Beijing Institute of TechnologyVerified email at bit.edu.cn
Mingshu ZhaoVerified email at umd.edu
Lei ZhangInternational Digital Economy Academy (IDEA)Verified email at idea.edu.cn
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondVerified email at microsoft.com
Pengchuan ZhangMeta AIVerified email at fb.com
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Jiang WangGoogleVerified email at google.com
Zicheng LiuMicrosoftVerified email at microsoft.com
Peng ChuMicrosoftVerified email at microsoft.com
Jianbo YuanBytedanceVerified email at cs.rochester.edu
Yongfei LiuBytedanceVerified email at bytedance.com
Hongxia YangByteDance, Alibaba Group, Yahoo!, IBM Watson

Xiaotian Han

TikTok (Bytedance)

Verified email at bytedance.com - Homepage

Machine learning Computer Vision Multimodal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Real-time micro-scale temperature imaging at low cost based on fluorescent intensity ratio J Xiong, M Zhao, X Han, Z Cao, X Wei, Y Chen, C Duan, M Yin Scientific Reports 7 (1), 41311, 2017	34	2017
Image scene graph generation (sgg) benchmark X Han, J Yang, H Hu, L Zhang, J Gao, P Zhang arXiv preprint arXiv:2107.12604, 2021	28	2021
Mmptrack: Large-scale densely annotated multi-camera multiple people tracking benchmark X Han, Q You, C Wang, Z Zhang, P Chu, H Hu, J Wang, Z Liu Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023	21*	2023
Exploring the reasoning abilities of multimodal large language models (mllms): A comprehensive survey on emerging trends in multimodal reasoning Y Wang, W Chen, X Han, X Lin, H Zhao, Y Liu, B Zhai, J Yuan, Q You, ... arXiv preprint arXiv:2401.06805, 2024	4	2024
ViTAR: Vision Transformer with Any Resolution Q Fan, Q You, X Han, Y Liu, Y Tao, H Huang, R He, H Yang arXiv preprint arXiv:2403.18361, 2024		2024
InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding H Liu, Q You, X Han, Y Wang, B Zhai, Y Liu, Y Tao, H Huang, R He, ... arXiv preprint arXiv:2403.01487, 2024		2024
COCO is" ALL''You Need for Visual Instruction Fine-tuning X Han, Y Wang, B Zhai, Q You, H Yang arXiv preprint arXiv:2401.08968, 2024		2024
CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models X Han, Q You, Y Liu, W Chen, H Zheng, K Mrini, X Lin, Y Wang, B Zhai, ... arXiv preprint arXiv:2311.11567, 2023		2023
InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models X Han, Q You, Y Liu, W Chen, H Zheng, K Mrini, X Lin, Y Wang, B Zhai, ... arXiv e-prints, arXiv: 2311.11567, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–9

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors