Follow
Minglun Han
Minglun Han
Research Scientist, Seed Team, ByteDance; Previously CASIA.
Verified email at bytedance.com
Title
Cited by
Cited by
Year
VLP: A Survey on Vision-language Pre-training
F Chen, D Zhang, M Han, X Chen, J Shi, S Xu, B Xu
Machine Intelligence Research 20 (1), 38-56, 2023
2172023
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
F Chen, M Han, H Zhao, Q Zhang, J Shi, S Xu, B Xu
arXiv preprint arXiv:2305.04160, 2023
1112023
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
M Han, L Dong, Z Liang, M Cai, S Zhou, Z Ma, B Xu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
412022
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
M Han*, Q Wang*, T Zhang*, Y Wang, D Zhang, B Xu
Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 102-109, 2023
372023
CIF-based Collaborative Decoding for End-to-End Contextual Speech Recognition
M Han, L Dong, S Zhou, B Xu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
262021
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Y Bai, Z Chen, M Han, J Chen, J Chen, W Chen, X Hu, Y Hu, D Hua, ...
arXiv preprint arXiv:2407.04675, 2024
142024
Knowledge Transfer from Pre-trained Language Models to CIF-based Speech Recognizers via Hierarchical Distillation
M Han, F Chen, J Shi, S Xu, B Xu
INTERSPEECH 2023, 2023
122023
VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition
M Han, Z Ni, F Chen, L Meng, J Shi, S Xu, B Xu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023
8*2023
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding
Z Hu, X Chen, H Wu, M Han, Z Ni, J Shi, S Xu, B Xu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
52023
NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training
M Han, Y Bai, C Shen, Y Huang, M Huang, Z Lin, L Dong, L Lu, Y Wang
arXiv preprint arXiv:2409.08680, 2024
2024
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers
F Chen, M Han, J Shi, S Xu, B Xu
INTERSPEECH 2023, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–11