Speech-transformer: a no-recurrence sequence-to-sequence model for speech recognition L Dong, S Xu, B Xu 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 1096 | 2018 |
Syllable-based sequence-to-sequence speech recognition with the transformer in mandarin chinese S Zhou, L Dong, S Xu, B Xu arXiv preprint arXiv:1804.10752, 2018 | 132 | 2018 |
Cif: Continuous integrate-and-fire for end-to-end speech recognition L Dong, B Xu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 110 | 2020 |
Self-attention aligner: A latency-control end-to-end model for asr using self-attention network and chunk-hopping L Dong, F Wang, B Xu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 89 | 2019 |
A comparison of modeling units in sequence-to-sequence speech recognition with the transformer on mandarin chinese S Zhou, L Dong, S Xu, B Xu International Conference on Neural Information Processing, 210-220, 2018 | 64 | 2018 |
Extending recurrent neural aligner for streaming end-to-end speech recognition in mandarin L Dong, S Zhou, W Chen, B Xu arXiv preprint arXiv:1806.06342, 2018 | 35 | 2018 |
Improving end-to-end contextual speech recognition with fine-grained contextual knowledge selection M Han, L Dong, Z Liang, M Cai, S Zhou, Z Ma, B Xu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 31 | 2022 |
Cif-based collaborative decoding for end-to-end contextual speech recognition M Han, L Dong, S Zhou, B Xu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 20 | 2021 |
A comparison of label-synchronous and frame-synchronous end-to-end models for speech recognition L Dong, C Yi, J Wang, S Zhou, S Xu, X Jia, B Xu arXiv preprint arXiv:2005.10113, 2020 | 15 | 2020 |
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring. Y Zou, L Dong, B Xu INTERSPEECH, 2055-2059, 2019 | 5 | 2019 |
Sequence-level speaker change detection with difference-based continuous integrate-and-fire Z Fan, L Dong, M Cai, Z Ma, B Xu IEEE Signal Processing Letters 29, 1551-1554, 2022 | 4 | 2022 |
Syllable-based acoustic modeling with CTC for multi-scenarios Mandarin speech recognition Y Zhao, L Dong, S Xu, B Xu 2018 International Joint Conference on Neural Networks (IJCNN), 1-8, 2018 | 4 | 2018 |
Language-specific acoustic boundary learning for mandarin-english code-switching speech recognition Z Fan, L Dong, C Shen, Z Liang, J Zhang, L Lu, Z Ma arXiv preprint arXiv:2306.05279, 2023 | 3 | 2023 |
Token-level speaker change detection using speaker difference and speech content via continuous integrate-and-fire Z Fan, Z Liang, L Dong, Y Liu, S Zhou, M Cai, J Zhang, Z Ma, B Xu arXiv preprint arXiv:2211.09381, 2022 | 2 | 2022 |
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training L Dong, Z An, P Wu, J Zhang, L Lu, Z Ma arXiv preprint arXiv:2305.17499, 2023 | 1 | 2023 |
Method, apparatus, device, and storage medium for speaker change point detection D Linhao, Z Fan, Z Ma US Patent App. 18/394,143, 2024 | | 2024 |
Model training method, speech recognition method, device, medium, and apparatus D Linhao, Z Ma US Patent App. 18/276,769, 2024 | | 2024 |
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR Z Fan, L Dong, J Zhang, L Lu, Z Ma ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |