Vmamba: Visual state space model Y Liu, Y Tian, Y Zhao, H Yu, L Xie, Y Wang, Q Ye, Y Liu arXiv preprint arXiv:2401.10166, 2024 | 78 | 2024 |
Diffumask: Synthesizing images with pixel-level annotations for semantic segmentation using diffusion models W Wu, Y Zhao, MZ Shou, H Zhou, C Shen Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023 | 68 | 2023 |
Datasetdm: Synthesizing data with perception annotations using diffusion models W Wu, Y Zhao, H Chen, Y Gu, R Zhao, Y He, H Zhou, MZ Shou, C Shen Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS …, 2023 | 26 | 2023 |
Generative prompt model for weakly supervised object localization Y Zhao, Q Ye, W Wu, C Shen, F Wan Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 10 | 2023 |
Explore faster localization learning for scene text detection Y Zhao, Y Cai, W Wu, W Wang 2023 IEEE International Conference on Multimedia and Expo (ICME), 156-161, 2023 | 7 | 2023 |
ICDAR 2023 Competition on Video Text Reading for Dense and Small Text W Wu, Y Zhao, Z Li, J Li, MZ Shou, U Pal, D Karatzas, X Bai International Conference on Document Analysis and Recognition, 405-419, 2023 | 6* | 2023 |
A large cross-modal video retrieval dataset with reading comprehension W Wu, Y Zhao, Z Li, J Li, H Zhou, MZ Shou, X Bai arXiv preprint arXiv:2305.03347, 2023 | 5 | 2023 |
FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation Y Zhao, W Wu, Z Li, J Li, W Wang 2023 IEEE International Conference on Multimedia and Expo (ICME), 2023 | 4 | 2023 |
Meticulously selecting 1% of the dataset for pre-training! generating differentially private images data with semantics query K Li, C Gong, Z Li, Y Zhao, X Hou, T Wang arXiv preprint arXiv:2311.12850, 2023 | 1 | 2023 |
Detect Arbitrary-Shaped Text via Adaptive Thresholding and Localization Quality Estimation P Cheng, Y Zhao, W Wang IEEE Transactions on Circuits and Systems for Video Technology, 2023 | 1 | 2023 |
Direct regression scene text detection with accuracy scoring P Cheng, Y Zhao, Y Cai, W Wang Neurocomputing 501, 705-714, 2022 | 1 | 2022 |
Controllable Dense Captioner with Multimodal Embedding Bridging Y Zhao, Y Liu, Z Guo, W Wu, C Gong, Q Ye, F Wan arXiv preprint arXiv:2401.17910, 2024 | | 2024 |
Continual Learning for Image Segmentation with Dynamic Query W Wu, Y Zhao, Z Li, L Shan, H Zhou, MZ Shou IEEE Transactions on Circuits and Systems for Video Technology, 2023 | | 2023 |
Supplementary Material for ‘DatasetDM’ W Wu, Y Zhao, H Chen, Y Gu, R Zhao, Y He, H Zhou, MZ Shou, C Shen | | |