Selected Publications
The full list of my publications can be found on my Google Scholar
Journal Papers
Nayu Liu; Kaiwen Wei; Yong Yang; Jianhua Tao; Xian Sun; Fanglong Yao; Hongfeng Yu; Li Jin; Zhao Lv; Cunhang Fan, Multimodal Cross-lingual Summarization for Videos: A Revisit in Knowledge Distillation Induced Triple-stage Training Method. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46:10697-10714.paper code (人工智能领域顶级期刊, CCF A, 唯一通讯作者)
Enrui Liu; Andong Li; Cunhang Fan; Chengshi Zheng; Jiangyan Yi; Ruibo Fu; Xinhui Li; Jian Zhou; Zhao Lv, SSE-Net: Towards Low-Power-Consumption Spiking Neural Network for Monaural Speech Enhancement. IEEE Transactions on Audio, Speech, and Language Processing, 2026, 34: 2077 - 2089. paper (语音领域顶级期刊, CAAI A, CCF B, 唯一通讯作者)
Cunhang Fan; Jiahao Li; Enrui Liu; Jiangyan Yi; Xinhui Li; Ruibo Fu; Zhao Lv, A Joint Training Framework for Noise-Robust Speech Recognition through Multi-Level Feature Fusion. IEEE Transactions on Audio, Speech, and Language Processing, 2025, 33: 4808 - 4820. paper (语音领域顶级期刊, CAAI A, CCF B)
Cunhang Fan; Wang Xiang; Jianhua Tao; Jiangyan Yi; Zhao Lv, Cross-Modal Knowledge Distillation with Multi-Stage Adaptive Feature Fusion for Speech Separation. IEEE Transactions on Audio, Speech, and Language Processing, 2025, 33: 935-948. paper (语音领域顶级期刊, CAAI A, CCF B)
Cunhang Fan; Mingming Ding; Jianhua Tao; Ruibo Fu; Jiangyan Yi; Zhengqi Wen; Zhao Lv, Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024, 32:2453-2466.paper code (语音领域顶级期刊, CAAI A, CCF B)
Cunhang Fan; Kang Zhu; Jianhua Tao; Guofeng Yi; Jun Xue; Zhao Lv, Multi-level Contrastive Learning: Hierarchical Alleviation of Heterogeneity in Multimodal Sentiment Analysis. IEEE Transactions on Affective Computing, 2025, 16:207-222. paper code (情感计算领域顶级期刊, CAAI A, CCF B)
范存航; 李骏凌; 张晶晶; 高佑典; 陈颖; 易江燕; 吕钊, 基于门控交叉注意力融合的神经导向说话人提取方法. 计算机研究与发展, 2026. paper (CCF A)
Cunhang Fan; Fan Yang, Jingjing Zhang, Jingpeng Sun, Hao Che, Su Hu, Zhengqi Wen, Zhao Lv, A Domain Adaptation Framework by Aligning the Inverse Gram Matrices for Cross-Subject Motor Imagery Classification. IEEE Transactions on Consumer Electronics, 2025, 5390 - 5403. paper (中科院一区top)
Cunhang Fan; Jianhua Tao; Bin Liu; Jiangyan Yi; Zhengqi Wen; Xuefei Liu, End-to-End Post-filter for Speech Separation with Deep Attention Fusion Features, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, 28:1303-1314.paper (语音领域顶级期刊, CAAI A, CCF B)
Cunhang Fan; Jiangyan Yi; Jianhua Tao; Zhengkun Tian; Bin Liu; Zhengqi Wen, Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 29:198-209.paper (语音领域顶级期刊, CAAI A, CCF B)
Cunhang Fan; Hongyu Zhang; Qinke Ni; Jingjing Zhang; Jianhua Tao; Jian Zhou; Jiangyan Yi; Zhao Lv; Xiaopei Wu, Seeing Helps Hearing: A Multi-modal Dataset and a Mamba-based Dual Branch Parallel Network for Auditory Attention Decoding, Information Fusion, 2025, Volume 118, 102946. paper (中科院一区top,CAAI A)
Cunhang Fan; Jinqin Wang; Wei Huang; Xiaoke Yang; Guangxiong Pei; Taihao Li; Zhao Lv. Light-weight residual convolution-based capsule network for EEG emotion recognition. Advanced Engineering Informatics, 2024, Volume 61, 102522.paper code (中科院一区top,CCF B)
Cunhang Fan; Hongyu Zhang; Wei Huang; Jun Xue; Jianhua Tao; Jiangyan Yi; Zhao Lv; Xiaopei Wu, DGSD: Dynamical graph self-distillation for EEG-based auditory spatial attention detection, Neural Networks, 2024, Volume 179, 106580. paper (中科院二区top,CCF B)
Cunhang Fan; Jun Xue; Jianhua Tao; Jiangyan Yi; Chenglong Wang; Chengshi Zheng; Zhao Lv, Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection. Neural Networks, 2024, Volume 175, 106320.paper code (中科院二区top,CCF B)
Cunhang Fan; Hongmei Zhang; Andong Li; Wang Xiang; Chengshi Zheng; Zhao Lv; Xiaopei Wu, CompNet: Complementary network for single-channel speech enhancement, Neural Networks, 2023, 168:508-517.paper code (中科院二区top,CCF B)
Cunhang Fan; Jun Xue; Shunbo Dong; Mingming Ding; Jiangyan Yi; Jinpeng Li; Zhao Lv, Subband Fusion of Complex Spectrogram for Fake Speech Detection, Speech Communication, 2023, 155: 102988.paper (语音领域top-2期刊,CCF B)
Guofeng Yi; Cunhang Fan#; Kang Zhu; Zhao Lv; Shan Liang; Zhengqi Wen; Guanxiong Pei; Taihao Li; Jianhua Tao, VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis, Knowledge-Based Systems, 2024, 283: 111136.(Corresponding author)paper (中科院一区top,通讯作者)
Jun Xue; Cunhang Fan#; Jiangyan Yi; Jian Zhou; Zhao Lv, Dynamic Ensemble Teacher-Student Distillation Framework for Light-weight Fake Audio Detection, IEEE Signal Processing Letters, 2024, 31:2305-2309. paper(IF:3.2,通讯作者)
Conference Papers
Cunhang Fan; Sheng Zhang; Jingjing Zhang; Enrui Liu; Xinhui Li; Gangming Zhao; Zhao Lv, DMF2Mel: A Dynamic Multiscale Fusion Network for EEG-Driven Mel Spectrogram Reconstruction. ACM International Conference on Multimedia (ACM MM), 2025: 6977 - 6985. paper (CCF A)
Jian Zhou; Yingjie Xie; Cunhang Fan; Huabin Wang; Zhao Lv; Liang Tao, DHGCN: Dual HyperGraph Convolutional Network for EEG-Based Auditory Attention Detection. ACM International Conference on Multimedia (ACM MM), 2025: 612-620. paper (CCF A, 唯一通讯作者)
Cunhang Fan; Xiaoke Yang; Hongyu Zhang; Ying Chen; Lu Li; Jian Zhou; Zhao Lv, ListenNet: A Lightweight Spatio-Temporal Enhancement Nested Network for Auditory Attention Detection. International Joint Conference on Artificial Intelligence (IJCAI 2025), 2025, 4137-4145. paper (CCF A)
Cunhang Fan; Ying Chen; Jian Zhou; Zexu Pan; Jingjing Zhang; Youdian Gao; Xiaoke Yang; Zhengqi Wen; Zhao Lv, M3ANet: Multi-scale and Multi-Modal Alignment Network for Brain-Assisted Target Speaker Extraction. International Joint Conference on Artificial Intelligence (IJCAI 2025), 2025, 8040-8048. paper (CCF A)
Lu Li; Cunhang Fan; Hongyu Zhang; Jingjing Zhang; Xiaoke Yang; Jian Zhou; Zhao Lv, MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection. International Joint Conference on Artificial Intelligence (IJCAI 2025), 2025, 4173-4181. paper (CCF A, 唯一通讯作者)
Sheng Yan; Cunhang Fan; Hongyu Zhang; Xiaoke Yang; Jianhua Tao; Zhao Lv, DARNet: Dual Attention Refinement Network with Spatiotemporal Construction for Auditory Attention Detection. Annual Conference on Neural Information Processing Systems (NeurIPS), 2024, 37: 31688-31707. paper(人工智能领域顶会, CCF A, 唯一通讯作者, 共同第一作者)
Cunhang Fan; Jingjing Zhang; Hongyu Zhang; Wang Xiang; Jianhua Tao; Xinhui Li; Jiangyan Yi; Dianbo Sui; Zhao Lv, MSFNet: Multi-Scale Fusion Network for Brain-Controlled Speaker Extraction. ACM International Conference on Multimedia (ACM MM), 2024, 1652 - 1661. paper (CCF A)
Cunhang Fan; Enrui Liu; Andong Li; Jianhua Tao; Jian Zhou; Jiahao Li; Chengshi Zheng; Zhao Lv. BSDB-Net:Band-Split Dual-Branch Network with Selective State Spaces Mechanism for Monaural Speech Enhancement. Proceedings of the AAAI Conference on Artificial Intelligence, 2025, 39(22): 23850-23858.paper (CCF A)
Cunhang Fan; Yujie Chen; Jun Xue; Yonghui Kong; Jianhua Tao; Zhao Lv. Progressive Distillation Based on Masked Generation Feature Method for Knowledge Graph Completion. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38(8), 8380-8388.paper code (CCF A)
Zhao Lv, Haoran Zhou, Ying Chen, Youdian Gao, Xinhui Li, Ruibo Fu, Cunhang Fan. Trainable EEG Interpolation and Structure-Sharing Dual-Path Encoders for Brain-Assisted Target Speaker Extraction. Proceedings of the AAAI Conference on Artificial Intelligence, 2026, 40(38), 32392-32400. paper (CCF A, 唯一通讯作者)
Qinke Ni; Hongyu Zhang; Cunhang Fan#; Shengbing Pei; Chang Zhou; Zhao Lv, DBPNet: Dual-Branch Parallel Network with Temporal-Frequency Fusion for Auditory Attention Detection. International Joint Conference on Artificial Intelligence (IJCAI 2024), Jeju, 2024:3115-3123. (Corresponding author)paper code (人工智能领域顶会, CCF A, 共同通讯作者, 共同第一作者)
Cunhang Fan, Zhao Lv, Shengbing Pei, Mingyue Niu, Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction, 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Singapore, 2022, pp. 546-550.paper code (语音领域顶级会议,CCF B)
Jun Xue, Cunhang Fan#, Jiangyan Yi, Chenglong Wang, Zhengqi Wen, Dan Zhang and Zhao Lv, LEARNING FROM YOURSELF: A SELF-DISTILLATION METHOD FOR FAKE SPEECH DETECTION, 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, 2023, pp. 1-5.(Corresponding author)paper (语音领域顶级会议,CCF B,通讯作者)
Kang Zhu; Cunhang Fan#; Jianhua Tao; Jun Xue; Heng Xie; Xuefei Liu; Yongwei Li; Zhengqi Wen; Zhao Lv, DUAL-VIEW MULTIMODAL INTERACTION IN MULTIMODAL SENTIMENT ANALYSIS. IEEE International Conference on Multimedia and Expo, 2024, pp. 1-6.(Corresponding author) paper (CCF B, 共同通讯作者)
Yonghui Kong; Cunhang Fan#; Yujie Chen; Shuai Zhang; Zhao Lv; Jianhua Tao,Bilateral Masking with prompt for Knowledge Graph Completion.In: Findings of 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). 2024, pp. 240–249.(Corresponding author) paper (CCF B, 共同通讯作者)
Cunhang Fan; Youdian Gao; Zexu Pan; Jingjing Zhang; Hongyu Zhang; Jie Zhang; Zhao Lv, Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction, 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025. paper(语音领域顶级会议,CCF B)
Cunhang Fan; Sheng Zhang; Jingjing Zhang; Zexu Pan; Zhao Lv, SSM2Mel: State Space Model to Reconstruct Mel Spectrogram from the EEG, 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025. paper (语音领域顶级会议,CCF B)
