Publications

可搜索论文条目中的所有可见文本,包括:论文标题、作者姓名、会议/期刊名、年份、摘要内容等。
You can search for all visible text in the publication entries, including the title, authors, conference/journal name, year, absract content, etc.

2026

  1. ICASSP
    Summary on The Multilingual Conversational Speech Language Model Challenge: Datasets, Tasks, Baselines, and Methods
    Bingshen Mu, Pengcheng Guo, Zhaokai Sun, Shuai Wang, Hexin Liu, Mingchen Shao, and 5 more authors
    In ICASSP, 2026
  2. ICASSP
    WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing
    Yuhang Dai, Ziyu Zhang, Shuai Wang, Longhao Li, Zhao Guo, Tianlun Zuo, and 10 more authors
    In ICASSP, 2026
  3. ICASSP
    Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages
    Mingchen Shao, Bingshen Mu, Chengyou Wang, Hai Li, Ying Yan, Zhonghua Fu, and 1 more author
    In ICASSP, 2026
  4. ICASSP
    MeanVC: Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
    Guobin Ma, Jixun Yao, Ziqian Ning, Yuepeng Jiang, Lingxin Xiong, Lei Xie, and 1 more author
    In ICASSP, 2026
  5. ICASSP
    S²Voice: Style-Aware Autoregressive Modeling with Enhanced Conditioning for Singing Style Conversion
    Ziqian Wang, Xianjun Xia, Chuanzeng Huang, and Lei Xie
    In ICASSP, 2026
  6. ICASSP
    The ICASSP 2026 Automatic Song Aesthetics Evaluation Challenge
    Guobin Ma, Yuxuan Xia, Jixun Yao, Huixin Xue, Hexin Liu, Shuai Wang, and 2 more authors
    In ICASSP, 2026
  7. ICASSP
    The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era
    Zhixian Zhao, Shuiyuan Wang, Guojian Li, Hongfei Xue, Chengyou Wang, Shuai Wang, and 10 more authors
    In ICASSP, 2026
  8. ICASSP
    Easy Turn: Integrating Acoustic and Linguistic Modalities for Robust Turn-Taking in Full-Duplex Spoken Dialogue Systems
    Guojian Li, Chengyou Wang, Hongfei Xue, Shuiyuan Wang, Dehui Gao, Zihan Zhang, and 5 more authors
    In ICASSP, 2026

2025

  1. ASRU
    DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization
    Huakang Chen, Yuepeng Jiang, Guobin Ma, Chunbo Hao, Shuai Wang, Jixun Yao, and 4 more authors
    In ASRU, 2025
  2. AAAI
    Drop the beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation
    Ziqian Ning, Shuai Wang, Yuepeng Jiang, Jixun Yao, Lei He, Shifeng Pan, and 2 more authors
    In AAAI, 2025
  3. AAAI
    StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching
    Jixun Yao, Yang Yuguang, Yu Pan, Ziqian Ning, Jianhao Ye, Hongbin Zhou, and 1 more author
    In AAAI, 2025
  4. ICASSP
    ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training
    Xinfa Zhu, Lei He, Yujia Xiao, Xi Wang, Xu Tan, Sheng Zhao, and 1 more author
    In ICASSP, 2025
  5. ICASSP
    CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
    He Wang, Xucheng Wan, Naijun Zheng, Kai Liu, Huan Zhou, Guojian Li, and 1 more author
    In ICASSP, 2025
  6. ICASSP
    HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
    Bingshen Mu, Kun Wei, Qijie Shao, Yong Xu, and Lei Xie
    In ICASSP, 2025
  7. ICASSP
    DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
    Qing Wang, Jixun Yao, Zhaokai Sun, Pengcheng Guo, Lei Xie, and John H.L. Hansen
    In ICASSP, 2025
  8. ICLR
    GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling
    Jixun Yao, Hexin Liu, Chen Chen, Yuchen Hu, EngSiong Chng, and Lei Xie
    In ICLR, 2025
  9. Interspeech
    EASY: Emotion-aware Speaker Anonymization via Factorized Distillation
    Jixun Yao, Hexin Liu, Eng Siong Chng, and Lei Xie
    In Interspeech, 2025
  10. Interspeech
    Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis
    Tianyi Xu, Hongjie Chen, Wang Qing, Lv Hang, Jian Kang, Li Jie, and 3 more authors
    In Interspeech, 2025
  11. Interspeech
    Selective Invocation for Multilingual ASR: A Cost-effective Approach Adapting to Speech Recognition Difficulty
    Hongfei Xue, Yufeng Tang, Jun Zhang, Xuelong Geng, and Lei Xie
    In Interspeech, 2025
  12. Interspeech
    AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition
    Yuhang Dai, He Wang, Xingchen Li, Zihan Zhang, Shuiyuan Wang, Lei Xie, and 5 more authors
    In Interspeech, 2025
  13. Interspeech
    FlowSE: Efficient and High-Quality Speech Enhancement via Flow Matching
    Ziqian Wang, Zikai Liu, Xinfa Zhu, Yike Zhu, Mingshuai Liu, Jun Chen, and 3 more authors
    In Interspeech, 2025
  14. Interspeech
    Weakly Supervised Data Refinement and Flexible Sequence Compression for Efficient Thai LLM-based ASR
    Mingchen Shao, Xinfa Zhu, Chengyou Wang, Bingshen Mu, Hai Li, Ying Yan, and 3 more authors
    In Interspeech, 2025
  15. Interspeech
    Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM
    Zhaokai Sun, Li Zhang, Qing Wang, Pan Zhou, and Lei Xie
    In Interspeech, 2025
  16. Interspeech
    U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding
    Ziqian Wang, Xianjun Xia, Xinfa Zhu, and Lei Xie
    In Interspeech, 2025
  17. Interspeech
    Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR
    Longhao Li, Yangze Li, Hongfei Xue, Jie Liu, Shuai Fang, Kai Wang, and 1 more author
    In Interspeech, 2025
  18. Interspeech
    Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation
    Zhennan Lin, Kaixun Huang, Wei Ren, Linju Yang, and Lei Xie
    In Interspeech, 2025
  19. ACM MM
    Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning
    Hongfei Xue, Yufeng Tang, Hexin Liu, Jun Zhang, Xuelong Geng, and Lei Xie
    In ACM MM, 2025
  20. ACM MM
    DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis
    Wenjie Tian, Xinfa Zhu, Haohe Liu, Zhixian Zhao, Zihao Chen, Chaofan Ding, and 3 more authors
    In ACM MM, 2025
  21. ASRU
    EchoFree: Towards Ultra Lightweight and Efficient Neural Acoustic Echo Cancellation
    Xingchen Li, Boyi Kang, Ziqian Wang, Zihan Zhang, Mingshuai Liu, Zhonghua Fu, and 1 more author
    In ASRU, 2025
  22. ASRU
    XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation
    Tianlun Zuo, Jingbin Hu, Yuke Li, Xinfa Zhu, Hai Li, Ying Yan, and 3 more authors
    In ASRU, 2025
  23. ASRU
    Llasa+: Free Lunch for Accelerated and Streaming Llama-Based Speech Synthesis
    Wenjie Tian, Xinfa Zhu, Hanke Xie, Zhen Ye, Wei Xue, and Lei Xie
    In ASRU, 2025
  24. ASRU
    Efficient Scaling for LLM-based ASR
    Bingshen Mu, Yiwen Shao, Kun Wei, Dong Yu, and Lei Xie
    In ASRU, 2025
  25. ASRU
    REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers
    Yuepeng Jiang, Ziqian Ning, Shuai Wang, Chengjia Wang, Mengxiao Bi, Pengcheng Zhu, and 2 more authors
    In ASRU, 2025
  26. NCMMSC
    Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text
    Hongfei Xue, Wei Ren, Xuelong Geng, Kun Wei, Longhao Li, Qijie Shao, and 3 more authors
    In NCMMSC, 2025
  27. NCMMSC
    StreamFlow: Streaming Flow Matching with Block-wise Guided Attention Mask for Speech Token Decoding
    Dake Guo, Jixun Yao, Linhan Ma, He Wang, and Lei Xie
    In NCMMSC, 2025
  28. NCMMSC
    HiStyle: Hierarchical Style Embedding Predictor for Text-Prompt-Guided Controllable Speech Synthesis
    Ziyu Zhang, Hanzhao Li, Jingbin Hu, Wenhao Li, and Lei Xie
    In NCMMSC, 2025
  29. NCMMSC
    MPO: Multidimensional Preference Optimization for Language Model-based Text-to-Speech
    Kangxiang Xia, Xinfa Zhu, Jixun Yao, and Lei Xie
    In NCMMSC, 2025
  30. NCMMSC
    SynthVC: Leveraging Synthetic Data for End-to-End Low Latency Streaming Voice Conversion
    Zhao Guo, Ziqian Ning, Guobin Ma, and Lei Xie
    In NCMMSC, 2025
  31. NCMMSC
    Serial-Parallel Dual-Path Architecture for Speaking Style Recognition
    Guojian Li, Qijie Shao, Zhixian Zhao, Shuiyuan Wang, Zhonghua Fu, and Lei Xie
    In NCMMSC, 2025
  32. AAAI
    KALL-E:Autoregressive Speech Synthesis with Next-Distribution Prediction
    Kangxiang Xia, Xinfa Zhu, Jixun Yao, Wenjie Tian, Wenhao Li, and Lei Xie
    In AAAI, 2025
  33. AAAI
    Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR
    Bingshen Mu, Hexin Liu, Hongfei Xue, Kun Wei, and Lei Xie
    In AAAI, 2025
  34. AAAI
    WenetSpeech-Yue: A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation
    Longhao Li, Zhao Guo, Hongjie Chen, Yuhang Dai, Ziyu Zhang, Hongfei Xue, and 12 more authors
    In AAAI, 2025
  35. TASLP
    Vec-Tok Speech: speech vectorization and tokenization for neural speech generation
    Xinfa Zhu, Yuanjun Lv, Yi Lei, Tao Li, Wendi He, Hongbin Zhou, and 2 more authors
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2025
  36. TASLP
    DQ-Data2vec: Decoupling Quantization for Multilingual Speech Recognition
    Qijie Shao, Linhao Dong, Kun Wei, Sining Sun, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2025
  37. TASLP
    MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement
    Jixun Yao, Qing Wang, Pengcheng Guo, Ziqian Ning, Yuguang Yang, Yu Pan, and 1 more author
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2025
  38. TASLP
    Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech Recognition
    Bingshen Mu, Kun Wei, Pengcheng Guo, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2025
  39. TASLP
    Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought
    Zhixian Zhao, Xinfa Zhu, Xinsheng Wang, Shuiyuan Wang, Xuelong Geng, Wenjie Tian, and 1 more author
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2025
  40. TASLP
    FPO: Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech
    Jixun Yao, Yuguang Yang, Yu Pan, Yuan Feng, Ziqian Ning, Jianhao Ye, and 2 more authors
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2025

2024

  1. SLT
    Optimizing Dysarthria Wake-Up Word Spotting: an End-to-End Approach For SLT 2024 LRDWWS Challenge
    Shuiyun Liu, Yuxiang Kong, Pengcheng Guo, Weiji Zhuang, Peng Gao, Yujun Wang, and 1 more author
    In SLT, 2024
  2. SLT
    DualSep: A Light-weight dual-encoder convolutional recurrent network for real-time in-car speech separation
    Ziqian Wang, Jiayao Sun, Zihan Zhang, Xingchen Li, Jie Liu, and Lei Xie
    In SLT, 2024
  3. ICASSP
    PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
    Jixun Yao, Yuguang Yang, Yi Lei, Ziqian Ning, Yanni Hu, Yu Pan, and 4 more authors
    In ICASSP, 2024
  4. ICASSP
    Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
    Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, and 1 more author
    In ICASSP, 2024
  5. ICASSP
    SponTTS: modeling and transferring spontaneous style for TTS
    Hanzhao Li, Xinfa Zhu, Liumeng Xue, Yang Song, Yunlin Chen, and Lei Xie
    In ICASSP, 2024
  6. ICASSP
    BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators
    Zihan Zhang, Jiayao Sun, Xianjun Xia, Chuanzeng Huang, Yijian Xiao, and Lei Xie
    In ICASSP, 2024
  7. ICASSP
    RaD-Net: A Repairing and Denoising Network for Speech Signal Improvement
    Mingshuai Liu, Zhuangqi Chen, Xiaopeng Yan, Yuanjun Lv, Xianjun Xia, Chuanzeng Huang, and 2 more authors
    In ICASSP, 2024
  8. ICASSP
    SELM: Speech Enhancement Using Discrete Tokens and Language Models
    Ziqian Wang, Xinfa Zhu, Zihan Zhang, YuanJun Lv, Ning Jiang, Guoqing Zhao, and 1 more author
    In ICASSP, 2024
  9. ICASSP
    MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition
    He Wang, Pengcheng Guo, Pan Zhou, and Lei Xie
    In ICASSP, 2024
  10. ICASSP
    Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies
    Bingshen Mu, Pengcheng Guo, Dake Guo, Pan Zhou, Wei Chen, and Lei Xie
    In ICASSP, 2024
  11. ICME
    Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning
    Xinfa Zhu, Yuke Li, Yi Lei, Ning Jiang, Guoqing Zhao, and Lei Xie
    In ICME, 2024
  12. ACL
    StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion
    Zhichao Wang, Yuanzhe Chen, Xinsheng Wang, Lei Xie, and Yuping Wang
    In ACL, 2024
  13. Interspeech
    FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
    Yuanjun Lv, Hai Li, Ying Yan, Junhui Liu, Danming Xie, and Lei Xie
    In Interspeech, 2024
  14. Interspeech
    Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation
    Hanzhao Li, Liumeng Xue, Haohan Guo, Xinfa Zhu, Yuanjun Lv, Lei Xie, and 3 more authors
    In Interspeech, 2024
  15. Interspeech
    Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling
    Yuepeng Jiang, Tao Li, Fengyu Yang, Lei Xie, Meng Meng, and Yujun Wang
    In Interspeech, 2024
  16. Interspeech
    RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention
    Mingshuai Liu, Zhuangqi Chen, Xiaopeng Yan, Yuanjun Lv, Xianjun Xia, Chuanzeng Huang, and 2 more authors
    In Interspeech, 2024
  17. Interspeech
    BS-PLCNet 2: Two-stage Band-split Packet Loss Concealment Network with Intra-model Knowledge Distillation
    Zihan Zhang, Xianjun Xia, Chuanzeng Huang, Yijian Xiao, and Lei Xie
    In Interspeech, 2024
  18. Interspeech
    SCDNet: Self-supervised Learning Feature-based Speaker Change Detection
    Yue Li, Xinsheng Wang, Li Zhang, and Lei Xie
    In Interspeech, 2024
  19. SLT
    Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
    Hongfei Xue, Rong Gong, Mingchen Shao, Xin Xu, Lezhi Wang, Lei Xie, and 7 more authors
    In SLT, 2024
  20. Interspeech
    DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion
    Ziqian Ning, Shuai Wang, Pengcheng Zhu, Zhichao Wang, Jixun Yao, Lei Xie, and 1 more author
    In Interspeech, 2024
  21. Interspeech
    Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy
    Linhan Ma, Xinfa Zhu, Yuanjun Lv, Zhichao Wang, Ziqian Wang, Wendi He, and 2 more authors
    In Interspeech, 2024
  22. Interspeech
    WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
    Linhan Ma, Dake Guo, Kun Song, Yuepeng Jiang, Shuai Wang, Liumeng Xue, and 4 more authors
    In Interspeech, 2024
  23. Interspeech
    Text-aware and Context-aware Expressive Audiobook Speech Synthesis
    Dake Guo, Xinfa Zhu, Liumeng Xue, Yongmao Zhang, Wenjie Tian, and Lei Xie
    In Interspeech, 2024
  24. Interspeech
    Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study
    Peikun Chen, Sining Sun, Changhao Shan, Qing Yang, and Lei Xie
    In Interspeech, 2024
  25. Interspeech
    A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition
    Yangze Li, Xiong Wang, Songjun Cao, Yike Zhang, Long Ma, and Lei Xie
    In Interspeech, 2024
  26. Interspeech
    Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper
    Tianyi Xu, Kaixun Huang, Pengcheng Guo, Yu Zhou, Longtao Huang, Hui Xue, and 1 more author
    In Interspeech, 2024
  27. Speech Communication
    Whisper-SV: Adapting Whisper for low-data-resource speaker verification
    Li Zhang, Ning Jiang, Qing Wang, Yue Li, Quan Lu, and Lei Xie
    Speech Communication, 2024
  28. SPL
    MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition
    Bingshen Mu, Yangze Li, Qijie Shao, Kun Wei, Xucheng Wan, Naijun Zheng, and 2 more authors
    IEEE Signal Processing Letters, 2024
  29. SPL
    Distil-DCCRN: A Small-footprint DCCRN Leveraging Feature-based Knowledge Distillation in Speech Enhancement
    Runduo Han, Weiming Xu, Zihan Zhang, Mingshuai Liu, and Lei Xie
    IEEE Signal Processing Letters, 2024
  30. TASLP
    U-Style: Cascading U-nets with Multi-level Speaker and Style Modeling for Zero-Shot Voice Cloning
    Tao Li, Zhichao Wang, Xinfa Zhu, Jian Cong, Qiao Tian, Yuping Wang, and 1 more author
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
  31. SPL
    StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion
    Zhichao Wang, Yuanzhe Chen, Xinsheng Wang, Lei Xie, and Yuping Wang
    IEEE Signal Processing Letters, 2024
  32. TASLP
    SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR
    Pengcheng Guo, Xuankai Chang, Hang Lv, Shinji Watanabe, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
  33. TASLP
    METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer
    Xinfa Zhu, Yi Lei, Tao Li, Yongmao Zhang, Hongbin Zhou, Heng Lu, and 1 more author
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
  34. TASLP
    Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation
    Kun Wei, Bei Li, Hang Lv, Quan Lu, Ning Jiang, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
  35. TASLP
    Multi-level Temporal-channel Speaker Retrieval for Zero-shot Voice Conversion
    Zhichao Wang, Liumeng Xue, Qiuqiang Kong, Lei Xie, Yuanzhe Chen, Qiao Tian, and 1 more author
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
  36. TASLP
    Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix
    Jixun Yao, Qing Wang, Pengcheng Guo, Ziqian Ning, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024

2023

  1. ASRU
    MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement
    Weiming Xu, Zhouxuan Chen, Zhili Tan, Shubo Lv, Runduo Han, Wenjiang Zhou, and 2 more authors
    In ASRU, 2023
  2. ASRU
    The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
    Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, and 8 more authors
    In ASRU, 2023
  3. ASRU
    PromptSpeaker: Speaker Generation Based on Text Descriptions
    Yongmao Zhang, Guanghou Liu, Yi Lei, Yunlin Chen, Hao Yin, Lei Xie, and 1 more author
    In ASRU, 2023
  4. ASRU
    U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias
    Ao Zhang, Pan Zhou, Kaixun Huang, Yong Zou, Ming Liu, and Lei Xie
    In ASRU, 2023
  5. SLT
    MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
    Fan Yu, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, and 1 more author
    In SLT, 2023
  6. ICASSP
    Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling
    Xinfa Zhu, Yi Lei, Kun Song, Yongmao Zhang, Tao Li, and Lei Xie
    In ICASSP, 2023
  7. ICASSP
    DSPGAN: a GAN-based universal vocoder for high-fidelity TTS by time-frequency domain supervision from DSP
    Kun Song, Yongmao Zhang, Yi Lei, Jian Cong, Hanzhao Li, Lei Xie, and 2 more authors
    In ICASSP, 2023
  8. ICASSP
    Preserving background sound in noise-robust voice conversion via multi-task learning
    Jixun Yao, Yi Lei, Qing Wang, Pengcheng Guo, Ziqian Ning, Lei Xie, and 3 more authors
    In ICASSP, 2023
  9. ICASSP
    Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling
    Jixun Yao, Qing Wang, Yi Lei, Pengcheng Guo, Lei Xie, Namin Wang, and 1 more author
    In ICASSP, 2023
  10. ICASSP
    Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints
    Zhichao Wang, Xinsheng Wang, Lei Xie, Yuanzhe Chen, Qiao Tian, and Yuping Wang
    In ICASSP, 2023
  11. ICASSP
    Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
    Ziqian Ning, Qicong Xie, Pengcheng Zhu, Zhichao Wang, Liumeng Xue, Jixun Yao, and 2 more authors
    In ICASSP, 2023
  12. ICASSP
    VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting
    Ao Zhang, He Wang, Pengcheng Guo, Yihui Fu, Lei Xie, Yingying Gao, and 2 more authors
    In ICASSP, 2023
  13. ICASSP
    Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation
    Kun Wei, Long Zhou, Ziqiang Zhang, Liping Chen, Shujie Liu, Lei He, and 2 more authors
    In ICASSP, 2023
  14. ICASSP
    Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
    Li Zhang, Qing Wang, Hongji Wang, Yue Li, Wei Rao, Yannan Wang, and 1 more author
    In ICASSP, 2023
  15. ICASSP
    Two-step Band-split Neural Network Approach for Full-band Residual Echo Suppression
    Zihan Zhang, Shimin Zhang, Mingshuai Liu, Yanhong Leng, Zhe Han, Li Chen, and 1 more author
    In ICASSP, 2023
  16. ICASSP
    Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge
    Mingshuai Liu, Shubo Lv, Zihan Zhang, Runduo Han, Xiang Hao, Xianjun Xia, and 3 more authors
    In ICASSP, 2023
  17. Interspeech
    DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
    Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, and 1 more author
    In Interspeech, 2023
  18. Interspeech
    PromptStyle: Controllable Style Transfer for Text-to-Speech with Natural Language Descriptions
    Guanghou Liu, Yongmao Zhang, Yi Lei, Yunlin Chen, Rui Wang, Zhifei Li, and 1 more author
    In Interspeech, 2023
  19. Interspeech
    VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
    Yongmao Zhang, Heyang Xue, Hanzhao Li, Lei Xie, Tingwei Guo, Ruixiong Zhang, and 1 more author
    In Interspeech, 2023
  20. Interspeech
    StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation
    Kun Song, Yi Ren, Yi Lei, Chunfeng Wang, Kun Wei, Lei Xie, and 2 more authors
    In Interspeech, 2023
  21. Interspeech
    DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting
    Shubo Lv, Xiong Wang, Sining Sun, Long Ma, and Lei Xie
    In Interspeech, 2023
  22. Interspeech
    Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer
    Zhanheng Yang, Sining Sun, Xiong Wang, Yike Zhang, Long Ma, and Lei Xie
    In Interspeech, 2023
  23. Interspeech
    Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
    Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, and 3 more authors
    In Interspeech, 2023
  24. Interspeech
    Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
    Kaixun Huang, Ao Zhang, Zhanheng Yang, Pengcheng Guo, Bingshen Mu, Tianyi Xu, and 1 more author
    In Interspeech, 2023
  25. Interspeech
    TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition
    Hongfei Xue, Qijie Shao, Peikun Chen, Pengcheng Guo, Lei Xie, and Jie Liu
    In Interspeech, 2023
  26. Interspeech
    BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR
    Yuhao Liang, Fan Yu, Yangze Li, Pengcheng Guo, Shiliang Zhang, Qian Chen, and 1 more author
    In Interspeech, 2023
  27. Interspeech
    Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
    Qing Wang, Jixun Yao, Ziqian Wang, Pengcheng Guo, and Lei Xie
    In Interspeech, 2023
  28. ASRU
    SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation
    Yuanjun Lv, Jixun Yao, Peikun Chen, Hongbin Zhou, Heng Lu, and Lei Xie
    In ASRU, 2023
  29. ASRU
    SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR
    Yangze Li, Fan Yu, Yuhao Liang, Pengcheng Guo, Mohan Shi, Zhihao Du, and 2 more authors
    In ASRU, 2023
  30. ASRU
    BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
    Peikun Chen, Fan Yu, Yuhao Lian, Hongfei Xue, Xucheng Wan, Naijun Zheng, and 2 more authors
    In ASRU, 2023
  31. ASRU
    An Exploration of Task-decoupling on Two-stage Neural Post Filter for Real-time Personalized Acoustic Echo Cancellation
    Zihan Zhang, Jiayao Sun, Xianjun Xia, Ziqian Wang, Xiaopeng Yan, Yijian Xiao, and 1 more author
    In ASRU, 2023
  32. ASRU
    Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition
    Kaixun Huang, Ao Zhang, Binbin Zhang, Tianyi Xu, Xingchen Song, and Lei Xie
    In ASRU, 2023
  33. ASRU
    Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
    Yuke Li, Xinfa Zhu, Yi Lei, Hai Li, Junhui Liu, Danming Xie, and 1 more author
    In ASRU, 2023
  34. ASRU
    HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS
    Dake Guo, Xinfa Zhu, Liumeng Xue, Tao Li, Yuanjun Lv, Yuepeng Jiang, and 1 more author
    In ASRU, 2023
  35. AAAI
    UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
    Yi Lei, Shan Yang, Xinsheng Wang, Qicong Xie, Jixun Yao, Lei Xie, and 1 more author
    In AAAI, 2023
  36. SLT
    Spatial-DCCRN: DCCRN Equipped with Frame-level Angle Feature and Hybrid Filtering for Multi-channel Speech Enhancement
    Shubo Lv, Yihui Fu, Yukai Jv, Lei Xie, Weixin Zhu, Wei Rao, and 1 more author
    In SLT, 2023
  37. SLT
    TEA-PSE 2.0: Sub-Band Network for Real-Time Personalized Speech Enhancement
    Yukai Ju, Shimin Zhang, Wei Rao, Yannan Wang, Tao Yu, Lei Xie, and 1 more author
    In SLT, 2023
  38. TASLP
    Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition
    Qijie Shao, Pengcheng Guo, Jinghao Yan, Pengfei Hu, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
  39. TASLP
    DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech – A Study between English and Mandarin
    Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, and 2 more authors
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
  40. TASLP
    MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style Modeling
    Zhichao Wang, Xinsheng Wang, Qicong Xie, Tao Li, Lei Xie, Qiao Tian, and 1 more author
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
  41. SPL
    LM-VC: Zero-shot Voice Conversion via Speech Generation based on Language Models
    Zhichao Wang, Yuanzhe Chen, Lei Xie, Qiao Tian, and Yuping Wang
    IEEE Signal Processing Letters, 2023
  42. TASLP
    Timbre-reserved Adversarial Attack in Speaker Identification
    Qing Wang, Jixun Yao, Li Zhang, Pengcheng Guo, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023

2022

  1. ISCSLP
    AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
    Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, and 3 more authors
    In ISCSLP, 2022
  2. ISCSLP
    AccentSpeech: Learning Accent from Crowd-sourced Data for Target Speaker TTS with Accents
    Yongmao Zhang, Zhichao Wang, Peiji Yang, Hongshen Sun, Zhisheng Wang, and Lei Xie
    In ISCSLP, 2022
  3. ISCSLP
    Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS
    Kun Song, Jian Cong, Xinsheng Wang, Yongmao Zhang, Lei Xie, Ning Jiang, and 1 more author
    In ISCSLP, 2022
  4. ISCSLP
    End-to-End Voice Conversion with Information Perturbation
    Qicong Xie, Shan Yang, Yi Lei, Lei Xie, and Dan Su
    In ISCSLP, 2022
  5. ISCSLP
    Multi-speaker Multi-style Text-to-speech Synthesis with Single-speaker Single-style Training Data Scenarios
    Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie, Guoqiao Yu, and 1 more author
    In ISCSLP, 2022
  6. VLSP
    MSV Challenge 2022: NPU-HC Speaker Verification System for Low-resource Indian Languages
    Yue Li, Li Zhang, Namin Wang, Jie Liu, and Lei Xie
    In VLSP, 2022
  7. Interspeech
    NPU-HCSpeaker Verification System for Far-field Speaker Verification Challenge 2022
    Li Zhang, Yue Li, Namin Wang, Jie Liu, and Lei Xie
    In Interspeech, 2022
  8. Interspeech
    NWPU-ASLP System for the VoicePrivacy 2022 Challenge
    Jixun Yao, Qing Wang, Li Zhang, Pengcheng Guo, Yuhao Liang, and Lei Xie
    In Interspeech, 2022
  9. Interspeech
    Backend Ensemble for Speaker Verification and Spoofing Countermeasure
    Li Zhang, Yue Li, Huan Zhao, Qing Wang, and Lei Xie
    In Interspeech, 2022
  10. Interspeech
    Personalized Acoustic Echo Cancellation for Full-duplex Communications
    Shimin Zhang, Ziteng Wang, Yukai Ju, Yihui Fu, Yueyue Na, Qiang Fu, and 1 more author
    In Interspeech, 2022
  11. Interspeech
    Learning noise-independent speech representations for high-quality voice conversion for noisy target speakers
    Liumeng Xue, Shan Yang, Na Hu, Dan Su, and Lei Xie
    In Interspeech, 2022
  12. ICASSP
    Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by learning from Singing Teachers
    Heyang Xue, Xinsheng Wang, Yongmao Zhang, Lei Xie, Pengcheng Zhu, and Mengxiao Bi
    In ICASSP, 2022
  13. Interspeech
    CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer
    Zhanheng Yang, Sining Sun, Jin Li, Xiaoming Zhang, Xiong Wang, Long Ma, and 1 more author
    In Interspeech, 2022
  14. Interspeech
    Minimizing Sequential Confusion Error in Speech Command Recognition
    Zhanheng Yang, Hang Lv, Xiong Wang, Ao Zhang, and Lei Xie
    In Interspeech, 2022
  15. Interspeech
    A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
    Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, and Lei Xie
    In Interspeech, 2022
  16. Interspeech
    Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
    Yu Wang, Xinsheng Wang, Pengcheng Zhu, Jie Wu, Hanzhao Li, Heyang Xue, and 3 more authors
    In Interspeech, 2022
  17. Interspeech
    WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
    Binbin Zhang, Di Wu, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv, and 4 more authors
    In Interspeech, 2022
  18. Interspeech
    Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis
    Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Mingqi Jiang, and Lei Xie
    In Interspeech, 2022
  19. Interspeech
    Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition
    Qijie Shao, Jinghao Yan, Jian Kang, Pengcheng Guo, Xian Shi, Pengfei Hu, and 1 more author
    In Interspeech, 2022
  20. Interspeech
    Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
    Kun Wei, Pengcheng Guo, and Ning Jiang
    In Interspeech, 2022
  21. Interspeech
    Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
    Kun Wei, Yike Zhang, Sining Sun, Lei Xie, and Long Ma
    In Interspeech, 2022
  22. Interspeech
    Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion
    Yi Lei, Shan Yang, Jian Cong, Lei Xie, and Dan Su
    In Interspeech, 2022
  23. ICASSP
    Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
    Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, and 10 more authors
    In ICASSP, 2022
  24. ICASSP
    M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge
    Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, and 6 more authors
    In ICASSP, 2022
  25. ICASSP
    VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis
    Yongmao Zhang, Jian Cong, Heyang Xue, Lei Xie, Pengcheng Zhu, and Mengxiao Bi
    In ICASSP, 2022
  26. ICASSP
    S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
    Shubo Lv, Yihui Fu, Mengtao Xing, Jiayao Sun, Lei Xie, Jun Huang, and 2 more authors
    In ICASSP, 2022
  27. ICASSP
    TEA-PSE: Tencent-ethereal-audiolab personalized speech enhancement system for ICASSP 2022 DNS CHALLENGE
    Yukai Ju, Wei Rao, Xiaopeng Yan, Yihui Fu, Shubo Lv, Luyao Cheng, and 3 more authors
    In ICASSP, 2022
  28. ICASSP
    Conversational Speech Recognition by Learning Conversation-level Characteristics
    Kun Wei, Yike Zhang, Sining Sun, Lei Xie, and Long Ma
    In ICASSP, 2022
  29. ICASSP
    WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
    Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, and 6 more authors
    In ICASSP, 2022
  30. ICASSP
    Uformer: A unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
    Yihui Fu, Yun Liu, Jingdong Li, Dawei Luo, Shubo Lv, Yukai Jv, and 1 more author
    In ICASSP, 2022
  31. ICASSP
    One-shot Voice Conversion for Style Transfer Based on Speaker Adaptation
    Zhichao Wang, Qicong Xie, Tao Li, Hongqiang Du, Lei Xie, Pengcheng Zhu, and 1 more author
    In ICASSP, 2022
  32. ICASSP
    Multi-Task Deep Residual Echo Suppression with Echo-aware Loss
    Shimin Zhang, Ziteng Wang, Jiayao Sun, Yihui Fu, Biao Tian, Qiang Fu, and 1 more author
    In ICASSP, 2022
  33. ISCSLP
    The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
    Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, and 4 more authors
    In ISCSLP, 2022
  34. ISCSLP
    The NPU-ASLP System for The ISCSLP 2022 Magichub Code-Switching ASR Challenge
    Yuhao Liang, Peikun Chen, Fan Yu, Xinfa Zhu, Tianyi Xu, Yingying Gao, and 1 more author
    In ISCSLP, 2022
  35. ISCSLP
    TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge
    Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang, and 2 more authors
    In ISCSLP, 2022
  36. Neural Networks
    Two-stage streaming keyword detection and localization with multi-scale depthwise temporal convolution
    Jingyong Hou, Lei Xie, and Shilei Zhang
    Nerual Networks, 2022
  37. TASLP
    MsEmoTTS: Multi-Scale Emotion Transfer, Prediction, and Control for Emotional Speech Synthesis
    Yi Lei, Shan Yang, Xinsheng Wang, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022
  38. TMM
    Look&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
    Junwen Xiong, Yu Zhou, Peng Zhang, Lei Xie, Wei Huang, and Yufei Zha
    IEEE Transactions on Multimedia, 2022
  39. TMM
    AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Persons
    Xinsheng Wang, Qicong Xie, Jihua Zhu, Lei Xie, and Odette Scharenborg
    IEEE Transactions on Multimedia, 2022
  40. Neural Networks
    Neural speech enhancement with unsupervised pre-training and mixture training
    Xiang Hao, Chenglin Xu, and Lei Xie
    Nerual Networks, 2022
  41. SPL
    Cross-speaker Emotion Transfer through Information Perturbation in Emotional Speech Synthesis
    Yi Lei, Shan Yang, Xinfa Zhu, Lei Xie, and Dan Su
    IEEE Signal Processing Letters, 2022
  42. TASLP
    ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
    Liumeng Xue, Frank K. Soong, Shaofei Zhang, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022
  43. Neural Networks
    Noise-robust voice conversion with domain adversarial training
    Hongqiang Du, Lei Xie, and Haizhou Li
    Nerual Networks, 2022
  44. TASLP
    Disentangling Style and Speaker Attributes for TTS Style Transfer
    Xiaochun An, Frank K. Soong, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022
  45. SPL
    Cross-speaker Emotion Transfer through Information Perturbation in Emotional Speech Synthesis
    Yi Lei, Shan Yang, Xinfa Zhu, Lei Xie, and Dan Su
    IEEE Signal Processing Letters, 2022

2021

  1. SLT
    Multi-Channel Automatic Speech Recognition Using Deep Complex Unet
    Yuxiang Kong, Jian Wu, Quandong Wang, Peng Gao, Weiji Zhuang, Yujun Wang, and 1 more author
    In SLT, 2021
  2. SLT
    Simplified Self-Attention for Transformer-based End-to-End Speech Recognition
    Haoneng Luo, Shiliang Zhang, Ming Lei, and Lei Xie
    In SLT, 2021
  3. SLT
    DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation
    Yihui Fu, Jian Wu, Yanxin Hu, Mengtao Xing, and Lei Xie
    In SLT, 2021
  4. SLT
    IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines
    Yihui Fu, Zhuoyuan Yao, Weipeng He, Jian Wu, Xiong Wang, Zhanheng Yang, and 6 more authors
    In SLT, 2021
  5. SLT
    The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines
    Fan Yu, Zhuoyuan Yao, Xiong Wang, Keyu An, Lei Xie, Zhijian Ou, and 3 more authors
    In SLT, 2021
  6. SLT
    Fine-grained Emotion Strength Transfer, Control and Prediction for Emotional Speech Synthesis
    Yi Lei, Shan Yang, and Lei Xie
    In SLT, 2021
  7. SLT
    Learn2Sing: Target Speaker Singing Voice Synthesis by Learning from a Singing Teacher
    Heyang Xue, Shan Yang, Yi Lei, Lei Xie, and Xiulin Li
    In SLT, 2021
  8. SLT
    Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech
    Geng Yang, Shan Yang, Kai Liu, Peng Fang, Wei Chen, and Lei Xie
    In SLT, 2021
  9. SLT
    Conversational End-to-End TTS for Voice Agent
    Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He, and Lei Xie
    In SLT, 2021
  10. SLT
    Optimizing voice conversion network with cycle consistency loss of speaker identity
    Hongqiang Du, Xiaohai Tian, Lei Xie, and Haizhou Li
    In SLT, 2021
  11. ISCSLP
    Controllable Emotion Transfer For End-to-End Speech Synthesis
    Tao Li, Shan Yang, Liumeng Xue, and Lei Xie
    In ISCSLP, 2021
  12. ISCSLP
    Accent and Speaker Disentanglement in Many-to-many Voice Conversion
    Zhichao Wang, Wenshuo Ge, Xiong Wang, Shan Yang, Wendong Gan, Haitao Chen, and 3 more authors
    In ISCSLP, 2021
  13. ISCSLP
    Context-aware RNNLM Rescoring for Conversational Speech Recognition
    Kun Wei, Pengcheng Guo, Hang Lv, Zhen Tu, Lei Xie, and Xiulin Li
    In ISCSLP, 2021
  14. ISCSLP
    Adversarial Training for Multi-domain Speaker Recognition
    Qing Wang, Wei Rao, Pengcheng Guo, and Lei Xie
    In ISCSLP, 2021
  15. APSIPA ASC
    Target Speaker Extraction for Customizable Query-by-Example Keyword Spotting
    Qijie Shao, Jingyong Hou, Yanxin Hu, Qing Wang, Lei Xie, and Xin Lei
    In APSIPA ASC, 2021
  16. ICML
    Efficient Gradient-Based Neural Architecture Search For End-to-End ASR
    Xian Shi, Pan Zhou, Wei Chen, and Lei Xie
    In ICML, 2021
  17. ICML
    TeNC: Low Bit-Rate Speech Coding with VQ-VAE and GAN
    Yi Chen, Shan Yang, Na Hu, Lei Xie, and Dan Su
    In ICML, 2021
  18. ICML
    Noise Robust Singing Voice Synthesis Using Gaussian Mixture Variational Autoencoder
    Heyang Xue, Xiao Zhang, Jie Wu, Jian Luan, Yujun Wang, and Lei Xie
    In ICML, 2021
  19. Interspeech
    Controllable Context-aware Conversational Speech Synthesis
    Jian Cong, Shan Yang, Na Hu, Guangzhi Li, Lei Xie, and Dan Su
    In Interspeech, 2021
  20. Interspeech
    Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
    Jian Cong, Shan Yang, Lei Xie, and Dan Su
    In Interspeech, 2021
  21. Interspeech
    Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion
    Zhichao Wang, Xinyong Zhou, Fengyu Yang, Tao Li, Hongqiang Du, Lei Xie, and 3 more authors
    In Interspeech, 2021
  22. Interspeech
    Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-End Speech Recognition
    Xiong Wang, Sining Sun, Lei Xie, and Long Ma
    In Interspeech, 2021
  23. Interspeech
    Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain
    Pengcheng Guo, Xuankai Chang, Shinji Watanabe, and Lei Xie
    In Interspeech, 2021
  24. Interspeech
    F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement
    Shimin Zhang, Yuxiang Kong, Shubo Lv, Yanxin Hu, and Lei Xie
    In Interspeech, 2021
  25. Interspeech
    AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
    Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong, Zhuo Chen, and 7 more authors
    In Interspeech, 2021
  26. Interspeech
    Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification
    Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, and Haizhou Li
    In Interspeech, 2021
  27. Interspeech
    Improving robustness of one-shot voice conversion with deep discriminative speaker encoder
    Hongqiang Du, and Lei Xie
    In Interspeech, 2021
  28. Interspeech
    Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS
    Xiaochun An, Frank K. Soong, and Lei Xie
    In Interspeech, 2021
  29. Interspeech
    DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
    Lv Shubo, Hu Yanxin, Zhang Shimin, and Xie Lei
    In Interspeech, 2021
  30. Interspeech
    Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
    Wang Jingsong, He Yuxuan, Zhao Chunyu, Shao Qijie, Tu Wei-Wei, Ko Tom, and 2 more authors
    In Interspeech, 2021
  31. Interspeech
    WeNet: Production Oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit
    Yao Zhuoyuan, Wu Di,  Wang,  Xiong, Zhang Binbin, Yu Fan, and 5 more authors
    In Interspeech, 2021
  32. ICASSP
    An asynchronous WFST-based decoder for automatic speech recognition
    Lv Hang, Chen Zhehuai, Xu Hainan, Povey Daniel, Xie Lei, and Khudanpur Sanjeev
    In ICASSP, 2021
  33. ICASSP
    Wake word detection with streaming transformers
    Wang Yiming, Lv Hang, Povey Daniel, Xie Lei, and Khudanpur Sanjeev
    In ICASSP, 2021
  34. ICASSP
    The Multi-Speaker Multi-Style Voice Cloning Challenge 2021
    Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, and 6 more authors
    In ICASSP, 2021
  35. ICASSP
    The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
    Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, and 2 more authors
    In ICASSP, 2021
  36. ISCSLP
    The NPU System for the 2020 Personalized Voice Trigger Challenge
    Jingyong Hou, Li Zhang, Yihui Fu, Qing Wang, Zhanheng Yang, Qijie Shao, and 1 more author
    In ISCSLP, 2021
  37. SLT
    Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter
    Xiong Wang, Zhuoyuan Yao, Xian Shi, and Lei Xie
    In SLT, 2021
  38. ASRU
    Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASR
    Fan Yu, Haoneng Luo, Pengcheng Guo, Yuhao Liang, Zhuoyuan Yao, Lei Xie, and 3 more authors
    In ASRU, 2021
  39. ASRU
    Duality Temporal-channel-frequency Attention Enhanced Speaker Representation Learning
    Li Zhang, Qing Wang, and Lei Xie
    In ASRU, 2021
  40. TASLP
    Cross-Speaker Emotion Disentangling and Transfer for End-to-End Speech Synthesis
    Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, and Lei Xie
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021
  41. Speech Communication
    Factorized WaveNet for voice conversion with limited data
    Hongqiang Du, Xiaohai Tian, Lei Xie, and Haizhou Li
    Speech Communication, 2021
  42. Computer Speech and Langauge
    Effective and direct control of neural TTS prosody by removing interactions between different attributes
    Xiaochun An, Frank K. Soong, Shan Yang, and Lei Xie
    Computer Speech & Language, 2021
  43. SPL
    LET-Decoder: A WFST-based lazy-evaluation token-group decoder with exact lattice generation
    Hang Lv, Daniel Povey, Mahsa Yarmohammadi, Ke Li, Yiming Wang, Lei Xei, and 1 more author
    IEEE Signal Processing Letters, 2021
  44. Neural Networks
    Cycle consistent network for end-to-end style transfer TTS training
    Liumeng Xue, Shifeng Pan, Lei He, Lei Xie, and Frank K. Soong
    Nerual Networks, 2021

2020

  1. NeurlPS(NIPS)
    Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
    Jing Shi, Xuankai Chang, Pengcheng Guo, Shinji Watanabe, Yusuke Fujita, Jiaming Xu, and 2 more authors
    In NeurlPS(NIPS), 2020
  2. Interspeech
    DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement
    Shan Yang, Yuxuan Wang, and Lei Xie
    In Interspeech, 2020
  3. Interspeech
    Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis
    Fengyu Yang, Shan Yang, Qinghua Wu, Yujun Wang, and Lei Xie
    In Interspeech, 2020
  4. Interspeech
    An End-to-end Architecture of Online Multi-channel Speech Separation
    Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, and 2 more authors
    In Interspeech, 2020
  5. Interspeech
    Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition
    Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, and 1 more author
    In Interspeech, 2020
  6. Interspeech
    Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition
    Qing Wang, Pengcheng Guo, and Lei Xie
    In Interspeech, 2020
  7. Interspeech
    NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge
    Li Zhang, Jian Wu, and Lei Xie
    In Interspeech, 2020
  8. Interspeech
    Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music
    Haohe Liu, Lei Xie, Jian Wu, and Geng Yang
    In Interspeech, 2020
  9. Interspeech
    Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training
    Jian Cong, Shan Yang, Lei Xie, Guoqiao Yu, and Guanglu Wan
    In Interspeech, 2020
  10. Interspeech
    Wake Word Detection with Alignment-Free Lattice-Free MMI
    Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, and Sanjeev Khudanpur
    In Interspeech, 2020
  11. Interspeech
    AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification
    Jingsong Wang, Tom Ko, Zhen Xu, Xiawei Guo, Souxiang Liu, Wei-Wei Tu, and 1 more author
    In Interspeech, 2020
  12. ICASSP
    Mining Effective Negative Training Samples for Keyword Spotting
    Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, and Lei Xie
    In ICASSP, 2020
  13. ICASSP
    Effective Wavenet Adaptation for Voice Conversion with Limited Data
    Hongqiang Du, Xiaohai Tian, Lei Xie, and Haizhou Li
    In ICASSP, 2020
  14. ICASSP
    Time-Domain Neural Network Approach for Speech Bandwidth Extension
    Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, and Haizhou Li
    In ICASSP, 2020
  15. SPL
    Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise
    Shan Yang, Yuxuan Wang, and Lei Xie
    IEEE Signal Processing Letters, 2020
  16. TASLP
    Fast Query-by-example Speech Search using Attention-based Deep Binary Embeddings
    Yougen Yuan, Lei Xie, Cheung-Chi Leung, Hongjie Chen, and Bin Ma
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020
  17. Neural Networks
    On the localness modeling for the self-attention based end-to-end speech synthesis
    Shan Yang, Heng Lu, Shiyin Kang, Liumeng Xue, Jinba Xiao, Dan Su, and 2 more authors
    Nerual Networks, 2020
  18. TALLIP
    Loanword Identification in Low-resource Languages with Minimal Supervision
    Chenggang Mi, Lei Xie, and Yanning Zhang
    ACM Transactions on Asian and Low-Resource Language Information Processing, 2020

2019

  1. ASRU
    Time Domain Audio Visual Speech Separation
    Jian Wu, Yong Xu, Shi-Xiong Zhang, Lian-Wu Chen, Meng Yu, Lei Xie, and 1 more author
    In ASRU, 2019
  2. ASRU
    Wavenet Factorization with Singular Value Decomposition for Voice Conversion
    Hongqiang Du, Xiaohai Tian, Lei Xie, and Haizhou Li
    In ASRU, 2019
  3. ASRU
    Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias
    Fengyu Yang, Shan Yang, Pengcheng Zhu, Pengju Yan, and Lei Xie
    In ASRU, 2019
  4. ASRU
    Verifying Deep Keyword Spotting Detection with Acoustic Word Embeddings
    Yougen Yuan, Zhiqiang Lv, Shen Huang, and Lei Xie
    In ASRU, 2019
  5. ASRU
    Controlling Emotion Strength with Relative Attribute for End-To-End Speech Synthesis
    Xiaolian Zhu, Shan Yang, Geng Yang, and Lei Xie
    In ASRU, 2019
  6. ASRU
    Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis
    Xiaochun An, Yuxuan Wang, Shan Yang, Zejun Ma, and Lei Xie
    In ASRU, 2019
  7. ASRU
    Virtual Adversarial Training for DS-CNN Based Small-Footprint Keyword Spotting
    Xiong Wang, Sining Sun, and Lei Xie
    In ASRU, 2019
  8. ASRU
    ESPRESSO: A Fast End-To-End Neural Speech Recognition Toolkit
    Yiming Wang, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, and 4 more authors
    In ASRU, 2019
  9. ASRU
    Incremental Lattice Determinization for Wfst Decoders
    Zhehuai Chen, Mahsa Yarmohammadi, Hainan Xu, Hang Lv, Lei Xie, Daniel Povey, and 1 more author
    In ASRU, 2019
  10. ICMI
    Deep Audio-visual System for Closed-set Word-level Speech Recognition
    Yougen Yuan, Wei Tang, Minhao Fan, Yue Chao, Peng Zhang, and Lei Xie
    In ICMI, 2019
  11. APSIPA ASC
    Exploring RNN-Transducer for Chinese Speech Recognition
    Senmao Wang, Pan Zhou, Wei Chen, Jia Jia, and Lei Xie
    In APSIPA ASC, 2019
  12. APSIPA ASC
    Multiple Fixed Beamformers with a Spacial Wiener-form Postfilter for Far-Field Speech Recognition
    Sining Sun, Shuran Zhou, Mei-Yuh Hwang, Lei Xie, Qin Li, and Xin Lei
    In APSIPA ASC, 2019
  13. Interspeech
    A New GAN-based End-to-End TTS Training Algorithm
    Haohan Guo, Frank K. Soong, Lei He, and Lei Xie
    In Interspeech, 2019
  14. Interspeech
    Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS
    Haohan Guo, Frank K. Soong, Lei He, and Lei Xie
    In Interspeech, 2019
  15. Interspeech
    Building a mixed-lingual neural TTS system with only monolingual data
    Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, and Zhizheng Wu
    In Interspeech, 2019
  16. Interspeech
    Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition
    Pengcheng Guo, Sining Sun, and Lei Xie
    In Interspeech, 2019
  17. Interspeech
    Improved Speaker-Dependent Separation for CHiME-5 Challenge
    Jian Wu, Yong Xu, Shi-Xiong Zhang, Lian-Wu Chen, Meng Yu, Lei Xie, and 1 more author
    In Interspeech, 2019
  18. Interspeech
    Adversarial Regularization for End-to-end Robust Speaker Verification
    Qing Wang, Pengcheng Guo, Sining Sun, Lei Xie1, and John H.L. Hansen
    In Interspeech, 2019
  19. Interspeech
    Towards Language-Universal Mandarin-English Speech Recognition
    Shiliang Zhang, Yuan Liu, Ming Lei, Bin Ma, and Lei Xie
    In Interspeech, 2019
  20. ICASSP
    Enhancing Hybrid Self-Attention Structure with Relative-Position-Aware Bias for Speech Synthesis
    Shan Yang, Heng Lu, Shiying Kang, Lei Xie, and Dong Yu
    In ICASSP, 2019
  21. ICASSP
    Investigating End-To-End Speech Recognition for Mandarin-English Code-Switching
    Changhao Shan, Chao Weng, Guangsen Wang, Dan Su, Min Luo, Dong Yu, and 1 more author
    In ICASSP, 2019
  22. ICASSP
    Component Fusion: Learning Replaceable Language Model Component for End-To-End Speech Recognition System
    Changhao Shan, Chao Weng, Guangsen Wang, Dan Su, Min Luo, Dong Yu, and 1 more author
    In ICASSP, 2019
  23. ICASSP
    A Pitch-Aware Approach to Single-Channel Speech Separation
    Ke Wang, Frank Soong, and Lei Xie
    In ICASSP, 2019
  24. ICASSP
    Domain Adversarial Training for Improving Keyword Spotting Performance of Esl Speech
    Jingyong Hou, Pengcheng Guo, Sining Sun, Frank K. Soong, Wenping Hu, and Lei Xie
    In ICASSP, 2019
  25. ICASSP
    An Attention-Based Neural Network Approach for Single Channel Speech Enhancement
    Xiang Hao, Changhao Shan, Yong Xu, Sining Sun, and Lei Xie
    In ICASSP, 2019
  26. ICASSP
    Adversarial Examples for Improving End-To-End Attention-Based Small-Footprint Keyword Spotting
    Xiong Wang, Sining Sun, Changhao Shan, Jingyong Hou, Lei Xie, Shen Li, and 1 more author
    In ICASSP, 2019
  27. ICASSP
    Robust Audio-Visual Speech Recognition Using Bimodal Dfsmn with Multi-Condition Training and Dropout Regularization
    Shiliang Zhang, Ming Lei, Bin Ma, and Lei Xie
    In ICASSP, 2019
  28. CHiME
    The NWPU System for CHiME-5 Challenge
    Wu Jian, Xu Yong, Zhang Shi-Xiong, Chen Lian-Wu, Yu Meng, Xie Lei, and 1 more author
    In CHiME, 2019
  29. CHiME
    Multiple Beamformers with ROVER for the CHiME-5 Challenge
    Sining Sun, Yangyang Shi, Ching-Feng Yeh, Suliang Bu, Mei-Yuh Hwang, and Lei Xie
    In CHiME, 2019
  30. TETCI
    Improving Adversarial Neural Machine Translation for Morphologically Rich Language
    Chenggang Mi, Lei Xie, and Yanning Zhang
    IEEE Transactions on Emerging Topics in Computational Intelligence, 2019
  31. TASLP
    Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition
    Sining Sun, Pengcheng Guo, Lei Xie, and Mei-Yuh Hwang
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2019
  32. SPL
    Region Proposal Network Based Small-Footprint Keyword Spotting
    Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, and Lei Xie
    IEEE Signal Processing Letters, 2019
  33. Access
    Pre-Alignment Guided Attention for Improving Training Efficiency and Model Stability in End-to-End Speech Synthesis
    Xiaolian Zhu, Yuchao Zhang, Shan Yang, Liumeng Xue, and Lei Xie
    IEEE Access, 2019
  34. Access
    Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context
    Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, and Bin Ma
    IEEE Access, 2019

2018

  1. Interspeech
    Attention-based End-to-End Models for Small-Footprint Keyword Spotting
    Changhao Shan, Junbo Zhang, Yujun Wang, and Lei Xie
    In Interspeech, 2018
  2. Interspeech
    Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition
    Ke Wang, Junbo Zhang, Sining Sun, Yujun Wang, Fei Xiang, and Lei Xie
    In Interspeech, 2018
  3. Interspeech
    Training Augmentation with Adversarial Examples for Robust Speech Recognition
    Sining Sun, Ching-Feng Yeh, Mari Ostendorf, Mei-Yuh Hwang, and Lei Xie
    In Interspeech, 2018
  4. Interspeech
    Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model
    Ke Wang, Junbo Zhang, Yujun Wang, and Lei Xie
    In Interspeech, 2018
  5. Interspeech
    Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search
    Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, and Haizhou Li
    In Interspeech, 2018
  6. Interspeech
    Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition
    Pengcheng Guo, Haihua Xu, Lei Xie, and Eng Siong Chng
    In Interspeech, 2018
  7. ICASSP
    Domain Adversarial Training for Accented Speech Recognition
    Sining Sun, Ching-Feng Yeh, Mei-Yuh Hwang, Mari Ostendorf, and Lei Xie
    In ICASSP, 2018
  8. ICASSP
    Attention-Based End-To-End Speech Recognition on Voice Search
    Changhao Shan, Junbo Zhang, Yujun Wang, and Lei Xie
    In ICASSP, 2018
  9. ISCSLP
    A Refined Query-by-Example Approach to Spoken-Term-Detection on ESL Learners’ Speech
    Jingyong Hou, Wenping Hu, Frank K. Soong, and Lei Xie
    In ISCSLP, 2018
  10. ACM MM
    A Kullback-Leibler Divergence Based Recurrent Mixture Density Network for Acoustic Modeling in Emotional Statistical Parametric Speech Synthesis
    Xiaochun An, Yuchao Zhang, Bing Liu, Liumeng Xue, and Lei Xie
    In ACM MM, 2018
  11. ACM MM
    A Comparison of Expressive Speech Synthesis Approaches based on Neural Network
    Liumeng Xue, Xiaolian Zhu, Xiaochun An, and Lei Xie
    In ACM MM, 2018
  12. ICASSP
    Unsupervised Domain Adaptation Via Domain Adversarial Training for Speaker Recognition
    Qing Wang, Wei Rao, Sining Sun, Lei Xie, Eng Siong Chng, and Haizhou Li
    In ICASSP, 2018
  13. Journal of Signal Processing Systems
    Guest Editorial: Advances in Deep Learning for Speech Processing
    Lei Xie, Tan Lee, and Man-Wai Mak
    Journal of Signal Processing Systems, 2018

2017

  1. ASRU
    Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework
    Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, and 1 more author
    In ASRU, 2017
  2. Interspeech
    Empirical Evaluation of Parallel Training Algorithms on Acoustic Modeling
    Wenpeng Li, BinBin Zhang, Lei Xie, and Dong Yu
    In Interspeech, 2017
  3. ASRU
    Multilingual Bottle-Neck Feature Learning from Untranscribed Speech
    Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In ASRU, 2017
  4. ASRU
    Extracting Bottleneck Features and Word-Like Pairs from Untranscribed Speechfor Feature Representation
    Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, and Haizhou Li
    In ASRU, 2017
  5. APSIPA ASC
    An End-to-End Neural Network Approach to Story Segmentation
    Jia Yu, Lei Xie, Xiong Xiao, and Eng Siong Chng
    In APSIPA ASC, 2017
  6. APSIPA ASC
    Topic Embedding of Sentences for Story Segmentation
    Jia Yu, Lei Xie, Xiong Xiao, and Eng Siong Chng
    In APSIPA ASC, 2017
  7. APSIPA ASC
    A Segmental DNN/i-vector Approach for Digit-Prompted Speaker Verification
    Jie Yan, Lei Xie, Guangsen Wang, and Zhong-Hua Fu
    In APSIPA ASC, 2017
  8. Interspeech
    Denoising Recurrent Neural Network for Deep Bidirectional LSTM based Voice Conversion
    Jie Wu, Dongyan Huang, Lei Xie, and Haizhou Li
    In Interspeech, 2017
  9. ICASSP
    Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection
    Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, and Haizhou L
    In ICASSP, 2017
  10. ICMI
    The I2R-NWPU Text-to-Speech System for Blizzard Challenge 2017
    Yanfeng Lu, Zhengchen Zhang, Chenyu Yang, Huaiping Ming, Xiaolian Zhu, Yuchao Zhang, and 4 more authors
    In ICMI, 2017
  11. Frontiers of Computer Science
    Sound image externalization for headphone based real-time 3D audio
    Yougen Yuan, Lei Xie, Zhong-Hua Fu, and Qi Cong
    Frontiers of Computer Science, 2017
  12. Journal of Signal Processing Systems
    A Bidirectional LSTM Approach with Word Embeddings for Sentence Boundary Detection
    Chenglin Xu, Lei Xie, and Xiong Xiao
    Journal of Signal Processing Systems, 2017
  13. J-STSP
    Multi-Task Feature Learning for Low-Resource Query-by-Example Spoken Term Detection
    Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    IEEE Journal of Selected Topics in Signal Processing, 2017
  14. Signal Processing
    Learning Distributed Sentence Representations for Story Segmentation
    Jia Yu, Lei Xie, Xiong Xiao, and Eng Siong Chng
    Signal Processing, 2017

2016

  1. ISCSLP
    Investigating Neural Network based Query-by-Example Keyword Spotting Approach for Personalized Wake-up Word Detection in Mandarin Chinese
    Jingyong Hou, Lei Xie, and Zhonghua Fu
    In ISCSLP, 2016
  2. ISCSLP
    A Bi-directional LSTM Approach for Polyphone Disambiguation in Mandarin Chinese
    Changhao Shan, Lei Xie, and Kaisheng Yao
    In ISCSLP, 2016
  3. ISCSLP
    Investigating LSTM for Punctuation Prediction
    Kaituo Xu, Lei Xie, and Kaisheng Yao
    In ISCSLP, 2016
  4. APSIPA ASC
    Predicting Articulatory Movement From Text Using Deep Architecture with Stacked Bottleneck Features
    Zhen Wei, Zhizheng Wu, and Lei Xie
    In APSIPA ASC, 2016
  5. Interspeech
    Unsupervised Bottleneck Features for Low-Resource Query-By-Example Spoken Term Detection
    Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In Interspeech, 2016
  6. Interspeech
    Learning Neural Network Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information
    Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In Interspeech, 2016
  7. Interspeech
    A DNN - Hmm Approach to Story Segmentation
    Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng, and Haizhou Li
    In Interspeech, 2016
  8. Interspeech
    Deep Bidirectional Lstm Modeling of Timbre and Prosody for Emotional Voice Conversion
    Huaiping Ming, Dongyan Huang, Lei Xie, Jie Wu, Minghui Dong, and Haizhou Li
    In Interspeech, 2016
  9. Interspeech
    Toward High-Performance Language-Independent Query-By-Example Spoken Term Detection for Mediaeval 2015: Post-Evaluation Analysis
    Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, and 6 more authors
    In Interspeech, 2016
  10. ICME
    Deep Neural Network Derived Bottleneck Features for Accurate Audio Classification
    Bihong Zhang, Lei Xie, Yougen Yuan, Huaiping Ming, Dongyan Huang, and Mingli Song
    In ICME, 2016
  11. ICASSP
    Exemplar-Based Sparse Representation of Timbre and Prosody for Voice Conversion
    Huaiping Ming, Dongyan Huang, Lei Xie, Shaofei Zhang, and Minghui Dong,Haizhou Li
    In ICASSP, 2016
  12. ICASSP
    Approximate Search of Audio Queries Using Dtw with Phone Time Boundary and Data Augmentation
    Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, and 6 more authors
    In ICASSP, 2016
  13. ASRU
    Automatic Prosody Prediction for Chinese Speech Synthesis Using Blstm-Rnn and Embedding Features
    Chuang Ding, Lei Xie, Jie Yan, Weini Zhang, and Yang Liu
    In ASRU, 2016
  14. APSIPA ASC
    A Waveform Representation Framework for High-Quality Statistical Parametric Speech Synthesis
    Bo Fan, Sui Wa Lee, Xiaohai Tian, Lei Xie, and Minghua Dong
    In APSIPA ASC, 2016
  15. APSIPA ASC
    A Density Peak Clustering Approach to Unsupervised Acoustic Subword Units Discovery
    Jia Yu, Lei Xie, Xiao Xiong, Eng Siong Chng, and Haizhou Li
    In APSIPA ASC, 2016
  16. APSIPA ASC
    Non-Negative Matrix Factorization Using Stable Alternating Direction Method of Multipliers for Source Separation
    Shaofei Zhang, Dongyan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, and Minghui Dong
    In APSIPA ASC, 2016
  17. Interspeech
    Parallel Inference of Dirichlet Process Gaussian Mixture Models for Unsupervised Acoustic Modeling: A Feasibility Study
    Hongjie Chen, Cheung - Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In Interspeech, 2016
  18. Interspeech
    An Alternating Optimization Approach for Phase Retrieval
    Huaiping Ming, Dongyan Huang, Lei Xie, Haizhou Li, and Minghui Dong
    In Interspeech, 2016
  19. Interspeech
    Articulatory Movement Prediction Using Deep Bidirectional Long Short-Term Memory Based Recurrent Neural Networks and Word/Phone Embeddings
    Pengcheng Zhu, Lei Xie, and Yunlin Chen
    In Interspeech, 2016
  20. Interspeech
    Regularized Non-Negative Matrix Factorization Using Alternating Direction Method of Multipliers and Its Application to Source Separation
    Shaofei Zhang, Dongyan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, and Minghui Dong
    In Interspeech, 2016
  21. ICASSP
    Photo-Real Talking Head with Deep Bidirectional Lstm
    Bo Fan, Lijuan Wang, Frank K. Soong, and Lei Xie
    In ICASSP, 2016
  22. ICASSP
    Language independent query-by-example spoken term detection using N-best phone sequences and partial matching
    Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung - Chi Leung, Hongjie Chen, and 7 more authors
    In ICASSP, 2016
  23. TASLP
    Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News
    Hongjie Chen, Lei Xie, Cheung-Chi Leung, Bin Ma, and Haizhou Li
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2016
  24. Neurocomputing
    An unsupervised deep domain adaptation approach for robust speech recognition
    Sining Sun, Binbin Zhang, Lei Xie, and Yanning Zhang
    Neurocomputing, 2016

2015

  1. MTA
    A Deep Bidirectional Lstm Approach for Video-Realistic Talking Head
    Bo Fan, Lei Xie, Shan Yang, Lijuan Wang, and Frank K. Soong
    Multimedia Tools and Applications, 2015

2014

  1. APSIPA ASC
    Multi-View Features in A Dnn-Crf Model for Improved Sentence Unit Detection on English Broadcast News
    Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, Eng Siong Chng, and Haizhou Li
    In APSIPA ASC, 2014
  2. Interspeech
    Speech-Driven Head Motion Synthesis Using Neural Networks
    Chuang Ding, Pengcheng Zhu, Lei Xie, Dongmei Jiang, and Zhonghua Fu
    In Interspeech, 2014
  3. Interspeech
    A Deep Neural Network Approach for Sentence Boundary Detection in Broadcast News
    Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Eng Siong Chng, and Haizhou Li
    In Interspeech, 2014
  4. Interspeech
    Intrinsic Spectral Analysis Based on Temporal Context Features for Query By Example Spoken Term Detection
    Peng Yang, Cheung - Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In Interspeech, 2014
  5. Interspeech
    Stereo Acoustic Echo Suppression Using Widely Linear Filtering in the Frequency Domain
    Zhong-Hua Fu, and Lei Xie
    In Interspeech, 2014
  6. ISCSLP
    A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency
    Shaofei Zhang, Lei Xie, and Zhong-Hua Fu
    In ISCSLP, 2014
  7. ISCSLP
    Experimental Study on Dereverberation and Noise Reduction for Distant Speech Recognition
    Zhong-Hua Fu, and Lei Xie
    In ISCSLP, 2014
  8. ICIP
    An Ensemble of Deep Neural Networks for Object Tracking
    Xiangzeng Zhou, Lei Xie, Peng Zhang, and Yanning Zhang
    In ICIP, 2014
  9. ICASSP
    Unsupervised Broadcast News Story Segmentation Using Distance Dependent Chinese Restaurant Processes
    Chao Yang, Lei Xie, and Xiangzeng Zhou
    In ICASSP, 2014
  10. CHINA SIP
    Learning Optimal Features for Music Transcription
    Huaiping Ming, Dongyan Huang, Lei Xie, and Haizhou Li
    In CHINA SIP, 2014
  11. CHINA SIP
    Sentence Boundary Detection in Chinese Broadcast News Using Conditional Random Fields and Prosodic Features
    Chenglin Xu, Lei Xie, and Zhonghua Fu
    In CHINA SIP, 2014
  12. MTA
    Multimodal Joint Information Processing in Human Machine Interaction: Recent Advances
    Lei Xie, Zhigang Deng, and Stephen Cox
    Multimedia Tools and Applications, 2014
  13. MTA
    A Statistical Parametric Approach to Video-Realistic Text-Driven Talking Avatar
    Lei Xie, Naicai Sun, and Bo Fan
    Multimedia Tools and Applications, 2014
  14. SOFT COMPUT
    Topic Segmentation on Spoken Documents Using Self-Validated Acoustic Cuts
    Hongjie Chen, Lei Xie, Wei Feng, Lilei Zheng, and Yanning Zhang
    Soft Computing, 2014
  15. MTA
    Head Motion Synthesis From Speech Using Deep Neural Networks
    Chuang Ding, Lei Xie, and Pengcheng Zhu
    Multimedia Tools and Applications, 2014
  16. TMM
    Tennis Ball Tracking Using A Two-Layered Data Association Approach
    Xiangzeng Zhou, Lei Xie, Qiang Huang, Stephen Cox, and Yanning Zhang
    IEEE Transactions on Multimedia, 2014

2013

  1. ACM MM
    Online Object Tracking Based on Cnn with Metropolis-Hasting Re-Sampling
    Xiangzeng Zhou, Lei Xie, Peng Zhang, and Yanning Zhang
    In ACM MM, 2013
  2. YES
    Filter Bank Design for Automatic Music Transcription
    Huaiping Ming, Lei Xie, and Haizhou Li
    In YES, 2013
  3. APSIPA ASC
    Personalized 3-D Facial Expression Synthesis Based on Landmark Constraint
    Haoran Liang, Mingli Song, Lei Xie, and Ronghua Liang
    In APSIPA ASC, 2013
  4. APSIPA ASC
    Numerical Calculation of the Head-Related Transfer Functions with Chinese Dummy Head
    Ling Tang, Zhong-Hua Fu, and Lei Xie
    In APSIPA ASC, 2013
  5. ICASSP
    A Tighter Lower Bound Estimate for Dynamic Time Warping
    Peng Yang, Lei Xie, Qiao Luan, and Wei Feng
    In ICASSP, 2013
  6. ICASSP
    A Two Layered Data Association Approach for Ball Tracking
    Xiangzeng Zhou, Qiang Huang, Lei Xie, and Stephen Cox
    In ICASSP, 2013
  7. ICASSP
    Broadcast News Story Segmentation Using Latent Topics on Data Manifold
    Xiaoming Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In ICASSP, 2013
  8. ICASSP
    Measuring semantic similarity by contextualword connections in Chinese news story segmentation
    Xuecheng Nie, Wei Feng, Liang Wan, and Lei Xie
    In ICASSP, 2013
  9. APSIPA ASC
    Face Sketch-To-Photo Synthesis From Simple Line Drawing
    Yang Liang, Mingli Song, Lei Xie, Jiajun Bu, and Chun Chen
    In APSIPA ASC, 2013
  10. ACL
    Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions
    Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma, and Haizhou Li
    In ACL, 2013
  11. APSIPA ASC
    Context-Dependent Deep Neural Networks for Commercial Mandarin Speech Recognition Applications
    Jianwei Niu, Lei Xie, Lei Jia, and Na Hu
    In APSIPA ASC, 2013
  12. QHDXXB
    Head Motion Generation for Speech-Driven Talking Avatar
    Bingfeng Li, Lei Xie, Pengcheng Zhu, and Fan Bo
    Journal of Tsinghua University (Science and Technology), 2013
  13. QHDXXB
    Mandarin speech pattern discovery using segmental dynamic tim
    Yang,  Peng, Lei Xie,  Chen, and  Hongjie
    Journal of Tsinghua University (Science and Technology), 2013

2012

  1. Interspeech
    Speech Pattern Discovery Using Audio-Visual Fusion and Canonical Correlation Analysis
    Lei Xie, Yinqing Xu, Lilei Zheng, Qiang Huang, and Bingfeng Li
    In Interspeech, 2012
  2. Interspeech
    A Two Stage Mask Estimation Approach to Robust Speaker Verification
    Yali Zhao, Lei Xie, and Zhonghua Fu
    In Interspeech, 2012
  3. Interspeech
    Lexical Story Co-Segmentation of Chinese Broadcast News
    Wei Feng, Xuecheng Nie, Liang Wan, Lei Xie, and Jianmin Jiang
    In Interspeech, 2012
  4. ISCSLP
    Prosody-Based Sentence Boundary Detection in Chinese Broadcast News
    Lei Xie, Chenglin Xu, and Xiaoxuan Wang
    In ISCSLP, 2012
  5. APSIPA ASC
    Detection of Ball Hits in A Tennis Game Using Audio and Visual Information
    Qiang Huang, Stephen Cox, Xiangzeng Zhou, and Lei Xie
    In APSIPA ASC, 2012
  6. ICASSP
    Acoustic Texttiling for Story Segmentation of Spoken Documents
    Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In ICASSP, 2012
  7. ICALIP
    Dual-Microphone Based Binary Mask Estimation for Robust Speaker Verification
    Yali Zhao, Zhong-Hua Fu, Lei Xie, Jian Zhang, and Yanning Zhang
    In ICALIP, 2012
  8. ICALIP
    Comprehensive Comparison of the Least Mean Square Algorithm and the Fast Deconvolution Algorithm for Crosstalk Cancellation
    Dan Li, Zhong-Hua Fu, and Lei Xie
    In ICALIP, 2012

2011

  1. Interspeech
    Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation
    Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In Interspeech, 2011
  2. APSIPA ASC
    Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps
    Mimi Lu, Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li
    In APSIPA ASC, 2011
  3. APSIPA ASC
    Multiple Sparse Sources Separation Based on Multichannel Frequency Domain Adaptive Filtering
    Xiaoyu Chen, Zhonghua Fu, and Lei Xie
    In APSIPA ASC, 2011
  4. APSIPA ASC
    A Block-Based Blind Source Separation Approach with Equilateral Triangular Microphone Array
    Jian Zhang, Zhonghua Fu, and Lei Xie
    In APSIPA ASC, 2011
  5. Information Sciences
    On The Effectiveness Of Subwords for Lexical Cohesion Based Story Segmentation Of Chinese Broadcast News
    Lei Xie, Yulian Yang, and Zhi-Qiang Liu
    Information Sciences, 2011
  6. Multimedia Syst
    Pitch-Density-Based Features And An Svm Binary Tree Approach for Multi-Class Audio Classification in Broadcast News
    Lei Xie, Zhong-Hua Fu, Wei Feng, and Yong Luo
    Multimedia Systems, 2011
  7. QHDXXB
    Real-Time Speech Driven Talking Avatar
    Li Bingfeng, Xie Lei, Zhou Xiangzeng, Fu Zhonghua, and Zhang Yanning
    Journal of Tsinghua University (Science and Technology), 2011
  8. QHDXXB
    Semi - Blind Dual - Microphone Noise Reduction with Known Target Localization
    Zhang Jian, Fu Zhonghua, Xie Lei, and Zhao Yali
    Journal of Tsinghua University (Science and Technology), 2011
  9. CJE
    An Automatic Caption Generator for Mandarin Broadcast News
    Zheng Li-Lei, Xie Lei, Lu Mi-Mi, Wang Xiao-Xuan, Yang Yu-Lian, and Zhang Yan-Ning
    Chinese Journal of Electronics, 2011
  10. TASLP
    Laplacian Eigenmaps for Automatic Story Segmentation Of Broadcast News
    Lei Xie, Lilei Zheng, Zihan Liu, and Yanning Zhang
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2011

2010

  1. ISCSLP
    Multi-Modal Feature Integration for Story Boundary Detection in Broadcast News
    Mimi Lu, Lei Xie, Zhonghua Fu, and Dongmei Jiang
    In ISCSLP, 2010
  2. APSIPA ASC
    Modeling Broadcast News Prosody Using Conditional Random Fields for Story Segmentation
    Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, and Haizhou Li
    In APSIPA ASC, 2010
  3. Interspeech
    Maximum Lexical Cohesion for Fine-Grained News Story Segmentation
    Zihan Liu, Lei Xie, and Wei Feng
    In Interspeech, 2010
  4. Interspeech
    Phoneme Lattice Based Texttiling Towards Multilingual Story Segmentation
    Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, and Haizhou Li
    In Interspeech, 2010
  5. ICALIP
    Integrating Acoustic and Lexical Features in Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach
    Lei Xie, Yulian Yang, Zhi-Qiang Liu, Wei Feng, and Zihan Liu
    In ICALIP, 2010
  6. ICALIP
    Laplacian Eigenmaps for Automatic News Story Segmentation
    Zihan Liu, Lei Xie, and Lilei Zheng
    In ICALIP, 2010
  7. UIC
    Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications
    Lei Xie, Wenhuai Zhao, Xiangzeng Zhou, Xiaohai Tian, Bingfeng Li, Naicai Sun, and 2 more authors
    In UIC, 2010
  8. ICWMMN
    An Experimental Comparison on Kemar and Bhead210 Dummy Heads for Hrtf-Based Virtual Auditory on Chinese Subjects
    Xiaohai Tian, Zhonghua Fu, and Lei Xie
    In ICWMMN, 2010
  9. Information Sciences
    Minimizing The Expected Complete Influence Time Of A Social Network
    Yaodong Ni, Lei Xie, and Zhi - Qiang Liu
    Information Sciences, 2010

2009

  1. ACCV
    Multicue Graph Mincut for Image Segmentation
    Wei Feng, Lei Xie, and Zhi - Qiang Liu
    In ACCV, 2009
  2. AIRS
    A subword normalized cut approach to automatic story segmentation of chinese broadcast news
    Jin Zhang, Lei Xie, Wei Feng, and Yanning Zhang
    In AIRS, 2009
  3. ISCSLP
    A Two - Stage Multi - Feature Integration Approach to Unsupervised Speaker Change Detection in Real - Time News Broadcasting
    Lei Xie, and Guangsen Wang
    In ISCSLP, 2009
  4. HHME
    Anchor Labeling System for Broadcast News Using Alize Toolkit
    Mimi Lu, Lei Xie, Lilei Zheng, Yulian Yang, and Yanning Zhang
    In HHME, 2009
  5. JVLC
    Audio - Visual Human Recognition Using Semi - Supervised Spectral Learning And Hidden Markov Models
    Wei Feng, Lei Xie, Jia Zeng, and Zhi - Qiang Liu
    Journal of Visual Languages and Computing, 2009
  6. Information Sciences
    Cascade Markov Random Fields for Stroke Extraction Of Chinese Characters
    Jia Zeng, Wei Feng, Lei Xie, and Zhi-Qiang Liu
    Information Sciences, 2009
  7. IEICE TIS
    Dynamic Bayesian Network Inversion for Robust Speech Recognition
    Lei Xie
    IEICE Transactions on Information and Systems, 2009

2008

  1. ISCSLP
    Subword Latent Semantic Analysis for Textiling - Based Automatic Story Segmentation Of Chinese Broadcast News
    Yulian Yang, and Lei Xie
    In ISCSLP, 2008
  2. PCM
    Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News
    Lei Xie, and Yulian Yang
    In PCM, 2008
  3. AIRS
    Multi - Scale Textiling for Automatic Story Segmentation in Chinese Broadcast News
    Lei Xie, Jia Zeng, and Wei Feng
    In AIRS, 2008
  4. Multimedia Syst
    Discovering Salient Prosodic Cues And Their Interactions for Automatic Story Segmentation in Mandarin Broadcast News
    Lei Xie
    Multimedia Systems, 2008

2007

  1. ICME
    Noise Robust Features for Speech/Music Discrimination in Real - Time Telecommunication
    Zhonghua Fu, Jhing - Fa Wang, and Lei Xie
    In ICME, 2007
  2. NCMMSC
    Classification of Music and Speech in Mandarin News Broadcasts
    Chuan Liu, Lei Xie, and Helen Meng
    In NCMMSC, 2007
  3. Interspeech
    Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation
    Shing-Kai Chan, Lei Xie, and Helen Mei-Ling Meng
    In Interspeech, 2007

2006

  1. ICPR
    Speech animation using coupled hidden Markov model
    Lei Xie Liu, and  Zhi-Qiang
    In ICPR, 2006
  2. ICSMC
    Lip assistant: Visualize speech for hearing impaired people in multin
    Lei Xie, and Zhi - Qiang Liu
    In ICSMC, 2006
  3. ISCSLP
    A Cantonese Speech - Driven Talking Face Using Translingual Audio-to-Visual Conversion
    Lei Xie, Helen Meng, and Zhi-Qiang Liu
    In ISCSLP, 2006
  4. ICMLC
    Multi - Stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition
    Lei Xie, and Zhi - Qiang Liu
    In ICMLC, 2006
  5. NAACL
    Combined Use of Speaker-and Tone - Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News
    Lei Xie, Chuan Liu, and Helen Meng
    In NAACL, 2006
  6. ICASSP
    An articulatory approach to video-realistic mouth animation
    Lei Xie, and Zhi - Qiang Liu
    In ICASSP, 2006
  7. TMM
    Realistic Mouth - Synching for Speech - Driven Talking Face Using Articulatory Modelling
    Lei Xie, and Zhi - Qiang Liu
    IEEE Transactions on Multimedia, 2006
  8. PR
    A Coupled Hmm Approach for Video-Realistic Speech Animation
    Lei Xie, and Zhi-Qiang Liu
    Pattern Recognition, 2006