Lei Xie Lei Xie

Ph.D, Professor,Senior Member IEEE

Audio, Speech & Language Processing Group (ASLP)

Shaanxi Provincial Key Laboratory of Speech & Image Information Processing (SAIIP)

Dean Assistant,

School of Computer Science, Northwestern Polytechnical University, Xi'an, China

E-mail: lxie (at) nwpu.edu.cn, xielei21st (at) gmail.com, lxie (at) nwpu-aslp.org (Convert AT to @)

Want to be my student or interested in my research? Please don't hesitate to contact me by xielei21st (at) gmail.com!

Biosketch

Dr. Lei Xie is currently a Professor in the Audio, Speech & Language Processing Group (ASLP), Shaanxi Provincial Key Laboratory of Speech & Image Information Processing, School of Computer Science, Northwestern Polytechnical Univeristy. Dr. Xie obtained his Ph.D. degree in Computer Science from Northwestern Polytechnical University, China in 2004. Dr. Lei Xie has worked in several oversea universities. From 2001 to 2002, he has worked with Professor Hichem Sahli at the Department of Electronics and Informatics, Vrije Universiteit Brussel (VUB), Brussels, Belgium as a Visiting Scientist. From 2004 to 2006, he has worked with Professor Zhi-Qiang Liu as a Postdoctoral Researcher in the Center for Media Technology (RCMT), School of Creative Media, City University of Hong Kong, Hong Kong SAR. From 2006 to 2007, Dr. Xie has worked with Professor Helen Meng as a Postdoctoral Fellow and a Project Lead in the Human-Computer Communications Laboratory (HCCL), Department of Systems Engineering & Engineering Management, The Chinese University of Hong Kong, Hong Kong SAR. He was also a visiting professor in the University of East Anglia (UEA), United Kingdom in 2011.

Dr. Xie's general research interests include audio, speech and languagpe processing, multimedia information processing, human-computer interaction, pattern recognition and machine learning. Current research topics include spoken content analysis, automatic speech recognition and synthesis, audio-visual and multimodal signal processing, virtural auditory, spoken dialogue system and multimedia applications. He has published over 80 papers in refereed journals and major conferences, including IEEE Transactions on Multimedia, IEEE Transactions on Audio, Speech and Language Processing, Pattern Recognition, ACM/Springer Multimedia Systems Journal, Information Sciences, ICASSP, Interspeech, ICPR, NAACL-HLT. Dr Xie has served as program/organizing chairs, committee members and reviewers of various international conferences. He serves as an editor of Open Journal of Acoustics and reviewers for IEEE Transactions on Multimedia, IEEE Transactions on Audio, Speech and Language Processing, IEEE Transactions on Visualization and Computer Graphics, Pattern Recognition and Information Sciences, etc. Dr. Xie has participated in many research projects as principal investigator (PI) and co-investigator (Co-I), supported by National Natural Science Foundation of China (NSFC), Ministry of Education, and Research Grant Council (RCG) of Hong Kong SAR. He is a Senior Member of IEEE, a member of ISCA, a member of ACM, a member of APSIPA, a member of SIG-CSLP and a senior member of China Computer Federation (CCF). He is a Board-of-Governor of the Chinese Information Processing Society of China (CIPSC), a board member of the APSIPA Speech, Language and Audio (SLA) technical committee, a board member of the multimedia technical committee of CCF, a board member of the multimedia technical committee of China Society of Image and Graphics (CSIG). a standing committee member of NCMMSC. He has enrolled in the Program for New Century Excellent Talents in University in 2008, supported by the Ministry of Education (MOE) of China. He was a recepient of  Fok Ying Tung Education Foundation Grant (Year 2012).

Research Interests

  • Audio, Speech and Language Processing
  • Multimedia Information Processing
  • Pattern Recogntion and Machine Learning
  • Human Computer Interaction

Recent Selected Publications

Lei Xie, Lilei Zheng, Zihan Liu and Yanning Zhang, "Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp 264-277, January 2012. PDF Bib

Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Acoustic Texttiling For Story Segmentation Of Spoken Documents", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2012), March 25 - 30, Kyoto, Japan, 2012. PDF Bib

Lei Xie, Yulian Yang and Zhi-Qiang Liu, "On the Effectiveness of Subwords for Lexical Cohesion Based Story Segmentation of Chinese Broadcast News", Information Sciences, 181(13):2873–2891, Elsevier, 2011. PDF Bib

Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation", Interspeech2011, Florence, Italy, August, 2011. (Interspeech Grant) PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng and Haizhou Li, "Broadcast News Story Segmentation Using Conditional Random Fields and Multi-modal Features", IEICE Transactions on Information and Systems, invited paper, to appear, 2012. PDF Bib

Mimi Lu, Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011. PDF Bib

Xiaoyu Chen, Zhonghua Fu and Lei Xie, "Multiple Sparse Sources Separation Based on Multichannel Frequency Domain Adaptive Filtering", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011

Jian Zhang, Zhonghua Fu and Lei Xie, "A Block-Based Blind Source Separation Approach with Equilateral Triangular Microphone Array", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011

Lei Xie, Zhong-hua Fu, Wei Feng and Yong Luo,"Pitch-Density-based Features and an SVM Binary Tree Approach for Multi-Class Audio Classification in Broadcast News", ACM/Springer Multimedia Systems Journal, 17(2):101-112 , 2011. PDF Bib

LI Bingfeng, XIE Lei, ZHOU Xiangzeng, FU Zhonghua and ZHANG Yanning, "Real-time speech driven talking avatar", Journal of Tsinghua University, 2011, 51(9):1180-1186. (In Chinese, selected paper from NCMMSC2011, Best Student Paper Nomination Award) PDF Bib

ZHANG Jian, FU Zhonghua, XIE Lei and ZHAO Yali, "Semi-blind dual-microphone noise reduction with known target localization", Journal of Tsinghua University. 2011, 51(9):1215-1219. (In Chinese, selected paper from NCMMSC2011)

ZHENG Li-lei, XIE Lei, LU Mi-mi, WANG Xiao-xuan, YANG Yu-lian and ZHANG Yan-ning, "An Automatic Caption Generator for Mandarin Broadcast News", Chinese Journal of Electronics, 39(3A): 69-74, 2011. PDF Bib

Mimi Lu, Lei Xie, Zhonghua Fu, Dongmei-Jiang, "Multi-Modal Feature Integration for Story Boundary Detection in Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, 2010. PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Modeling Broadcast News Prosody Using Conditional Random Fields for Story Segmentation", APSIPA Annual Summit and Conference (APSIPA ASC 2010), Biopolis, Singapore, December 14-17, 2010. PDF Bib

Zihan Liu, Lei Xie, Wei Feng, "Maximum Lexical Cohesion for Fine-Grained News Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010. (Interspeech Best Student Paper Award Finalist) PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Phoneme Lattice based TextTiling towards Multilingual Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010. PDF Bib

Lei Xie, Yulian Yang, Zhi-Qiang Liu, Wei Feng and Zihan Liu, "Integrating Acoustic and Lexical Features In Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach," International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.

Zihan Liu, Lei Xie and Lilei Zheng, "Laplacian Eigenmaps for Automatic News Story Segmentation", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.

Lei Xie et al., "Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications," Demo for The 7th International Conference on Ubiquitous Intelligence and Computing(UIC), October 26-29, 2010, Xi'an, China

Xiaohai Tian, Zhonghua Fu and Lei Xie, "An Experimental Comparison on KEMAR and BHead210 Dummy Heads for HRTF-based Virtual Auditory on Chinese Subjects," The Third IET International Conference on Wireless, Mobile & Multimedia Networks (ICWMMN2010), 26 - 29, September 2010, Beijing, China.

Yaodong Ni, Lei Xie, and Zhi-Qiang Liu, " Minimizing the Expected Complete Influence Time of a Social Network," Information Sciences, 180(13): 2514-2527, 2010.

Wei Feng, Lei Xie and Zhi-Qiang Liu, "Multicue Graph Mincut for Image Segmentation", Ninth Asian Conference on Computer Vision (ACCV2009), LNCS 5995, pp. 707-717, Springer, 2010.

Jin Zhang, Lei Xie, Wei Feng and Yanning Zhang, "A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2009), LNCS 5839, Springer, pp136-148, 2009.

Wei Feng, Lei Xie, Jia Zeng and Zhi-Qiang Liu, "Audio-Visual Human Recognition Using Semi-Supervised Spectral Learning and Hidden Markov Models," Journal of Visual languages and Computing , invited paper, 20(3):188-195, 2009.

Jia Zeng, Wei Feng, Lei Xie and Zhi-Qiang Liu, "Cascade Markov random fields for stroke extraction of Chinese characters," Information Sciences, 180(2):301-311, 2009.

Lilei Zheng, Lei Xie, Xiaoxuan Wang, Mimi Lu, Yulian Yang and Yanning Zhang, "An Antomatic Caption Generator for Mandarin Broadcast News," 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009 (Best Paper Award)

Mimi Lu, Lei Xie, Lilei Zheng, Yulian Yang, Yanning Zhang, "Anchor Labeling System for Broadcast News using Alize toolkit", 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009

Zhonghua Fu, Jhing-Fa Wang and Lei Xie, "Noise Robust Features for Speech/Music Discrimination in Real-time Telecommunication", IEEE International Conference on Multimedia and Expo (ICME 2009), pp 574-577, New York, USA.

Lei Xie, "Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news", ACM/Springer Multimedia Systems Journal, 14(4):237-253, 2008.

Jia Zeng, Lei Xie and Zhi-Qiang Liu, "Type-2 Fuzzy Gaussian Mixture Models" Pattern Recognition, 41, 2008, pp 3636-3643.

Lei Xie and Guangsen Wang, "A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 350-353, Yunnan, China, 2008. PDF Bib

Yulian Yang and Lei Xie, "Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 358-361, Yunnan, China, 2008. (Microsoft Student Grant. This paper is also presented in the 2008 Beijing-Hong Kong International Doctoral Forum, Beijing) PDF Bib

Lei Xie and Yulian Yang, "Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News", Pacific-Rim Conference on Multimedia (PCM2008), LNCS 5353, Springer, pp248-258, 2008.

Lei Xie, Jia Zeng and Wei Feng, "Multi-Scale TextTiling for Automatic Story Segmentation in Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2008), LNCS 4993, Harbin, China, pp345-355, Springer, 2008.

Lei Xie and Zhi-Qiang Liu, "Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling", IEEE Transactions on Multimedia, 9(3), 2007, pp500-510. PDF Bib

Lei Xie and Zhi-Qiang Liu, "A Coupled HMM Approach for Video-Realistic Speech Animation", Pattern Recognition, 40(10), 2007, pp2325-2340. PDF Bib

Lei Xie, "Dynamic Bayesian Network Inversion for Robust Speech Recognition", IEICE Transactions on Information and Systems, 2007, Vol. E90-D, No. 7, pp 156-159.

Lei Xie, Chuan Liu and Helen Meng, "Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News", Human Language Technology Conference /North American chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL), pp193-196, Rochester, NY, USA, April, 2007.

Chuan Liu, Lei Xie, Helen Meng, "Classification of Music and Speech in Mandarin News Broadcasts", 9th National Conference on Man-Machine Speech Communication (NCMMSC), Huangshan, Anhui, China, 2007.

Shing-kai Chan, Lei Xie and Helen Mei-ling Meng, "Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation" Interspeech, Belgium, 2007. PDF Bib

Lei Xie, and Zhi-Qiang Liu, "An Articulatory Approach to Video-Realistic Mouth Animation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. I, pp593-596, Toulouse, France, 2006.

Lei Xie, Helen Meng and Zhi-Qiang Liu, "A Cantonese Speech-Driven Talking Face using Translingual Audio-to-Visual Conversion", International Symposium on Chinese Spoken Language Processing (ISCSLP2006), LNAI 4274, Singapore, pp627-639, Springer, Dec, 2006.

Lei Xie and Zhi-Qiang Liu, "Multi-Stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition", Advances in Machine Learning and Cybernetics, LNAI 3930, Springer, pp99-114, April, 2006.

Lei Xie, and Zhi-Qiang Liu, "Speech Animation Using Coupled Hidden Markov Models", International Conference on Pattern Recognition (ICPR), vol. I, pp1128-1131, Hong Kong, 2006.

Lei Xie, and Zhi-Qiang Liu, "Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services" , International Conference on System, Man and Cybernetics (ICSMC) , pp4331-4336, Taipei, Taiwan, 2006.

......

WARNING: This page contains links to pdf files whose contents may be covered by copyright. You may browse them at your convenience in the same spirit as you may read a journal or a conference proceedings article in a public library. Retrieving, copying, or distributing these files, however, may violate international copyright protection law. 

Recent Professional Activities

What's New:

Serving as a Reviewer or a Program Committee Member for:

  • IEEE Transactions on Audio, Speech and Language Processing
  • IEEE Transactions on Multimedia
  • IEEE Transactions on Visualization and Computer Graphics
  • IEEE Transactions on Fuzzy Systems
  • ACM Transactions on Embedded Computing Systems
  • Pattern Recognition
  • Soft Computing
  • Information Sciences
  • International Journal on Computational Intelligence and Applications (IJCAI)
  • Journal of Ambient Intelligence and Humanized Computing (AIHC)
  • APSIPA Transactions on Signal and Information Processing
  • 清华大学学报
  • 华南理工大学学报
  • ACL 2012
  • HHME 2011
  • APSIPA ASC 2011
  • 2011 ACM Conference on Information and Knowledge Management (CIKM)
  • International Conference On Audio, Language And Image Processing (ICALIP2010)
  • International Symposium on Chinese Spoken Language Processing (ISCSLP2010)
  • HHME 2010
  • IEEE Tencon 2009
  • International Conference On Audio, Language And Image Processing (ICALIP2008)
  • The 18th International Conference on Pattern Recognition (ICPR2006)
  • The 8th ICAISC Int'l Conference on Artificial Intelligence and Soft Computing (ICAISC2006)
  • Int'l Conference on Machine Learning and Cybernetics (ICMLC2006)
  • Int'l conference on Systems, Man and Cybernetics (ICSMC2006)
  • Int'l Conference of Computational Intelligence and Multimedia Applications (ICCIMA2005)
  • Int'l Conference on Machine Learning and Cybernetics (ICMLC2005)
  • Int'l Symposium on Modeling Decisions for Artificial Intelligence (MDAI2005)
  • Asia-Pacific Workshop on Visual Information Processing (VIP2005)
  • The 18th International FLAIRS Conference (FLAIRS2005)
  • ......

Presentations/Slides:

Topic Segmentation: a summary of recent approaches, talk in University of East Anglia, Norwich, United Kingdom, Dec, 2011

Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps, APSIPA ASC2011, Xi'an, China, 2011

Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation, Interspeech2011, Florence, Italy, 2011

Maximum Lexical Cohesion for Fine-Grained News Story Segmentation, Interspeech2010, Makuhari, Japan, 2010

Phoneme Lattice based TextTiling towards Multilingual Story Segmentation, Interspeech2010, Makuhari, Japan, 2010

A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News, Asia Information Retrieval Symposium (AIRS2009), Sapporo, Japan, 2009

Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News, International Symposium on Chinese Spoken Language Processing (ISCSLP), Yunnan, China, 2008

Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News, 2008 Beijing-Hong Kong International Doctoral Forum, Beijing, China

A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting, International Symposium on Chinese Spoken Language Processing (ISCSLP), Yunnan, China, 2008

Speech-driven Talking Face for Interactive Human-Computer Communication, National Cheng Kung University (NCKU), Tainan, Taiwan, 2008

Automatic Story Segmentation of Chinese Broadcast News based on Special Features of Chinese Language, Academic Annual Meeting for Postgraduates, NWPU, Xi'an, China, Nov 2007.

Classification of Music and Speech in Mandarin News Broadcast, NCMMSC2007, Huangshan, China, Oct 2007.

Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modeling, NWPU, Xi'an, China, July 2007.

Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation, Interspeech2007, Anterwerp, Belgium, Aug 2007.

Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News, HLT2007, NY, USA, April 2007.

A Cantonese Speech-Driven Talking Face Using Translingual Audio-to-Visual Conversion, ISCSLP2006, Singapore, Dec 2006.

Links:

ASLP@NPU   Audio, Speech and Language Processing Group, Northwestern Polytechnical University, China
HCCL   Professor Helen Meng, Human Computer Communications Laboratory, The Chinese University of Hong Kong
CSAIL-MIT
  Computer Science and Artificial Intelligence Laboratory, MIT
SLS-MIT   Spoken Language Systems, MIT; Victor Zue, Stephanie Seneff, Jim Class...
HTK   Hidden Markov Toolkit, Cambirdge Univ., UK
ASLP@UEA   Audio, Speech and Language Processing Lab., University of East Anglia, UK
Cambridge Machine Intelligence Laboratory   Steve Young, P.C. Woodland, Mark Gales,...
LTI @ CMU   Language Technologies Institute, CMU
Speech @ CMU   Speech Technologies at CMU
Fred Juang   Professor Biing-Hwang (Fred) Juang at Georgia Tech
C. H. Lee   Professor Chin-Hui Lee at Georgia Tech
Speech Tech @ MS Research   Speech Technology at Microsoft Research
CSLP @ JHU   Center for Speech and Language Processing, Johns Hopkins University
L.-S. Lee   Professor Lee Lin-Shan at National Taiwan University, Taiwan
Chiu-yu Tseng   Dr. Chiu-yu Tseng at  Institute of Linguistic, Academia Sinica, Taiwan