Lei Xie
|
Ph.D, Professor,Senior Member IEEE Audio, Speech & Language Processing Group (ASLP) Shaanxi Provincial Key Laboratory of Speech & Image Information Processing (SAIIP) Dean Assistant, School of Computer Science, Northwestern Polytechnical University, Xi'an, China E-mail: lxie (at) nwpu.edu.cn, xielei21st (at) gmail.com, lxie (at) nwpu-aslp.org (Convert AT to @) |
Dr. Lei Xie is currently a Professor in the Audio, Speech & Language Processing Group (ASLP), Shaanxi Provincial Key Laboratory of Speech & Image Information Processing, School of Computer Science, Northwestern Polytechnical Univeristy. Dr. Xie obtained his Ph.D. degree in Computer Science from Northwestern Polytechnical University, China in 2004. Dr. Lei Xie has worked in several oversea universities. From 2001 to 2002, he has worked with Professor Hichem Sahli at the Department of Electronics and Informatics, Vrije Universiteit Brussel (VUB), Brussels, Belgium as a Visiting Scientist. From 2004 to 2006, he has worked with Professor Zhi-Qiang Liu as a Postdoctoral Researcher in the Center for Media Technology (RCMT), School of Creative Media, City University of Hong Kong, Hong Kong SAR. From 2006 to 2007, Dr. Xie has worked with Professor Helen Meng as a Postdoctoral Fellow and a Project Lead in the Human-Computer Communications Laboratory (HCCL), Department of Systems Engineering & Engineering Management, The Chinese University of Hong Kong, Hong Kong SAR. He was also a visiting professor in the University of East Anglia (UEA), United Kingdom in 2011.
Dr. Xie's general research interests include audio, speech and languagpe processing, multimedia information processing, human-computer interaction, pattern recognition and machine learning. Current research topics include spoken content analysis, automatic speech recognition and synthesis, audio-visual and multimodal signal processing, virtural auditory, spoken dialogue system and multimedia applications. He has published over 80 papers in refereed journals and major conferences, including IEEE Transactions on Multimedia, IEEE Transactions on Audio, Speech and Language Processing, Pattern Recognition, ACM/Springer Multimedia Systems Journal, Information Sciences, ICASSP, Interspeech, ICPR, NAACL-HLT. Dr Xie has served as program/organizing chairs, committee members and reviewers of various international conferences. He serves as an editor of Open Journal of Acoustics and reviewers for IEEE Transactions on Multimedia, IEEE Transactions on Audio, Speech and Language Processing, IEEE Transactions on Visualization and Computer Graphics, Pattern Recognition and Information Sciences, etc. Dr. Xie has participated in many research projects as principal investigator (PI) and co-investigator (Co-I), supported by National Natural Science Foundation of China (NSFC), Ministry of Education, and Research Grant Council (RCG) of Hong Kong SAR. He is a Senior Member of IEEE, a member of ISCA, a member of ACM, a member of APSIPA, a member of SIG-CSLP and a senior member of China Computer Federation (CCF). He is a Board-of-Governor of the Chinese Information Processing Society of China (CIPSC), a board member of the APSIPA Speech, Language and Audio (SLA) technical committee, a board member of the multimedia technical committee of CCF, a board member of the multimedia technical committee of China Society of Image and Graphics (CSIG). a standing committee member of NCMMSC. He has enrolled in the Program for New Century Excellent Talents in University in 2008, supported by the Ministry of Education (MOE) of China. He was a recepient of Fok Ying Tung Education Foundation Grant (Year 2012).
Lei Xie, Lilei Zheng, Zihan Liu and Yanning Zhang, "Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp 264-277, January 2012. PDF Bib
Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Acoustic Texttiling For Story Segmentation Of Spoken Documents", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2012), March 25 - 30, Kyoto, Japan, 2012. PDF Bib
Lei Xie, Yulian Yang and Zhi-Qiang Liu, "On the Effectiveness of Subwords for Lexical Cohesion Based Story Segmentation of Chinese Broadcast News", Information Sciences, 181(13):2873–2891, Elsevier, 2011. PDF Bib
Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation", Interspeech2011, Florence, Italy, August, 2011. (Interspeech Grant) PDF Bib
Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng and Haizhou Li, "Broadcast News Story Segmentation Using Conditional Random Fields and Multi-modal Features", IEICE Transactions on Information and Systems, invited paper, to appear, 2012. PDF Bib
Mimi Lu, Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011. PDF Bib
Xiaoyu Chen, Zhonghua Fu and Lei Xie, "Multiple Sparse Sources Separation Based on Multichannel Frequency Domain Adaptive Filtering", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011
Jian Zhang, Zhonghua Fu and Lei Xie, "A Block-Based Blind Source Separation Approach with Equilateral Triangular Microphone Array", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011
Lei Xie, Zhong-hua Fu, Wei Feng and Yong Luo,"Pitch-Density-based Features and an SVM Binary Tree Approach for Multi-Class Audio Classification in Broadcast News", ACM/Springer Multimedia Systems Journal, 17(2):101-112 , 2011. PDF Bib
LI Bingfeng, XIE Lei, ZHOU Xiangzeng, FU Zhonghua and ZHANG Yanning, "Real-time speech driven talking avatar", Journal of Tsinghua University, 2011, 51(9):1180-1186. (In Chinese, selected paper from NCMMSC2011, Best Student Paper Nomination Award) PDF Bib
ZHANG Jian, FU Zhonghua, XIE Lei and ZHAO Yali, "Semi-blind dual-microphone noise reduction with known target localization", Journal of Tsinghua University. 2011, 51(9):1215-1219. (In Chinese, selected paper from NCMMSC2011)
ZHENG Li-lei, XIE Lei, LU Mi-mi, WANG Xiao-xuan, YANG Yu-lian and ZHANG Yan-ning, "An Automatic Caption Generator for Mandarin Broadcast News", Chinese Journal of Electronics, 39(3A): 69-74, 2011. PDF Bib
Mimi Lu, Lei Xie, Zhonghua Fu, Dongmei-Jiang, "Multi-Modal Feature Integration for Story Boundary Detection in Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, 2010. PDF Bib
Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Modeling Broadcast News Prosody Using Conditional Random Fields for Story Segmentation", APSIPA Annual Summit and Conference (APSIPA ASC 2010), Biopolis, Singapore, December 14-17, 2010. PDF Bib
Zihan Liu, Lei Xie, Wei Feng, "Maximum Lexical Cohesion for Fine-Grained News Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010. (Interspeech Best Student Paper Award Finalist) PDF Bib
Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Phoneme Lattice based TextTiling towards Multilingual Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010. PDF Bib
Lei Xie, Yulian Yang, Zhi-Qiang Liu, Wei Feng and Zihan Liu, "Integrating Acoustic and Lexical Features In Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach," International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.
Zihan Liu, Lei Xie and Lilei Zheng, "Laplacian Eigenmaps for Automatic News Story Segmentation", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.
Lei Xie et al., "Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications," Demo for The 7th International Conference on Ubiquitous Intelligence and Computing(UIC), October 26-29, 2010, Xi'an, China
Xiaohai Tian, Zhonghua Fu and Lei Xie, "An Experimental Comparison on KEMAR and BHead210 Dummy Heads for HRTF-based Virtual Auditory on Chinese Subjects," The Third IET International Conference on Wireless, Mobile & Multimedia Networks (ICWMMN2010), 26 - 29, September 2010, Beijing, China.
Yaodong Ni, Lei Xie, and Zhi-Qiang Liu, " Minimizing the Expected Complete Influence Time of a Social Network," Information Sciences, 180(13): 2514-2527, 2010.
Wei Feng, Lei Xie and Zhi-Qiang Liu, "Multicue Graph Mincut for Image Segmentation", Ninth Asian Conference on Computer Vision (ACCV2009), LNCS 5995, pp. 707-717, Springer, 2010.
Jin Zhang, Lei Xie, Wei Feng and Yanning Zhang, "A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2009), LNCS 5839, Springer, pp136-148, 2009.
Wei Feng, Lei Xie, Jia Zeng and Zhi-Qiang Liu, "Audio-Visual Human Recognition Using Semi-Supervised Spectral Learning and Hidden Markov Models," Journal of Visual languages and Computing , invited paper, 20(3):188-195, 2009.
Jia Zeng, Wei Feng, Lei Xie and Zhi-Qiang Liu, "Cascade Markov random fields for stroke extraction of Chinese characters," Information Sciences, 180(2):301-311, 2009.
Lilei Zheng, Lei Xie, Xiaoxuan Wang, Mimi Lu, Yulian Yang and Yanning Zhang, "An Antomatic Caption Generator for Mandarin Broadcast News," 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009 (Best Paper Award)
Mimi Lu, Lei Xie, Lilei Zheng, Yulian Yang, Yanning Zhang, "Anchor Labeling System for Broadcast News using Alize toolkit", 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009
Zhonghua Fu, Jhing-Fa Wang and Lei Xie, "Noise Robust Features for Speech/Music Discrimination in Real-time Telecommunication", IEEE International Conference on Multimedia and Expo (ICME 2009), pp 574-577, New York, USA.
Lei Xie, "Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news", ACM/Springer Multimedia Systems Journal, 14(4):237-253, 2008.
Jia Zeng, Lei Xie and Zhi-Qiang Liu, "Type-2 Fuzzy Gaussian Mixture Models" Pattern Recognition, 41, 2008, pp 3636-3643.
Lei Xie and Guangsen Wang, "A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 350-353, Yunnan, China, 2008. PDF Bib
Yulian Yang and Lei Xie, "Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 358-361, Yunnan, China, 2008. (Microsoft Student Grant. This paper is also presented in the 2008 Beijing-Hong Kong International Doctoral Forum, Beijing) PDF Bib
Lei Xie and Yulian Yang, "Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News", Pacific-Rim Conference on Multimedia (PCM2008), LNCS 5353, Springer, pp248-258, 2008.
Lei Xie, Jia Zeng and Wei Feng, "Multi-Scale TextTiling for Automatic Story Segmentation in Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2008), LNCS 4993, Harbin, China, pp345-355, Springer, 2008.
Lei Xie and Zhi-Qiang Liu, "Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling", IEEE Transactions on Multimedia, 9(3), 2007, pp500-510. PDF Bib
Lei Xie and Zhi-Qiang Liu, "A Coupled HMM Approach for Video-Realistic Speech Animation", Pattern Recognition, 40(10), 2007, pp2325-2340. PDF Bib
Lei Xie, "Dynamic Bayesian Network Inversion for Robust Speech Recognition", IEICE Transactions on Information and Systems, 2007, Vol. E90-D, No. 7, pp 156-159.
Lei Xie, Chuan Liu and Helen Meng, "Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News", Human Language Technology Conference /North American chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL), pp193-196, Rochester, NY, USA, April, 2007.
Chuan Liu, Lei Xie, Helen Meng, "Classification of Music and Speech in Mandarin News Broadcasts", 9th National Conference on Man-Machine Speech Communication (NCMMSC), Huangshan, Anhui, China, 2007.
Shing-kai Chan, Lei Xie and Helen Mei-ling Meng, "Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation" Interspeech, Belgium, 2007. PDF Bib
Lei Xie, and Zhi-Qiang Liu, "An Articulatory Approach to Video-Realistic Mouth Animation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. I, pp593-596, Toulouse, France, 2006.
Lei Xie, Helen Meng and Zhi-Qiang Liu, "A Cantonese Speech-Driven Talking Face using Translingual Audio-to-Visual Conversion", International Symposium on Chinese Spoken Language Processing (ISCSLP2006), LNAI 4274, Singapore, pp627-639, Springer, Dec, 2006.
Lei Xie and Zhi-Qiang Liu, "Multi-Stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition", Advances in Machine Learning and Cybernetics, LNAI 3930, Springer, pp99-114, April, 2006.
Lei Xie, and Zhi-Qiang Liu, "Speech Animation Using Coupled Hidden Markov Models", International Conference on Pattern Recognition (ICPR), vol. I, pp1128-1131, Hong Kong, 2006.
Lei Xie, and Zhi-Qiang Liu, "Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services" , International Conference on System, Man and Cybernetics (ICSMC) , pp4331-4336, Taipei, Taiwan, 2006.
......
WARNING: This page contains links to pdf files whose contents may be covered by copyright. You may browse them at your convenience in the same spirit as you may read a journal or a conference proceedings article in a public library. Retrieving, copying, or distributing these files, however, may violate international copyright protection law.
What's New:
Serving as a Reviewer or a Program Committee Member for:
Topic Segmentation: a summary of recent approaches, talk in University of East Anglia, Norwich, United Kingdom, Dec, 2011
Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps, APSIPA ASC2011, Xi'an, China, 2011
Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation, Interspeech2011, Florence, Italy, 2011
Maximum Lexical Cohesion for Fine-Grained News Story Segmentation, Interspeech2010, Makuhari, Japan, 2010
Phoneme Lattice based TextTiling towards Multilingual Story Segmentation, Interspeech2010, Makuhari, Japan, 2010
A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News, Asia Information Retrieval Symposium (AIRS2009), Sapporo, Japan, 2009
Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News, International Symposium on Chinese Spoken Language Processing (ISCSLP), Yunnan, China, 2008
Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News, 2008 Beijing-Hong Kong International Doctoral Forum, Beijing, China
A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting, International Symposium on Chinese Spoken Language Processing (ISCSLP), Yunnan, China, 2008
Speech-driven Talking Face for Interactive Human-Computer Communication, National Cheng Kung University (NCKU), Tainan, Taiwan, 2008
Automatic Story Segmentation of Chinese Broadcast News based on Special Features of Chinese Language, Academic Annual Meeting for Postgraduates, NWPU, Xi'an, China, Nov 2007.
Classification of Music and Speech in Mandarin News Broadcast, NCMMSC2007, Huangshan, China, Oct 2007.
Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modeling, NWPU, Xi'an, China, July 2007.
Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation, Interspeech2007, Anterwerp, Belgium, Aug 2007.
Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News, HLT2007, NY, USA, April 2007.
A Cantonese Speech-Driven Talking Face Using Translingual Audio-to-Visual Conversion, ISCSLP2006, Singapore, Dec 2006.
| ASLP@NPU | Audio, Speech and Language Processing Group, Northwestern Polytechnical University, China | |
| HCCL | Professor Helen Meng, Human Computer Communications Laboratory, The Chinese University of Hong Kong | |
| CSAIL-MIT |
Computer Science and Artificial Intelligence Laboratory, MIT | |
| SLS-MIT | Spoken Language Systems, MIT; Victor Zue, Stephanie Seneff, Jim Class... | |
| HTK | Hidden Markov Toolkit, Cambirdge Univ., UK | |
| ASLP@UEA | Audio, Speech and Language Processing Lab., University of East Anglia, UK | |
| Cambridge Machine Intelligence Laboratory | Steve Young, P.C. Woodland, Mark Gales,... | |
| LTI @ CMU | Language Technologies Institute, CMU | |
| Speech @ CMU | Speech Technologies at CMU | |
| Fred Juang | Professor Biing-Hwang (Fred) Juang at Georgia Tech | |
| C. H. Lee | Professor Chin-Hui Lee at Georgia Tech | |
| Speech Tech @ MS Research | Speech Technology at Microsoft Research | |
| CSLP @ JHU | Center for Speech and Language Processing, Johns Hopkins University | |
| L.-S. Lee | Professor Lee Lin-Shan at National Taiwan University, Taiwan | |
| Chiu-yu Tseng | Dr. Chiu-yu Tseng at Institute of Linguistic, Academia Sinica, Taiwan |