aslp@npu

English Version

Lab Wiki

只搜索谢磊博士的网页

最新更新:2016年04月

隶属于

合作者

Baidu

unisound

sougou

Huawei

microsoft

horizenrobotics

I2R

I2R

harman

AVIC

谢磊 Lei Xie

博士,教授,博导,院长助理

西北工业大学计算机学院

陕西省语音与图像信息处理重点实验室

音频、语音与语言处理研究组

电邮: lxie (at) nwpu.edu.cn, xielei21st (at) gmail.com, lxie (at) nwpu-aslp.org (将 (at) 换为 @)

网页: http://lxie.npu-aslp.org

想成为我的学生?对我的研究工作感兴趣?快发email联系我吧!

最新事件

履历:

谢磊,教授,博导,于2004年在西北工业大学取得博士学位。谢磊博士具有在多所海外大学从事研究和工作经历。2001年至2002年,在比利时布鲁塞尔自由大学做访问学者,合作者为Hichem Sahli教授。2004年至2006年,在香港城市大学创意媒体学院媒体技术研究中心从事研究工作,合作者为刘志强( Zhi-Qiang Liu)教授。2006年至2007年,在香港中文大学系统工程与工程管理学系人机通讯实验室从事研究工作,合作者为蒙美玲(Helen Meng)教授。2007年,谢磊博士作为海外引进人才受聘于西北工业大学计算机学院,同年入选西北工业大学首批"翱翔之星"计划。谢磊入选中国教育部"新世纪优秀人才支持计划",获得陕西省青年科技新星、西安市青年科技奖等荣誉。谢磊博士的研究兴趣包括语音频、语音与语言处理、多媒体信息处理、模式识别与机器学习、人机交互技术。目前的研究题目包括语音内容分析、语音识别与合成、音视频与多模态信息处理、虚拟听觉、自然人机界面与口语对话、多媒体应用系统。谢磊博士在包括 IEEE Transactions on Multimedia, IEEE Transactions on Audio, Speech and Language Processing, Pattern Recognition, ACM/Springer Multimedia Systems Journal, Information Sciences, ACL,ACM Multimedia,Interspeech, ICASSP在内的重要期刊和会议上发表论文120余篇。主持和参加了多项国家自然科学基金项目、省部级项目、香港研资局项目和重要国际合作课题等,与百度、微软、IBM、华为、中兴、搜狗等业界知名企业开展了广泛深入的技术合作。担任第十届国际中文口语语言处理学术会议(ISCSLP2016)程序委员会主席、第十一届全国人机语音通讯学术会议程序委员会主席、第三届亚太信号与信息处理协会年度峰会(APSPA ASC2011)本地组织主席、第十五届国际语音通讯协会年度会议(Interspeech2014)出版主席,以及其他多个国际国内会议的程序委员会委员、分会主席等。谢磊教授是IEEE高级会员(Senior Member)、中国中文信息学会理事、中国中文信息学会语音信息专业委员会副主任,亚太信号与信息处理协会(APSIPA)语音语言与音频学术委员会委员、国际中文口语语言处理兴趣小组(SIG-CSLP) Workgroup Chair、NCMMSC常设机构委员、中国计算机学会多媒体专业委员会委员、中国图形图像学会多媒体专业委员会委员、中国计算机学会高级会员等。

研究兴趣:

  • 音频、语音与语言处理
  • 多媒体信息处理
  • 模式识别与机器学习
  • 人机交互

近期论文:

Jingyong Hou, Lei Xie, Zhonghua Fu, "Investigating Neural Network based Query-by-Example Keyword Spotting Approach for Personalized Wake-up Word Detection in Mandarin Chinese", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Changhao Shan, Lei Xie, Kaisheng Yao, "A Bi-directional LSTM Approach for Polyphone Disambiguation in Mandarin Chinese", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Kaituo Xu, Lei Xie, Kaisheng Yao, "Investigating LSTM for Punctuation Prediction", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Zhengchen Zhang, Mei Li, Yuchao Zhang, Weini Zhang, Yang Liu, Shan Yang, Yanfeng Lu,Van Tung Pham, Lei Xie, Minghui Dong, "The I2R-NWPU-NTU Text-to-Speech System at Blizzard Challenge 2016", Blizzard Challenge 2016 Workshop, September 16, 2016, Apple Inc., Cupertino, CA, USA PDF

Dong-Yan Huang, Lei Xie, Yvonne Siu Wa Lee, Jie Wu, Huaiping Ming, Xiaohai Tian, Shaofei Zhang, Chuang Ding, Mei Li, Quy Hy Nguyen, Minghui Dong, Haizhou Li, "An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity", the 9th ISCA Workshop on Speech Synthesis (SSW9), September 13th -15th, 2016, Sunnyvale, CA, USA PDF

Mei Li, Zhizheng Wu, Lei Xie, "On the impact of phoneme alignment in DNN-based speech synthesis", Mei Li, Zhizheng Wu, Lei Xie, the 9th ISCA Workshop on Speech Synthesis (SSW9), September 13th -15th, 2016, Sunnyvale, CA, USA PDF

Jie Wu, Zhizheng Wu, Lei Xie, "On the Use of I-vectors and Average Voice Model for Voice Conversion without Parallel Data", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea

Shan Yang, Zhizheng Wu, Lei Xie, "On the training of DNN-based average voice model for speech synthesis", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea

Zhen Wei, Zhizheng Wu, Lei Xie, "Predicting Articulatory Movement from Text Using Deep Architecture with Stacked Bottleneck Features", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea

Xiong Xiao, Chenglin Xu, Zhaofeng Zhang, Shengkui Zhao, Sining Sun, Shinji Watanabe, Longbiao Wang, Lei Xie, Douglas L. Jones, Eng Siong Chng, Haizhou Li, Investigation of Neural Networks Based Beamforming Approaches for Speech Recognition: The NTU Systems for CHiME-4 Evaluation, the 4th International Workshop on Speech Processing in Everyday Environments (CHiME), San Francisco, September 13, 2016 PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Learning Neural Network Representations using Cross-lingual Bottleneck Features with Word-pair Information", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng and Haizhou Li, "A DNN-HMM Approach to Story Segmentation", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Huaiping Ming, Dongyan Huang, Lei Xie, Jie Wu, Minghui Dong and Haizhou Li, "Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li,"Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Bihong Zhang, Lei Xie, Yougen Yuan, Huaiping Ming, Dongyan Huang and Mingli Song, "Deep neural network derived bottleneck features for accurate audio classification", ICME2016, S July 11-15, 2016, Seattle, USA PDF

Huaiping Ming, Dongyan Huang, Lei Xie, Shaofei Zhang, Minghui Dong and Haizhou Li, "Exemplar-based Sparse Representation of Timbre and Prosody for Voice Conversion", ICASSP2016, March 20-25, 2016, Shanghai, China PDF

Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Approximate Search of Audio Queries using DTW with Phone Time Boundary and Data Augmentation", ICASSP2016, March 20-25, 2016, Shanghai, China PDF

Chuang Ding, Lei Xie, Jie Yan, Weini Zhang and Yang Liu, "Automatic Prosody Prediction for Chinese Speech Synthesis using BLSTM-RNN and Embedding Features",2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2015), Dec 13-17, 2015, Scottsdale, Arizona PDF

Jingyong Hou, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Haihua Xu, Hang Lv, Lei Xie, Zhonghua Fu, Chongjia Ni, Xiong Xiao, Hongjie Chen, Shaofei Zhang, Sining Sun, Yougen Yuan, Pengcheng Li, Tin Lay Nwe, Sunil Sivadas, Bin Ma, Eng Siong Chng, Haizhou Li,"The NNI Query-by-Example System for MediaEval 2015", MediaEval 2015 Workshop, Wurzen, Germany, Sept 14-15, 2015 PDF  (Best performing system in the MediaEval2015 QUESST Evaluation)

Xiangzeng Zhou, Lei Xie, Peng Zhang and Yanning Zhang, "Online Object Tracking based on CNN with Metropolis-Hasting Re-sampling", ACM Multimedia 2015, Brisbane, Australia, Oct 26-30, 2015  PDF

Bo Fan, Lei Xie, Shan Yang, Lijuan Wang and Frank K. Soong, "A Deep Bidirectional LSTM Approach for Video-Realistic Talking Head", Multimedia Tools and Applications, Springer, 2015PDF

Bo Fan, Sui Wa Lee, Xiaohai Tian, Lei Xie and Minghua Dong, "A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis", APSIPA ASC 2015, Hong Kong, China, Dec 16-19, 2015 PDF

Jia Yu, Lei Xie, Xiao Xiong, Eng Siong Chng, Haizhou Li, "A Density Peak Clustering Approach to Unsupervised Acoustic Subword Units Discovery", APSIPA ASC 2015, Hong Kong, China, Dec 16-19, 2015 PDF

Shaofei Zhang, Dongyan Huang, Lei Xie, Eng Siong Chng, Haizhou Li and Minghui Dong, "Non-negative Matrix Factorization using Stable Alternating Direction Method of Multipliers for Source Separation", APSIPA ASC 2015, Hong Kong, China, Dec 16-19, 2015 PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, Parallel Inference of Dirichlet Process Gaussian Mixture Models for Unsupervised Acoustic Modeling: A Feasibility Study, Interspeech2015, September 6-10, Dresden, Germany PDF (Interspeech2015 Zerospeech Challenge Best Paper Award)

Huaiping Ming, Dongyan Huang, Lei Xie, Haizhou Li and Minghui Dong, An Alternating Optimization Approach for Phase Retrieval Interspeech2015, September 6-10, Dresden, Germany PDF

Pengcheng Zhu, Lei Xie, Yunlin Chen, Articulatory Movement Prediction Using Deep Bidirectional Long Short-Term Memory Based Recurrent Neural Networks andWord/Phone Embeddings,Interspeech2015, September 6-10, Dresden, Germany PDF

Shaofei Zhang, Dongyan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, Minghui Dong, Regularized Non-negative Matrix Factorization Using Alternating Direction Method of Multipliers and Its Application to Source SeparationInterspeech2015, September 6-10, Dresden, Germany PDF

Xiangzeng Zhou, Lei Xie, Qiang Huang, Stephen Cox and Yanning Zhang, "Tennis Ball Tracking using a Two-Layered Data Association Approach", IEEE Transactions on Multimedia, 2014 PDF

Bo Fan, Lijuan Wang, Frank K. Soong and Lei Xie, Photo-real Talking Head with Deep Bidirectional LSTM, ICASSP2015, 19-24 April 2015, Brisbane, Australia PDF

Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, Haizhou Li, Language Independent Query-by-Example Spoken Term Detection using N-Best Phone Sequences and Partial Matching, ICASSP2015, 19-24 April 2015, Brisbane, Australia PDF

Peng Yang, Haihua Xu, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, Haizhou Li, "The NNI Query-by-Example System for MediaEval 2014", MediaEval 2014 Workshop, Barcelona, Spain, Oct 16-17, 2014 PDF

Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li, " Multi-View Features in a DNN-CRF Model for Improved Sentence Unit Detection on English Broadcast News", APSIPA ASC 2014, Siem Reap, Cambodia, December 9-12, 2014

Chuang Ding, Pengcheng Zhu, Lei Xie, Dongmei Jiang and Zhonghua Fu, "Speech-Driven Head Motion Synthesis Using Neural Networks," Interspeech, Singapore, 14-18, September 2014 PDF

Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Eng Siong Chng and Haizhou Li, "A Deep Neural Network Approach for Sentence Boundary Detection in Broadcast News," Interspeech, Singapore, 14-18, September 2014 PDF

Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Intrinsic Spectral Analysis Based on Temporal Context Features for Query by Example Spoken Term Detection," Interspeech, Singapore, 14-18, September 2014 (Best Student Paper Finalist) PDF

Zhong-hua Fu, Lei Xie, "Stereo Acoustic Echo Suppression Using Widely Linear Filtering in the Frequency Domain," Interspeech, Singapore, 14-18, September 2014

Shaofei Zhang, Lei Xie, Zhong-hua Fu, "A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency,” ISCSLP, Singapore, 12-14, September 2014

Zhong-hua Fu, Lei Xie, "Experimental Study on Dereverberation and Noise Reduction for Distant Speech Recognition,” ISCSLP, Singapore, 12-14, September 2014

Hongjie Chen, Lei Xie, Wei Feng, Lilei Zheng and Yanning Zhang, "Topic Segmentation on Spoken Documents Using Self-Validated Acoustic Cuts,” Soft Computing, Springer, accepted, June 2014

Xiangzeng Zhou, Lei Xie, Peng Zhang, Yanning Zhang, "An Ensemble of Deep Neural Networks for Object Tracking", ICIP2014, October 27-30, 2014, Paris, France PDF

Chuang Ding, Lei Xie, Pengcheng Zhu, " "Head Motion Synthesis From Speech Using Deep Neural Networks", Multimedia Tools and Applications, Springer, accepted, 2014

Chao Yang, Lei Xie and Xiangzeng Zhou, "Unsupervised Broadcast News Story Segmentation Using Distance Dependent Chinese Restaurant Processes", ICASSP2014, May 4-9, 2014, Florence, Italy PDF

Huaiping Ming, Dongyan Huang, Lei Xie and Haizhou Li, "Learning Optimal Features for Music Transcription", the 2nd IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP2014), July 9-13, 2014, Xi'an, China

Chenglin Xu, Lei Xie and Zhonghua Fu, "Sentence Boundary Detection in Chinese Broadcast News using Conditional Random Fields and Prosodic Features", the 2nd IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP2014), July 9-13, 2014, Xi'an, China

Huaiping Ming, Lei Xie and Haizhou Li, "Filter Bank Design for Automatic Music Transcription", the 2013 Young Engineers and Scientists Conference on Multimedia, Communication and Mobile Application Technologies (YES2013), Nov. 8, 2013, Singapore

Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma and Haizhou Li, "Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions", ACL2013, 4-9 August, 2013, Sofia, Bulgaria. PDF

Jianwei Niu, Lei Xie, Lei Jia and Na Hu, "Context-Dependent Deep Neural Networks for Commercial Mandarin Speech Recognition Applications", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013. PDF

Haoran Liang, Mingli Song, Lei Xie and Ronghua Liang, "Personalized 3-D Facial Expression Synthesis based on Landmark Constraint", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013.

Ling Tang, Zhong-Hua Fu and Lei Xie, "Numerical Calculation of the Head-Related Transfer Functions with Chinese Dummy Head", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013.

Lei Xie, Zhigang Deng and Stephen Cox, "Multimodal joint information processing in human machine interaction: recent advances", Multimedia Tools and Applications, Guest Editorial, Springer, November, 2013.

Lei Xie, Naicai Sun and Bo Fan, "A Statistical Parametric Approach to Video-Realistic Text-driven Talking Avatar", Multimedia Tools and Applications, Springer, August 2013.

Peng Yang, Lei Xie, Qiao Luan and Wei Feng, "A Tighter Lower Bound Estimate for Dynamic Time Warping", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xiangzeng Zhou, Qiang Huang, Lei Xie and Stephen Cox, "A Two Layered Data Association Approach for Ball Tracking", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xiaoming Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Broadcast News Story Segmentation Using Latent Topics on Data Manifold", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xuecheng Nie, Wei Feng, Liang Wan, Lei Xie, "Measuring Similarity by Contextual Word Connections in Chinese News Story Segmentation", ICASSP2013, May 26-31, 2013, Vancouver, Canada

李冰锋,谢磊,朱鹏程,樊博,语音驱动虚拟说话人的自然头动生成,第12届全国人机语音通讯学术会议,《清华大学学报》,2013年第6期 PDF

杨鹏,谢磊,陈虹洁,基于SDTW和后验特征的中文语音模式发现,第12届全国人机语音通讯学术会议,《清华大学学报》,2013年第6期 PDF

Lei Xie, Lilei Zheng, Zihan Liu and Yanning Zhang, "Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp 264-277, January 2012. PDF Bib

Lei Xie, Yinqing Xu, Lilei Zheng, Qiang Huang and Bingfeng Li, "Speech Pattern Discovery using Audio-Visual Fusion and Canonical Correlation Analysis", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib Poster

Yali Zhao, Lei Xie and Zhonghua Fu, "A Two Stage Mask Estimation Approach to Robust Speaker Verification", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib Poster

Wei Feng, Xuecheng Nie, Liang Wan, Lei Xie and Jianmin Jiang, "Lexical Story Co-Segmentation of Chinese Broadcast News", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib

Lei Xie, Chenglin Xu and Xiaoxuan Wang, "Prosody-based Sentence Boundary Detection in Chinese Broadcast News", The 8th International Symposium on Chinese Spoken Language Processing (ISCSLP2012) , Hong Kong, China, December 5-8, 2012 PDF Bib

Qiang Huang, Stephen Cox, Xiangzeng Zhou and Lei Xie, "Detection of Ball Hits in a Tennis Game Using Audio and Visual Information", APSIPA Annual Summit and Conference (APSIPA ASC 2012), Hollywood, California, USA, Dec 3-6, 2012

Yang Liang, Mingli Song, Lei Xie, Jiajun Bu and Chun Chen,"Face Sketch-to-Photo Synthesis from Simple Line Drawing", APSIPA Annual Summit and Conference (APSIPA ASC 2012), Hollywood, California, USA, Dec 3-6, 2012

Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Acoustic Texttiling For Story Segmentation Of Spoken Documents", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2012), March 25 - 30, Kyoto, Japan, 2012. PDF Bib Poster

Yali Zhao, Zhong-Hua Fu, Lei Xie, Jian Zhang, Yanning Zhang, "Dual-microphone based binary mask estimation for robust speaker verification", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, July 16 -18, 2012.

Dan Li, Zhong-Hua Fu and Lei Xie, "Comprehensive Comparison of the Least Mean Square Algorithm and the Fast Deconvolution Algorithm for Crosstalk Cancellation", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, July 16 -18, 2012.

Lei Xie, Yulian Yang and Zhi-Qiang Liu, "On the Effectiveness of Subwords for Lexical Cohesion Based Story Segmentation of Chinese Broadcast News", Information Sciences, 181(13):2873–2891, Elsevier, 2011. PDF Bib

Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation", Interspeech2011, Florence, Italy, August, 2011. (Interspeech Grant) PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng and Haizhou Li, "Broadcast News Story Segmentation Using Conditional Random Fields and Multi-modal Features", IEICE Transactions on Information and Systems, invited paper, to appear, 2012. PDF Bib

Mimi Lu, Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011. PDF Bib

Xiaoyu Chen, Zhonghua Fu and Lei Xie, "Multiple Sparse Sources Separation Based on Multichannel Frequency Domain Adaptive Filtering", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011

Jian Zhang, Zhonghua Fu and Lei Xie, "A Block-Based Blind Source Separation Approach with Equilateral Triangular Microphone Array", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011

李冰锋,谢磊, 周祥增, 付中华,张艳宁,实时语音驱动的虚拟说话人,第11届全国人机语音通讯学术会议(《清华大学学报》),2011 PDF Bib

张健,付中华,谢磊,赵亚丽,基于目标声源方位已知的双麦克风噪声抑制,第11届全国人机语音通讯学术会议(《清华大学学报》),2011

赵亚丽,付中华,谢磊,张健, 张艳宁,双麦克风语音增强和杂混模型训练相结合的顽健说话人确认,第11届全国人机语音通讯学术会议,2011

Lei Xie, Zhong-hua Fu, Wei Feng and Yong Luo,"Pitch-Density-based Features and an SVM Binary Tree Approach for Multi-Class Audio Classification in Broadcast News", ACM/Springer Multimedia Systems Journal, 17(2):101-112 , 2011. PDF Bib

Mimi Lu, Lei Xie, Zhonghua Fu, Dongmei-Jiang, "Multi-Modal Feature Integration for Story Boundary Detection in Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, 2010. PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Modeling Broadcast News Prosody Using Conditional Random Fields for Story Segmentation", APSIPA Annual Summit and Conference (APSIPA ASC 2010), Biopolis, Singapore, December 14-17, 2010. PDF Bib

郑李磊,谢磊,芦咪咪,王晓暄,杨玉莲,张艳宁,全自动中文新闻字幕生成系统的设计与实现,《电子学报》, 2011 PDF Bib

Zihan Liu, Lei Xie, Wei Feng, "Maximum Lexical Cohesion for Fine-Grained News Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010.最佳学生论文FinalistPDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Phoneme Lattice based TextTiling towards Multilingual Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010. PDF Bib

Lei Xie, Yulian Yang, Zhi-Qiang Liu, Wei Feng and Zihan Liu, "Integrating Acoustic and Lexical Features In Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach," International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.

Zihan Liu, Lei Xie and Lilei Zheng, "Laplacian Eigenmaps for Automatic News Story Segmentation", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.

Lei Xie et al., "Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications," Demo for The 7th International Conference on Ubiquitous Intelligence and Computing(UIC), October 26-29, 2010, Xi'an, China

Xiaohai Tian, Zhonghua Fu and Lei Xie, "An Experimental Comparison on KEMAR and BHead210 Dummy Heads for HRTF-based Virtual Auditory on Chinese Subjects," The Third IET International Conference on Wireless, Mobile & Multimedia Networks (ICWMMN2010), 26 - 29, September 2010, Beijing, China.

Yaodong Ni, Lei Xie, and Zhi-Qiang Liu, " Minimizing the Expected Complete Influence Time of a Social Network," Information Sciences, 180(13): 2514-2527, 2010.

Jin Zhang, Lei Xie, Wei Feng and Yanning Zhang, "A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2009), LNCS 5839, Springer, pp136-148, 2009.

Wei Feng, Lei Xie and Zhi-Qiang Liu, "Multicue Graph Mincut for Image Segmentation", Ninth Asian Conference on Computer Vision (ACCV2009), LNCS 5995, pp. 707-717, Springer, 2010.

Wei Feng, Lei Xie, Jia Zeng and Zhi-Qiang Liu, "Audio-Visual Human Recognition Using Semi-Supervised Spectral Learning and Hidden Markov Models," Journal of Visual languages and Computing , invited paper, 20(3):188-195, 2009.

Jia Zeng, Wei Feng, Lei Xie and Zhi-Qiang Liu, "Cascade Markov random fields for stroke extraction of Chinese characters," Information Sciences, 180(2):301-311, 2009.

Lilei Zheng, Lei Xie, Xiaoxuan Wang, Mimi Lu, Yulian Yang and Yanning Zhang, "An Antomatic Caption Generator for Mandarin Broadcast News," 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009 (最佳论文)

Mimi Lu, Lei Xie, Lilei Zheng, Yulian Yang, Yanning Zhang, "Anchor Labeling System for Broadcast News using Alize toolkit", 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009

Zhonghua Fu, Jhing-Fa Wang and Lei Xie, "Noise Robust Features for Speech/Music Discrimination in Real-time Telecommunication", IEEE International Conference on Multimedia and Expo (ICME 2009), pp 574-577, New York, USA.

Lei Xie, "Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news", ACM/Springer Multimedia Systems Journal, 14(4):237-253, 2008.

Jia Zeng, Lei Xie and Zhi-Qiang Liu, "Type-2 Fuzzy Gaussian Mixture Models" Pattern Recognition, 41, 2008, pp 3636-3643.

Lei Xie and Guangsen Wang, "A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 350-353, Yunnan, China, 2008. PDF Bib

Yulian Yang and Lei Xie, "Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 358-361, Yunnan, China, 2008. (Microsoft Student Grant. This paper is also presented in the 2008 Beijing-Hong Kong International Doctoral Forum, Beijing) PDF Bib

Lei Xie and Yulian Yang, "Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News", Pacific-Rim Conference on Multimedia (PCM2008), LNCS 5353, Springer, pp248-258, 2008.

Lei Xie, Jia Zeng and Wei Feng, "Multi-Scale TextTiling for Automatic Story Segmentation in Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2008), LNCS 4993, Harbin, China, pp345-355, Springer, 2008.

Lei Xie and Zhi-Qiang Liu, "Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling", IEEE Transactions on Multimedia, 9(3), 2007, pp500-510. PDF Bib

Lei Xie and Zhi-Qiang Liu, "A Coupled HMM Approach for Video-Realistic Speech Animation", Pattern Recognition, 40(10), 2007, pp2325-2340. PDF Bib

Lei Xie, "Dynamic Bayesian Network Inversion for Robust Speech Recognition", IEICE Transactions on Information and Systems, 2007, Vol. E90-D, No. 7, pp 156-159.

Lei Xie, Chuan Liu and Helen Meng, "Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News", Human Language Technology Conference /North American chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL), pp193-196, Rochester, NY, USA, April, 2007.

Chuan Liu, Lei Xie, Helen Meng, "Classification of Music and Speech in Mandarin News Broadcasts", 9th National Conference on Man-Machine Speech Communication (NCMMSC), Huangshan, Anhui, China, 2007.

Shing-kai Chan, Lei Xie and Helen Mei-ling Meng, "Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation" Interspeech, Belgium, 2007. PDF Bib

Lei Xie, and Zhi-Qiang Liu, "An Articulatory Approach to Video-Realistic Mouth Animation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. I, pp593-596, Toulouse, France, 2006.

Lei Xie, Helen Meng and Zhi-Qiang Liu, "A Cantonese Speech-Driven Talking Face using Translingual Audio-to-Visual Conversion", International Symposium on Chinese Spoken Language Processing (ISCSLP2006), LNAI 4274, Singapore, pp627-639, Springer, Dec, 2006.

Lei Xie and Zhi-Qiang Liu, "Multi-Stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition", Advances in Machine Learning and Cybernetics, LNAI 3930, Springer, pp99-114, April, 2006.

Lei Xie, and Zhi-Qiang Liu, "Speech Animation Using Coupled Hidden Markov Models", International Conference on Pattern Recognition (ICPR), vol. I, pp1128-1131, Hong Kong, 2006.

Lei Xie, and Zhi-Qiang Liu, "Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services" , International Conference on System, Man and Cybernetics (ICSMC) , pp4331-4336, Taipei, Taiwan, 2006.

合作者

  • Haizhou Li, Department Head, Principal Scientist, Institute for Infocomm Research (I2R), A*STAR, Singapore
  • Helen Meng, Professor, The Chinese University of Hong Kong, Hong Kong SAR
  • Hichem Sahli, Professor, Vrije Universitiet Brussel (VUB), Belgium
  • Stephen Cox, Professor, University of East Anglia, UK
  • Zhi-Qiang Liu, Professor, City University of Hong Kong, Hong Kong SAR
  • Jhing-Fa Wang, Professor, National Cheng Kung University, Taiwan
  • Frank Soong, Principal Researcher, Microsoft Research Asia, China
  • Lijuan Wang, Researcher, Microsoft Research Asia, China
  • Eng Siong Chng, Professor, Nanyang Technological University, Singapore
  • Xiong Xiao, Senior Research Scientist, Nanyang Technological University, Singapore
  • Bin Ma, Senior Scientist, Institute for Infocomm Research (I2R), A*STAR, Singapore
  • Cheung-Chi Leung, Scientist, Institute for Infocomm Research (I2R), A*STAR, Singapore
  • Lee Siu Wa, Scientist, Institute for Infocomm Research (I2R), A*STAR, Singapore
  • Dongyan Huang, Scientist, Institute for Infocomm Research (I2R), A*STAR, Singapore
  • Lei Jia, Baidu, China
  • Zhigang Deng, Professor, University of Houston, USA
  • Qiang Huang, University of Edinburgh, UK
  • Wei Feng, Professor, Tianjin University, China
  • Jia Zeng, Professor, Suzhou University, China
  • Yi Wang, Tencent Co.
  • Zhonghua Fu, Professor, ASLP, Northwestern Polytechnical University, China
  • Dongmei Jiang, Professor, ASLP, Northwestern Polytechnical University, China
  • Yanning Zhang, Professor, ASLP, Northwestern Polytechnical University, China

近期学术活动:

最近:

下列国际期刊和国际会议的审稿人(或程序委员会委员):

  • IEEE Transactions on Audio, Speech and Language Processing
  • IEEE Transactions on Multimedia
  • IEEE Transactions on Visualization and Computer Graphics
  • IEEE Transactions on Fuzzy Systems
  • ACM Transactions on Embedded Computing Systems
  • ACM Transactions on Asian Language Information Processing
  • Pattern Recognition
  • Eurasip Journal on Audio, Speech and Music Processing
  • Multimedia Systems
  • Multimedia Tools and Applications
  • Soft Computing
  • Information Sciences
  • International Journal on Computational Intelligence and Applications (IJCAI)
  • Journal of Ambient Intelligence and Humanized Computing (AIHC)
  • APSIPA Transactions on Signal and Information Processing
  • 清华大学学报
  • 华南理工大学学报
  • ACL 2012
  • ISCSLP2012
  • APSIPA ASC2012
  • HHME 2011
  • APSIPA ASC 2011
  • 2011 ACM Conference on Information and Knowledge Management (CIKM)
  • International Conference On Audio, Language And Image Processing (ICALIP2010)
  • International Symposium on Chinese Spoken Language Processing (ISCSLP2010)
  • HHME 2010
  • IEEE Tencon 2009
  • International Conference On Audio, Language And Image Processing (ICALIP2008)
  • The 18th International Conference on Pattern Recognition (ICPR2006)
  • The 8th ICAISC Int'l Conference on Artificial Intelligence and Soft Computing (ICAISC2006)
  • Int'l Conference on Machine Learning and Cybernetics (ICMLC2006)
  • Int'l conference on Systems, Man and Cybernetics (ICSMC2006)
  • Int'l Conference of Computational Intelligence and Multimedia Applications (ICCIMA2005)
  • Int'l Conference on Machine Learning and Cybernetics (ICMLC2005)
  • Int'l Symposium on Modeling Decisions for Artificial Intelligence (MDAI2005)
  • Asia-Pacific Workshop on Visual Information Processing (VIP2005)
  • The 18th International FLAIRS Conference (FLAIRS2005)
  • ......

演讲稿:

Topic Segmentation: a summary of recent approaches, talk in University of East Anglia, 英国东英吉利大学,2011年12月

Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps, APSIPA ASC2011, Xi'an, China, 2011

Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation, Interspeech2011, Florence, Italy, 2011

Maximum Lexical Cohesion for Fine-Grained News Story Segmentation, Interspeech2010, 日本千叶, 2010年9月

Phoneme Lattice based TextTiling towards Multilingual Story Segmentation, Interspeech2010, 日本千叶, 20109月

A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News", 2009,亚洲信息检索国际会议,日本札幌,2009年10月

Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News, 第六届中文口语语言处理国际会议(ISCSLP),昆明,2008年12月

Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News, 2008京港国际博士生论坛,北京,2008年12月

A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting, 第六届中文口语语言处理国际会议(ISCSLP),昆明,2008年12月

Speech-driven Talking Face for Interactive Human-Computer Communication, 台湾成功大学特邀报告, 台南,台湾, 2008年11月

Automatic Story Segmentation of Chinese Broadcast News based on Special Features of Chinese Language, 西北工业大学研究生学术年会报告, 2007年11月.

Classification of Music and Speech in Mandarin News Broadcast, 第九届全国人机语音通讯学术会议, 中国黄山,2007年10月.

Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modeling, 西北工业大学计算机学院学术报告,2007年7月.

Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation, Interspeech2007, 比利时安特卫普,2007年8月.

Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News, HLT2007, 美国纽约州, 2007年4月.

A Cantonese Speech-Driven Talking Face Using Translingual Audio-to-Visual Conversion, 国际中文口语处理大会 ISCSLP2006, 新加坡, 2006年12月.

链接:

ASLP@NPU   Audio, Speech and Language Processing Group, Northwestern Polytechnical University, China
HCCL   Professor Helen Meng, Human Computer Communications Laboratory, The Chinese University of Hong Kong
CSAIL-MIT
  Computer Science and Artificial Intelligence Laboratory, MIT
SLS-MIT   Spoken Language Systems, MIT; Victor Zue, Stephanie Seneff, Jim Class...
HTK   Hidden Markov Toolkit, Cambirdge Univ., UK
ASLP@UEA   Audio, Speech and Language Processing Lab., University of East Anglia, UK
Cambridge Machine Intelligence Laboratory   Steve Young, P.C. Woodland, Mark Gales,...
LTI @ CMU   Language Technologies Institute, CMU
Speech @ CMU   Speech Technologies at CMU
Fred Juang   Professor Biing-Hwang (Fred) Juang at Georgia Tech
C. H. Lee   Professor Chin-Hui Lee at Georgia Tech
Speech Tech @ MS Research   Speech Technology at Microsoft Research
CSLP @ JHU   Center for Speech and Language Processing, Johns Hopkins University
L.-S. Lee   Professor Lee Lin-Shan at National Taiwan University, Taiwan
Chiu-yu Tseng   Dr. Chiu-yu Tseng at  Institute of Linguistic, Academia Sinica, Taiwan

陕ICP备15008649号