Kuan-Yu (Menphis) Chen
Transcription
Kuan-Yu (Menphis) Chen
Kuan-Yu (Menphis) Chen Dep. of CSIE National Taiwan University Taiwan [email protected] http://www.iis.sinica.edu.tw/~kychen/ (Last Update: 2015/1/15) RESEARCH INTERESTS & SKILLS Speech Recognition Language Modeling: Topic Modeling (Latent Dirichlet Allocation, Probabilistic Latent Semantic Analysis, Word-based Topic Modeling, and etc.), Relevance-based Language Modeling, Neural Network-based Language Modeling and Representations Decoding and Search Natural Language Processing Information Retrieval: Pseudo-relevance Feedback, Language Modeling, Retrieval Models, Spoken Content Retrieval, Diversification Search, Spelling Check Summarization: Sentence Modeling, Ranking Models, Supervised Learning, Spoken Document Summarization Machine Learning & Pattern Recognition Discriminative Training (Global Conditional Log-linear Models, Minimum Error Rate Training, Minimum Classification Error Training, and etc.), Low-rank Matrix Factorization, Deep Neural Network EDUCATION 2010.9 ~ Present: 2007.9 - 2010.7: 2003.9 -2007.6: Ph.D. Candidate, Natural Language Processing Lab., Department of Computer Science and Information Engineering, National Taiwan University Master, Spoken Language Processing Lab., Department of Computer Science and Information Engineering, National Taiwan Normal University Bachelor, Department of Information and Computer Education, National Taiwan Normal University JOB EXPERIENCES 2009.4 ~ Present: 2014.6 ~ 2014.9: Research Assistant, with advisor Prof. Hsin-Min Wang, Institute of Information Science, Academia Sinica, Taiwan Research Intern, IBM T.J. Watson Research Center, Yorktown Heights, New York, USA RESEARCH EXPERIENCES Speech Recognition System Language Modeling Language modeling has been widely used for information retrieval. However, this approach has 2 major challenges. 1) a query is often a vague expression of an underlying information need, 2) there can be word usage mismatch between a query and a document even they are topically related to each other. To mitigate these problems, we proposed a few relevance-based language models using different objective functions to reformulate the original queries. Pseudo-relevance feedback is by far the most commonly-used paradigm for query reformulation. In general, top-ranked documents obtained from the initial retrieval are used for query modeling (reformulation). However, this approach will not work well when the top-ranked documents contain much redundant or non-relevant information. We proposed to glean useful cues from the top-ranked documents to achieve more accurate query representation. Latent semantic analysis and its extensions (such as PLSA and LDA) have been proved the capacity for IR. However, the number of non-occurring words is usually much larger than the number of occurring words in a document. Treating the occurring and non-occurring words with equal importance can be a disadvantage of LSA because the non-occurring words can dominate the estimation of model parameters. We proposed a weighted matrix factorization framework to modulate the impact from occurring and non-occurring words properly. Document Summarization The well-established topic modeling revolves the discovery of “word-document” co-occurrence dependence. Orthogonal to these models, we proposed a word vicinity model (WVM) to explore the “word-word” co-occurrence relationship between words and the long-span latent topic information for IR and speech recognition. Language model framework is sensitive to training data, and it is vulnerable for cross domain applications. To mitigate this deficiency, we proposed a relevance-based dynamic language model adaptation for IR, summarization, and speech recognition. This approach provides a flexible generative framework to render the lexical and topical relationships between observations and the predictions. The state of art i-vector framework which reduces a series of acoustic feature vectors of a speech utterance to a low-dimensional vector representation has demonstrate great performance improvement on language identification and speaker recognition. We adapted this concept to an i-vector based language modeling for information retrieval. Information Retrieval I developed a speech recognition system (especially for Mandarin initial-final phone set) with tree-copy search, three different acoustic look-ahead methods, and naïve n-best generator by using standard C++ STL. Language modeling has been used for unsupervised summarization. However, how to formulate the sentence models and to estimate their parameters for each document to be summarized remains a big challenge. We proposed a novel recurrent neural network language modeling to render word usage cues and long-span structural information of word co-occurrence relationships within documents. Spelling Check Chinese spelling check is still an open problem today. We expand the widely used n-gram based language model by gleaning extra semantic clues and Web resources to enhance the performance using an unsupervised framework. PUBLICATIONS Journal Articles 1. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Ea-Ee Jan, Wen-Lian Hsu, and Hsin-Hsi Chen, "Extractive Broadcast News Summarization Leveraging Recurrent Neural Network Language Modeling Techniques," submitted to IEEE Transactions on Audio, Speech, and Language Processing. (under revision) Kuan-Yu Chen, Hsin-Min Wang, and Hsin-Hsi Chen, "A Probabilistic Framework with Topic Language Modeling for Chinese Spelling Check," submitted to ACM Transactions on Asian Language Information Processing. (under revision) Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "Combining Relevance Language Modeling and Clarity Measure for Extractive Speech Summarization," submitted to IEEE Transactions on Audio, Speech, and Language Processing. (under revision) Berlin Chen, Yi-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, and Kuen-Tyng Yu, "Enhancing Query Formulation for Spoken Document Retrieval," Special Issue on Emerging Technologies and Applications of Artificial Intelligence, Journal of Information Science and Engineering, Vol. 30, No. 3, pp. 553-569, May, 2014. Hsuan-Sheng Chiu, Kuan-Yu Chen, and Berlin Chen, "Leveraging Topical and Positional Cues for Language Modeling in Speech Recognition," Multimedia Tools and Applications, Vol. 72, No. 2, pp. 1465-1481, September, 2014. Berlin Chen, and Kuan-Yu Chen, "Leveraging Relevance Cues for Language Modeling in Speech Recognition," Information Processing & Management, Vol. 49, No. 4, pp. 807-816, July, 2013. Berlin Chen, Kuan-Yu Chen, Pei-Ning Chen, and Yi-Wen Chen, "Spoken Document Retrieval with Unsupervised Query Modeling Techniques," IEEE Transactions on Audio, Speech, and Language Processing, Vol.20, No.9, pp.2602-2612, November, 2012. Kuan-Yu Chen, Hsin-Min Wang, and Berlin Chen, "Spoken Document Retrieval Leveraging Unsupervised and Supervised Topic Modeling Techniques," Special Section: Recent Advances in Multimedia Signal Processing Techniques and Applications, IEICE Transactions on Information and Systems, Vol. E95-D, No.5, pp. 1195-1205, May, 2012. 2. 3. 4. 5. 6. 7. 8. Conference Papers (International Track) 1. Kuan-Yu Chen, Hsin-Min Wang, Berlin Chen, and Hsin-Hsi Chen, "I-vector Based Language Modeling for Query Representation," the 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2015), Brisbane, Australia, April 19-24, 2015. Kuan-Yu Chen, Ea-Ee Jan, and Tsuyoshi Ide, "Probabilistic Text Analytics Framework for Information Technology Service Desk Tickets," the IFIP/IEEE International Symposium on Integrated Network Management (IM 2015), Ottawa, Canada, May 11-15, 2015. (Short Paper) Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Ea-Ee Jan, Hsin-Min Wang, Wen-Lian Hsu, and Hsin-Hsi Chen, "Leveraging Effective Query Modeling Techniques for Speech Recognition and Summarization," the Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), pp. 1474-1480, Doha, Qatar, October 25-29, 2014. (Short Paper) Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Wen-Lian Hsu and Hsin-Hsi Chen, "A Recurrent Neural Network Language Modeling Framework for Extractive Speech 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. Summarization," IEEE International Conference on Multimedia and Expo (ICME 2014), pp. 569-574, Chengdu, China, July 14-18, 2014. (Full Paper) Ea-Ee Jan, Kuan-Yu Chen, and Tsuyoshi Ide, "A Probabilistic Concept Annotation for IT Service Desk Tickets," the Seventh International Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR 2014), pp. 21-23, Shanghai, China, November 7, 2014. Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Ea-Ee Jan, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "A Margin-Based Discriminative Modeling Approach for Extractive Speech Summarization," the APSIPA Annual Summit and Conference (APSIPA 2014), Angkor Wat, Cambodia, December 9-12, 2014. Shih-Hung Liu, Kuan-Yu Chen, Yu-lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen and Wen-Lian Hsu, "Enhanced Language Modeling for Extractive Speech Summarization with Sentence Relatedness Information," the Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Max Atria, Singapore, Sep 14-18, 2014. Kuan-Yu Chen, Hung-Shin Lee, Hsin-Min Wang, Berlin Chen, and Hsin-Hsi Chen, "I-vector Based Language Modeling for Spoken Document Retrieval," the 39th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, May 4-9, 2014. Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "Effective Pseudo-relevance Feedback for Language Modeling in Extractive Speech Summarization," the 39th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, May 4-9, 2014. Berlin Chen, Yi-Wen Chen, Kuan-Yu Chen, and Ea-Ee Jan, "Effective Pseudo-Relevance Feedback for Language Modeling in Speech Recognition," IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2013), pp. 13-18, Olomouc, Czech Republic, December 8-12, 2013. Kuan-Yu Chen, Hung-Shin Lee, Chung-Han Lee, Hsin-Min Wang and Hsin-Hsi Chen, "A Study of Language Modeling for Chinese Spelling Check," the 7th SIGHAN Workshop on Chinese Language Processing (SIGHAN-7), pp. 79-83, Nagoya, Japan, Oct 14, 2013. How Jing, Yu Tsao, Kuan-Yu Chen, and Hsin-Min Wang, "Semantic Naive Bayes Classifier for Document Classification," the 6th International Joint Conference on Natural Language Processing (IJCNLP 2013), pp. 1117-1123, Nagoya, Japan, Oct 14-18, 2013. Kuan-Yu Chen, Hsin-Min Wang, Berlin Chen, and Hsin-Hsi Chen, "Weighted Matrix Factorization for Spoken Document Retrieval," the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), pp. 8530-8534, Vancouver, Canada, May 26-31, 2013. Yi-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, and Berlin Chen, "Effective Pseudo-Relevance Feedback for Spoken Document Retrieval," the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), pp. 8535-8539, Vancouver, Canada, May 26-31, 2013. Yi-Wen Chen, Bo-Han Hao, Kuan-Yu Chen, and Berlin Chen, "Incorporating Proximity Information for Relevance Language Modeling in Speech Recognition," the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), pp. 2683-2687, Lyon, France, August 25-29, 2013. Berlin Chen, Hao-Chin Chang, and Kuan-Yu Chen, "Sentence Modeling for Extractive Speech Summarization," IEEE International Conference on Multimedia and Expo (ICME 2013), pp. 1-6, San Jose, California, USA, July 15-19, 2013. Kuan-Yu Chen, Hao-Chin Chang, Berlin Chen, and Hsin-Min Wang, "Word Relevance Modeling for Speech Recognition," the 13th Annual Conference of the International Speech Communication Association (Interspeech 2012), pp. 999-1002, Portland, Oregon, USA, September 9-13, 2012. 18. Berlin Chen, Pei-Ning Chen, and Kuan-Yu Chen, "Query Modeling for Spoken Document Retrieval," IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2011), pp. 389-394, Hawaii, USA, December 11-15, 2011. 19. Pei-Ning Chen, Kuan-Yu Chen, and Berlin Chen, "Leveraging Relevance Cues for Improved Spoken Document Retrieval," the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), pp. 929-932, Florence, Italy, August 28-31, 2011. 20. Kuan-Yu Chen, and Berlin Chen, "Relevance Language Modeling for Speech Recognition," the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), pp. 5568-5571, Prague, Czech, May 22-27, 2011. 21. Kuan-Yu Chen, and Berlin Chen, "A Study of Topic Modeling Techniques for Spoken Document Retrieval," APSIPA Annual Summit and Conference (APSIPA 2010), pp. 237-242, Biopolis, Singapore, December 14-17, 2010. 22. Kuan-Yu Chen, Hsuan-Sheng Chiu, and Berlin Chen, "Latent Topic Modeling of Word Vicinity Information for Speech Recognition," the 35th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 5394-5397, Dallas, Texas, USA, March 14-19, 2010. 23. Hsuan-Sheng Chiu, Kuan-Yu Chen, Chun-Jen Lee, and Berlin Chen, "Position Information for Language Modeling in Speech Recognition," the 6th International Symposium on Chinese Spoken Language Processing (ISCSLP 2008), pp. 1-4, Kunming, China, December 16-19, 2008. Conference Papers (Domestic Track) 1. Shih-Hung Liu, Kuan-Yu Chen, Hsin-Min Wang, Wen-Lian Hsu, and Berlin Chen, "Improved Sentence Modeling Techniques for Extractive Speech Summarization," ROCLING XXV: Conference on Computational Linguistics and Speech Processing (ROCLING 2013), 2013. Bo-Han Hao, Yi-Wen Chen, Kuan-Yu Chen, and Berlin Chen, "An Empirical Study of Exploring Proximity Information for Improved Language Modeling," 2013 National Computer Symposium (NCS 2013), 2013. Shih-Hung Liu, Kuan-Yu Chen, Hsin-Min Wang, Wen-Lian Hsu, and Berlin Chen, "An Empirical Study of Extractive Speech Summarization Techniques," 2013 National Computer Symposium (NCS 2013), 2013. Yi-Wen Chen, Jun-Yu Chen, Kuan-Yu Chen, and Berlin Chen, "Empirical Comparisons of Various Pseudo-relevant Document Selection Methods for Improved Spoken Document Retrieval," the 17th Conference on Technologies and Applications of Artificial Intelligence (TAAI 2012), November 16-18, 2012. Bang-Xuan Huang, Hank Hao, Kuan-Yu Chen, and Berlin Chen, "Recurrent Neural Network-based Language Modeling with Relevance Information," ROCLING XXIV: Conference on Computational Linguistics and Speech Processing (ROCLING 2012), 2012. Berlin Chen, Min-Hsuan Lai, Kuan-Yu Chen, and Bang-Xuan Huang, "A Survey on Discriminative Language Modeling for Speech Recognition," ACLCLP Newsletter, Vol. 22, 2011. Min-Hsuan Lai, Bang-Xuan Huang, Kuan-Yu Chen, and Berlin Chen, "Empirical Comparisons of Various Discriminative Language Models for Speech Recognition," ROCLING XXIII: Conference on Computational Linguistics and Speech Processing (ROCLING 2011), 2011. Kuan-Yu Chen, Min-Hsuan Lai, and Berlin Chen, "A Study on Using Word Vicinity Information for Speech Recognition," the 15th Conference on Technologies and Applications of Artificial Intelligence (TAAI 2010), November 18-20, 2010. 2. 3. 4. 5. 6. 7. 8. 9. Feng-Ping Liu, Kuan-Yu Chen, Chia-Wen Liu, Yu-Mei Chang, and Berlin Chen, "On the Use of Discriminative Language Modeling Adaptation for Large Vocabulary Continuous Speech Recognition," the 14th Conference on Technologies and Applications of Artificial Intelligence (TAAI 2009), October 30-31, 2009. 10. Kuan-Yu Chen, and Berlin Chen, "On the Use of Topic Models for Large Vocabulary Continuous Speech Recognition," ROCLING XXI: Conference on Computational Linguistics and Speech Processing (ROCLING 2009), September 1-2, 2009. 11. Ting-Wei Hsu, Kuan-Yu Chen, and Berlin Chen, "An Initial Study on English Continuous Speech Recognition," the 12th Conference on Technologies and Applications of Artificial Intelligence (TAAI 2007), November 16-17, 2007. HONORS AND AWARDS 1. 2. 3. 4. 5. 6. IEEE ICASSP spoken language processing student travel grant, supported by Drs. XD Huang, Alex Acero and Hsiao-Wuen Hon with proceeds from royalties of their book Spoken Language Processing (Prentice Hall, 2001), 2014. Student travel grant, supported by Ministry of Science and Technology, Executive Yuan, Taiwan, 2014. Student travel grant, supported by Ministry of Science and Technology, Executive Yuan, Taiwan, 2013. Best student paper award, "Improved Sentence Modeling Techniques for Extractive Speech Summarization," ROCLING XXV: Conference on Computational Linguistics and Speech Processing, 2013. Best student paper award, "Recurrent Neural Network-based Language Modeling with Relevance Information," ROCLING XXIV: Conference on Computational Linguistics and Speech Processing, 2012. Best student paper award, "Empirical Comparisons of Various Discriminative Language Models for Speech Recognition," ROCLING XXIII: Conference on Computational Linguistics and Speech Processing, 2011. REFERENCES Hsin-Hsi Chen ([email protected]) Professor, Department of Computer Science and Information Engineering, National Taiwan University, Taiwan. Hsin-Min Wang ([email protected]) Research Fellow, Institute of Information Science, Academia Sinica, Taiwan. Berlin Chen ([email protected]) Professor, Department of Computer Science and Information Engineering, National Taiwan Normal University, Taiwan.