清华大学 | 信息学院 | 国家实验室 | English Version

重要公告

【2012学术报告1】音訊辨識技術與應用 - 以遊戲為導向的語音與音樂學習

题目:音訊辨識技術與應用 - 以遊戲為導向的語音與音樂學習

报告人:Prof. Jyh-Shing Roger Jang

Department of Computer Science, National Tsing Hua University, Taiwan

时间:2012229日 上午10:00-12:00

地点:FIT1-312

联系人:邬晓钧  62771954

演讲人介绍Jyh-Shing Roger Jang (張智星) received the B.S. degree in Electrical Engineering from National Taiwan University in 1984, and the Ph.D. degree in EECS Department at the University of California at Berkeley, in 1992. He worked for the MathWorks Inc. during 1993-1995, and coauthored the Fuzzy Logic Toolbox. Since 1995, he has been with the Department of Computer Science, National Tsing Hua University, Taiwan. His most famous book is "Neuro-Fuzzy and Soft Computing" (1997, Prentice Hall), with Google citations of 4632 (Feb 2012). He has also maintained a toolbox of machine learning, and two on-line tutorials on “Data Clustering and Pattern Recognition” and “Audio Signal Processing and Recognition”. His research interests include machine learning and pattern recognition, with applications to speech recognition/assessment/synthesis, music analysis/retrieval, and image identification/retrieval. More information about Dr. Jang can be found at http://mirlab.org/jang”.

报告简介:本演講將說明各項音訊識別技術在語音與音樂學習方面的應用,這些技術包含語音識別(speech recognition)、文字轉語音(text-to-speech conversion)、語音評分(speech assessment)、哼唱選歌(query by singing/humming)、音高追蹤(pitch tracking)、節拍追蹤(beat tracking)、曲風分類(music genreclassification)等。由於每個技術項目都有各自的特性,因此我們在應用於以遊戲為導向的學習時,必須考慮到這些特性,才能建構出有趣的應用程式。本演講將穿插各項展示,讓聽眾能夠體驗每項技術的優點和缺點,並說明如何以流程和創意來發揮最大的學習效果。

 

【发布时间:2012-03-02】【浏览次数: