Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
This paper discusses the challenges and proposes a solution to performing information retrieval on the Web using Chinese natural language speech query. The main contribution of this research is in devising a divide-and-conquer strategy to alleviate the speech recognition errors. It uses the query model to facilitate the extraction of main core semantic string (CSS) from the Chinese natural language speech query. It then breaks the CSS into basic components corresponding to phrases, and uses a multi-tier strategy to map the basic components to known phrases in order to further eliminate the errors. . | Extracting Key Semantic Terms from Chinese Speech Query for Web Searches Gang WANG Tat-Seng CHUA Yong-Cheng WANG National University of Singapore wanggang sh@hotmail.com National University of Singapore chuats@comp.nus.edu.sg Shanghai Jiao Tong University China 200030 ycwang@mail.sjtu.edu.cn Abstract This paper discusses the challenges and proposes a solution to performing information retrieval on the Web using Chinese natural language speech query. The main contribution of this research is in devising a divide-and-conquer strategy to alleviate the speech recognition errors. It uses the query model to facilitate the extraction of main core semantic string CSS from the Chinese natural language speech query. It then breaks the CSS into basic components corresponding to phrases and uses a multi-tier strategy to map the basic components to known phrases in order to further eliminate the errors. The resulting system has been found to be effective. 1 Introduction We are entering an information era where information has become one of the major resources in our daily activities. With its wide spread adoption Internet has become the largest information wealth for all to share. Currently most Chinese search engines can only support term-based information retrieval where the users are required to enter the queries directly through keyboards in front of the computer. However there is a large segment of population in China and the rest of the world who are illiterate and do not have the skills to use the computer. They are thus unable to take advantage of the vast amount of freely available information. Since almost every person can speak and understand spoken language the research on Chinese natural language speech query retrieval would enable average persons to access information using the current search engines without the need to learn special computer skills or training. They can simply access the search engine using common devices that they are familiar with such as the .