TAILIEUCHUNG - Báo cáo khoa học: "User Simulations for context-sensitive speech recognition in Spoken Dialogue Systems"

We use a machine learner trained on a combination of acoustic and contextual features to predict the accuracy of incoming n-best automatic speech recognition (ASR) hypotheses to a spoken dialogue system (SDS). Our novel approach is to use a simple statistical User Simulation (US) for this task, which measures the likelihood that the user would say each hypothesis in the current context. Such US models are now common in machine learning approaches to SDS, are trained on real dialogue data, and are related to theories of “alignment” in psycholinguistics. We use a US to predict the user’s next dialogue. | User Simulations for context-sensitive speech recognition in Spoken Dialogue Systems Oliver Lemon Edinburgh University olemon@ Ioannis Konstas University of Glasgow konstas@ Abstract We use a machine learner trained on a combination of acoustic and contextual features to predict the accuracy of incoming n-best automatic speech recognition ASR hypotheses to a spoken dialogue system SDS . Our novel approach is to use a simple statistical User Simulation US for this task which measures the likelihood that the user would say each hypothesis in the current context. Such US models are now common in machine learning approaches to SDS are trained on real dialogue data and are related to theories of alignment in psycholinguistics. We use a US to predict the user s next dialogue move and thereby re-rank n-best hypotheses of a speech recognizer for a corpus of 2564 user utterances. The method achieved a significant relative reduction of Word Error Rate WER of 5 this is 44 of the possible WER improvement on this data and 62 of the possible semantic improvement Dialogue Move Accuracy compared to the baseline policy of selecting the topmost ASR hypothesis. The majority of the improvement is attributable to the User Simulation feature as shown by Information Gain analysis. 1 Introduction A crucial problem in the design of spoken dialogue systems SDS is to decide for incoming recognition hypotheses whether a system should accept consider correctly recognized reject assume misrecognition or ignore classify as noise or speech not directed to the system them. Obviously incorrect decisions at this point can have serious negative effects on system usability and user satisfaction. On the one hand accept ing misrecognized hypotheses leads to misunderstandings and unintended system behaviors which are usually difficult to recover from. On the other hand users might get frustrated with a system that behaves too cautiously and rejects or ignores too many utterances.

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.