TAILIEUCHUNG - Báo cáo khoa học: "Combining Acoustic and Pragmatic Features to Predict Recognition Performance in Spoken Dialogue Systems"

We use machine learners trained on a combination of acoustic confidence and pragmatic plausibility features computed from dialogue context to predict the accuracy of incoming n-best recognition hypotheses to a spoken dialogue system. Our best results show a 25% weighted f-score improvement over a baseline system that implements a “grammar-switching” approach to context-sensitive speech recognition. | Combining Acoustic and Pragmatic Features to Predict Recognition Performance in Spoken Dialogue Systems Malte Gabsdil Department of Computational Linguistics Saarland University Germany gabsdil@ Oliver Lemon School of Informatics Edinburgh University Scotland olemon@ Abstract We use machine learners trained on a combination of acoustic confidence and pragmatic plausibility features computed from dialogue context to predict the accuracy of incoming n-best recognition hypotheses to a spoken dialogue system. Our best results show a 25 weighted f-score improvement over a baseline system that implements a grammar-switching approach to context-sensitive speech recognition. 1 Introduction A crucial problem in the design of spoken dialogue systems is to decide for incoming recognition hypotheses whether a system should accept consider correctly recognized reject assume misrecognition or ignore classify as noise or speech not directed to the system them. In addition a more sophisticated dialogue system might decide whether to clarify or confirm certain hypotheses. Obviously incorrect decisions at this point can have serious negative effects on system usability and user satisfaction. On the one hand accepting misrecognized hypotheses leads to misunderstandings and unintended system behaviors which are usually difficult to recover from. On the other hand users might get frustrated with a system that behaves too cautiously and rejects or ignores too many utterances. Thus an important feature in dialogue system engineering is the tradeoff between avoiding task failure due to misrecognitions and promoting overall dialogue efficiency flow and naturalness. In this paper we investigate the use of machine learners trained on a combination of acoustic confidence and pragmatic plausibility features . computed from dialogue context to predict the quality of incoming n-best recognition hypotheses to a spoken dialogue system. These predictions are then used .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.