TAILIEUCHUNG - Báo cáo khoa học: "WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings"

This paper presents the application of WordNet-based semantic relatedness measures to Automatic Speech Recognition (ASR) in multi-party meetings. Different word-utterance context relatedness measures and utterance-coherence measures are defined and applied to the rescoring of N best lists. No significant improvements in terms of Word-Error-Rate (WER) are achieved compared to a large word-based ngram baseline model. We discuss our results and the relation to other work that achieved an improvement with such models for simpler tasks. . | WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings Michael Pucher Telecommunications Research Center Vienna Vienna Austria Speech and Signal Processing Lab TU Graz Graz Austria pucher@ Abstract This paper presents the application of WordNet-based semantic relatedness measures to Automatic Speech Recognition ASR in multi-party meetings. Different word-utterance context relatedness measures and utterance-coherence measures are defined and applied to the rescoring of N-best lists. No significant improvements in terms of Word-Error-Rate WER are achieved compared to a large word-based ngram baseline model. We discuss our results and the relation to other work that achieved an improvement with such models for simpler tasks. 1 Introduction As Pucher 2005 has shown different WordNet-based measures and contexts are best for word prediction in conversational speech. The JCN Section measure performs best for nouns using the noun-context. The LESK Section measure performs best for verbs and adjectives using a mixed word-context. Text-based semantic relatedness measures can improve word prediction on simulated speech recognition hypotheses as Demetriou et al. 2000 have shown. Demetriou et al. 2000 generated N-best lists from phoneme confusion data acquired from a speech recognizer and a pronunciation lexicon. Then sentence hypotheses of varying Word-Error-Rate WER were generated based on sentences from different genres from the British National Corpus BNC . It was shown by them that the semantic 129 model can improve recognition where the amount of improvement varies with context length and sentence length. Thereby it was shown that these models can make use of long-term information. In this paper the best performing measures from Pucher 2005 which outperform baseline models on word prediction for conversational telephone speech are used for Automatic Speech Recognition ASR in multi-party meetings. Thereby we want to .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.