TAILIEUCHUNG - Báo cáo khoa học: "Summarizing multiple spoken documents: finding evidence from untranscribed audio"

This paper presents a model for summarizing multiple untranscribed spoken documents. Without assuming the availability of transcripts, the model modifies a recently proposed unsupervised algorithm to detect re-occurring acoustic patterns in speech and uses them to estimate similarities between utterances, which are in turn used to identify salient utterances and remove redundancies. This model is of interest due to its independence from spoken language transcription, an error-prone and resource-intensive process, its ability to integrate multiple sources of information on the same topic, and its novel use of acoustic patterns that extends previous work on low-level prosodic feature detection. . | Summarizing multiple spoken documents finding evidence from untranscribed audio Xiaodan Zhu Gerald Penn and Frank Rudzicz University of Toronto 10 King s College Rd Toronto M5S 3G4 ON Canada xzhu gpenn frank @ Abstract This paper presents a model for summarizing multiple untranscribed spoken documents. Without assuming the availability of transcripts the model modifies a recently proposed unsupervised algorithm to detect re-occurring acoustic patterns in speech and uses them to estimate similarities between utterances which are in turn used to identify salient utterances and remove redundancies. This model is of interest due to its independence from spoken language transcription an error-prone and resource-intensive process its ability to integrate multiple sources of information on the same topic and its novel use of acoustic patterns that extends previous work on low-level prosodic feature detection. We compare the performance of this model with that achieved using manual and automatic transcripts and find that this new approach is roughly equivalent to having access to ASR transcripts with word error rates in the 33-37 range without actually having to do the ASR plus it better handles utterances with out-ofvocabulary words. 1 Introduction Summarizing spoken documents has been extensively studied over the past several years Penn and Zhu 2008 Maskey and Hirschberg 2005 Murray et al. 2005 Christensen et al. 2004 Zechner 2001 . Conventionally called speech summarization although speech connotes more than spoken documents themselves it is motivated by the demand for better ways to navigate spoken content and the natural difficulty in doing so speech is inherently more linear or sequential than text in its traditional delivery. Previous research on speech summarization has addressed several important problems in this field see Section . All of this work however has focused on single-document summarization and the integration of fairly simplistic .

TÀI LIỆU MỚI ĐĂNG
337    151    2    23-01-2025
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.