TAILIEUCHUNG - Báo cáo khoa học: "Summarizing multiple spoken documents: ﬁnding evidence from untranscribed audio"

This paper presents a model for summarizing multiple untranscribed spoken documents. Without assuming the availability of transcripts, the model modiﬁes a recently proposed unsupervised algorithm to detect re-occurring acoustic patterns in speech and uses them to estimate similarities between utterances, which are in turn used to identify salient utterances and remove redundancies. This model is of interest due to its independence from spoken language transcription, an error-prone and resource-intensive process, its ability to integrate multiple sources of information on the same topic, and its novel use of acoustic patterns that extends previous work on low-level prosodic feature detection. . | Summarizing multiple spoken documents finding evidence from untranscribed audio Xiaodan Zhu Gerald Penn and Frank Rudzicz University of Toronto 10 King s College Rd Toronto M5S 3G4 ON Canada xzhu gpenn frank @ Abstract This paper presents a model for summarizing multiple untranscribed spoken documents. Without assuming the availability of transcripts the model modifies a recently proposed unsupervised algorithm to detect re-occurring acoustic patterns in speech and uses them to estimate similarities between utterances which are in turn used to identify salient utterances and remove redundancies. This model is of interest due to its independence from spoken language transcription an error-prone and resource-intensive process its ability to integrate multiple sources of information on the same topic and its novel use of acoustic patterns that extends previous work on low-level prosodic feature detection. We compare the performance of this model with that achieved using manual and automatic transcripts and find that this new approach is roughly equivalent to having access to ASR transcripts with word error rates in the 33-37 range without actually having to do the ASR plus it better handles utterances with out-ofvocabulary words. 1 Introduction Summarizing spoken documents has been extensively studied over the past several years Penn and Zhu 2008 Maskey and Hirschberg 2005 Murray et al. 2005 Christensen et al. 2004 Zechner 2001 . Conventionally called speech summarization although speech connotes more than spoken documents themselves it is motivated by the demand for better ways to navigate spoken content and the natural difficulty in doing so speech is inherently more linear or sequential than text in its traditional delivery. Previous research on speech summarization has addressed several important problems in this field see Section . All of this work however has focused on single-document summarization and the integration of fairly simplistic .

Minh Thái 97 9 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Summarizing multiple spoken documents: ﬁnding evidence from untranscribed audio"

9 82 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462386 61

Giới thiệu :Lập trình mã nguồn mở

14 27275 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11388 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10588 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9870 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8914 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8538 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8114 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8076 1836

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7322 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Gia đình vui nhộn

4 396 3 23-01-2025

Chương 10: Các phương pháp tính quá trình quá độ trong mạch điện tuyến tính

57 247 8 23-01-2025

báo cáo hóa học:" Perceptions of rewards among volunteer caregivers of people living with AIDS working in faith-based organizations in South Africa: a qualitative study"

10 165 1 23-01-2025

báo cáo hóa học:" Quality of data collection in a large HIV observational clinic database in sub-Saharan Africa: implications for clinical research and audit of care"

7 164 4 23-01-2025

Sử dụng mô hình ARCH và GARCH để phân tích và dự báo về giá cổ phiếu trên thị trường chứng khoán

24 1080 2 23-01-2025

Đề tài " Dự báo về tác động của Tổ chức Thương mại Thế giới WTO đối với các doanh nghiệp xuất khẩu vừa và nhỏ Việt Nam – Những giải pháp đề xuất "

72 194 2 23-01-2025

Valve Selection Handbook - Fourth Edition

337 151 2 23-01-2025

ĐỀ TÀI " ĐÁNH GIÁ HIỆU QUẢ HOẠT ĐỘNG KINH DOANH NGOẠI HỐI CỦA NGÂN HÀNG THƯƠNG MẠI CỔ PHẦN XUẤT NHẬP KHẨU VIỆT NAM "

51 160 3 23-01-2025

Bệnh sán lá gan trên gia súc và cách phòng trị

3 171 1 23-01-2025

CUỘC KHÁNG CHIẾN CHỐNG THỰC DÂN PHÁP KẾT THÚC (1953 - 1954)_5

11 155 1 23-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8114 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8076 1836

Ebook Chào con ba mẹ đã sẵn sàng

112 4475 1381

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6463 1285

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8914 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3883 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3934 613

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4833 568

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11388 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4551 490