TAILIEUCHUNG - Báo cáo khoa học: "Question Detection in Spoken Conversations Using Textual Conversations"

We investigate the use of textual Internet conversations for detecting questions in spoken conversations. We compare the text-trained model with models trained on manuallylabeled, domain-matched spoken utterances with and without prosodic features. Overall, the text-trained model achieves over 90% of the performance (measured in Area Under the Curve) of the domain-matched model including prosodic features, but does especially poorly on declarative questions. | Question Detection in Spoken Conversations Using Textual Conversations Anna Margolis and Mari Ostendorf Department of Electrical Engineering University of Washington Seattle WA USA amargoli mo @ Abstract We investigate the use of textual Internet conversations for detecting questions in spoken conversations. We compare the text-trained model with models trained on manually-labeled domain-matched spoken utterances with and without prosodic features. Overall the text-trained model achieves over 90 of the performance measured in Area Under the Curve of the domain-matched model including prosodic features but does especially poorly on declarative questions. We describe efforts to utilize unlabeled spoken utterances and prosodic features via domain adaptation. 1 Introduction Automatic speech recognition systems which transcribe words are often augmented by subsequent processing for inserting punctuation or labeling speech acts. Both prosodic features extracted from the acoustic signal and lexical features extracted from the word sequence have been shown to be useful for these tasks Shriberg et al. 1998 Kim and Woodland 2003 Ang et al. 2005 . However access to labeled speech training data is generally required in order to use prosodic features. On the other hand the Internet contains large quantities of textual data that is already labeled with punctuation and which can be used to train a system using lexical features. In this work we focus on question detection in the Meeting Recorder Dialog Act corpus MRDA Shriberg et al. 2004 using text sentences with question marks in Wikipedia talk 118 pages. We compare the performance of a question detector trained on the text domain using lexical features with one trained on MRDA using lexical features and or prosodic features. In addition we experiment with two unsupervised domain adaptation methods to incorporate unlabeled MRDA utterances into the text-based question detector. The goal is to use the unlabeled .

Thu Thảo 64 7 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Question Detection in Spoken Conversations Using Textual Conversations"

7 57 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461865 55

Giới thiệu :Lập trình mã nguồn mở

14 22639 59

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10884 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10065 446

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9519 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8281 1125

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8238 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7864 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6686 253

Vật lý hạt cơ bản (1)

29 5770 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Báo cáo khoa học: Loss of kinase activity in Mycobacterium tuberculosis multidomain protein Rv1364c

14 235 0 26-04-2024

Động cơ đốt trong và máy kéo công nghiêp tập 1 part 7

23 258 0 26-04-2024

Sáng tạo trong thuật toán và lập trình với ngôn ngữ Pascal và C# Tập 2 - Chương 4

47 246 1 26-04-2024

Mass Transfer in Multiphase Systems and its Applications Part 19

40 256 1 26-04-2024

TƯƠNG QUAN GIỮA MÔ HỌC, GIẢI PHẪU VÀ HÌNH ẢNH CỦA CÁC KHỐI U PHẦN PHỤ

3 167 0 26-04-2024

Management and Services Part 1

10 156 0 26-04-2024

Công nghiệp gang thép Việt Nam : Một giai đoạn phát triển và chuyển đổi chính sách mới part 5

6 194 0 26-04-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 144 0 26-04-2024

MÔN HỌC VẬT LIỆU VÀ CÔNG NGHỆ KIM LOẠI - PHẦN I: KIM LOẠI HỌC

32 176 2 26-04-2024

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 138 0 26-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7864 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5727 1368

Ebook Chào con ba mẹ đã sẵn sàng

112 3767 1231

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5319 1136

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8281 1125

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3498 643

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10884 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3684 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4046 515

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4127 480