TAILIEUCHUNG - Báo cáo khoa học: "Which words are hard to recognize? Prosodic, lexical, and disﬂuency factors that increase ASR error rates"

Many factors are thought to increase the chances of misrecognizing a word in ASR, including low frequency, nearby disﬂuencies, short duration, and being at the start of a turn. However, few of these factors have been formally examined. This paper analyzes a variety of lexical, prosodic, and disﬂuency factors to determine which are likely to increase ASR error rates. Findings include the following. | Which words are hard to recognize Prosodic lexical and disfluency factors that increase ASR error rates Sharon Goldwater Dan Jurafsky and Christopher D. Manning Department of Linguistics and Computer Science Stanford University sgwater jurafsky manning @ Abstract Many factors are thought to increase the chances of misrecognizing a word in ASR including low frequency nearby disfluencies short duration and being at the start of a turn. However few of these factors have been formally examined. This paper analyzes a variety of lexical prosodic and disfluency factors to determine which are likely to increase ASR error rates. Findings include the following. 1 For disfluencies effects depend on the type of disfluency errors increase by up to 15 absolute for words near fragments but decrease by up to absolute for words near repetitions. This decrease seems to be due to longer word duration. 2 For prosodic features there are more errors for words with extreme values than words with typical values. 3 Although our results are based on output from a system with speaker adaptation speaker differences are a major factor influencing error rates and the effects of features such as frequency pitch and intensity may vary between speakers. 1 Introduction In order to improve the performance of automatic speech recognition ASR systems on conversational speech it is important to understand the factors that cause problems in recognizing words. Previous work on recognition of spontaneous monologues and dialogues has shown that infrequent words are more likely to be misrecognized Fosler-Lussier and Morgan 1999 Shinozaki and Furui 2001 and that fast speech increases error rates Siegler and Stern 1995 Fosler-Lussier and Morgan 1999 Shinozaki and Furui 2001 . Siegler and Stern 1995 and Shinozaki and Furui 2001 also found higher error rates in very slow speech. Word length in phones has also been found to be a useful predictor of higher error rates Shinozaki and Furui 2001 . In

Nhã Uyên 104 9 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Which words are hard to recognize? Prosodic, lexical, and disﬂuency factors that increase ASR error rates"

9 84 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462343 61

Giới thiệu :Lập trình mã nguồn mở

14 26104 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11350 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10553 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9844 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8891 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8507 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8101 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7765 1793

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7274 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 229 3 29-12-2024

Đóng mới oto 8 chỗ ngồi part 9

10 179 3 29-12-2024

báo cáo hóa học:" Increased androgen receptor expression in serous carcinoma of the ovary is associated with an improved survival"

6 156 3 29-12-2024

Hướng dẫn chế độ dinh dưỡng cho người bệnh viêm khớp

5 168 2 29-12-2024

báo cáo hóa học:" Quality of data collection in a large HIV observational clinic database in sub-Saharan Africa: implications for clinical research and audit of care"

7 154 4 29-12-2024

CHƯƠNG 2: RỦI RO THÂM HỤT TÀI KHÓA

28 160 1 29-12-2024

Giáo án điện tử tiểu học môn lịch sử: Cách mạng mùa thu

39 165 1 29-12-2024

Valve Selection Handbook - Fourth Edition

337 146 2 29-12-2024

Báo cáo nghiên cứu khoa học " Vai trò chính quyền địa phương trong phát triển kinh tế : khu chuyên doanh gốm sứ ( Trung Quốc ) và Bát Tràng ( Việt Nam )("

11 214 1 29-12-2024

The Ombudsman Enterprise and Administrative Justice

309 143 0 29-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8101 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7765 1793

Ebook Chào con ba mẹ đã sẵn sàng

112 4409 1371

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6305 1268

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8891 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3843 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3920 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4719 565

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11350 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4511 490