TAILIEUCHUNG - Báo cáo khoa học: "N-Best Rescoring Based on Pitch-accent Patterns"

In this paper, we adopt an n-best rescoring scheme using pitch-accent patterns to improve automatic speech recognition (ASR) performance. The pitch-accent model is decoupled from the main ASR system, thus allowing us to develop it independently. N-best hypotheses from recognizers are rescored by additional scores that measure the correlation of the pitch-accent patterns between the acoustic signal and lexical cues. To test the robustness of our algorithm, we use two different data sets and recognition setups: the ﬁrst one is English radio news data that has pitch accent labels, but the recognizer is trained from a small amount of. | N-Best Rescoring Based on Pitch-accent Patterns Je Hun Jeon1 Wen Wang2 Yang Liu1 department of Computer Science The University of Texas at Dallas USA 2Speech Technology and Research Laboratory SRI International USA jhjeon yangl @ wwang@ Abstract In this paper we adopt an n-best rescoring scheme using pitch-accent patterns to improve automatic speech recognition ASR performance. The pitch-accent model is decoupled from the main ASR system thus allowing us to develop it independently. N-best hypotheses from recognizers are rescored by additional scores that measure the correlation of the pitch-accent patterns between the acoustic signal and lexical cues. To test the robustness of our algorithm we use two different data sets and recognition setups the first one is English radio news data that has pitch accent labels but the recognizer is trained from a small amount of data and has high error rate the second one is English broadcast news data using a state-of-the-art SRI recognizer. Our experimental results demonstrate that our approach is able to reduce word error rate relatively by about 3 . This gain is consistent across the two different tests showing promising future directions of incorporating prosodic information to improve speech recognition. 1 Introduction Prosody refers to the suprasegmental features of natural speech such as rhythm and intonation since it normally extends over more than one phoneme segment. Speakers use prosody to convey paralin-guistic information such as emphasis intention attitude and emotion. Humans listening to speech with natural prosody are able to understand the content with low cognitive load and high accuracy. However most modern ASR systems only use an acous 732 tic model and a language model. Acoustic information in ASR is represented by spectral features that are usually extracted over a window length of a few tens of milliseconds. They miss useful information contained in the prosody of the speech

Thủy Trang 43 10 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461860 55

Giới thiệu :Lập trình mã nguồn mở

14 22617 59

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10883 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10061 446

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9516 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8275 1125

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8226 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7863 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6671 253

Vật lý hạt cơ bản (1)

29 5767 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Sáng tạo trong thuật toán và lập trình với ngôn ngữ Pascal và C# Tập 2 - Chương 4

47 246 1 25-04-2024

extremetech Hacking BlackBerry phần 9

31 248 0 25-04-2024

Bibliography on Medieval Women, Gender, and Medicine 1980-2009

82 207 0 25-04-2024

Bơm máy nén quạt trong công nghệ part 1

20 249 2 25-04-2024

Trading Strategies Profit Making Techniques For Stock_8

23 174 0 25-04-2024

Anh văn bằng C-124

8 172 0 25-04-2024

MySQL Database Usage & Administration PHẦN 7

37 155 0 25-04-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 144 0 25-04-2024

Khurana et al. Journal of Orthopaedic Surgery and Research 2010, 5:23

7 133 0 25-04-2024

New Trends and Developments in Automotive Industry Part 7

35 94 0 25-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7863 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5699 1356

Ebook Chào con ba mẹ đã sẵn sàng

112 3764 1231

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5315 1136

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8275 1125

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3493 642

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10883 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3680 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4042 514

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4123 480