TAILIEUCHUNG - Báo cáo khoa học: "Automatic Recognition of Intonation Patterns"

This paper is a progress report on a project in linguistically based automatic speech recognition, The domain of this project is English intonation. The system I will describe analyzes fundamental frequency contours (F0 contours) of speech in terms of the theory of melody laid out in Pierrehumbert (1980). Experiments discussed in Liberman and Pierrehumbert (1983) support the assumptions made about intonational phonetics, and an F0 synthesis program based on a precursor to the present theory is described in Pierrehumbert (1981). . | Automatic Recognition of Intonation Patterns Janet B. Pierrehumbert Bell Laboratories Murray Hill New Jersey 07974 1. Introduction This paper is a progress report on a project in linguistically based automatic speech recognition The domain of this project is English intonation. The system I will describe analyzes fundamental frequency contours F0 contours of speech in terms of the theory of melody laid out in Pierrehumbert 1980 . Experiments discussed in Liberman and Pierrehumbert 1983 support the assumptions made about intonational phonetics and an F0 synthesis program based on a precursor to the present theory is described in Pierrehumbert 1981 . One aim of the project is to investigate the descriptive adequacy of this theory of English melody. A second motivation is to characterize cases where F0 may provide useful information about stress and phrasing. The third and to my mind the most important motivation depends on the observation that English intonation is in itself a small language complete with a syntax and phonetics. Building a recognizer for this small language is a relatively tractable problem which still presents some of the interesting features of the general speech recognition problem. In particular the FO contour like other measurements of speech is a continuously varying time function without overt segmentation. Its transcription is in terms of a sequence of discrete elements whose relation to the quantitative level of description is not transparent. An analysis of a contour thus relates heterogeneous levels of description one quantitative and one symbolic. In developing speech recognizers we wish to exploit achievements in symbolic computation. At the same time we wish to avoid forcing into a symbolic framework properties which could more insightfully or simply be treated as quantitative. In the case of intonation our experimental results suggest both a division of labor between these two levels of description and principles for their interaction.

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.