TAILIEUCHUNG - Báo cáo khoa học: "Learning Phonological Rule Probabilities from Speech Corpora with Exploratory Computational Phonology"

This paper presents an algorithm for learning the probabilities of optional phonological rules from corpora. The algorithm is based on using a speech recognition system to discover the surface pronunciations of words in corpora; using an automatic system obviates expensive phonetic labeling by hand. We describe the details of our algorithm and show the probabilities the system has learned for ten common phonological rules which model reductions and coarticulation effects. | Learning Phonological Rule Probabilities from Speech Corpora with Exploratory Computational Phonology Gary Tajchman Daniel Jurafsky and Eric Fosler International Computer Science Institute and University of California at Berkeley taj chman j urafsky fosler @icsi. Abstract This paper presents an algorithm for learning the probabilities of optional phonological rules from corpora. The algorithm is based on using a speech recognition system to discover the surface pronunciations of words in speech corpora using an automatic system obviates expensive phonetic labeling by hand. We describe the details of our algorithm and show the probabilities the system has learned for ten common phonological rules which model reductions and coarticulation effects. These probabilities were derived from a corpus of 7203 sentences of read speech from the Wall Street Journal and are shown to be a reasonably close match to probabilities from phonetically hand-transcribed data TIMIT . Finally we analyze the probability differences between rule use in male versus female speech and suggest that the differences are caused by differing average rates of speech. - 1 Introduction Phonological rules have formed the basis of phonological theory for decades although their form and their coverage of the data has changed over the years. Until recently however it was difficult to determine the relationship between hand-written phonological rules and actual speech data. The current availability of large speech corpora and pronunciation dictionaries has allowed US to connect rules and speech in much tighter ways. For example a number of algorithms have recently been proposed which automatically induce phonological rules from dictionaries or corpora Gasser 1993 Ellison 1992 Daelemans et al. 1994 . While such algorithms have successfully induced syllabicity or harmony constraints or simple oblig- Currently at Voice Processing Corp 1 Main St Cambridge MA 02142 tajchman@ atory .

TÀI LIỆU MỚI ĐĂNG
337    150    2    09-01-2025
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.