TAILIEUCHUNG - Báo cáo khoa học: "Analysis of Unknown Words through Morphological Decomposition"

This paper describes a method of analysing words through morphological decomposition when the lexicon is incomplete. The method is used within a text-to-speech system to help generate pronunciations of unknown words. The method is achieved within a general morphological analyser system using Koskenniemi twolevel rules. | Analysis of Unknown Words through Morphological Decomposition Alan w Black Dept of Artificial Intelligence University of Edinburgh 80 South Bridge Edinburgh EH1 1HN Scotland UK. Joke van de Plassche NICI University of Nijmegen Montessorilaan 3 6525 HR Nijmegen The Netherlands Briony Williams Centre for Speech Technology University of Edinburgh 80 South Bridge Edinburgh EH1 1HN Scotland UK. brionyOcs Abstract This paper describes a method of analysing words through morphological decomposition when the lexicon is incomplete. The method is used within a text-to-speech system to help generate pronunciations of unknown words. The method is achieved within a general morphological analyser system using Koskenniemi two-level rules. Keywords Morphology incomplete lexicon text-to-speech systems Background When a text-to-speech synthesis system is used it is likely that the text being processed will contain a few words which do not appear in the lexicon as entries in theữ own right. If the lexicon consists only of whole-word entries then the method for producing a pronunciation for such unknown words is simply to pass them through a set of letter-to-sound rules followed by word stress assignment rules and vowel reduction rules. The resulting pronunciation may well be inaccurate particularly in English which often shows a poor relationship between spelling and pronunciation . In addition the default set of word classes assigned to the word noun verb adjective will be too general to be of much help to the syntactic parsing module. However if the lexicon contains individual morphemes both bound and free an unknown word can be analysed into its constituent morphemes. Stress assignment rules will then be more likely to yield the correct pronunciation and any characteristic suffix that may be present will allow for the assignment of a more accurate word class or classes eg. nesa denotes a noun ly an adverb . Morphological .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.