TAILIEUCHUNG - Báo cáo khoa học: "Bootstrapping a Unified Model of Lexical and Phonetic Acquisition"

During early language acquisition, infants must learn both a lexicon and a model of phonetics that explains how lexical items can vary in pronunciation—for instance “the” might be realized as [Di] or [D@]. Previous models of acquisition have generally tackled these problems in isolation, yet behavioral evidence suggests infants acquire lexical and phonetic knowledge simultaneously. | Bootstrapping a Unified Model of Lexical and Phonetic Acquisition Micha Elsner melsner0@ ILCC School of Informatics University of Edinburgh Edinburgh EH8 9AB Uk Sharon Goldwater Jacob Eisenstein sgwater@ jacobe@ ILCC School of Informatics School of Interactive Computing University of Edinburgh Georgia Institute of Technology Edinburgh EH8 9AB Uk Atlanta GA 30308 USA Abstract During early language acquisition infants must learn both a lexicon and a model of phonetics that explains how lexical items can vary in pronunciation for instance the might be realized as pi or Ỗ9 . Previous models of acquisition have generally tackled these problems in isolation yet behavioral evidence suggests infants acquire lexical and phonetic knowledge simultaneously. We present a Bayesian model that clusters together phonetic variants of the same lexical item while learning both a language model over lexical items and a log-linear model of pronunciation variability based on articulatory features. The model is trained on transcribed surface pronunciations and learns by bootstrapping without access to the true lexicon. We test the model using a corpus of child-directed speech with realistic phonetic variation and either gold standard or automatically induced word boundaries. In both cases modeling variability improves the accuracy of the learned lexicon over a system that assumes each lexical item has a unique pronunciation. 1 Introduction Infants acquiring their first language confront two difficult cognitive problems building a lexicon of word forms and learning basic phonetics and phonology. The two tasks are closely related knowing what sounds can substitute for one another helps in clustering together variant pronunciations of the same word while knowing the environments in which particular words can occur helps determine which sound changes are meaningful and which are not Feldman a intended juwantwAn wantekuki b surface jo wa WAn wan a kuki c .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.