TAILIEUCHUNG - Báo cáo khoa học: "Prototype-Driven Grammar Induction"

We investigate prototype-driven learning for primarily unsupervised grammar induction. Prior knowledge is specified declaratively, by providing a few canonical examples of each target phrase type. This sparse prototype information is then propagated across a corpus using distributional similarity features, which augment an otherwise standard PCFG model. We show that distributional features are effective at distinguishing bracket labels, but not determining bracket locations. To improve the quality of the induced trees, we combine our PCFG induction with the CCM model of Klein and Manning (2002) | Prototype-Driven Grammar Induction Aria Haghighi Computer Science Division University of California Berkeley aria42@ Dan Klein Computer Science Division University of California Berkeley klein@ Abstract We investigate prototype-driven learning for primarily unsupervised grammar induction. Prior knowledge is specified declaratively by providing a few canonical examples of each target phrase type. This sparse prototype information is then propagated across a corpus using distributional similarity features which augment an otherwise standard PCFG model. We show that distributional features are effective at distinguishing bracket labels but not determining bracket locations. To improve the quality of the induced trees we combine our PCFG induction with the CCM model of Klein and Manning 2002 which has complementary stengths it identifies brackets but does not label them. Using only a handful of prototypes we show substantial improvements over naive PCFG induction for English and Chinese grammar induction. 1 Introduction There has been a great deal of work on unsupervised grammar induction with motivations ranging from scientific interest in language acquisition to engineering interest in parser construction Carroll and Charniak 1992 Clark 2001 . Recent work has successfully induced unlabeled grammatical structure but has not successfully learned labeled tree structure Klein and Manning 2002 Klein and Manning 2004 Smith and Eisner 2004 . In this paper our goal is to build a system capable of producing labeled parses in a target grammar with as little total effort as possible. We investigate a prototype-driven approach to grammar induction in which one supplies canonical examples of each target concept. For example we might specify that we are interested in trees which use the symbol NP and then list several examples of prototypical NPs determiner noun pronouns etc. see figure 1 for a sample prototype list . This prototype information is .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.