TAILIEUCHUNG - Báo cáo khoa học: "Unlexicalised Hidden Variable Models of Split Dependency Grammars∗"

This paper investigates transforms of split dependency grammars into unlexicalised context-free grammars annotated with hidden symbols. Our best unlexicalised grammar achieves an accuracy of 88% on the Penn Treebank data set, that represents a 50% reduction in error over previously published results on unlexicalised dependency parsing. | Unlexicalised Hidden Variable Models of Split Dependency Grammars Gabriele Antonio Musillo Department of Computer Science and Department of Linguistics University of Geneva 1211 Geneva 4 Switzerland musillo4@ Paola Merlo Department of Linguistics University of Geneva 1211 Geneva 4 Switzerland merlo@ Abstract This paper investigates transforms of split dependency grammars into unlexicalised context-free grammars annotated with hidden symbols. Our best unlexicalised grammar achieves an accuracy of 88 on the Penn Treebank data set that represents a 50 reduction in error over previously published results on unlexicalised dependency parsing. 1 Introduction Recent research in natural language parsing has extensively investigated probabilistic models of phrase-structure parse trees. As well as being the most commonly used probabilistic models of parse trees probabilistic context-free grammars PCFGs are the best understood. As shown in Klein and Manning 2003 the ability of PCFG models to disambiguate phrases crucially depends on the expressiveness of the symbolic backbone they use. Treebank-specific heuristics have commonly been used both to alleviate inadequate independence assumptions stipulated by naive PCFGs Collins 1999 Charniak 2000 . Such methods stand in sharp contrast to partially supervised techniques that have recently been proposed to induce hidden grammatical representations that are finer-grained than those that can be read off the parsed sentences in treebanks Henderson 2003 Matsuzaki et al. 2005 Prescher 2005 Petrov et al. 2006 . Part of this work was done when Gabriele Musillo was visiting the MIT Computer Science and Artificial Intelligence Laboratory funded by a grant from the Swiss NSF PBGE2-117146 . Many thanks to Michael Collins and Xavier Carreras for their insightful comments on the work presented here. This paper presents extensions of such grammar induction techniques to dependency grammars. Our extensions rely on .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.