TAILIEUCHUNG - Báo cáo khoa học: "Faster Parsing by Supertagger Adaptation"

We propose a novel self-training method for a parser which uses a lexicalised grammar and supertagger, focusing on increasing the speed of the parser rather than its accuracy. The idea is to train the supertagger on large amounts of parser output, so that the supertagger can learn to supply the supertags that the parser will eventually choose as part of the highestscoring derivation. Since the supertagger supplies fewer supertags overall, the parsing speed is increased. | Faster Parsing by Supertagger Adaptation Jonathan K. Kummerfeld a Jessika Roesnerb TimDawborn a James Haggerty a James R. Curran a Stephen Clarkc School of Information Technologies0 Department of Computer Scienceb University of Sydney University of Texas at Austin NSW 2006 Australia james@ Computer Laboratoryc University of Cambridge Cambridge CB3 0FD Uk c Austin TX USA Abstract We propose a novel self-training method for a parser which uses a lexicalised grammar and supertagger focusing on increasing the speed of the parser rather than its accuracy. The idea is to train the supertagger on large amounts of parser output so that the supertagger can learn to supply the supertags that the parser will eventually choose as part of the highest-scoring derivation. Since the supertagger supplies fewer supertags overall the parsing speed is increased. We demonstrate the effectiveness of the method using a CCG supertagger and parser obtaining significant speed increases on newspaper text with no loss in accuracy. We also show that the method can be used to adapt the CCG parser to new domains obtaining accuracy and speed improvements for Wikipedia and biomedical text. 1 Introduction In many NLP tasks and applications . distributional similarity Curran 2004 and question answering Dumais et al. 2002 large volumes of text and detailed syntactic information are both critical for high performance. To avoid a tradeoff between these two we need to increase parsing speed but without losing accuracy. Parsing with lexicalised grammar formalisms such as Lexicalised Tree Adjoining Grammar and Combinatory Categorial Grammar CCG Steedman 2000 can be made more efficient using a supertagger. Bangalore and Joshi 1999 call supertagging almost parsing because of the significant reduction in ambiguity which occurs once the supertags have been assigned. In this paper we focus on the CCG parser and supertagger described in Clark and Curran 2007 . Since

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.