TAILIEUCHUNG - Báo cáo khoa học: "The Second Release of the RASP System"

We describe the new release of the RASP (robust accurate statistical parsing) system, designed for syntactic annotation of free text. The new version includes a revised and more semantically-motivated output representation, an enhanced grammar and part-of-speech tagger lexicon, and a more flexible and semi-supervised training method for the structural parse ranking model. We evaluate the released version on the WSJ using a relational evaluation scheme, and describe how the new release allows users to enhance performance using (in-domain) lexical information. . | The Second Release of the RASP System Ted Briscoey John Carrollz Rebecca Watsony Computer Laboratory University of Cambridge Cambridge CB3 OFD UK Department of Informatics University of Sussex Brighton BN1 9QH UK Abstract We describe the new release of the RASP robust accurate statistical parsing system designed for syntactic annotation of free text. The new version includes a revised and more semantically-motivated output representation an enhanced grammar and part-of-speech tagger lexicon and a more flexible and semi-supervised training method for the structural parse ranking model. We evaluate the released version on the WSJ using a relational evaluation scheme and describe how the new release allows users to enhance performance using in-domain lexical information. 1 Introduction The first public release of the RASP system Briscoe Carroll 2002 has been downloaded by over 120 sites and used in diverse natural language processing tasks such as anaphora resolution word sense disambiguation identifying rhetorical relations resolving metonymy detecting compositionality in phrasal verbs and diverse applications such as topic and sentiment classification text anonymisation summarisation information extraction and open domain question answering. Briscoe Carroll 2002 give further details about the first release. Briscoe 2006 provides references and more information about extant use of RASP and fully describes the modifications discussed more briefly here. The new release which is free for all noncommercial use1 is designed to address several weaknesses of the extant toolkit. Firstly all modules have been incrementally improved to cover a greater range of text types. Secondly the part-of-speech tagger lexicon has been semi-automatically enhanced to better deal with rare or unseen behaviour of known words. Thirdly better facilities have been provided for user customisation. 1 See http .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.