TAILIEUCHUNG - Báo cáo khoa học: "A Debug Tool for Practical Grammar Development"

We have developed willex, a tool that helps grammar developers to work efficiently by using annotated corpora and recording parsing errors. Willex has two major new functions. First, it decreases ambiguity of the parsing results by comparing them to an annotated corpus and removing wrong partial results both automatically and manually. Second, willex accumulates parsing errors as data for the developers to clarify the defects of the grammar statistically. We applied willex to a large-scale HPSG-style grammar as an example. . | A Debug Tool for Practical Grammar Development Akane Yakushijif Yuka Tateisii Yusuke Miyaof Naoki Yoshinagaf Jun ichi Tsujiifi fDepartment of Computer Science University of Tokyo Hongo 7-3-1 Bunkyo-ku Tokyo 113-0033 JAPAN 1CREST. JST Japan Science and Technology Corporation Honcho 4-1-8 Kawaguchi-shi Saitama 332-0012 JAPAN akane yucca yusuke yoshinag tsujii @ Abstract We have developed willex a tool that helps grammar developers to work efficiently by using annotated corpora and recording parsing errors. Willex has two major new functions. First it decreases ambiguity of the parsing results by comparing them to an annotated corpus and removing wrong partial results both automatically and manually. Second willex accumulates parsing errors as data for the developers to clarify the defects of the grammar statistically. We applied willex to a large-scale HPSG-style grammar as an example. 1 Introduction There is an increasing need for syntactical parsers for practical usages such as information extraction. For example Yakushiji et al. 2001 extracted argument structures from biomedical papers using a parser based on XHPSG Tateisi et al. 1998 which is a large-scale HPSG. Although large-scale and general-purpose grammars have been developed they have a problem of limited coverage. The limits are derived from deficiencies of grammars themselves. For example XHPSG cannot treat coordinations of verbs ex. Molybdate slowed but did not prevent the conversion. nor reduced relatives ex. Rb mutants derived from patients with retinoblastoma. . Finding these grammar defects and modifying them require tremendous human effort. Hence we have developed willex that helps to improve the general-purpose grammars. Willex has two major functions. First it reduces a human workload to improve the general-purpose grammar through using language intuition encoded in syntactically tagged corpora in XML format. Second it records data of grammar defects to allow developers to have a

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.