TAILIEUCHUNG - Báo cáo khoa học: "Supporting Annotation Layers for Natural Language Processing"

We demonstrate a system for flexible querying against text that has been annotated with the results of NLP processing. The system supports self-overlapping and parallel layers, integration of syntactic and ontological hierarchies, flexibility in the format of returned results, and tight integration with SQL. We present a query language and its use on examples taken from the NLP literature. | Supporting Annotation Layers for Natural Language Processing Preslav Nakov Ariel Schwartz Brian Wolf Marti Hearst Computer Science Division University of California Berkeley Berkeley CA 94720 nakov sariel @ SIMS University of California Berkeley Berkeley CA 94720 hearst@ Abstract We demonstrate a system for flexible querying against text that has been annotated with the results of NLP processing. The system supports self-overlapping and parallel layers integration of syntactic and ontological hierarchies flexibility in the format of returned results and tight integration with SQL. We present a query language and its use on examples taken from the NLP literature. 1 Introduction Today most natural language processing NLP algorithms make use of the results of previous processing steps. For example a word sense disambiguation algorithm may combine the output of a to-kenizer a part-of-speech tagger a phrase boundary recognizer and a module that classifies noun phrases into semantic categories. Currently there is no standard way to represent and store the results of such processing for efficient retrieval. We propose a framework for annotating text with the results of NLP processing and then querying against those annotations in flexible ways. The framework includes a query language and an indexing architecture for efficient retrieval built on top of a relational database management system RDBMS . The model allows for both hierarchical and overlapping layers of annotation as well as for querying at multiple levels of description. In the remainder of the paper we describe related work illustrate the annotation model and the query language and describe the indexing architecture and the experimental results thus showing the feasibility of the approach for a variety of NLP tasks. 2 Related Work There are several specialized tools for indexing and querying treebanks. See Bird et al. 2005 for an overview and critical comparisons. TGrep2r is a a

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.