TAILIEUCHUNG - Báo cáo khoa học: "A Flexible Stand-Off Data Model with Query Language for Multi-Level Annotation"

We present an implemented XML data model and a new, simplified query language for multi-level annotated corpora. The new query language involves automatic conversion of queries into the underlying, more complicated MMAXQL query language. It supports queries for sequential and hierarchical, but also associative (. coreferential) relations. The simplified query language has been designed with non-expert users in mind. | A Flexible Stand-Off Data Model with Query Language for Multi-Level Annotation Christoph Muller EML Research gGmbH Villa Bosch SchloB-Wolfsbrunnenweg 33 69118 Heidelberg Germany mueller@ Abstract We present an implemented XML data model and a new simplified query language for multi-level annotated corpora. The new query language involves automatic conversion of queries into the underlying more complicated MMAXQL query language. It supports queries for sequential and hierarchical but also associative . coreferential relations. The simplified query language has been designed with non-expert users in mind. 1 Introduction Growing interest in richly annotated corpora is a driving force for the development of annotation tools that can handle multiple levels of annotation. We find it crucial in order to make full use of the potential of multi-level annotation that individual annotation levels be treated as self-contained modules which are independent of other annotation levels. This independence should also include the storing of each level in a separate file. If these principles are observed annotation data management incl. level addition removal and replacement but also conversion into and from other formats is greatly facilitated. The way to keep individual annotation levels independent of each other is by defining each with direct reference to the underlying basedata . the text or transcribed speech. Both sequential and hierarchical . embedding or dominance relations between markables on different levels are thus only expressed implicitly viz. by means of the relations of their basedata elements. While it has become common practice to use the stand-off mechanism to relate several annotation levels to one basedata file it is also not uncommon to find this mechanism applied for relating markables to other markables on a different or the same level directly expressing the relation between them explicitly. We argue that this is unfavourable not .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.