TAILIEUCHUNG - Báo cáo khoa học: "Coarse Lexical Semantic Annotation with Supersenses: An Arabic Case Study"

“Lightweight” semantic annotation of text calls for a simple representation, ideally without requiring a semantic lexicon to achieve good coverage in the language and domain. In this paper, we repurpose WordNet’s supersense tags for annotation, developing specific guidelines for nominal expressions and applying them to Arabic Wikipedia articles in four topical domains. | Coarse Lexical Semantic Annotation with Supersenses An Arabic Case Study Nathan Schneider Behrang Mohit Kemal Oflazer Noah A. Smith School of Computer Science Carnegie Mellon University Doha Qatar Pittsburgh pA 15213 USA nschneid@cs. behrang@ ko@cs. nasmith@cs. Abstract Lightweight semantic annotation of text calls for a simple representation ideally without requiring a semantic lexicon to achieve good coverage in the language and domain. In this paper we repurpose WordNet s supersense tags for annotation developing specific guidelines for nominal expressions and applying them to Arabic Wikipedia articles in four topical domains. The resulting corpus has high coverage and was completed quickly with reasonable inter-annotator agreement. 1 Introduction The goal of lightweight semantic annotation of text particularly in scenarios with limited resources and expertise presents several requirements for a representation simplicity adaptability to new languages topics and genres and coverage. This paper describes coarse lexical semantic annotation of Arabic Wikipedia articles subject to these constraints. Traditional lexical semantic representations are either narrow in scope like named entities 1 or make reference to a full-fledged lexicon ontology which may insufficiently cover the language domain of interest or require prohibitive expertise and effort to We therefore turn to supersense tags SSTs 40 coarse lexical semantic classes 25 for nouns 15 for verbs originating in WordNet. Previously these served as groupings of English lexicon 1 Some ontologies like those in Sekine et al. 2002 and BBN Identifinder Bikel et al. 1999 include a large selection of classes which tend to be especially relevant to proper names. . a WordNet Fellbaum 1998 sense annotation effort reported by Passonneau et al. 2010 found considerable interannotator variability for some lexemes FrameNet Baker et al. 1998 is limited in coverage even for English and PropBank Kingsbury and .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.