TAILIEUCHUNG - Báo cáo khoa học: "Managing Uncertainty in Semantic Tagging"

Low interannotator agreement (IAA) is a well-known issue in manual semantic tagging (sense tagging). IAA correlates with the granularity of word senses and they both correlate with the amount of information they give as well as with its reliability. We compare different approaches to semantic tagging in WordNet, FrameNet, PropBank and OntoNotes with a small tagged data sample based on the Corpus Pattern Analysis to present the reliable information gain (RG), a measure used to optimize the semantic granularity of a sense inventory with respect to its reliability indicated by the IAA in the given data set. . | Managing Uncertainty in Semantic Tagging Silvie Cinkova and Martin Holub and Vincent Kriz Charles University in Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics cinkova holub @ Abstract Low interannotator agreement IAA is a well-known issue in manual semantic tagging sense tagging . IAA correlates with the granularity of word senses and they both correlate with the amount of information they give as well as with its reliability. We compare different approaches to semantic tagging in WordNet FrameNet PropBank and OntoNotes with a small tagged data sample based on the Corpus Pattern Analysis to present the reliable information gain RG a measure used to optimize the semantic granularity of a sense inventory with respect to its reliability indicated by the IAA in the given data set. RG can also be used as feedback for lexicographers and as a supporting component of automatic semantic classifiers especially when dealing with a very fine-grained set of semantic categories. 1 Introduction The term semantic tagging is used in two divergent areas 1 recognizing objects of semantic importance such as entities events and polarity often tailored to a restricted domain or 2 relating occurrences of words in a corpus to a lexicon and selecting the most appropriate semantic categories such as synsets semantic frames wordsenses semantic patterns or framesets . We are concerned with the second case which seeks to make lexical semantics tractable for computers. Lexical semantics as opposed to propositional semantics focuses the meaning of lexical items. The disciplines that focus lexical semantics are lexicology and lexicography rather than logic. By semantic tagging we mean a process of assigning semantic categories to target words in given contexts. This process can be either manual or automatic. Traditionally semantic tagging relies on the tacit assumption that various uses of polysemous words can be sorted

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.