TAILIEUCHUNG - Báo cáo khoa học: "Using Lexical Dependency and Ontological Knowledge to Improve a Detailed Syntactic and Semantic Tagger of English"

This paper presents a detailed study of the integration of knowledge from both dependency parses and hierarchical word ontologies into a maximum-entropy-based tagging model that simultaneously labels words with both syntax and semantics. Our findings show that information from both these sources can lead to strong improvements in overall system accuracy: dependency knowledge improved performance over all classes of word, and knowledge of the position of a word in an ontological hierarchy increased accuracy for words not seen in the training data. . | Using Lexical Dependency and Ontological Knowledge to Improve a Detailed Syntactic and Semantic Tagger of English Andrew Finch NiCPATRt Kyoto Japan @ Ezra Black Epimenides Corp. New York UsA @ Young-Sook Hwang ETRI Seoul Korea yshwang7 @ Eiichiro Sumita NiCT-ATR Kyoto Japan @ Abstract This paper presents a detailed study of the integration of knowledge from both dependency parses and hierarchical word ontologies into a maximum-entropy-based tagging model that simultaneously labels words with both syntax and semantics. Our findings show that information from both these sources can lead to strong improvements in overall system accuracy dependency knowledge improved performance over all classes of word and knowledge of the position of a word in an ontological hierarchy increased accuracy for words not seen in the training data. The resulting tagger offers the highest reported tagging accuracy on this tagset to date. 1 Introduction Part-of-speech POS tagging has been one of the fundamental areas of research in natural language processing for many years. Most of the prior research has focussed on the task of labeling text with tags that reflect the words syntactic role in the sentence. In parallel to this the task of word sense disambiguation WSD the process of deciding in which semantic sense the word is being used has been actively researched. This paper addresses a combination of these two fields that is labeling running words with tags that comprise in addition to their syntactic function a broad semantic class that signifies the semantics of the word in the context of the sentence but does not necessarily provide information that is sufficiently finegrained as to disambiguate its sense. This differs National Institute of Information and Communications Technology ATR Spoken Language Communication Research Labs from what is commonly meant by WSD in that although each word may have many senses

Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.