TAILIEUCHUNG - Báo cáo khoa học: "TOWARDS THE AUTOMATIC IDENTIFICATION OF ADJECTIVAL SCALES: CLUSTERING ADJECTIVES ACCORDING TO MEANING"

In this paper we present a method to group adjectives according to their meaning, as a first step towards the automatic identification of adjectival scales. We discuss the properties of adjectival scales and of groups of semantically related adjectives and how they imply sources of linguistic knowledge in text corpora. We describe how our system exploits this linguistic knowledge to compute a measure of similarity between two adjectives, using statistical techniques and without having access to any semantic information about the adjectives. . | TOWARDS THE AUTOMATIC IDENTIFICATION OF ADJECTIVAL SCALES CLUSTERING ADJECTIVES ACCORDING TO MEANING Vasileios Hatzivassiloglou Kathleen R. McKeown Department of Computer Science 450 Computer Science Building Columbia University New York . 10027 Internet vh@ kathy@ ABSTRACT In this paper we present a method to group adjectives according to their meaning as a first step towards the automatic identification of adjectival scales. We discuss the properties of adjectival scales and of groups of semantically related adjectives and how they imply sources of linguistic knowledge in text corpora. We describe how our system exploits this linguistic knowledge to compute a measure of similarity between two adjectives using statistical techniques and without having access to any semantic information about the adjectives. We also show how a clustering algorithm can use these similarities to produce the groups of adjectives and we present results produced by our system for a sample set of adjectives. We conclude by presenting evaluation methods for the task at hand and analyzing the significance of the results obtained. 1. INTRODUCTION As natural language processing systems become more oriented towards solving real-world problems like machine translation or spoken language understanding in a limited domain their need for access to vast amounts of knowledge increases. While a model of the general rules of the language at various levels morphological syntactic etc. can be hand-encoded knowledge which pertains to each specific word is harder to encode manually if only because of the size of the lexicon. Most systems currently rely on human linguists or lexicographers who compile lexicon entries by hand. This approach requires significant amounts of time and effort for expanding the system s lexicon. Furthermore if the compiled information depends in any way on the domain of the application the acquisition of lexical knowledge must be repeated .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.