TAILIEUCHUNG - Báo cáo khoa học: "Similarity between Words Computed by Spreading Activation on an English Dictionary"

This paper proposes a method for measuring semantic similarity between words as a new tool for text analysis. The similarity is measured on a semantic network constructed systematically from a subset of the English dictionary, LDOCE (Long-man Dictionary of Contemporary English). Spreading activation on the network can directly compute the similarity between any two words in the Longman Defining Vocabulary, and indirectly the similarity of all the other words in LDOCE. | Similarity between Words Computed by Spreading Activation on an English Dictionary Hideki Kozima Course in Computer Science and Information Mathematics Graduate School University of Electro-Communications 1-5-1 Chofugaoka Chofu Tokyo 182 Japan Teiji Furugori Department of Computer Science and Information Mathematics University of Electro-Communications 1-5-1 Chofugaoka Chofu Tokyo 182 Japan Tel. 81-424-83-2161 Abstract This paper proposes a method for measuring semantic similarity between words as a new tool for text analysis. The similarity is measured on a semantic network constructed systematically from a subset of the English dictionary LDOCE Longman Dictionary of Contemporary English . Spreading activation on the network can directly compute the similarity between any two words in the Longman Defining Vocabulary and indirectly the similarity of all the other words in LDOCE. The similarity represents the strength of lexical cohesion or semantic relation and also provides valuable information about similarity and coherence of texts. 1 Introduction A text is not just a sequence of words but it also has coherent structure. The meaning of each word in a text depends on the structure of the text. Recognizing the structure of text is an essential task in text understanding. Grosz and Sidner 1986 One of the valuable indicators of the structure of text is lexical cohesion. Halliday and Hasan 1976 Lexical cohesion is the relationship between words classified as follows 1. Reiteration Molly likes cats. She keeps a cat. 2. Semantic relation a. Desmond saw a cat. It was Molly s pet. . b. Molly goes to the north. Not east. c. Desmond goes to a theatre. He likes films. Reiteration of words is easy to capture by morphological analysis. Semantic relation between words which is the focus of this paper is hard to recognize by computers. We consider lexical cohesion as semantic similarity between words. Similarity

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.