TAILIEUCHUNG - Báo cáo khoa học: "Homonymy and Polysemy in Information Retrieval"

This paper discusses research on distinguishing word meanings in the context of information retrieval systems. We conducted experiments with three sources of evidence for making these distinctions: morphology, part-of-speech, and phrases. We have focused on the distinction between h o m o n y m y and polysemy (unrelated vs. related meanings). Our results support the need to distinguish h o m o n y m y and p o l y semy. We found: 1) grouping morphological variants makes a significant improvement in retrieval performance, 2) that more than half of all words in a dictionary that differ. | Homonymy and Polysemy in Information Retrieval Robert Krovetz NEC Research Institute 4 Independence Way Princeton NJ. 08540 krovetz@ . Abstract This paper discusses research on distinguishing word meanings in the context of information retrieval systems. We conducted experiments with three sources of evidence for making these distinctions morphology part-of-speech and phrases. We have focused on the distinction between homonymy and polysemy unrelated vs. related meanings . Our results support the need to distinguish homonymy and polysemy. We found 1 grouping morphological variants makes a significant improvement in retrieval performance 2 that more than half of all words in à dictionary that differ in part-of-speech are related in meaning and 3 that it is crucial to assign credit to the component words of a phrase. These experiments provide a better understanding of word-based methods and suggest where natural language processing can provide further improvements in retrieval performance. 1 Introduction Lexical ambiguity is a fundamental problem in natural language processing but relatively little quantitative information is available about the extent of the problem or about the impact that it has on specific applications. We report on our experiments to resolve lexical ambiguity in the context of information retrieval IR . Our approach to disambiguation is to treat the information associated with dictionary This paper is based on work that Weis done at the Center for Intelligent Information Retrieval at the University of Massachusetts. It was supported by the National Science Foundation Library of Congress and Department of Commerce under cooperative agreement number EEC-9 209623. I am grateful for their support. senses morphology part of speech and phrases as multiple sources of Experiments were designed to test each source of evidence independently and to identify areas of interaction. Our hypothesis is Hypothesis 1 Resolving lexical

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.