TAILIEUCHUNG - Báo cáo khoa học: "Extracting Word Sets with Non-Taxonomical Relation"

At least two kinds of relations exist among related words: taxonomical relations and thematic relations. Both relations identify related words useful to language understanding and generation, information retrieval, and so on. However, although words with taxonomical relations are easy to identify from linguistic resources such as dictionaries and thesauri, words with thematic relations are difficult to identify because they are rarely maintained in linguistic resources. | Extracting Word Sets with Non-Taxonomical Relation Eiko Yamamoto Hitoshi Isahara Computational Linguistics Group National Institute of Information and Communications Technology 3-5 Hikaridai Seika-cho Soraku-gun Kyoto 619-0289 Japan eiko isahara @ Abstract At least two kinds of relations exist among related words taxonomical relations and thematic relations. Both relations identify related words useful to language understanding and generation information retrieval and so on. However although words with taxonomical relations are easy to identify from linguistic resources such as dictionaries and thesauri words with thematic relations are difficult to identify because they are rarely maintained in linguistic resources. In this paper we sought to extract thematically non-taxonomically related word sets among words in documents by employing case-marking particles derived from syntactic analysis. We then verified the usefulness of word sets with non-taxonomical relation that seems to be a thematic relation for information retrieval. 1. Introduction Related word sets are useful linguistic resources for language understanding and generation information retrieval and so on. In previous research on natural language processing many methodologies for extracting various relations from corpora have been developed such as the is-a relation Hearst 1992 part-of relation Berland and Charniak 1999 causal relation Girju 2003 and entailment relation Geffet and Dagan 2005 . Related words can be used to support retrieval in order to lead users to high-quality information. One simple method is to provide additional words related to the key words users have input such as an input support function within the Google search engine. What kind of relation between the key words that have been input and the additional word is effective for information retrieval As for the relations among words at least two kinds of relations exist the taxonomical relation and the thematic relation. The

TÀI LIỆU MỚI ĐĂNG
11    177    2    08-01-2025
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.