TAILIEUCHUNG - Báo cáo khoa học: "A Re-examination of Query Expansion Using Lexical Resources"

Query expansion is an effective technique to improve the performance of information retrieval systems. Although hand-crafted lexical resources, such as WordNet, could provide more reliable related terms, previous studies showed that query expansion using only WordNet leads to very limited performance improvement. One of the main challenges is how to assign appropriate weights to expanded terms. In this paper, we re-examine this problem using recently proposed axiomatic approaches and find that, with appropriate term weighting strategy, we are able to exploit the information from lexical resources to significantly improve the retrieval performance. . | A Re-examination of Query Expansion Using Lexical Resources Hui Fang Department of Computer Science and Engineering The Ohio State University Columbus OH 43210 hfang@ Abstract Query expansion is an effective technique to improve the performance of information retrieval systems. Although hand-crafted lexical resources such as WordNet could provide more reliable related terms previous studies showed that query expansion using only WordNet leads to very limited performance improvement. One of the main challenges is how to assign appropriate weights to expanded terms. In this paper we re-examine this problem using recently proposed axiomatic approaches and find that with appropriate term weighting strategy we are able to exploit the information from lexical resources to significantly improve the retrieval performance. Our empirical results on six TREC collections show that query expansion using only hand-crafted lexical resources leads to significant performance improvement. The performance can be further improved if the proposed method is combined with query expansion using co-occurrence-based resources. 1 Introduction Most information retrieval models Salton et al. 1975 Fuhr 1992 Ponte and Croft 1998 Fang and Zhai 2005 compute relevance scores based on matching of terms in queries and documents. Since various terms can be used to describe a same concept it is unlikely for a user to use a query term that is exactly the same term as used in relevant documents. Clearly such vocabulary gaps make the retrieval performance non-optimal. Query expansion Voorhees 1994 Mandala et al. 1999a Fang and Zhai 2006 Qiu and Frei 1993 Bai et al. 2005 Cao et al. 2005 is a commonly used strategy to bridge the vocabulary gaps by expanding original queries with related terms. Expanded terms are often selected from either co-occurrence-based thesauri Qiu and Frei 1993 Bai et al. 2005 Jing and Croft 1994 Peat and Willett 1991 Smeaton and van Rijsbergen 1983 Fang and Zhai .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.