TAILIEUCHUNG - Báo cáo khoa học: "Word or Phrase? Learning Which Unit to Stress for Information Retrieval∗"

The use of phrases in retrieval models has been proven to be helpful in the literature, but no particular research addresses the problem of discriminating phrases that are likely to degrade the retrieval performance from the ones that do not. In this paper, we present a retrieval framework that utilizes both words and phrases flexibly, followed by a general learning-to-rank method for learning the potential contribution of a phrase in retrieval. | Word or Phrase Learning Which Unit to Stress for Information Retrieval Young-In Song and Jung-Tae Lee and Hae-Chang Rim Microsoft Research Asia Beijing China Dept. of Computer Radio Communications Engineering Korea University Seoul Korea yosong@ jtlee rim @ Abstract The use of phrases in retrieval models has been proven to be helpful in the literature but no particular research addresses the problem of discriminating phrases that are likely to degrade the retrieval performance from the ones that do not. In this paper we present a retrieval framework that utilizes both words and phrases flexibly followed by a general learning-to-rank method for learning the potential contribution of a phrase in retrieval. We also present useful features that reflect the compositionality and discriminative power of a phrase and its constituent words for optimizing the weights of phrase use in phrase-based retrieval models. Experimental results on the TREC collections show that our proposed method is effective. 1 Introduction Various researches have improved the quality of information retrieval by relaxing the traditional bag-of-words assumption with the use of phrases. Miller et al. 1999 Song and Croft 1999 explore the use n-grams in retrieval models. Fagan 1987 Gao et al. 2004 Metzler and Croft 2005 Tao and Zhai 2007 use statistically-captured term dependencies within a query. Strzalkowski et al. 1994 Kraaij and Pohlmann 1998 Arampatzis et al. 2000 study the utility of various kinds of syntactic phrases. Although use of phrases clearly helps there still exists a fundamental but unsolved question Do all phrases contribute an equal amount of increase in the performance of information retrieval models Let us consider a search query World Bank Criticism which has the following phrases world This work was done while Young-In Song was with the Dept. of Computer Radio Communications Engineering Korea University. bank and bank criticism . Intuitively the former .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.