TAILIEUCHUNG - Báo cáo khoa học: "Exploring Distributional Similarity Based Models for Query Spelling Correction"

A query speller is crucial to search engine in improving web search relevance. This paper describes novel methods for use of distributional similarity estimated from query logs in learning improved query spelling correction models. The key to our methods is the property of distributional similarity between two terms: it is high between a frequently occurring misspelling and its correction, and low between two irrelevant terms only with similar spellings. We present two models that are able to take advantage of this property. . | Exploring Distributional Similarity Based Models for Query Spelling Correction Mu Li Microsoft Research Asia 5F Sigma Center Zhichun Road Haidian District Beijing China 100080 muli@ Yang Zhang School of Computer Science and Technology Tianjin University Tianjin China 300072 yangzhang@ Muhua Zhu School of Information Science and Engineering Northeastern University Shenyang Liaoning China 110004 zhumh@ Ming Zhou Microsoft Research Asia 5F Sigma Center Zhichun Road Haidian District Beijing China 100080 mingzhou@ Abstract A query speller is crucial to search engine in improving web search relevance. This paper describes novel methods for use of distributional similarity estimated from query logs in learning improved query spelling correction models. The key to our methods is the property of distributional similarity between two terms it is high between a frequently occurring misspelling and its correction and low between two irrelevant terms only with similar spellings. We present two models that are able to take advantage of this property. Experimental results demonstrate that the distributional similarity based models can significantly outperform their baseline systems in the web query spelling correction task. 1 Introduction Investigations into query log data reveal that more than 10 of queries sent to search engines contain misspelled terms Cucerzan and Brill 2004 . Such statistics indicate that a good query speller is crucial to search engine in improving web search relevance because there is little opportunity that a search engine can retrieve many relevant contents with misspelled terms. The problem of designing a spelling correction program for web search queries however poses special technical challenges and cannot be well solved by general purpose spelling correction methods. Cucerzan and Brill 2004 discussed in detail specialties and difficulties of a query spell checker and illustrated why the existing .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.