Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Optimizing Language Model Information Retrieval System with Expectation Maximization Algorithm"

Hồng Minh 64 9 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

Statistical language modeling (SLM) has been used in many different domains for decades and has also been applied to information retrieval (IR) recently. Documents retrieved using this approach are ranked according their probability of generating the given query. In this paper, we present a novel approach that employs the generalized Expectation Maximization (EM) algorithm to improve language models by representing their parameters as observation probabilities of Hidden Markov Models (HMM). | Optimizing Language Model Information Retrieval System with Expectation Maximization Algorithm Justin Liang-Te Chiu Department of Computer Science and Information Engineering National Taiwan University 1 Roosevelt Rd. Sec. 4 Taipei Taiwan 106 ROC b94902009@ntu.edu.tw Jyun-Wei Huang Department of Computer Science and Engineering Yuan Ze University 135 Yuan-Tung Road Chungli Taoyuan Taiwan ROC s976017 @mail.yzu.edu.tw Abstract Statistical language modeling SLM has been used in many different domains for decades and has also been applied to information retrieval IR recently. Documents retrieved using this approach are ranked according their probability of generating the given query. In this paper we present a novel approach that employs the generalized Expectation Maximization EM algorithm to improve language models by representing their parameters as observation probabilities of Hidden Markov Models HMM . In the experiments we demonstrate that our method outperforms standard SLM-based and tf.idf-based methods on TREC 2005 HARD Track data. 1 Introduction In 1945 soon after the computer was invented Vannevar Bush wrote a famous article--- As we may think V. Bush 1996 which formed the basis of research into Information Retrieval IR . The pioneers in IR developed two models for ranking the vector space model G. Salton and M. J. McGill 1986 and the probabilistic model S. E. Robertson and S. Jones 1976 . Since then the research of classical probabilistic models of relevance has been widely studied. For example Robertson S. E. Robertson and S. Walker 1994 S. E. Robertson 1977 modeled word occurrences into relevant or non-relevant classes and ranked documents according to the probabilities they belong to the relevant one. In 1998 Ponte and Croft 1998 proposed a language modeling framework which opens a new point of view in IR. In this approach they gave up the model of relevance instead they treated query generation as random sampling from every document model. The retrieval

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Optimizing Story Link Detection is not Equivalent to Optimizing New Event Detection"

Báo cáo khoa học: "Optimizing Question Answering Accuracy by Maximizing Log-Likelihood"

Báo cáo khoa học: "Jointly optimizing a two-step conditional random ﬁeld model for machine transliteration and its fast decoding algorithm"

Báo cáo khoa học: "Optimizing Informativeness and Readability for Sentiment Summarization"

Báo cáo khoa học: "Optimizing Language Model Information Retrieval System with Expectation Maximization Algorithm"

Báo cáo khoa học: "Optimizing Word Alignment Combination For Phrase Table Training"

Báo cáo khoa học: "Optimizing Grammars for Minimum Dependency Length"

Báo cáo khoa học: "Optimizing Typed Feature Structure Grammar Parsing through Non-Statistical Indexing"

Báo cáo khoa học: "OPTIMIZING THE COMPUTATION ALL EXICALIZATION OF LARGE GRAMMARS"

cáo khoa học: " Barriers and facilitators to the dissemination of DECISION+, a continuing medical education program for optimizing decisions about antibiotics for acute respiratory infections in primary care: A study protocol"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.