Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
We introduce a probabilistic noisychannel model for question answering and we show how it can be exploited in the context of an end-to-end QA system. Our noisy-channel system outperforms a stateof-the-art rule-based QA system that uses similar resources. We also show that the model we propose is flexible enough to accommodate within one mathematical framework many QA-specific resources and techniques, which range from the exploitation of WordNet, structured, and semi-structured databases to reasoning, and paraphrasing. impossible) to understand what contributes to the performance of a system and what doesn’t. . | A Noisy-Channel Approach to Question Answering Abdessamad Echihabi and Daniel Marcu Information Sciences Institute Department of Computer Science University of Southern California 4676 Admiralty Way Suite 1001 Marina Del Rey CA 90292 echihabi marcu @isi.edu Abstract We introduce a probabilistic noisy-channel model for question answering and we show how it can be exploited in the context of an end-to-end QA system. Our noisy-channel system outperforms a state-of-the-art rule-based QA system that uses similar resources. We also show that the model we propose is flexible enough to accommodate within one mathematical framework many QA-specific resources and techniques which range from the exploitation of WordNet structured and semi-structured databases to reasoning and paraphrasing. 1 Introduction Current state-of-the-art Question Answering QA systems are extremely complex. They contain tens of modules that do everything from information retrieval sentence parsing Ittycheriah and Roukos 2002 Hovy et al. 2001 Moldovan et al 2002 question-type pinpointing Ittycheriah and Roukos 2002 Hovy et al. 2001 Moldovan et al 2002 semantic analysis Xu et al. Hovy et al. 2001 Moldovan et al 2002 and reasoning Moldovan et al 2002 . They access external resources such as the WordNet Hovy et al. 2001 Pasca and Harabagiu 2001 Prager et al. 2001 the web Brill et al. 2001 structured and semistructured databases Katz et al. 2001 Lin 2002 Clarke 2001 . They contain feedback loops ranking and re-ranking modules. Given their complexity it is often difficult and sometimes impossible to understand what contributes to the performance of a system and what doesn t. In this paper we propose a new approach to QA in which the contribution of various resources and components can be easily assessed. The fundamental insight of our approach which departs significantly from the current architectures is that at its core a QA system is a pipeline of only two modules An IR engine that retrieves a set of M .