Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
This paper describes initial work on Deep Read, an automated reading comprehension system that accepts arbitrary text input (a story) and answers questions about it. We have acquired a corpus of 60 development and 60 test stories of 3rd to 6 th grade material; each story is followed by short-answer questions (an answer key was also provided). We used these to construct and evaluate a baseline system that uses pattern matching (bag-of-words) techniques augmented with additional automated linguistic processing (stemming, name identification, semantic class identification, and pronoun resolution). This simple system retrieves the sentence containing the answer 30-40% of. | Deep Read A Reading Comprehension System Lynette Hirschman Marc Light Eric Breck John D. Burger The MITRE Corporation 202 Burlington Road Bedford MA USA 01730 lynette light ebreck john @mitre.org Abstract This paper describes initial work on Deep Read an automated reading comprehension system that accepts arbitrary text input a story and answers questions about it. We have acquired a corpus of 60 development and 60 test stories of 3rd to 6th grade material each story is followed by short-answer questions an answer key was also provided . We used these to construct and evaluate a baseline system that uses pattern matching bag-of-words techniques augmented with additional automated linguistic processing stemming name identification semantic class identification and pronoun resolution . This simple system retrieves the sentence containing the answer 30-40 of the time. 1 Introduction This paper describes our initial work exploring reading comprehension tests as a research problem and an evaluation method for language understanding systems. Such tests can take the form of standardized multiple-choice diagnostic reading skill tests as well as fill-in-the-blank and short-answer tests. Typically such tests ask the student to read a story or article and to demonstrate her his understanding of that article by answering questions about it. For an example see Figure 1. Reading comprehension tests are interesting because they constitute found test material these tests are created in order to evaluate children s reading skills and therefore test materials scoring algorithms and human performance measures already exist. Furthermore human performance measures provide a more intuitive way of assessing the capabilities of a given system than current measures of precision recall F-measure operating curves etc. In addition reading comprehension tests are written to test a range of skill levels. With proper choice of test material it should be possible to challenge systems to .