Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
This paper proposes a method of selecting important sentences from a text based on the evaluation of the connectivity between sentences by using surface information. We assume that the title of a text is the most concise statement which expresses the most essential information of the text, and that the closer a sentence relates to an important sentence, the more important this sentence is. Se | Evaluation of Importance of Sentences based on Connectivity to Title Takehiko Yoshimi and Toshiyuki Okunishi Takahiro Yamaji and Yoji Fukumochi Software Business Development Center SHARP Corporation 492 Minosho-cho Yamatokoriyama Nara Japan Abstract This paper proposes a method of selecting important sentences from a text based on the evaluation of the connectivity between sentences by using surface information. We assume that the title of a text is the most concise statement which expresses the most essential information of the text and that the closer a sentence relates to an important sentence the more important this sentence is. The importance of a sentence is defined as the connectivity between the sentence and the title. The connectivity between two sentences is measured based on correference between a pronoun and a preceding pro noun and on lexical cohesion of lexical items. In an experiment with 80 English texts which consist of an average of 29.0 sentences the proposed method has marked recall of 78.2 and precision of 57.7 with the selection ratio being 25 . The recall and precision values surpass those achieved by conventional methods which means that our method is more effective in abridging relatively short texts. 1 ttưồòlr ậ-CV 5. 9 ỏfcr CTMẾTIả ẴoSSSoStoỉìl- l 3t r Od 2 own bí xoxoffl . iá 3 4 yũ-btomií ỉfoF sìwov m fed V ỉ r n e ẳ-a -ê- ĩíĩ t ịiĩ Luhn 1958 Edmundson 1969 1987 and 1988 rÓMẾ et aL 1989 Salton et al. 1994 Brandow et al. 1995 t lếktSĩẾ and 1995 fejgR et al. 1995 et al. 1995 Watanabe 1996 Zechner 1996 1997 . ỸO5Ềếlr 5V XXOM eÀ ỡtíẾ-rdịỉèèiótrd. ẽ-rd fexi jt 1443 Ị- . 2. Sg 5tiroõkà5Ị7ăsféitKĩíỉỄt lĩỉA ỸO 5ci aàxfed. son y vx fedfcfe Ẫ.ĩ ỉfi C íOttìỉmưtì-ut ĩ ỸOiộ iícti fe dVHÌteO ỉr LXOraSẽ -o 9 ỈẾV ẴX roi.5 fãxi Oo C Ị9 0ỈẾèỉrỸOJtoa KStTd. ẴtẴO-o eổSỊ oỉỀèỲFffli-rdfc r Roro atc sTs. Ị Ẹ ị ft a g ỉ 51 -r ĩ tz ft IC X w o ti 0 Ỳ M w -r ĩ ÍỈẾ5féO Ết L-Oi 1 t L-Ctt smis flWL Ỹo icí-ỡv TxoagSỲĩP tt-rd ỉÈ MÌS ÉÍ et al. 1989 Ono et al. 1994 Xx