TAILIEUCHUNG - Báo cáo khoa học: "A Bottom-up Approach to Sentence Ordering for Multi-document Summarization"

Ordering information is a difficult but important task for applications generating natural-language text. We present a bottom-up approach to arranging sentences extracted for multi-document summarization. To capture the association and order of two textual segments (eg, sentences), we define four criteria, chronology, topical-closeness, precedence, and succession. These criteria are integrated into a criterion by a supervised learning approach. We repeatedly concatenate two textual segments into one segment based on the criterion until we obtain the overall segment with all sentences arranged. . | A Bottom-up Approach to Sentence Ordering for Multi-document Summarization Danushka Bollegala Naoaki Okazaki Mitsuru Ishizuka Graduate School of Information Science and Technology The University of Tokyo 7-3-1 Hongo Bunkyo-ku Tokyo 113-8656 Japan danushka okazaki @ ishizuka@ Abstract Ordering information is a difficult but important task for applications generating natural-language text. We present a bottom-up approach to arranging sentences extracted for multi-document summarization. To capture the association and order of two textual segments eg sentences we define four criteria chronology topical-closeness precedence and succession. These criteria are integrated into a criterion by a supervised learning approach. We repeatedly concatenate two textual segments into one segment based on the criterion until we obtain the overall segment with all sentences arranged. Our experimental results show a significant improvement over existing sentence ordering strategies. 1 Introduction Multi-document summarization MDS Radev and McKeown 1999 tackles the information overload problem by providing a condensed version of a set of documents. Among a number of sub-tasks involved in MDS eg sentence extraction topic detection sentence ordering information extraction sentence generation etc. most MDS systems have been based on an extraction method which identifies important textual segments eg sentences or paragraphs in source documents. It is important for such MDS systems to determine a coherent arrangement of the textual segments extracted from multi-documents in order to reconstruct the text structure for summarization. Ordering information is also essential for Research Fellow of the Japan Society for the Promotion of Science JSPS other text-generation applications such as Question Answering. A summary with improperly ordered sentences confuses the reader and degrades the qual-ity reliability of the summary itself. Barzi-lay 2002 has provided

TÀI LIỆU MỚI ĐĂNG
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.