TAILIEUCHUNG - Báo cáo khoa học: "An Integrated Multi-document Summarization Approach based on Word Hierarchical Representation"

This paper introduces a novel hierarchical summarization approach for automatic multidocument summarization. By creating a hierarchical representation of the words in the input document set, the proposed approach is able to incorporate various objectives of multidocument summarization through an integrated framework. The evaluation is conducted on the DUC 2007 data set. | An Integrated Multi-document Summarization Approach based on Word Hierarchical Representation You Ouyang Wenji Li Qin Lu Department of Computing The Hong Kong Polytechnic University csyouyang cswjli csluqin @ Abstract This paper introduces a novel hierarchical summarization approach for automatic multidocument summarization. By creating a hierarchical representation of the words in the input document set the proposed approach is able to incorporate various objectives of multidocument summarization through an integrated framework. The evaluation is conducted on the DUC 2007 data set. 1 Introduction and Background Multi-document summarization requires creating a short summary from a set of documents which concentrate on the same topic. Sometimes an additional query is also given to specify the information need of the summary. Generally an effective summary should be relevant concise and fluent. It means that the summary should cover the most important concepts in the original document set contain less redundant information and should be well-organized. Currently most successful multi-document summarization systems follow the extractive summarization framework. These systems first rank all the sentences in the original document set and then select the most salient sentences to compose summaries for a good coverage of the concepts. For the purpose of creating more concise and fluent summaries some intensive post-processing approaches are also appended on the extracted sentences. For example redundancy removal Carbonell and Goldstein 1998 and sentence compression Knight and Marcu 2000 approaches are used to make the summary more concise. Sentence re-ordering approaches Barzilay et al. 2002 are used to make the summary more fluent. In most systems these approaches are treated as independent steps. A sequential process is usually adopted in their implementation applying the various approaches one after another. In this paper we suggest a new summarization

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.