TAILIEUCHUNG - Báo cáo khoa học: "An Empirical Study of Information Synthesis Tasks"

This paper describes an empirical study of the “Information Synthesis” task, defined as the process of (given a complex information need) extracting, organizing and inter-relating the pieces of information contained in a set of relevant documents, in order to obtain a comprehensive, non redundant report that satisfies the information need. Two main results are presented: a) the creation of an Information Synthesis testbed with 72 reports manually generated by nine subjects for eight complex topics with 100 relevant documents each; and b) an empirical comparison of similarity metrics between reports, under the hypothesis that the best metric is the. | An Empirical Study of Information Synthesis Tasks Enrique Amigo Julio Gonzalo Victor Peinado Anselmo Penas Felisa Verdejo Departamento de Lenguajes y Sistemas Informaticos Universidad Nacional de Educacion a Distancia c Juan del Rosal 16 - 28040 Madrid - Spain enrique julio victor anselmo felisa @ Abstract This paper describes an empirical study of the Information Synthesis task defined as the process of given a complex information need extracting organizing and inter-relating the pieces of information contained in a set of relevant documents in order to obtain a comprehensive non redundant report that satisfies the information need. Two main results are presented a the creation of an Information Synthesis testbed with 72 reports manually generated by nine subjects for eight complex topics with 100 relevant documents each and b an empirical comparison of similarity metrics between reports under the hypothesis that the best metric is the one that best distinguishes between manual and automatically generated reports. A metric based on key concepts overlap gives better results than metrics based on n-gram overlap such as ROUGE or sentence overlap. 1 Introduction A classical Information Retrieval IR system helps the user finding relevant documents in a given text collection. In most occasions however this is only the first step towards fulfilling an information need. The next steps consist of extracting organizing and relating the relevant pieces of information in order to obtain a comprehensive non redundant report that satisfies the information need. In this paper we will refer to this process as Information Synthesis. It is normally understood as an intellectually challenging human task and perhaps the Google Answer Service1 is the best general purpose illustration of how it works. In this service users send complex queries which cannot be answered simply by inspecting the first two or three documents returned by a search engine. These are a couple of .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.