TAILIEUCHUNG - Báo cáo khoa học: "Evaluation challenges in large-scale document summarization"

We present a large-scale meta evaluation of eight evaluation measures for both single-document and multi-document summarizers. To this end we built a corpus consisting of (a) 100 Million automatic summaries using six summarizers and baselines at ten summary lengths in both English and Chinese, (b) more than 10,000 manual abstracts and extracts, and (c) 200 Million automatic document and summary retrievals using 20 queries. | Evaluation challenges in large-scale document summarization Dragomir R. Radev U. of Michigan radev@ Wai Lam Chinese U. of Hong Kong wlam@ Arda Celebi USC ISI ardax@ Simone Teufel U. of Cambridge John Blitzer U. of Pennsylvania blitzer@ Danyu Liu U. of Alabama liudy@ Horacio Saggion U. of Sheffield Hong Qi U. of Michigan hqi@ Elliott Drabek Johns Hopkins U. edrabek@ Abstract We present a large-scale meta evaluation of eight evaluation measures for both single-document and multi-document summarizers. To this end we built a corpus consisting of a 100 Million automatic summaries using six summarizers and baselines at ten summary lengths in both English and Chinese b more than 10 000 manual abstracts and extracts and c 200 Million automatic document and summary retrievals using 20 queries. We present both qualitative and quantitative results showing the strengths and drawbacks of all evaluation methods and how they rank the different summarizers. 1 Introduction Automatic document summarization is a field that has seen increasing attention from the NLP community in recent years. In part this is because summarization incorporates many important aspects of both natural language understanding and natural language generation. In part it is because effective automatic summarization would be useful in a variety of areas. Unfortunately evaluating automatic summarization in a standard and inexpensive way is a difficult task Mani et al. 2001 . Traditional large-scale evaluations are either too simplistic using measures like precision recall and percent agreement which 1 don t take chance agreement into account and 2 don t account for the fact that human judges don t agree which sentences should be in a summary or too expensive an approach using manual judgements can scale up to a few hundred summaries but not to tens or hundreds of thousands . In this .

Hoàng Oanh 50 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Evaluation challenges in large-scale document summarization"

8 42 0

Báo cáo y học: "Meeting the challenges of drug discovery: a multidisciplinary re-evaluation of current practices"

3 27 0

Applications of toxicogenomic technologies to predictive toxicology and risk assessment

301 70 0

Rubber Planting in Laos: Local Approaches to New Challenges

30 57 0

"The Potential of Cellulosic Ethanol Production from Municipal Solid Waste: A Technical and Economic Evaluation"

41 83 0

SOA End to End Security

71 54 0

Research on the impact of environmental risk perception and public participation on evaluation of local government environmental regulation implementation behavior

8 45 2

The influence of earnings management on bank efficiency: Evidence from frontier markets

19 52 2

Evaluation of salivary vasopressin as an acute stress biomarker in healthy dogs with stress due to noise and environmental challenges

9 8 1

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461846 55

Giới thiệu :Lập trình mã nguồn mở

14 22508 57

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10861 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10024 445

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9488 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8241 1124

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8199 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7859 2219

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6642 253

Vật lý hạt cơ bản (1)

29 5754 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Sáng tạo trong thuật toán và lập trình với ngôn ngữ Pascal và C# Tập 2 - Chương 4

47 245 1 19-04-2024

extremetech Hacking BlackBerry phần 9

31 239 0 19-04-2024

Trading Strategies Profit Making Techniques For Stock_3

23 181 0 19-04-2024

Bơm máy nén quạt trong công nghiệp part 8

20 196 2 19-04-2024

Công nghiệp gang thép Việt Nam : Một giai đoạn phát triển và chuyển đổi chính sách mới part 5

6 193 0 19-04-2024

MySQL Database Usage & Administration PHẦN 9

37 137 0 19-04-2024

The profit magic of stock Timing The Markets_5

22 117 0 19-04-2024

Giáo trình CẤU TRÚC DỮ LIỆU VÀ GIẢI THUẬT - Chương 1

5 123 0 19-04-2024

Diseases of the Liver and Biliary System - part 1

33 120 0 19-04-2024

XỬ TRÍ CHẤN THƯƠNG SỌ NÃO KÍN

1 111 1 19-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7859 2219

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5591 1326

Ebook Chào con ba mẹ đã sẵn sàng

112 3749 1228

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8241 1124

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5246 1124

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3471 641

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10861 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3668 524

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4023 513

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4098 478