TAILIEUCHUNG - Báo cáo khoa học: "Language Independent Extractive Summarization"

We demonstrate TextRank – a system for unsupervised extractive summarization that relies on the application of iterative graphbased ranking algorithms to graphs encoding the cohesive structure of a text. An important characteristic of the system is that it does not rely on any language-speciﬁc knowledge resources or any manually constructed training data, and thus it is highly portable to new languages or domains. | Language Independent Extractive Summarization Rada Mihalcea Department of Computer Science and Engineering University of North Texas rada@ Abstract We demonstrate TextRank - a system for unsupervised extractive summarization that relies on the application of iterative graphbased ranking algorithms to graphs encoding the cohesive structure of a text. An important characteristic of the system is that it does not rely on any language-specific knowledge resources or any manually constructed training data and thus it is highly portable to new languages or domains. 1 Introduction Given the overwhelming amount of information available today on the Web and elsewhere techniques for efficient automatic text summarization are essential to improve the access to such information. Algorithms for extractive summarization are typically based on techniques for sentence extraction and attempt to identify the set of sentences that are most important for the understanding of a given document. Some of the most successful approaches to extractive summarization consist of supervised algorithms that attempt to learn what makes a good summary by training on collections of summaries built for a relatively large number of training documents . Hirao et al. 2002 Teufel and Moens 1997 . However the price paid for the high performance of such supervised algorithms is their inability to easily adapt to new languages or domains as new training data are required for each new type of data. TextRank Mi-halcea and Tarau 2004 Mihalcea 2004 is specifi cally designed to address this problem by using an extractive summarization technique that does not require any training data or any language-specific knowledge sources. TextRank can be effectively applied to the summarization of documents in different languages without any modifications of the algorithm and without any requirements for additional data. Moreover results from experiments performed on standard data sets have demonstrated that .

Ngọc Liên 95 4 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Language Independent Authorship Attribution using Character Level Language Models"

8 57 0

Báo cáo khoa học: "Domain-Independent Natural Language Database Access Systems"

3 72 0

Báo cáo khoa học: "Language-independent bilingual terminology extraction from a multilingual parallel corpus"

9 56 0

Báo cáo khoa học: "A comparison of clausal coordinate ellipsis in Estonian and German: Remarkably similar elision rules allow a language-independent ellipsis-generation module"

4 76 0

Báo cáo khoa học: "Language-independent Compound Splitting with Morphological Operations"

10 56 0

Báo cáo khoa học: "Language-Independent Parsing with Empty Elements"

5 59 0

Báo cáo khoa học: "Language-independent Probabilistic Answer Ranking for Question Answering"

8 64 0

Báo cáo khoa học: "A Language-Independent Unsupervised Model for Morphological Segmentation"

8 54 0

Báo cáo khoa học: "Language Independent Extractive Summarization"

4 76 0

Báo cáo khoa học: "A language−independent shallow−parser Compiler"

8 50 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461836 55

Giới thiệu :Lập trình mã nguồn mở

14 22499 57

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10846 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10023 445

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9477 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8240 1124

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8198 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7859 2219

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6636 253

Vật lý hạt cơ bản (1)

29 5751 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Mass Transfer in Multiphase Systems and its Applications Part 19

40 254 1 18-04-2024

Trading Strategies Profit Making Techniques For Stock_8

23 170 0 18-04-2024

Anh văn bằng C-124

8 170 0 18-04-2024

Magnetic Bearings Theory and Applications phần 2

14 169 0 18-04-2024

Công nghiệp gang thép Việt Nam : Một giai đoạn phát triển và chuyển đổi chính sách mới part 5

6 193 0 18-04-2024

MySQL Database Usage & Administration PHẦN 7

37 154 0 18-04-2024

BÀI GIẢNG VỀ - MẠCH ĐIỆN II - Chương I: Phân tích mạch trong miền thời gian

38 140 0 18-04-2024

MÔN HỌC VẬT LIỆU VÀ CÔNG NGHỆ KIM LOẠI - PHẦN I: KIM LOẠI HỌC

32 175 2 18-04-2024

Hướng dẫn sử dụng Quickoffice cho Ipad và Iphone

13 149 0 18-04-2024

Khurana et al. Journal of Orthopaedic Surgery and Research 2010, 5:23

7 133 0 18-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7859 2219

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5572 1319

Ebook Chào con ba mẹ đã sẵn sàng

112 3740 1228

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8240 1124

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5234 1124

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3470 641

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10846 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3667 524

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4014 513

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4092 478