TAILIEUCHUNG - Báo cáo khoa học: "Enriching the Output of a Parser Using Memory-Based Learning"

We describe a method for enriching the output of a parser with information available in a corpus. The method is based on graph rewriting using memorybased learning, applied to dependency structures. This general framework allows us to accurately recover both grammatical and semantic information as well as non-local dependencies. It also facilitates dependency-based evaluation of phrase structure parsers. Our method is largely independent of the choice of parser and corpus, and shows state of the art performance. . | Enriching the Output of a Parser Using Memory-Based Learning Valentin Jijkoun and Maarten de Rijke Informatics Institute University of Amsterdam jijkoun mdr @ Abstract We describe a method for enriching the output of a parser with information available in a corpus. The method is based on graph rewriting using memorybased learning applied to dependency structures. This general framework allows us to accurately recover both grammatical and semantic information as well as non-local dependencies. It also facilitates dependency-based evaluation of phrase structure parsers. Our method is largely independent of the choice of parser and corpus and shows state of the art performance. 1 Introduction We describe a method to automatically enrich the output of parsers with information that is present in existing treebanks but usually not produced by the parsers themselves. Our motivation is two-fold. First and most important for applications requiring information extraction or semantic interpretation of text it is desirable to have parsers produce grammatically and semantically rich output. Second to facilitate dependency-based comparison and evaluation of different parsers their outputs may need to be transformed into specific rich dependency formalisms. The method allows us to automatically transform the output of a parser into structures as they are annotated in a dependency treebank. For a phrase structure parser we first convert the produced phrase structures into dependency graphs in a straightforward way and then apply a sequence of graph transformations changing dependency labels adding new nodes and adding new dependencies. A memory-based learner trained on a dependency corpus is used to detect which modifications should be performed. For a dependency corpus derived from the Penn Treebank and the parsers we considered these transformations correspond to adding Penn functional tags . -SBJ -TMP -LOC empty nodes . NP PRO and non-local dependencies .

Duy Uyên 71 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Enriching Morphologically Poor Languages for Statistical Machine Translation"

8 77 0

Báo cáo khoa học: "Enriching spoken language translation with dialog acts"

4 56 0

Báo cáo khoa học: "Enriching the Output of a Parser Using Memory-Based Learning"

8 65 0

Báo cáo khoa học: "Beyond Lexical Units: Enriching Wordnets with Phrasets"

4 43 0

The Willy Wagtail tale: Knowledge management and e-learning enriching multiliteracies in the early years

18 68 0

Calculus 5th Edition

1300 55 0

Identifying named entities from PubMed® for enriching semantic categories

10 40 1

Determine technical assessment test for male athletes ages 13-15 at the iron ball team District 6, Ho Chi Minh City

6 43 1

Enriching for direct regulatory targets in perturbed gene-expression profiles

1 47 0

Enriching the High School Curriculum Through Postsecondary Credit-Based Transition Programs

12 69 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462341 61

Giới thiệu :Lập trình mã nguồn mở

14 26057 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11347 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10551 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9842 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8891 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8505 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8101 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7748 1790

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7270 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 229 3 27-12-2024

Đóng mới oto 8 chỗ ngồi part 9

10 179 3 27-12-2024

Báo cáo nghiên cứu khoa học " HÃY LÀM CHO HUẾ XANH HƠN VÀ ĐẸP HƠN "

6 181 3 27-12-2024

Bảng màu theo chữ cái – V

11 165 2 27-12-2024

Hướng dẫn chế độ dinh dưỡng cho người bệnh viêm khớp

5 168 2 27-12-2024

Giáo án điện tử tiểu học môn lịch sử: Cách mạng mùa thu

39 165 1 27-12-2024

Valve Selection Handbook - Fourth Edition

337 146 2 27-12-2024

ETHICAL CODE HANDBOOK: Demonstrate your commitment to high standards

7 147 1 27-12-2024

Bệnh sán lá gan trên gia súc và cách phòng trị

3 162 1 27-12-2024

Báo cáo nghiên cứu khoa học " Đại hội XVI thông qua điều lệ Đảng cộng sản Trung Quốc những sửa đổi bổ sung mới "

4 163 1 27-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8101 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7748 1790

Ebook Chào con ba mẹ đã sẵn sàng

112 4407 1371

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6284 1266

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8891 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3840 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3920 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4709 565

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11347 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4509 490