TAILIEUCHUNG - Báo cáo khoa học: "What is the Minimal Set of Fragments that Achieves Maximal Parse Accuracy?"

We aim at finding the minimal set of fragments which achieves maximal parse accuracy in Data Oriented Parsing. Experiments with the Penn Wall Street Journal treebank show that counts of almost arbitrary fragments within parse trees are important, leading to improved parse accuracy over previous models tested on this treebank (a precis ion of and a recall of ). We isolate some dependency relations which previous models neglect but which contribute to higher parse accuracy. | What is the Minimal Set of Fragments that Achieves Maximal Parse Accuracy Rens Bod School of Computing University of Leeds Leeds LS2 9JT Institute for Logic Language and Computation University of Amsterdam Spuistraat 134 1012 VB Amsterdam rens@ Abstract We aim at finding the minimal set of fragments which achieves maximal parse accuracy in Data Oriented Parsing. Experiments with the Penn Wall Street Journal treebank show that counts of almost arbitrary fragments within parse trees are important leading to improved parse accuracy over previous models tested on this treebank a precis -ion of and a recall of . We isolate some dependency relations which previous models neglect but which contribute to higher parse accuracy. 1 Introduction One of the goals in statistical natural language parsing is to find the minimal set of statistical dependencies between words and syntactic structures that achieves maximal parse accuracy. Many stochastic parsing models use linguistic intuitions to find this minimal set for example by restricting the statistical dependencies to the locality of headwords of constituents Collins 1997 1999 Eisner 1997 leaving it as an open question whether there exist important statistical dependencies that go beyond linguistically motivated dependencies. The Data Oriented Parsing DOP model on the other hand takes a rather extreme view on this issue given an annotated corpus all fragments . subtrees seen in that corpus regardless of size and lexicalization are in principle taken to form a grammar see Bod 1993 1998 Goodman 1998 Sima an 1999 . The set of subtrees that is used is thus very large and extremely redundant. Both from a theoretical and from a computational perspective we may wonder whether it is possible to impose constraints on the subtrees that are used in such a way that the accuracy of the model does not deteriorate or perhaps even improves. That is the main question addressed in this paper. We report on .

Minh Hạnh 78 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

After reading this chapter, you should be able to answer the following questions: What are generally accepted accounting principle? What kind of information is reported on each financial statement and how are the financial statements related? What are transactions? What is the meaning and usefulness of the accounting equation? What are meanings of the captions in the financial statements?...

23 64 1

Tiếng Anh lớp 1, 2 - Lesson twenty (Bài 20) WHAT...? - WHAT IS YOUR JOB? - WHAT TIME? - WHAT cOLOUR?

9 78 0

Báo cáo khoa học: "MACHINE TRANSLATION : WHAT TYPE OF POST-EDITING ON WHAT TYPE OF DOCUMENTSFOR WHAT TYPE OF USERS"

3 80 0

“What you say is what you get”..KỸ NĂNG PR & EVENT.Diễn giả Quách Tuấn Khanh.www.quachtuankhanh.net...“What you say is what you get”..KỸ NĂNG QUAN HỆ BÁO CHÍ.www.quachtuankhanh.net...SƠ ĐỒ TRUYỀN THÔNG..Người truyền..Thông điệp..Người nhận..Mã hóa..Giải m

33 82 0

Ebook What is weather

10 72 0

Firefox 3 Revealed - What’s New, What’s Hot & What’s Not

30 68 0

INDUSTRY SURVEY - What future animators say about what they are looking for in a school, and what professional animators say are the most important things to look for.

14 93 0

Ebook Habitats

10 92 0

Ebook What do i have

1 111 0

Ebook I like what i see

1 73 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462351 61

Giới thiệu :Lập trình mã nguồn mở

14 26682 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11375 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10567 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9855 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8518 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7920 1821

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7290 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Data Structures and Algorithms - Chapter 8: Heaps

41 195 5 09-01-2025

Báo cáo nghiên cứu khoa học " HÃY LÀM CHO HUẾ XANH HƠN VÀ ĐẸP HƠN "

6 187 3 09-01-2025

báo cáo hóa học:" Quality of data collection in a large HIV observational clinic database in sub-Saharan Africa: implications for clinical research and audit of care"

7 163 4 09-01-2025

Báo cáo " Thẩm quyền quản lí nhà nước đối với hoạt động quảng cáo thực trạng và hướng hoàn thiện "

7 217 7 09-01-2025

Bệnh sán lá gan trên gia súc và cách phòng trị

3 170 1 09-01-2025

Báo cáo nghiên cứu khoa học " Vai trò chính quyền địa phương trong phát triển kinh tế : khu chuyên doanh gốm sứ ( Trung Quốc ) và Bát Tràng ( Việt Nam )("

11 218 1 09-01-2025

IT Audit: EMC’s Journey to the Private Cloud

13 163 1 09-01-2025

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 143 1 09-01-2025

Báo cáo lâm nghiệp: "Assessment of the effects of below-zero temperatures on photosynthesis and chlorophyll a fluorescence in leaf discs of Eucalyptus globulu"

4 152 0 09-01-2025

TRẮC NGHIỆM - CÁC BỆNH THIẾU DINH DƯỠNG THƯỜNG GẶP

32 221 2 09-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7920 1821

Ebook Chào con ba mẹ đã sẵn sàng

112 4436 1376

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6360 1276

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3859 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3930 610

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4778 567

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11375 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4533 490