TAILIEUCHUNG - Báo cáo khoa học: "Unlexicalised Hidden Variable Models of Split Dependency Grammars∗"

This paper investigates transforms of split dependency grammars into unlexicalised context-free grammars annotated with hidden symbols. Our best unlexicalised grammar achieves an accuracy of 88% on the Penn Treebank data set, that represents a 50% reduction in error over previously published results on unlexicalised dependency parsing. | Unlexicalised Hidden Variable Models of Split Dependency Grammars Gabriele Antonio Musillo Department of Computer Science and Department of Linguistics University of Geneva 1211 Geneva 4 Switzerland musillo4@ Paola Merlo Department of Linguistics University of Geneva 1211 Geneva 4 Switzerland merlo@ Abstract This paper investigates transforms of split dependency grammars into unlexicalised context-free grammars annotated with hidden symbols. Our best unlexicalised grammar achieves an accuracy of 88 on the Penn Treebank data set that represents a 50 reduction in error over previously published results on unlexicalised dependency parsing. 1 Introduction Recent research in natural language parsing has extensively investigated probabilistic models of phrase-structure parse trees. As well as being the most commonly used probabilistic models of parse trees probabilistic context-free grammars PCFGs are the best understood. As shown in Klein and Manning 2003 the ability of PCFG models to disambiguate phrases crucially depends on the expressiveness of the symbolic backbone they use. Treebank-specific heuristics have commonly been used both to alleviate inadequate independence assumptions stipulated by naive PCFGs Collins 1999 Charniak 2000 . Such methods stand in sharp contrast to partially supervised techniques that have recently been proposed to induce hidden grammatical representations that are finer-grained than those that can be read off the parsed sentences in treebanks Henderson 2003 Matsuzaki et al. 2005 Prescher 2005 Petrov et al. 2006 . Part of this work was done when Gabriele Musillo was visiting the MIT Computer Science and Artificial Intelligence Laboratory funded by a grant from the Swiss NSF PBGE2-117146 . Many thanks to Michael Collins and Xavier Carreras for their insightful comments on the work presented here. This paper presents extensions of such grammar induction techniques to dependency grammars. Our extensions rely on .

Cao Kỳ 77 4 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Unlexicalised Hidden Variable Models of Split Dependency Grammars∗"

4 70 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461942 55

Giới thiệu :Lập trình mã nguồn mở

14 23111 64

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10986 531

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10181 451

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9572 106

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8385 1132

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8278 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7889 2228

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6836 256

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 6113 1472

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Đề tài: Ôn xác định vị trí trên – dưới, trước- sau của đối tượng khác.

8 373 3 21-05-2024

Trading Strategies Profit Making Techniques For Stock_3

23 200 1 21-05-2024

Trading Strategies Profit Making Techniques For Stock_8

23 187 1 21-05-2024

Posted prices versus bargaining in markets_7

23 166 0 21-05-2024

HƯỚNG DẪN SỬ DỤNG PHẦN MỀM CAITA part 9

18 137 0 21-05-2024

Christmas Meditations on the Twelve Holy Days

173 112 0 21-05-2024

MẪU CHỨNG CHỈ QUẢN LÝ VŨ KHÍ, VẬT LIỆU NỔ, CCHT

1 128 0 21-05-2024

Giáo trình phân tích phương trình vi phân viết dưới dạng thuật toán đặc tính của hệ thống p1

5 111 0 21-05-2024

Thương hiệu sản phẩm làng nghề: Đã ít, lại thiếu tính cạnh tranh

5 125 0 21-05-2024

Điều bạn cần làm để giữ chặt tình yêu

5 115 0 21-05-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7889 2228

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 6113 1472

Ebook Chào con ba mẹ đã sẵn sàng

112 3788 1255

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5413 1138

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8385 1132

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3552 656

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3756 544

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10986 531

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4169 523

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4191 483