TAILIEUCHUNG - Báo cáo khoa học: "Boosting-based parse reranking with subtree features"

This paper introduces a new application of boosting for parse reranking. Several parsers have been proposed that utilize the all-subtrees representation (., tree kernel and data oriented parsing). This paper argues that such an all-subtrees representation is extremely redundant and a comparable accuracy can be achieved using just a small set of subtrees. We show how the boosting algorithm can be applied to the all-subtrees representation and how it selects a small and relevant feature set efﬁciently. Two experiments on parse reranking show that our method achieves comparable or even better performance than kernel methods and also improves the. | Boosting-based parse reranking with subtree features Taku Kudo Jun Suzuki Hideki Isozaki NTT Communication Science Laboratories. 2-4 Hikaridai Seika-cho Soraku Kyoto Japan taku jun isozaki @ Abstract This paper introduces a new application of boosting for parse reranking. Several parsers have been proposed that utilize the all-subtrees representation . tree kernel and data oriented parsing . This paper argues that such an all-subtrees representation is extremely redundant and a comparable accuracy can be achieved using just a small set of subtrees. We show how the boosting algorithm can be applied to the all-subtrees representation and how it selects a small and relevant feature set efficiently. Two experiments on parse reranking show that our method achieves comparable or even better performance than kernel methods and also improves the testing efficiency. 1 Introduction Recent work on statistical natural language parsing and tagging has explored discriminative techniques. One of the novel discriminative approaches is reranking where discriminative machine learning algorithms are used to rerank the n-best outputs of generative or conditional parsers. The discriminative reranking methods allow us to incorporate various kinds of features to distinguish the correct parse tree from all other candidates. With such feature design flexibility it is nontrivial to employ an appropriate feature set that has a good discriminative ability for parse reranking. In early studies feature sets were given heuristically by simply preparing task-dependent feature templates Collins 2000 Collins 2002 . These ad-hoc solutions might provide us with reasonable levels of per Currently Google Japan Inc. taku@ formance. However they are highly task dependent and require careful design to create the optimal feature set for each task. Kernel methods offer an elegant solution to these problems. They can work on a potentially huge or even infinite number of .

Việt Thông 46 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461887 55

Giới thiệu :Lập trình mã nguồn mở

14 22723 61

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10906 530

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10083 447

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9540 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8302 1127

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8248 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7867 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6713 253

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5795 1391

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Đề tài: Ôn xác định vị trí trên – dưới, trước- sau của đối tượng khác.

8 355 3 01-05-2024

Đánh giá hao mòn và độ tin cậy của chi tiết và kết cấu trên đầu máy diezel part 3

12 315 0 01-05-2024

Báo cáo nghiên cứu khoa học " KẾT QUẢ NGHIÊN CỨU BƯỚC ĐẦU VỀ THIÊN ĐỊCH CHÂN KHỚP TRÊN CÂY THANH TRÀ Ở THỪA THIÊN HUẾ "

7 176 0 01-05-2024

HƯỚNG DẪN SỬ DỤNG PHẦN MỀM CAITA part 9

18 130 0 01-05-2024

GIÁO TRÌNH MÁY ĐIỆN KHÍ CỤ ĐIỆN - PHẦN I MÁY ĐIỆN - CHƯƠNG 1

46 131 2 01-05-2024

XỬ TRÍ CHẤN THƯƠNG SỌ NÃO KÍN

1 116 1 01-05-2024

Hệ thống làm lạnh và điều hòa không khí

21 127 0 01-05-2024

Fecal Incontinence Diagnosis and Treatment - part 8

35 103 0 01-05-2024

Gastroenterology an illustrated colour text - part 10

10 89 0 01-05-2024

Quy Trình Canh Tác Cây Bông Vải

8 110 0 01-05-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7867 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5795 1391

Ebook Chào con ba mẹ đã sẵn sàng

112 3772 1233

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5334 1136

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8302 1127

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3518 644

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10906 530

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3695 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4071 516

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4136 480