TAILIEUCHUNG - Báo cáo khoa học: " New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron"

This paper introduces new learning algorithms for natural language processing based on the perceptron algorithm. We show how the algorithms can be efﬁciently applied to exponential sized representations of parse trees, such as the “all subtrees” (DOP) representation described by (Bod 1998), or a representation tracking all sub-fragments of a tagged sentence. We give experimental results showing signiﬁcant improvements on two tasks: parsing Wall Street Journal text, and namedentity extraction from web data. . | Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics ACL Philadelphia July 2002 pp. 263-270. New Ranking Algorithms for Parsing and Tagging Kernels over Discrete Structures and the Voted Perceptron Michael Collins AT T Labs-Research Florham Park New Jersey. mcollins@ Nigel Duffy iKuni Inc. 3400 Hillview Ave. Building 5 Palo Alto CA 94304. nigeduff@ Abstract This paper introduces new learning algorithms for natural language processing based on the perceptron algorithm. We show how the algorithms can be efficiently applied to exponential sized representations of parse trees such as the all subtrees DOP representation described by Bod 1998 or a representation tracking all sub-fragments of a tagged sentence. We give experimental results showing significant improvements on two tasks parsing Wall Street Journal text and named-entity extraction from web data. 1 Introduction The perceptron algorithm is one of the oldest algorithms in machine learning going back to Rosenblatt 1958 . It is an incredibly simple algorithm to implement and yet it has been shown to be competitive with more recent learning methods such as support vector machines - see Freund Schapire 1999 for its application to image classification for example. This paper describes how the perceptron and voted perceptron algorithms can be used for parsing and tagging problems. Crucially the algorithms can be efficiently applied to exponential sized representations of parse trees such as the all subtrees DOP representation described by Bod 1998 or a representation tracking all sub-fragments of a tagged sentence. It might seem paradoxical to be able to efficiently learn and apply a model with an exponential number of The key to our algorithms is the Although see Goodman 1996 for an efficient algorithm for the DOP model which we discuss in section 7 of this paper. kernel trick Cristianini and Shawe-Taylor 2000 discuss kernel methods at length .

Ðình Cường 75 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: " New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron"

8 66 0

Project Leadership – Step by Step: Part I A Handbook on How to Master Small- and Medium-Sized Projects – SMPs

108 35 0

Project Leadership – Step by Step: Part II A Handbook on How to Master Small- and Medium-Sized Projects – SMPs

120 55 1

A New Approach for Ranking Efficient Units in Data Envelopment Analysis and Application to a Sample of Vietnamese Agricultural Bank Branches

11 71 0

Ideal Profile Method: A comparison between rating and ranking technique

7 74 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462307 61

Giới thiệu :Lập trình mã nguồn mở

14 25017 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11301 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10515 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9800 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8879 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8469 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8093 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7501 1765

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7200 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Hướng dẫn chế độ dinh dưỡng cho người bệnh viêm khớp

5 161 2 02-12-2024

Color Atlas of Ophthamology

165 135 2 02-12-2024

CUỘC KHÁNG CHIẾN CHỐNG THỰC DÂN PHÁP KẾT THÚC (1953 - 1954)_5

11 136 1 02-12-2024

5 thói quen ăn uống hủy hoại hàm răng đẹp

5 161 1 02-12-2024

Báo cáo lâm nghiệp: "Assessment of the effects of below-zero temperatures on photosynthesis and chlorophyll a fluorescence in leaf discs of Eucalyptus globulu"

4 132 0 02-12-2024

Sinh thái học nông nghiệp : Sinh thái học và sự phát triển Nông nghiệp part 8

8 132 0 02-12-2024

LINUX DEVICE DRIVERS 3rd edition phần 8

64 126 0 02-12-2024

Món ngon ngày lễ tết part 2

16 128 1 02-12-2024

Báo cáo lâm nghiệp: " Influence de l’élagage sur la duraminisation, la production de bois de tension et quelques autres propriétés du bois de peuplierI 214"

13 104 0 02-12-2024

Giáo trình điều dưỡng khoa ngoại part 7

22 150 2 02-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8093 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7501 1765

Ebook Chào con ba mẹ đã sẵn sàng

112 4370 1369

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6169 1260

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8879 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3801 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3912 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4629 562

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11301 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4463 490