TAILIEUCHUNG - Báo cáo khoa học: "Phrase Chunking using Entropy Guided Transformation Learning"

Entropy Guided Transformation Learning (ETL) is a new machine learning strategy that combines the advantages of decision trees (DT) and Transformation Based Learning (TBL). In this work, we apply the ETL framework to four phrase chunking tasks: Portuguese noun phrase chunking, English base noun phrase chunking, English text chunking and Hindi text chunking. In all four tasks, ETL shows better results than Decision Trees and also than TBL with hand-crafted templates. | Phrase Chunking using Entropy Guided Transformation Learning Ruy L. Milidiu Cicero Nogueira dos Santos Julio C. Duarte Departamento de Informatica Departamento de Informatica Centro Tecnologico do Exercito PUC-Rio PUC-Rio Rio de Janeiro Brazil Rio de Janeiro Brazil nogueira@ jduarte@ milidiu@ Abstract Entropy Guided Transformation Learning ETL is a new machine learning strategy that combines the advantages of decision trees DT and Transformation Based Learning TBL . In this work we apply the ETL framework to four phrase chunking tasks Portuguese noun phrase chunking English base noun phrase chunking English text chunking and Hindi text chunking. In all four tasks ETL shows better results than Decision Trees and also than TBL with hand-crafted templates. ETL provides a new training strategy that accelerates transformation learning. For the English text chunking task this corresponds to a factor of five speedup. For Portuguese noun phrase chunking ETL shows the best reported results for the task. For the other three linguistic tasks ETL shows state-of-the-art competitive results and maintains the advantages of using a rule based system. 1 Introduction Phrase Chunking is a Natural Language Processing NLP task that consists in dividing a text into syntactically correlated parts of words. Theses phrases are non-overlapping . a word can only be a member of one chunk Sang and Buchholz 2000 . It provides a key feature that helps on more elaborated NLP tasks such as parsing and information extraction. Since the last decade many high-performance chunking systems were proposed such as SVM-based Kudo and Matsumoto 2001 Wu et al. 2006 Winnow Zhang et al. 2002 voted-perceptrons Carreras and Marquez 2003 Transformation-Based Learning TBL Ramshaw and Marcus 1999 Megyesi 2002 and Hidden Markov Model HMM Molina and Pla 2002 Memory-based Sang 2002 . State-of-the-art systems for English base noun phrase chunking and text chunking are based in .

Uyên My 63 9 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Automatic Evaluation Method for Machine Translation using Noun-Phrase Chunking"

10 59 0

Báo cáo khoa học: "Phrase Chunking using Entropy Guided Transformation Learning"

9 50 0

Báo cáo khoa học: "Noun Phrase Chunking in Hebrew Influence of Lexical and Morphological Features"

8 46 0

Báo cáo khoa học: "A Uniﬁed Single Scan Algorithm for Japanese Base Phrase Chunking and Dependency Parsing"

4 70 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462291 61

Giới thiệu :Lập trình mã nguồn mở

14 24914 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11286 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10511 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9790 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8876 1160

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8467 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8090 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7471 1763

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7188 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Gia đình vui nhộn

4 374 3 26-11-2024

Đóng mới oto 8 chỗ ngồi part 9

10 171 3 26-11-2024

báo cáo hóa học:" Increased androgen receptor expression in serous carcinoma of the ovary is associated with an improved survival"

6 150 3 26-11-2024

Giáo trình phân tích phương trình vi phân viết dưới dạng thuật toán đặc tính của hệ thống p1

5 149 1 26-11-2024

CHƯƠNG 2: RỦI RO THÂM HỤT TÀI KHÓA

28 152 1 26-11-2024

Báo cáo " Thẩm quyền quản lí nhà nước đối với hoạt động quảng cáo thực trạng và hướng hoàn thiện "

7 196 7 26-11-2024

ETHICAL CODE HANDBOOK: Demonstrate your commitment to high standards

7 140 1 26-11-2024

Word Games with English 1

65 130 1 26-11-2024

Báo cáo nghiên cứu khoa học " Đại hội XVI thông qua điều lệ Đảng cộng sản Trung Quốc những sửa đổi bổ sung mới "

4 155 1 26-11-2024

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 133 1 26-11-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8090 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7471 1763

Ebook Chào con ba mẹ đã sẵn sàng

112 4364 1369

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6155 1258

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8876 1160

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3789 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3909 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4617 562

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11286 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4454 490