TAILIEUCHUNG - Báo cáo khoa học: "Empirical Lower Bounds on the Complexity of Translational Equivalence ∗"

This paper describes a study of the patterns of translational equivalence exhibited by a variety of bitexts. The study found that the complexity of these patterns in every bitext was higher than suggested in the literature. These ﬁndings shed new light on why “syntactic” constraints have not helped to improve statistical translation models, including ﬁnitestate phrase-based models, tree-to-string models, and tree-to-tree models. | Empirical Lower Bounds on the Complexity of Translational Equivalence Benjamin Wellington Computer Science Dept. New York University New York NY 10003 lastname @ Sonjia Waxmonsky Computer Science Dept. University of Chicago 1 Chicago IL 60637 wax@ I. Dan Melamed Computer Science Dept. New York University New York NY 10003 lastname @ Abstract This paper describes a study of the patterns of translational equivalence exhibited by a variety of bitexts. The study found that the complexity of these patterns in every bitext was higher than suggested in the literature. These findings shed new light on why syntactic constraints have not helped to improve statistical translation models including finite-state phrase-based models tree-to-string models and tree-to-tree models. The paper also presents evidence that inversion transduction grammars cannot generate some translational equivalence relations even in relatively simple real bitexts in syntactically similar languages with rigid word order. Instructions for replicating our experiments are at http GenPar ACL06 1 Introduction Translational equivalence is a mathematical relation that holds between linguistic expressions with the same meaning. The most common explicit representations of this relation are word alignments between sentences that are translations of each other. The complexity of a given word alignment can be measured by the difficulty of decomposing it into its atomic units under certain constraints detailed in Section 2. This paper describes a study of the distribution of alignment complexity in a variety of bitexts. The study considered word alignments both in isolation and in combination with independently generated parse trees for one or both sentences in each pair. Thus the study Thanks to David Chiang Liang Huang the anonymous reviewers and members of the NYU Proteus Project for helpful feedback. This research was supported by NSF grant s 0238406 and .

Sơn Lâm 72 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Empirical Lower Bounds on the Complexity of Translational Equivalence ∗"

8 50 0

Determinants of Public Debt in Lower-Middle Income Countries

7 174 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462307 61

Giới thiệu :Lập trình mã nguồn mở

14 25017 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11301 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10515 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9800 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8879 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8469 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8093 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7501 1765

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7200 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Báo cáo nghiên cứu nông nghiệp " Biofertiliser inoculant technology for the growth of rice in Vietnam: Developing technical infrastructure for quality assurance and village production for farmers "

12 134 2 02-12-2024

Bảng màu theo chữ cái – V

11 156 2 02-12-2024

Hướng dẫn chế độ dinh dưỡng cho người bệnh viêm khớp

5 161 2 02-12-2024

Báo cáo " Thẩm quyền quản lí nhà nước đối với hoạt động quảng cáo thực trạng và hướng hoàn thiện "

7 198 7 02-12-2024

Bệnh sán lá gan trên gia súc và cách phòng trị

3 158 1 02-12-2024

báo cáo khoa học: "Malignant peripheral nerve sheath tumor arising from the greater omentum: Case report"

4 136 1 02-12-2024

IT Audit: EMC’s Journey to the Private Cloud

13 152 1 02-12-2024

CUỘC KHÁNG CHIẾN CHỐNG THỰC DÂN PHÁP KẾT THÚC (1953 - 1954)_5

11 136 1 02-12-2024

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 136 1 02-12-2024

Business English Lesson – Advanced Level's archiveFinance (1)

8 111 0 02-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8093 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7501 1765

Ebook Chào con ba mẹ đã sẵn sàng

112 4370 1369

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6169 1260

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8879 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3801 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3912 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4629 562

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11301 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4463 490