TAILIEUCHUNG - Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training"

Word alignment methods can gain valuable guidance by ensuring that their alignments maintain cohesion with respect to the phrases speciﬁed by a monolingual dependency tree. However, this hard constraint can also rule out correct alignments, and its utility decreases as alignment models become more complex. We use a publicly available structured output SVM to create a max-margin syntactic aligner with a soft cohesion constraint. | Soft Syntactic Constraints for Word Alignment through Discriminative Training Colin Cherry Department of Computing Science University of Alberta Edmonton AB Canada T6G2E8 colinc@ Dekang Lin Google Inc. 1600 Amphitheatre Parkway Mountain View CA USA 94043 lindek@ Abstract Word alignment methods can gain valuable guidance by ensuring that their alignments maintain cohesion with respect to the phrases specified by a monolingual dependency tree. However this hard constraint can also rule out correct alignments and its utility decreases as alignment models become more complex. We use a publicly available structured output SVM to create a max-margin syntactic aligner with a soft cohesion constraint. The resulting aligner is the first to our knowledge to use a discriminative learning method to train an ITG bitext parser. 1 Introduction Given a parallel sentence pair or bitext bilingual word alignment finds word-to-word connections across languages. Originally introduced as a byproduct of training statistical translation models in Brown et al. 1993 word alignment has become the first step in training most statistical translation systems and alignments are useful to a host of other tasks. The dominant IBM alignment models Och and Ney 2003 use minimal linguistic intuitions sentences are treated as flat strings. These carefully designed generative models are difficult to extend and have resisted the incorporation of intuitively useful features such as morphology. There have been many attempts to incorporate syntax into alignment we will not present a complete list here. Some methods parse two flat strings at once using a bitext grammar Wu 1997 . Others parse one of the two strings before alignment begins and align the resulting tree to the remaining string Yamada and Knight 2001 . The statistical models associated with syntactic aligners tend to be very different from their IBM counterparts. They model operations that are meaningful at a syntax level .

Hữu Nghĩa 65 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Soft Syntactic Constraints for Hierarchical Phrased-Based Translation"

9 45 0

Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training"

8 52 0

Báo cáo khoa học: "Discriminative Sentence Compression with Soft Syntactic Evidence"

8 51 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462291 61

Giới thiệu :Lập trình mã nguồn mở

14 24918 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11286 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10511 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9790 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8876 1160

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8467 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8090 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7471 1763

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7188 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Báo cáo nghiên cứu khoa học " KẾT QUẢ NGHIÊN CỨU BƯỚC ĐẦU VỀ THIÊN ĐỊCH CHÂN KHỚP TRÊN CÂY THANH TRÀ Ở THỪA THIÊN HUẾ "

7 261 4 26-11-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 211 4 26-11-2024

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 213 3 26-11-2024

Data Structures and Algorithms - Chapter 8: Heaps

41 172 5 26-11-2024

báo cáo hóa học:" Increased androgen receptor expression in serous carcinoma of the ovary is associated with an improved survival"

6 150 3 26-11-2024

Báo cáo nghiên cứu nông nghiệp " Field control of pest fruit flies in Vietnam "

14 181 4 26-11-2024

Báo cáo y học: "The Factors Influencing Depression Endpoints Research (FINDER) study: final results of Italian patients with depressio"

9 139 1 26-11-2024

ETHICAL CODE HANDBOOK: Demonstrate your commitment to high standards

7 140 1 26-11-2024

Báo cáo nghiên cứu khoa học " Vai trò chính quyền địa phương trong phát triển kinh tế : khu chuyên doanh gốm sứ ( Trung Quốc ) và Bát Tràng ( Việt Nam )("

11 206 1 26-11-2024

CUỘC KHÁNG CHIẾN CHỐNG THỰC DÂN PHÁP KẾT THÚC (1953 - 1954)_5

11 133 1 26-11-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8090 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7471 1763

Ebook Chào con ba mẹ đã sẵn sàng

112 4364 1369

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6156 1258

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8876 1160

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3790 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3909 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4618 562

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11286 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4454 490