TAILIEUCHUNG - Báo cáo khoa học: "Word Alignment via Submodular Maximization over Matroids"

We cast the word alignment problem as maximizing a submodular function under matroid constraints. Our framework is able to express complex interactions between alignment components while remaining computationally efﬁcient, thanks to the power and generality of submodular functions. We show that submodularity naturally arises when modeling word fertility. Experiments on the English-French Hansards alignment task show that our approach achieves lower alignment error rates compared to conventional matching based approaches. . | Word Alignment via Submodular Maximization over Matroids Hui Lin Dept. of Electrical Engineering University of Washington Seattle WA 98195 USA hlin@ Jeff Bilmes Dept. of Electrical Engineering University of Washington Seattle Wa 98195 USA bilmes@ Abstract We cast the word alignment problem as maximizing a submodular function under matroid constraints. Our framework is able to express complex interactions between alignment components while remaining computationally efficient thanks to the power and generality of submodular functions. We show that submodularity naturally arises when modeling word fertility. Experiments on the English-French Hansards alignment task show that our approach achieves lower alignment error rates compared to conventional matching based approaches. 1 Introduction Word alignment is a key component in most statistical machine translation systems. While classical approaches for word alignment are based on generative models . IBM models Brown et al. 1993 and HMM Vogel et al. 1996 word alignment can also be viewed as a matching problem where each word pair is associated with a score reflecting the desirability of aligning that pair and the alignment is then the highest scored matching under some constraints. Several matching-based approaches have been proposed in the past. Melamed 2000 introduces the competitive linking algorithm which greedily constructs matchings under the one-to-one mapping assumption. In Matusov et al. 2004 matchings are found using an algorithm for constructing a maximum weighted bipartite graph matching Schrijver 2003 where word pair scores come from alignment posteriors of generative models. Similarly Taskar et al. 2005 cast word alignment as a maximum weighted matching problem and propose a 170 framework for learning word pair scores as a function of arbitrary features of that pair. These approaches however have two potentially substantial limitations words have fertility of at most .

Hòa Hợp 41 6 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the 0"

9 73 0

Báo cáo khoa học: "Alignment Model Adaptation for Domain-Specific Word Alignment"

8 60 0

Báo cáo khoa học: "Word Alignment Combination over Multiple Word Segmentation"

5 44 0

Báo cáo khoa học: "Bootstrapping Word Alignment via Word Packing"

8 42 0

A new feature to improve Moore’s sentence alignment method

13 65 0

Báo cáo khoa học: "Hierarchical Search for Word Alignment"

10 81 0

Báo cáo khoa học: "Diversify and Combine: Improving Word Alignment for Machine Translation on Low-Resource Languages"

5 72 0

Báo cáo khoa học: "Word Alignment with Synonym Regularization"

5 52 0

Báo cáo khoa học: "How Much Can We Gain from Supervised Word Alignment"

5 55 0

Báo cáo khoa học: "Active Learning-Based Elicitation for Semi-Supervised Word Alignment"

6 61 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462351 61

Giới thiệu :Lập trình mã nguồn mở

14 26653 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11375 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10566 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9854 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8518 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7912 1821

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7289 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 231 4 08-01-2025

báo cáo hóa học:" Increased androgen receptor expression in serous carcinoma of the ovary is associated with an improved survival"

6 162 3 08-01-2025

Chương 10: Các phương pháp tính quá trình quá độ trong mạch điện tuyến tính

57 246 8 08-01-2025

báo cáo hóa học:" Quality of data collection in a large HIV observational clinic database in sub-Saharan Africa: implications for clinical research and audit of care"

7 163 4 08-01-2025

Báo cáo " Thẩm quyền quản lí nhà nước đối với hoạt động quảng cáo thực trạng và hướng hoàn thiện "

7 217 7 08-01-2025

Valve Selection Handbook - Fourth Edition

337 150 2 08-01-2025

ĐỀ TÀI " ĐÁNH GIÁ HIỆU QUẢ HOẠT ĐỘNG KINH DOANH NGOẠI HỐI CỦA NGÂN HÀNG THƯƠNG MẠI CỔ PHẦN XUẤT NHẬP KHẨU VIỆT NAM "

51 159 3 08-01-2025

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 143 1 08-01-2025

Báo cáo lâm nghiệp: "Assessment of the effects of below-zero temperatures on photosynthesis and chlorophyll a fluorescence in leaf discs of Eucalyptus globulu"

4 152 0 08-01-2025

CÔNG NGHỆ MÔI TRƯỜNG - CHƯƠNG 5 CƠ SỞ QUÁ TRÌNH XỬ LÝ SINH HỌC

1 154 0 08-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7912 1821

Ebook Chào con ba mẹ đã sẵn sàng

112 4435 1376

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6353 1276

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3859 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3930 610

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4768 567

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11375 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4533 490