TAILIEUCHUNG - Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation"

In this paper, we present a block-based model for statistical machine translation. A block is a pair of phrases which are translations of each other. For example, Fig. 1 shows an Arabic-English translation example that uses blocks. During decoding, we view translation as a block segmentation process, where the input sentence is segmented from left to right and the target sentence is generated from bottom to top, one block at a time. A monotone block sequence is generated except for the possibility to swap a pair of neighbor blocks. We use an orientation model similar to the lexicalized block. | A Localized Prediction Model for Statistical Machine Translation Christoph Tillmann and Tong Zhang IBM TJ. Watson Research Center Yorktown Heights NY 10598 USA ctill tzhang @ Abstract In this paper we present a novel training method for a localized phrase-based prediction model for statistical machine translation SMT . The model predicts blocks with orientation to handle local phrase re-ordering. We use a maximum likelihood criterion to train a log-linear block bigram model which uses realvalued features . a language model score as well as binary features based on the block identities themselves . block bigram features. Our training algorithm can easily handle millions of features. The best system obtains a improvement over the baseline on a standard Arabic-English translation task. aữspa Lebanese vlolale warplanes Israeli A 1 A 1 A 1 Ì n A 1 A Ị A 1 T H A t m j 1 Ặ r s h j w b b r k A y n r y A 1 A A p n t y y 1 Introduction In this paper we present a block-based model for statistical machine translation. A block is a pair of phrases which are translations of each other. For example Fig. 1 shows an Arabic-English translation example that uses 4 blocks. During decoding we view translation as a block segmentation process where the input sentence is segmented from left to right and the target sentence is generated from bottom to top one block at a time. A monotone block sequence is generated except for the possibility to swap a pair of neighbor blocks. We use an orientation model similar to the lexicalized block re-ordering model in Tillmann 2004 Och et al. 2004 to generate a block b with orientation Ỡ relative to its predecessor block 6 . During decoding we compute the probability p b o of a block sequence 6 with orientation o as a product of block bigram probabilities 011 -1 01-1 1 i l y p Figure 1 An Arabic-English block translation example where the Arabic words are romanized. The following orientation sequence is generated 01 N 02 L o3 N Ỡ4

Xuân Thiện 85 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

A new localized multi-constraint QoS routing algorithm

11 65 0

A study on the localized corrosion inhibition for mild steel in saline solution

10 72 0

Stability of non localized responses for damaging materials

12 55 0

Numerical analysis of arching effect within embankment reinforced by geosynthetic beneath localized sinkhole

5 15 1

Cytoplasmically localized tRNA-derived fragments inhibit translation in Drosophila S2 cells

11 3 1

Localized surface plasmon resonances with spherical metallic nanoparticles

11 62 0

Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation"

8 68 0

Construction and analysis of localized responses for gradient damage models in a 1D setting

14 78 0

Parathyromatosis: A rare case of recurrent hyperparathyroidism localized by four-dimensional computed tomography

4 79 0

A study on the localized corrosion inhibition for mild steel in saline solution

10 62 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462340 61

Giới thiệu :Lập trình mã nguồn mở

14 26019 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11345 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10550 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9841 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8889 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8504 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8100 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7735 1790

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7263 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Data Structures and Algorithms - Chapter 8: Heaps

41 188 5 26-12-2024

Báo cáo nghiên cứu nông nghiệp " Biofertiliser inoculant technology for the growth of rice in Vietnam: Developing technical infrastructure for quality assurance and village production for farmers "

12 146 2 26-12-2024

Báo cáo nghiên cứu nông nghiệp " Field control of pest fruit flies in Vietnam "

14 190 4 26-12-2024

Chương 10: Các phương pháp tính quá trình quá độ trong mạch điện tuyến tính

57 233 7 26-12-2024

Hướng dẫn chế độ dinh dưỡng cho người bệnh viêm khớp

5 167 2 26-12-2024

BÀI GIẢNG Biến Đổi Năng Lượng Điện Cơ - TS. Hồ Phạm Huy

137 158 1 26-12-2024

báo cáo hóa học:" Quality of data collection in a large HIV observational clinic database in sub-Saharan Africa: implications for clinical research and audit of care"

7 154 4 26-12-2024

Giáo án điện tử tiểu học môn lịch sử: Cách mạng mùa thu

39 164 1 26-12-2024

Word Games with English 1

65 137 1 26-12-2024

Báo cáo nghiên cứu khoa học " Sự nhất quán phát triển kinh tế thị trường XHCN trong xây dựng xã hội hài hoà của Trung Quốc và đổi mới của Việt Nam "

8 144 1 26-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8100 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7735 1790

Ebook Chào con ba mẹ đã sẵn sàng

112 4406 1371

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6283 1266

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8889 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3839 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3919 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4708 565

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11345 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4508 490