Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Better Alignments = Better Translations?"

Bảo Huỳnh 93 8 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

Automatic word alignment is a key step in training statistical machine translation systems. Despite much recent work on word alignment methods, alignment accuracy increases often produce little or no improvements in machine translation quality. In this work we analyze a recently proposed agreement-constrained EM algorithm for unsupervised alignment models. We attempt to tease apart the effects that this simple but effective modiﬁcation has on alignment precision and recall trade-offs, and how rare and common words are affected across several language pairs. . | Better Alignments Better Translations Kuzman Ganchev Computer Information Science University of Pennsylvania kuzman@cis.upenn.edu Joao V. Graca L2f INESC-ID Lisboa Portugal javg@l2f.inesc-id.pt Ben Taskar Computer Information Science University of Pennsylvania taskar@cis.upenn.edu Abstract Automatic word alignment is a key step in training statistical machine translation systems. Despite much recent work on word alignment methods alignment accuracy increases often produce little or no improvements in machine translation quality. In this work we analyze a recently proposed agreement-constrained EM algorithm for unsupervised alignment models. We attempt to tease apart the effects that this simple but effective modification has on alignment precision and recall trade-offs and how rare and common words are affected across several language pairs. We propose and extensively evaluate a simple method for using alignment models to produce alignments better-suited for phrase-based MT systems and show significant gains as measured by BLEU score in end-to-end translation systems for six languages pairs used in recent MT competitions. 1 Introduction The typical pipeline for a machine translation MT system starts with a parallel sentence-aligned corpus and proceeds to align the words in every sentence pair. The word alignment problem has received much recent attention but improvements in standard measures of word alignment performance often do not result in better translations. Fraser and Marcu 2007 note that none of the tens of papers published over the last five years has shown that significant decreases in alignment error rate AER result in significant increases in translation performance. In this work we show that by changing the way the word alignment models are trained and used we can get not only improvements in alignment performance but also in the performance of the MT system that uses those alignments. We present extensive experimental results evaluating a new training

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Better Alignments = Better Translations?"

Báo cáo khoa học: "Learning Better Rule Extraction with Translation Span Alignment"

Báo cáo khoa học: "Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the 0"

Báo cáo khoa học: "Better Filtration and Augmentation for Hierarchical Phrase-Based Translation Rules"

Báo cáo khoa học: "Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability"

Báo cáo khoa học: "Better Automatic Treebank Conversion Using A Feature-Based Approach"

Báo cáo khoa học: "Learning Better Data Representation using Inference-Driven Metric Learning"

Báo cáo khoa học: "Better Word Alignments with Supervised ITG Models"

Báo cáo khoa học: "Toward Smaller, Faster, and Better Hierarchical Phrase-based SMT"

Báo cáo khoa học: "Low-cost, High-performance Translation Retrieval: Dumber is Better"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.