TAILIEUCHUNG - Báo cáo khoa học: "Asynchronous Binarization for Synchronous Grammars"

Binarization of n-ary rules is critical for the efficiency of syntactic machine translation decoding. Because the target side of a rule will generally reorder the source side, it is complex (and sometimes impossible) to find synchronous rule binarizations. However, we show that synchronous binarizations are not necessary in a two-stage decoder. Instead, the grammar can be binarized one way for the parsing stage, then rebinarized in a different way for the reranking stage. Each individual binarization considers only one monolingual projection of the grammar, entirely avoiding the constraints of synchronous binarization and allowing binarizations that are separately optimized for. | Asynchronous Binarization for Synchronous Grammars John DeNero Adam Pauls and Dan Klein Computer Science Division University of California Berkeley denero adpauls klein @ Abstract Binarization of n-ary rules is critical for the efficiency of syntactic machine translation decoding. Because the target side of a rule will generally reorder the source side it is complex and sometimes impossible to find synchronous rule binarizations. However we show that synchronous binarizations are not necessary in a two-stage decoder. Instead the grammar can be binarized one way for the parsing stage then rebinarized in a different way for the reranking stage. Each individual binarization considers only one monolingual projection of the grammar entirely avoiding the constraints of synchronous binarization and allowing binarizations that are separately optimized for each stage. Compared to n-ary forest reranking even simple target-side binarization schemes improve overall decoding accuracy. 1 Introduction Syntactic machine translation decoders search over a space of synchronous derivations scoring them according to both a weighted synchronous grammar and an n-gram language model. The rewrites of the synchronous translation grammar are typically flat n-ary rules. Past work has synchronously binarized such rules for efficiency Zhang et al. 2006 Huang et al. 2008 . Unfortunately because source and target orders differ synchronous binarizations can be highly constrained and sometimes impossible to find. Recent work has explored two-stage decoding which explicitly decouples decoding into a source parsing stage and a target language model integration stage Huang and Chiang 2007 . Because translation grammars continue to increase in size and complexity both decoding stages require efficient approaches DeNero et al. 2009 . In this paper we show how two-stage decoding enables independent binarizations for each stage. The source-side binarization guarantees cubictime .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.