TAILIEUCHUNG - Báo cáo khoa học: "Reordering Constraint Based on Document-Level Context"

One problem with phrase-based statistical machine translation is the problem of longdistance reordering when translating between languages with different word orders, such as Japanese-English. In this paper, we propose a method of imposing reordering constraints using document-level context. As the documentlevel context, we use noun phrases which significantly occur in context documents containing source sentences. | Reordering Constraint Based on Document-Level Context Takashi Onishi and Masao Utiyama and Eiichiro Sumita Multilingual Translation Laboratory MASTAR Project National Institute of Information and Communications Technology 3-5 Hikaridai Keihanna Science City Kyoto JAPAN mutiyama @ Abstract One problem with phrase-based statistical machine translation is the problem of longdistance reordering when translating between languages with different word orders such as Japanese-English. In this paper we propose a method of imposing reordering constraints using document-level context. As the documentlevel context we use noun phrases which significantly occur in context documents containing source sentences. Given a source sentence zones which cover the noun phrases are used as reordering constraints. Then in decoding reorderings which violate the zones are restricted. Experiment results for patent translation tasks show a significant improvement of BLEU points in Japanese-English translation and BLEU points in English-Japanese translation. 1 Introduction Phrase-based statistical machine translation is useful for translating between languages with similar word orders. However it has problems with longdistance reordering when translating between languages with different word orders such as Japanese-English. These problems are especially crucial when translating long sentences such as patent sentences because many combinations of word orders cause high computational costs and low translation quality. In order to address these problems various methods which use syntactic information have been proposed. These include methods where source sentences are divided into syntactic chunks or clauses and the translations are merged later Koehn and 434 Knight 2003 Sudoh et al. 2010 methods where syntactic constraints or penalties for reordering are added to a decoder Yamamoto et al. 2008 Cherry 2008 Marton and Resnik 2008 Xiong et al. 2010

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.