TAILIEUCHUNG - Báo cáo khoa học: "Forest-to-String Statistical Translation Rules"

In this paper, we propose forest-to-string rules to enhance the expressive power of tree-to-string translation models. A forestto-string rule is capable of capturing nonsyntactic phrase pairs by describing the correspondence between multiple parse trees and one string. To integrate these rules into tree-to-string translation models, auxiliary rules are introduced to provide a generalization level. | Forest-to-String Statistical Translation Rules Yang Liu Yun Huang Qun Liu and Shouxun Lin Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences PO. Box 2704 Beijing 100080 China yliu huangyun liuqun sxlin @ Abstract In this paper we propose forest-to-string rules to enhance the expressive power of tree-to-string translation models. A forest-to-string rule is capable of capturing nonsyntactic phrase pairs by describing the correspondence between multiple parse trees and one string. To integrate these rules into tree-to-string translation models auxiliary rules are introduced to provide a generalization level. Experimental results show that on the NIST 2005 Chinese-English test set the tree-to-string model augmented with forest-to-string rules achieves a relative improvement of in terms of BLEU score over the original model which allows tree-to-string rules only. 1 Introduction The past two years have witnessed the rapid development of linguistically syntax-based translation models Quirk et al. 2005 Galley et al. 2006 Marcu et al. 2006 Liu et al. 2006 which induce tree-to-string translation rules from parallel texts with linguistic annotations. They demonstrated very promising results when compared with the state of the art phrase-based system Och and Ney 2004 in the NIST 2006 machine translation evaluation 1. While Galley et al. 2006 and Marcu et al. 2006 put emphasis on target language analysis Quirk et al. 2005 and Liu et al. 2006 show benefits from modeling the syntax of source language. 1See http speech tests mt 704 One major problem with linguistically syntaxbased models however is that tree-to-string rules fail to syntactify non-syntactic phrase pairs because they require a syntax tree fragment over the phrase to be syntactified. Here we distinguish between syntactic and non-syntactic phrase pairs. By syntactic we mean that the phrase pair is subsumed by some syntax tree .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.