TAILIEUCHUNG - Báo cáo khoa học: "Constituency to Dependency Translation with Forests"

Tree-to-string systems (and their forestbased extensions) have gained steady popularity thanks to their simplicity and efficiency, but there is a major limitation: they are unable to guarantee the grammaticality of the output, which is explicitly modeled in string-to-tree systems via targetside syntax. We thus propose to combine the advantages of both, and present a novel constituency-to-dependency translation model, which uses constituency forests on the source side to direct the translation, and dependency trees on the target side (as a language model) to ensure grammaticality. . | Constituency to Dependency Translation with Forests Haitao Mi and Qun Liu Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences . Box 2704 Beijing 100190 China htmi liuqun @ Abstract Tree-to-string systems and their forestbased extensions have gained steady popularity thanks to their simplicity and efficiency but there is a major limitation they are unable to guarantee the grammaticality of the output which is explicitly modeled in string-to-tree systems via targetside syntax. We thus propose to combine the advantages of both and present a novel constituency-to-dependency translation model which uses constituency forests on the source side to direct the translation and dependency trees on the target side as a language model to ensure grammaticality. Medium-scale experiments show an absolute and statistically significant improvement of BLEU points over a state-of-the-art forest-based tree-to-string system even with fewer rules. This is also the first time that a tree-to-tree model can surpass tree-to-string counterparts. 1 Introduction Linguistically syntax-based statistical machine translation models have made promising progress in recent years. By incorporating the syntactic annotations of parse trees from both or either side s of the bitext they are believed better than phrasebased counterparts in reorderings. Depending on the type of input these models can be broadly divided into two categories see Table 1 the stringbased systems whose input is a string to be simultaneously parsed and translated by a synchronous grammar and the tree-based systems whose input is already a parse tree to be directly converted into a target tree or string. When we also take into account the type of output tree or string the treebased systems can be divided into tree-to-string and tree-to-tree efforts. tree on examples partial fast gram. BLEU source Liu06 Huang06 - target Galley06 Shen08 - both Ding05 Liu09

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.