TAILIEUCHUNG - Báo cáo khoa học: "ADP based Search Algorithm for Statistical Machine Translation"

We introduce a novel search algorithm for statistical machine translation based on dynamic programming (DP). During the search process two statistical knowledge sources are combined: a translation model and a bigram language model. This search algorithm expands hypotheses along the positions of the target string while guaranteeing progressive coverage of the words in the source string. We present experimental results on the Verbmobil task. | A DP based Search Algorithm for Statistical Machine Translation s. Niefien s. Vogel H. Ney and c. Tillmann Lehrstuhl fur Informatik VI RWTH Aachen - University of Technology D-52056 Aachen Germany Email niessen@informatik. rwth-aachen. de Abstract We introduce a novel search algorithm for statistical machine translation based on dynamic programming DP . During the search process two statistical knowledge sources are combined a translation model and a bigram language model. This search algorithm expands hypotheses along the positions of the target string while guaranteeing progressive coverage of the words in the source string. We present experimental results on the Verbmobil task. 1 Introduction In this paper we address the problem of finding the most probable target language representation of a given source language string. In our approach we use a DP based search algorithm which sequentially visits the target string positions while progressively considering the source string words. The organization of the paper is as follows. After reviewing the statistical approach to machine translation we first describe the statistical knowledge sources used during the search process. We then present our DP based search algorithm in detail. Finally experimental results for a bilingual corpus are reported. Statistical Machine Translation In statistical machine translation the goal of the search strategy can be formulated as follows We are given a source language French string f 1 fj which is to be translated into a target language English string e ei. .ej with the unknown length I. Every English string is considered as a possible translation for the input string. If we assign a probability Pr e If to each pair of strings ef fl then we have to choose the length I opt and the English string ê that maximize Pr e for a given French string f . According to Bayes decision rule lopt and Ể1 P can be found by Upt ẻi p i argmax Pr e I v 7 Z eỉ argmax Pr e -Pr fief . 1 Fr e is the .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.