TAILIEUCHUNG - Báo cáo khoa học: "Enriching Morphologically Poor Languages for Statistical Machine Translation"

We address the problem of translating from morphologically poor to morphologically rich languages by adding per-word linguistic information to the source language. We use the syntax of the source sentence to extract information for noun cases and verb persons and annotate the corresponding words accordingly. In experiments, we show improved performance for translating from English into Greek and Czech. For English–Greek, we reduce the error on the verb conjugation from 19% to and noun case agreement from 9% to 6%. . | Enriching Morphologically Poor Languages for Statistical Machine Translation Eleftherios Avramidis Philipp Koehn pkoehn@ School of Informatics University of Edinburgh 2 Baccleuch Place Edinburgh EH8 9LW UK Abstract We address the problem of translating from morphologically poor to morphologically rich languages by adding per-word linguistic information to the source language. We use the syntax of the source sentence to extract information for noun cases and verb persons and annotate the corresponding words accordingly. In experiments we show improved performance for translating from English into Greek and Czech. For English-Greek we reduce the error on the verb conjugation from 19 to and noun case agreement from 9 to 6 . 1 Introduction Traditional statistical machine translation methods are based on mapping on the lexical level which takes place in a local window of a few words. Hence they fail to produce adequate output in many cases where more complex linguistic phenomena play a role. Take the example of morphology. Predicting the correct morphological variant for a target word may not depend solely on the source words but require additional information about its role in the sentence. Recent research on handling rich morphology has largely focused on translating from rich morphology languages such as Arabic into English Habash and Sadat 2006 . There has been less work on the opposite case translating from English into morphologically richer languages. In a study of translation quality for languages in the Europarl corpus Koehn 2005 reports that translating into morphologically richer languages is more difficult than translating from them. There are intuitive reasons why generating richer morphology from morphologically poor languages is harder. Take the example of translating noun phrases from English to Greek or German Czech etc. . In English a noun phrase is rendered the same if it is the subject or the object. However .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.