TAILIEUCHUNG - Báo cáo khoa học: "Bitext Dependency Parsing with Bilingual Subtree Constraints"

This paper proposes a dependency parsing method that uses bilingual constraints to improve the accuracy of parsing bilingual texts (bitexts). In our method, a targetside tree fragment that corresponds to a source-side tree fragment is identified via word alignment and mapping rules that are automatically learned. Then it is verified by checking the subtree list that is collected from large scale automatically parsed data on the target side. Our method, thus, requires gold standard trees only on the source side of a bilingual corpus in the training phase, unlike the joint parsing model, which requires gold standard trees on. | Bitext Dependency Parsing with Bilingual Subtree Constraints Wenliang Chen Jun ichi Kazama and Kentaro Torisawa Language Infrastructure Group MASTAR Project National Institute of Information and Communications Technology 3-5 Hikari-dai Seika-cho Soraku-gun Kyoto Japan 619-0289 chenwl kazama torisawa @ Abstract This paper proposes a dependency parsing method that uses bilingual constraints to improve the accuracy of parsing bilingual texts bitexts . In our method a targetside tree fragment that corresponds to a source-side tree fragment is identified via word alignment and mapping rules that are automatically learned. Then it is verified by checking the subtree list that is collected from large scale automatically parsed data on the target side. Our method thus requires gold standard trees only on the source side of a bilingual corpus in the training phase unlike the joint parsing model which requires gold standard trees on the both sides. Compared to the reordering constraint model which requires the same training data as ours our method achieved higher accuracy because of richer bilingual constraints. Experiments on the translated portion of the Chinese Treebank show that our system outperforms monolingual parsers by points for Chinese and points for English. 1 Introduction Parsing bilingual texts bitexts is crucial for training machine translation systems that rely on syntactic structures on either the source side or the target side or the both Ding and Palmer 2005 Nakazawa et al. 2006 . Bitexts could provide more information which is useful in parsing than a usual monolingual texts that can be called bilingual constraints and we expect to obtain more accurate parsing results that can be effectively used in the training of MT systems. With this motivation there are several studies aiming at highly accurate bitext parsing Smith and Smith 2004 Burkett and Klein 2008 Huang et al. 2009 . This paper proposes a dependency parsing method which uses .

TỪ KHÓA LIÊN QUAN
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.