Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Empirical Lower Bounds on the Complexity of Translational Equivalence ∗"

Sơn Lâm 72 8 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

This paper describes a study of the patterns of translational equivalence exhibited by a variety of bitexts. The study found that the complexity of these patterns in every bitext was higher than suggested in the literature. These ﬁndings shed new light on why “syntactic” constraints have not helped to improve statistical translation models, including ﬁnitestate phrase-based models, tree-to-string models, and tree-to-tree models. | Empirical Lower Bounds on the Complexity of Translational Equivalence Benjamin Wellington Computer Science Dept. New York University New York NY 10003 lastname @cs.nyu.edu Sonjia Waxmonsky Computer Science Dept. University of Chicago 1 Chicago IL 60637 wax@cs.uchicago.edu I. Dan Melamed Computer Science Dept. New York University New York NY 10003 lastname @cs.nyu.edu Abstract This paper describes a study of the patterns of translational equivalence exhibited by a variety of bitexts. The study found that the complexity of these patterns in every bitext was higher than suggested in the literature. These findings shed new light on why syntactic constraints have not helped to improve statistical translation models including finite-state phrase-based models tree-to-string models and tree-to-tree models. The paper also presents evidence that inversion transduction grammars cannot generate some translational equivalence relations even in relatively simple real bitexts in syntactically similar languages with rigid word order. Instructions for replicating our experiments are at http nlp.cs.nyu.edu GenPar ACL06 1 Introduction Translational equivalence is a mathematical relation that holds between linguistic expressions with the same meaning. The most common explicit representations of this relation are word alignments between sentences that are translations of each other. The complexity of a given word alignment can be measured by the difficulty of decomposing it into its atomic units under certain constraints detailed in Section 2. This paper describes a study of the distribution of alignment complexity in a variety of bitexts. The study considered word alignments both in isolation and in combination with independently generated parse trees for one or both sentences in each pair. Thus the study Thanks to David Chiang Liang Huang the anonymous reviewers and members of the NYU Proteus Project for helpful feedback. This research was supported by NSF grant s 0238406 and .

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Integrating surprisal and uncertain-input models in online sentence comprehension: formal techniques and empirical results"

Báo cáo khoa học: "An Empirical Investigation of Discounting in Cross-Domain Language Models"

Báo cáo khoa học: "An Empirical Evaluation of Data-Driven Paraphrase Generation Techniques"

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query"

Báo cáo khoa học: "An Empirical Study of Chinese Chunking"

Báo cáo khoa học: "Empirical Lower Bounds on the Complexity of Translational Equivalence ∗"

Báo cáo khoa học: "Empirical Measurements of Lexical Similarity in Noun Phrase Conjuncts"

Báo cáo khoa học: "An Extensive Empirical Study of Collocation Extraction Methods"

Báo cáo khoa học: "An Empirical Study of Information Synthesis Tasks"

Báo cáo khoa học: "An Empirical Study of the Influence of Argument Conciseness on Argument Effectiveness"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.