TAILIEUCHUNG - Báo cáo khoa học: "Cross-lingual Parse Disambiguation based on Semantic Correspondence"

We present a system for cross-lingual parse disambiguation, exploiting the assumption that the meaning of a sentence remains unchanged during translation and the fact that different languages have different ambiguities. We simultaneously reduce ambiguity in multiple languages in a fully automatic way. | Cross-lingual Parse Disambiguation based on Semantic Correspondence Lea Frermann Francis Bond Department of Computational Linguistics Linguistics and Multilingual Studies Saarland University Nanyang Technological University frermann@ bond@ Abstract We present a system for cross-lingual parse disambiguation exploiting the assumption that the meaning of a sentence remains unchanged during translation and the fact that different languages have different ambiguities. We simultaneously reduce ambiguity in multiple languages in a fully automatic way. Evaluation shows that the system reliably discards dispreferred parses from the raw parser output which results in a pre-selection that can speed up manual treebanking. 1 Introduction Treebanks sets of parsed sentences annotated with a sytactic structure are an important resource in NLP. The manual construction of treebanks where a human annotator selects a gold parse from all parses returned by a parser is a tedious and error prone process. We present a system for simultaneous and accurate partial parse disambiguation of multiple languages. Using the pre-selected set of parses returned by the system the treebanking process for multiple languages can be sped up. The system operates on an aligned parallel corpus. The languages of the parallel corpus are considered as mutual semantic tags As the meaning of a sentence stays constant during translation we are able to resolve ambiguities which exist in only one of the langauges by only accepting those interpretations which are licensed by the other language. In particular we select one language as the target language translate the other language s semantics for every parse into the target language and thus align maximally similar semantic representations. 125 The parses with the most overlapping semantics are selected as preferred parses. As an example consider the English sentence They closed the shop at five which has the following two .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.