TAILIEUCHUNG - Báo cáo khoa học: "Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora"

Paraphrase patterns are useful in paraphrase recognition and generation. In this paper, we present a pivot approach for extracting paraphrase patterns from bilingual parallel corpora, whereby the English paraphrase patterns are extracted using the sentences in a foreign language as pivots. We propose a loglinear model to compute the paraphrase likelihood of two patterns and exploit feature functions based on maximum likelihood estimation (MLE) and lexical weighting (LW). Using the presented method, we extract over 1,000,000 pairs of paraphrase patterns from 2M bilingual sentence pairs, the precision of which exceeds 67%. . | Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora Shiqi Zhao1 Haifeng Wang2 Ting Liu1 Sheng Li1 1Harbin Institute of Technology Harbin China zhaosq tliu lisheng @ 2Toshiba China Research and Development Center Beijing China wanghaifeng@ Abstract Paraphrase patterns are useful in paraphrase recognition and generation. In this paper we present a pivot approach for extracting paraphrase patterns from bilingual parallel corpora whereby the English paraphrase patterns are extracted using the sentences in a foreign language as pivots. We propose a log-linear model to compute the paraphrase likelihood of two patterns and exploit feature functions based on maximum likelihood estimation MLE and lexical weighting LW . Using the presented method we extract over 1 000 000 pairs of paraphrase patterns from 2M bilingual sentence pairs the precision of which exceeds 67 . The evaluation results show that 1 The pivot approach is effective in extracting paraphrase patterns which significantly outperforms the conventional method DIRT. Especially the log-linear model with the proposed feature functions achieves high performance. 2 The coverage of the extracted paraphrase patterns is high which is above 84 . 3 The extracted paraphrase patterns can be classified into 5 types which are useful in various applications. 1 Introduction Paraphrases are different expressions that convey the same meaning. Paraphrases are important in plenty of natural language processing NLP applications such as question answering QA Lin and Pantel 2001 Ravichandran and Hovy 2002 machine translation MT Kauchak and Barzilay 2006 Callison-Burch et al. 2006 multi-document summarization McKeown et al. 2002 and natural language generation Iordanskaja et al. 1991 . Paraphrase patterns are sets of semantically equivalent patterns in which a pattern generally contains two parts . the pattern words and slots. For example in the pattern X solves Y solves is the .

TÀI LIỆU MỚI ĐĂNG
8    167    3    01-01-2025
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.