TAILIEUCHUNG - Báo cáo khoa học: "Combining Multiple Resources to Improve SMT-based Paraphrasing Model∗"

This paper proposes a novel method that exploits multiple resources to improve statistical machine translation (SMT) based paraphrasing. In detail, a phrasal paraphrase table and a feature function are derived from each resource, which are then combined in a log-linear SMT model for sentence-level paraphrase generation. Experimental results show that the SMT-based paraphrasing model can be enhanced using multiple resources. The phrase-level and sentence-level precision of the generated paraphrases are above 60% and 55%, respectively. In addition, the contribution of each resource is evaluated, which indicates that all the exploited resources are useful for generating paraphrases of high quality | Combining Multiple Resources to Improve SMT-based Paraphrasing Model Shiqi Zhao1 Cheng Niu2 Ming Zhou2 Ting Liu1 Sheng Li1 1Harbin Institute of Technology Harbin China zhaosq tliu lisheng @ 2Microsoft Research Asia Beijing China chengniu mingzhou @ Abstract This paper proposes a novel method that exploits multiple resources to improve statistical machine translation SMT based paraphrasing. In detail a phrasal paraphrase table and a feature function are derived from each resource which are then combined in a log-linear SMT model for sentence-level paraphrase generation. Experimental results show that the SMT-based paraphrasing model can be enhanced using multiple resources. The phrase-level and sentence-level precision of the generated paraphrases are above 60 and 55 respectively. In addition the contribution of each resource is evaluated which indicates that all the exploited resources are useful for generating paraphrases of high quality. 1 Introduction Paraphrases are alternative ways of conveying the same meaning. Paraphrases are important in many natural language processing NLP applications such as machine translation MT question answering QA information extraction IE multidocument summarization MDS and natural language generation NLG . This paper addresses the problem of sentencelevel paraphrase generation which aims at generating paraphrases for input sentences. An example of sentence-level paraphrases can be seen below S1 The table was set up in the carriage shed. S2 The table was laid under the cart-shed. This research was finished while the first author worked as an intern in Microsoft Research Asia. Paraphrase generation can be viewed as monolingual machine translation Quirk et al. 2004 which typically includes a translation model and a language model. The translation model can be trained using monolingual parallel corpora. However acquiring such corpora is not easy. Hence data sparseness is a key problem for the SMT-based .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.