TAILIEUCHUNG - Báo cáo khoa học: "Detecting Erroneous Sentences using Automatically Mined Sequential Patterns"

This paper studies the problem of identifying erroneous/correct sentences. The problem has important applications, ., providing feedback for writers of English as a Second Language, controlling the quality of parallel bilingual sentences mined from the Web, and evaluating machine translation results. In this paper, we propose a new approach to detecting erroneous sentences by integrating pattern discovery with supervised learning models. Experimental results show that our techniques are promising. . | Detecting Erroneous Sentences using Automatically Mined Sequential Patterns Guihua Sun Chongqing University sunguihua5018@ Zhongyang Xiong Chongqing University zyxiong@ Xiaohua Liu Gao Cong Ming Zhou Microsoft Research Asia xiaoliu gaocong mingzhou @ John Lee t Chin-Yew Lin MIT Microsoft Research Asia jsylee@ cyl@ Abstract This paper studies the problem of identifying erroneous correct sentences. The problem has important applications . providing feedback for writers of English as a Second Language controlling the quality of parallel bilingual sentences mined from the Web and evaluating machine translation results. In this paper we propose a new approach to detecting erroneous sentences by integrating pattern discovery with supervised learning models. Experimental results show that our techniques are promising. 1 Introduction Detecting erroneous correct sentences has the following applications. First it can provide feedback for writers of English as a Second Language ESL as to whether a sentence contains errors. Second it can be applied to control the quality of parallel bilingual sentences mined from the Web which are critical sources for a wide range of applications such as statistical machine translation Brown et al. 1993 and cross-lingual information retrieval Nie et al. 1999 . Third it can be used to evaluate machine translation results. As demonstrated in Corston-Oliver et al. 2001 Gamon et al. 2005 the better human reference translations can be distinguished from machine translations by a classification model the worse the machine translation system is. Work done while the author was a visiting student at MSRA Work done while the author was a visiting student at MSRA The previous work on identifying erroneous sentences mainly aims to find errors from the writing of ESL learners. The common mistakes Yukio et al. 2001 Gui and Yang 2003 made by ESL learners include spelling lexical collocation sentence .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.