TAILIEUCHUNG - Báo cáo khoa học: "How Are Spelling Errors Generated and Corrected? "

When we type text using a keyboard, we generate many spelling errors, both typographical (caused by the keyboard layout and hand/finger movement) and cognitive (caused by phonetic or orthographic similarity) (Kukich, 1992). When the errors are caught during typing, they are corrected on the fly, but unnoticed errors will persist in the final text. | How Are Spelling Errors Generated and Corrected A Study of Corrected and Uncorrected Spelling Errors Using Keystroke Logs Yukino Baba The University of Tokyo Abstract This paper presents a comparative study of spelling errors that are corrected as you type vs. those that remain uncorrected. First we generate naturally occurring online error correction data by logging users keystrokes and by automatically deriving pre- and postcorrection strings from them. We then perform an analysis of this data against the errors that remain in the final text as well as across languages. Our analysis shows a clear distinction between the types of errors that are generated and those that remain uncorrected as well as across languages. 1 Introduction When we type text using a keyboard we generate many spelling errors both typographical caused by the keyboard layout and hand finger movement and cognitive caused by phonetic or orthographic similarity Kukich 1992 . When the errors are caught during typing they are corrected on the fly but unnoticed errors will persist in the final text. Previous research on spelling correction has focused on the latter type which we call uncorrected errors presumably because the errors that are corrected on the spot referred to here as corrected errors are not recoded in the form of a text. However studying corrected errors is important for at least three reasons. First such data encapsulates the spelling mistake and correction by the author in contrast to the case of uncorrected errors in which the intended correction is typically assigned by a third person an annotator or by an automatic method Whitelaw et al. 2009 Aramaki et al. 2010 1. Secondly data on corrected errors will enable us to build a spelling correction application that targets correction on the fly which directly reduces the number of keystrokes in typing. This is crucial for languages that use transliteration-based text input methods such as Chinese and Japanese .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.