TAILIEUCHUNG - Báo cáo khoa học: "A Morphological Analysis Based Method for Spelling Correction"

The correction method distinguishes between orthographic errors and typographical errors. • Typographical errors (or misstypings) are uncognitive errors which do not follow linguistic criteria. • Orthographic errors are cognitive errors which occur when the writer does not know or has forgotten the correct spelling for a word. They are more persistent because of their cognitive nature, they leave worse impression and, finally, its treatment is an interesting application for language standardization purposes. . | A Morphological Analysis Based Method for Spelling Correction Aduriz I. Agirre E. Alegria I. Arregi X. Arriola Ariola X. Diaz de Ilarraza A. Ezeiza N. Maritxalar M. Sarasola K. Urkia M. Informatika Fakultatea Basque Country University. . 649. 20080 DONOSTIA Basque Country . Aldapeta DONOSTIA Basque Country 1 Introduction Xuxen is a spelling checker corrector for Basque which is going to be comercialized next year. The checker recognizes a word-form if a correct morphological breakdown is allowed. The morphological analysis is based on two-level morphology. The correction method distinguishes between orthographic errors and typographical errors. Typographical errors or misstypings are uncogni-tive errors which do not follow linguistic criteria. Orthographic errors are cognitive ẽưors which occur when the writer does not know or has forgotten the coưect spelling for a word. They are more persistent because of their cognitive nature they leave worse impression and finally its ưeatment is an interesting application for language standardization purposes. 2 Correction Method in Xuxen The main problems found in designing the checking coưection sưategy were Due to the high level of inflection of Basque it is impossible to store every word-form in a dictionary therefore the mainstream checking correction methods were not suitable. Because of the recent standardization and widespread dialectal use of Basque orthographic errors are more likely and therefore theữ ưeatment becomes critical. The word-forms which are generated without linguistic knowledge must be fed into the spelling checker to check whether they are coưect or not. In order to face these issues the strategy used is basically the following see also Figure 1 . Handling orthographic errors The ưeatment of orthographic eưors is based on the parallel use of a two-level subsystem designed to detect misspellings previously typified. This subsystem has two main components Additional two-level .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.