TAILIEUCHUNG - Báo cáo khoa học: "Combined One Sense Disambiguation of Abbreviations"

A process that attempts to solve abbreviation ambiguity is presented. Various contextrelated features and statistical features have been explored. Almost all features are domain independent and language independent. The application domain is Jewish Law documents written in Hebrew. Such documents are known to be rich in ambiguous abbreviations. Various implementations of the one sense per discourse hypothesis are used, improving the features with new variants. An accuracy of has been achieved by SVM. . | Combined One Sense Disambiguation of Abbreviations Yaakov HaCohen-Kerner Department of Computer Science Jerusalem College of Technology Machon Lev 21 Havaad Haleumi St. . 16031 91160 Jerusalem Israel kerner@ Ariel Kass Department of Computer Science Jerusalem College of Technology Machon Lev 21 Havaad Haleumi St. . 16031 91160 Jerusalem Israel Ariel Peretz Department of Computer Science Jerusalem College of Technology Machon Lev 21 Havaad Haleumi St. . 16031 91160 Jerusalem Israel relperetz@ Abstract A process that attempts to solve abbreviation ambiguity is presented. Various context-related features and statistical features have been explored. Almost all features are domain independent and language independent. The application domain is Jewish Law documents written in Hebrew. Such documents are known to be rich in ambiguous abbreviations. Various implementations of the one sense per discourse hypothesis are used improving the features with new variants. An accuracy of has been achieved by SVM. 1 Introduction An abbreviation is a letter or sequence of letters which is a shortened form of a word or a sequence of words which is called the sense of the abbreviation. Abbreviation disambiguation means to choose the correct sense for a specific context. Jewish Law documents written in Hebrew are known to be rich in ambiguous abbreviations HaCohen-Kerner et al. 2004 . They can therefore serve as an excellent test-bed for the development of models for abbreviation disambiguation. As opposed to the documents investigated in previous systems Jewish Law documents usually do not contain the sense of the abbreviations in the same discourse. Therefore the abbreviations are regarded as more difficult to disambiguate. This research defines features as well as experiments with various variants of the one sense per discourse hypothesis. The developed process considers other languages and does not define preexecution .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.