TAILIEUCHUNG - Báo cáo khoa học: "Non-Literal Word Sense Identification Through Semantic Network Path Schemata"

When computer programs disambiguate words in a sentence, they often encounter non-literal or novel usages not included in their lexicon. In a recent study, Georgia Green (personal communication) estimated that 17% to 20% of the content word senses encountered in various types of normal English text are not fisted in the dictionary. While these novel word senses are generally valid, they occur in such great numbers, and with such little individual frequency that it is impractical to explicity include them all within the lexicon. . | Non-Literal Word Sense Identification Through Semantic Network Path Schemata Eric Iverson Stephen Helmreich Computing Research Lab and Computer Science Department Box 3OOO1 3CRL New Mexico State University Las Croces. NM 88003-0001 When computer programs disambiguate words in a sentence they often encounter non-literal or novel usages not included in then lexicon. In a recent study Georgia Green personal communication estimated that 17 to 20 of the content word senses encountered in various types of normal English text are not listed in the dictionary. While these novel word senses are generally valid they occur in such great numbers and with such little individual frequency that it is impractical to explicitly include them all within the lexicon. Instead mechanisms are needed which can derive novel senses from existing ones thus allowing a program to recognize a significant set of potential word senses while keeping its lexicon within a reasonable size. Spreading activation is a mechanism that allows us to do this. Here the program follows paths from existing word senses stored in a semantic network to other closely associated word senses. By examining tile shape of the resultant path we can determine the relationship between the senses contained in the path thus deriving novel composite meanings not contained within any of the original lexical entries. This process is similar to the spreading activation and marker passing techniques of Hirst 1988 Chamiak 1986 and Norvig 1989 and is embodied in the Prolog program metallel based on Pass program meta5 Fass 1988 . Metallel s lexicon is written as a series of sense frames each containing information about a particular word sense. A sense frame can be broken into two main parts genera and differentiae. Genera are the genus terms that function as the ancestors of a word sense. Differentiae denote the qualities that distinguish a particular sense from other senses of the same genus. Differentiae can be broken down into .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.