TAILIEUCHUNG - Báo cáo khoa học: "NOUN CLASSIFICATION FROM PREDICATE.ARGUMENT STRUCTURES"

A method of determining the similarity of nouns on the basis of a metric derived from the distribution of subject, verb and object in a large text corpus is described. The resulting quasi-semantic classification of nouns demonstrates the plausibility of the distributional hypothesis, and has potential application to a variety of tasks, including automatic indexing, resolving nominal compounds, and determining the scope of modification. 1. I N T R O D U C T I O N A variety of linguistic relations apply to sets of semantically similar words. . | NOUN CLASSIFICATION FROM PREDICATE-ARGUMENT STRUCTURES Donald Kindle AT T Bell Laboratories 600 Mountain Avenue Murray Hill NJ 07974 ABSTRACT A method of determining the similarity of nouns on the basis of a metric derived from the distribution of subject verb and object in a large text corpus is described. The resulting quasi-sent an tic classification of nouns demonstrates the plausibility of the distributional hypothesis and has potential application to a variety of tasks including automatic indexing resolving nominal compounds and determining the scope of modification. 1. INTRODUCTION A variety of linguistic relations apply to sets of semantically similar words. For example modifiers select semantically similar nouns selectional restrictions are expressed in terms of the semantic class of objects and semantic type restricts the possibilities for noun compounding. Therefore it is useful to have a classification of words into semantically similar sets. Standard approaches to classifying nouns in terms of an is-a hierarchy have proven hard to apply to unrestricted language. Is-a hierarchies are expensive to acquire by hand for anything but highly restricted domains while attempts to automatically derive these hierarchies from existing dictionaries have been only partially successful Chodorow Byrd and Heidom 1985 . This paper describes an approach to classifying English words according to the predicate-argument structures they show in a corpus of text. The general idea is straightforward in any natural language there are restrictions on what words can appear together in the same construction and in particular on what can be arguments of what predicates. For nouns there is a restricted set of verbs that it appears as subject of or object of. For example wine may be drunk produced and sold but not pruned. Each noun may therefore be characterized according to the verbs that it occurs with. Nouns may then be grouped according to the extent to which they appear in .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.