TAILIEUCHUNG - Báo cáo khoa học: "EXTRACTING SEMANTIC HIERARCHIES FROM A LARGE ON-LINE DICTIONARY"

The goal of this research is to extract semantic information from standard dictionary definitions, for use in constructing lexicons for natural language processing systems. Although dictionaries contain finely detailed semantic knowledge, the systematic organization of that knowledge has not heretofore been exploited in such a way as to make the information available for computer applications. | EXTRACTING SEMANTIC HIERARCHIES FROM A LARGE ON-LINE DICTIONARY Martin s. Chodorow Department of Psychology Hunter College of CUNY and . Thomas J. Watson Research Center Yorktown Heights New York 10598 Roy J. Byrd George E. Heidom . Thomas J. Watson Research Center Yorktown Heights New York 10598 ABSTRACT 1. Introduction. Dictionaries are rich sources of detailed semantic information. but in order to use the information for natural language processing it must be organized systematically. This paper describes automatic and semi-automatic procedures for extracting and organizing semantic feature information implicit in dictionary definitions. Two head-finding heuristics are described for locating the genus terms in noun and verb definitions. The assumption is that the genus term represents inherent features of the word it defines. The two heuristics have been used to process definitions of 40 000 nouns and 8 000 verbs producing indexes in which each genus term is associated with the words it defined. The Sprout program interactively grows a taxonomic tree from any specified root feature by consulting the genus index. Its output is a tree in which all of the nodes have the root feature for at least one of their senses. The Filter program uses an inverted form of the genus index. Filtering begins with an initial filter file consisting of words that have a given feature . human in all of their senses. The program then locates in the index words whose genus terms all appear in the filter file. The output is a list of new words that have the given feature in all of theứ senses. The goal of this research is to extract semantic information from standard dictionary definitions for use in constructing lexicons for natural language processing systems. Although dictionaries contain finely detailed semantic knowledge the systematic organization of that knowledge has not heretofore been exploited in such a way as to make the information available for computer .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.