TAILIEUCHUNG - Báo cáo khoa học: "Building Accurate Semantic Taxonomies from Monolingual MRDs"

Our aim is to use conventional MRDs, with no explicit semantic coding, to obtain a comparable accuracy. The system we propose is capable of 1) performing fully automatic extraction (with a counterpart in terms of both recall and precision fall) of taxonomic links of dictionary senses and 2) ranking the extracted relations in a w a y that selective manual refinement is allowed. | Building Accurate Semantic Taxonomies from Monolingual MRDs German Rigau and Horacio Rodríguez Departament de LSI. Universitat Politècnica de Catalunya. Barcelona. Catalonia. horacio @ Abstract This paper presents a method that conbines a set of unsupervised algorithms in order to accurately build large taxonomies from any machine-readable dictionary MRD . Our aim is to profit from conventional MRDs with no explicit semantic coding. We propose a system that 1 performs fully automatic extraction of taxonomic links from MRD entries and 2 ranks the extracted relations in a way that selective manual refinement is allowed. Tested accuracy can reach around 100 depending on the degree of coverage selected showing that taxonomy building is not limited to structured dictionaries such as LDOCE. 1 Introduction There is no doubt about the increasing need of owning accurate and broad coverage general lexical semantic resources for developing NL applications. These resources include Lexicons Lexical Databases Lexical Knowledge Bases LKBs Ontologies etc. Many researchers believe that for effective NLP it is necessary to build a LKB which contain class subclass relations and mechanisms for the inheritance of properties as well as other inferences. The work presented here attempts to lay out some solutions to overcome or alleviate the lexical bottleneck problem Briscoe 91 providing a methodology to build large scale LKBs from conventional dictionaries in any language. Starting with the seminal work of Amsler 81 many systems have followed this approach . Bruce et al. 92 Richardson 97 . Why should we propose another one Regarding the resources used we must point out that most of the systems built until now refer to English only and use rather rich well structured controlled and explicitly semantically coded dictionaries . LDOCE 87 . This is not the case for most of the available sources for languages Eneko Agirre Lengoia eta Informatikoak saila. Euskal Erriko

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.