TAILIEUCHUNG - Báo cáo khoa học: "Man* vs. Machine: A Case Study in Base Noun Phrase Learning"

A great deal of work has been done demonstrating the ability of machine learning algorithms to automatically extract linguistic knowledge from annotated corpora. Very little work has gone into quantifying the difference in ability at this task between a person and a machine. This paper is a first step in that direction. | Man vs. Machine A Case Study in Base Noun Phrase Learning Eric Brill and Grace Ngai Department of Computer Science The Johns Hopkins University Baltimore MD 21218 USA Email brill gyn Abstract A great deal of work has been done demonstrating the ability of machine learning algorithms to automatically extract linguistic knowledge from annotated corpora. Very little work has gone into quantifying the difference in ability at this task between a person and a machine. This paper is a first step in that direction. 1 Introduction Machine learning has been very successful at solving many problems in the field of natural language processing. It has been amply demonstrated that a wide assortment of machine learning algorithms are quite effective at extracting linguistic information from manually annotated corpora. Among the machine learning algorithms studied rule based systems have proven effective on many natural language processing tasks including part-of-speech tagging Brill 1995 Ramshaw and Marcus 1994 spelling correction Mangu and Brill 1997 word-sense disambiguation Gale et al. 1992 message understanding Day et al. 1997 discourse tagging Samuel et al. 1998 accent restoration Yarowsky 1994 prepositional-phrase attachment Brill and Resnik 1994 and base noun phrase identification Ramshaw and Marcus In Press Cardie and Pierce 1998 Veenstra 1998 Argamon et al. 1998 . Many of these rule based systems learn a short list of simple rules typically on the order of 50-300 which are easily understood by humans. Since these rule-based systems achieve good performance while learning a small list of simple rules it raises the question of whether peo- and Woman. pie could also derive an effective rule list manually from an annotated corpus. In this paper we explore how quickly and effectively relatively untrained people can extract linguistic generalities from a corpus as compared to a machine. There are a number of reasons for doing this. We would like to understand the

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.