TAILIEUCHUNG - Báo cáo khoa học: "Guiding Semi-Supervision with Constraint-Driven Learning"

Over the last few years, two of the main research directions in machine learning of natural language processing have been the study of semi-supervised learning algorithms as a way to train classifiers when the labeled data is scarce, and the study of ways to exploit knowledge and global information in structured learning tasks. In this paper, we suggest a method for incorporating domain knowledge in semi-supervised learning algorithms. Our novel framework unifies and can exploit several kinds of task specific constraints. . | Guiding Semi-Supervision with Constraint-Driven Learning Ming-Wei Chang Lev Ratinov Dan Roth Department of Computer Science University of Illinois at Urbana-Champaign Urbana IL 61801 mchang21 ratinov2 danr @ Abstract Over the last few years two of the main research directions in machine learning of natural language processing have been the study of semi-supervised learning algorithms as a way to train classihers when the labeled data is scarce and the study of ways to exploit knowledge and global information in structured learning tasks. In this paper we suggest a method for incorporating domain knowledge in semi-supervised learning algorithms. Our novel framework unihes and can exploit several kinds of task specific constraints. The experimental results presented in the information extraction domain demonstrate that applying constraints helps the model to generate better feedback during learning and hence the framework allows for high performance learning with significantly less training data than was possible before on these tasks. 1 Introduction Natural Language Processing NLP systems typically require large amounts of knowledge to achieve good performance. Acquiring labeled data is a dif-hcult and expensive task. Therefore an increasing attention has been recently given to semi-supervised learning where large amounts of unlabeled data are used to improve the models learned from a small training set Collins and Singer 1999 Thelen and Riloff 2002 . The hope is that semi-supervised or even unsupervised approaches when given enough 280 knowledge about the structure of the problem will be competitive with the supervised models trained on large training sets. However in the general case semi-supervised approaches give mixed results and sometimes even degrade the model performance Nigam et al. 2000 . In many cases improving semi-supervised models was done by seeding these models with domain information taken from dictionaries or ontology Cohen and Sarawagi .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.