TAILIEUCHUNG - Báo cáo khoa học: "Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification"

Automatic sentiment classification has been extensively studied and applied in recent years. However, sentiment is expressed differently in different domains, and annotating corpora for every possible domain of interest is impractical. We investigate domain adaptation for sentiment classifiers, focusing on online reviews for different types of products. First, we extend to sentiment classification the recently-proposed structural correspondence learning (SCL) algorithm, reducing the relative error due to adaptation between domains by an average of 30% over the original SCL algorithm and 46% over a supervised baseline. . | Biographies Bollywood Boom-boxes and Blenders Domain Adaptation for Sentiment Classification John Blitzer Mark Dredze Fernando Pereira Department of Computer and Information Science University of Pennsylvania blitzer mdredze pereria@ Abstract Automatic sentiment classification has been extensively studied and applied in recent years. However sentiment is expressed differently in different domains and annotating corpora for every possible domain of interest is impractical. We investigate domain adaptation for sentiment classifiers focusing on online reviews for different types of products. First we extend to sentiment classification the recently-proposed structural correspondence learning SCL algorithm reducing the relative error due to adaptation between domains by an average of 30 over the original SCL algorithm and 46 over a supervised baseline. Second we identify a measure of domain similarity that correlates well with the potential for adaptation of a classifier from one domain to another. This measure could for instance be used to select a small set of domains to annotate whose trained classifiers would transfer well to many other domains. 1 Introduction Sentiment detection and classification has received considerable attention recently Pang et al. 2002 Turney 2002 Goldberg and Zhu 2004 . While movie reviews have been the most studied domain sentiment analysis has extended to a number of new domains ranging from stock message boards to congressional floor debates Das and Chen 2001 Thomas et al. 2006 . Research results have been 440 deployed industrially in systems that gauge market reaction and summarize opinion from Web pages discussion boards and blogs. With such widely-varying domains researchers and engineers who build sentiment classification systems need to collect and curate data for each new domain they encounter. Even in the case of market analysis if automatic sentiment classification were to be used across a wide range of domains the .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.