Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
the sentiment words are expressed. Extracting the topic lexicon from a specific domain is important because users not only care about the overall sentiment polarity of a review but also care about which aspects are mentioned in review. Note that, similar to sentiment lexicons, different domains may have very different topic lexicons. | Cross-Domain Co-Extraction of Sentiment and Topic Lexicons Fangtao Li Sinno Jialin Pan Ou Jin Qiang Yang and Xiaoyan Zhu Department of Computer Science and Technology Tsinghua University Beijing China fangtao06@gmail.com zxy-dcs@tsinghua.edu.cn Institute for Infocomm Research Singapore tjspan@i2r.a-star.edu.sg Hong Kong University of Science and Technology Hong Kong China Í kingomiga@gmail.com qyang@cse.ust.hk Abstract Extracting sentiment and topic lexicons is important for opinion mining. Previous works have showed that supervised learning methods are superior for this task. However the performance of supervised methods highly relies on manually labeled training data. In this paper we propose a domain adaptation framework for sentiment- and topic- lexicon co-extraction in a domain of interest where we do not require any labeled data but have lots of labeled data in another related domain. The framework is twofold. In the first step we generate a few high-confidence sentiment and topic seeds in the target domain. In the second step we propose a novel Relational Adaptive bootstraPping RAP algorithm to expand the seeds in the target domain by exploiting the labeled source domain data and the relationships between topic and sentiment words. Experimental results show that our domain adaptation framework can extract precise lexicons in the target domain without any annotation. 1 Introduction In the past few years opinion mining and sentiment analysis have attracted much attention in Natural Language Processing NLP and Information Retrieval IR Pang and Lee 2008 Liu 2010 . Sentiment lexicon construction and topic lexicon extraction are two fundamental subtasks for opinion mining Qiu et al. 2009 . A sentiment lexicon is a list of sentiment expressions which are used to indicate sentiment polarity e.g. positive or negative . The sentiment lexicon is domain dependent as users may use different sentiment words to express their opinion in different domains e.g. different .