TAILIEUCHUNG - Báo cáo khoa học: "Generating a Non-English Subjectivity Lexicon: Relations That Matter"

We describe a method for creating a nonEnglish subjectivity lexicon based on an English lexicon, an online translation service and a general purpose thesaurus: Wordnet. We use a PageRank-like algorithm to bootstrap from the translation of the English lexicon and rank the words in the thesaurus by polarity using the network of lexical relations in Wordnet. We apply our method to the Dutch language. The best results are achieved when using synonymy and antonymy relations only, and ranking positive and negative words simultaneously. Our method achieves an accuracy of at the top 3,000 negative words, and at. | Generating a Non-English Subjectivity Lexicon Relations That Matter Valentin Jijkoun and Katja Hofmann ISLA University of Amsterdam Amsterdam The Netherlands jijkoun @ Abstract We describe a method for creating a nonEnglish subjectivity lexicon based on an English lexicon an online translation service and a general purpose thesaurus Wordnet. We use a PageRank-like algorithm to bootstrap from the translation of the English lexicon and rank the words in the thesaurus by polarity using the network of lexical relations in Wordnet. We apply our method to the Dutch language. The best results are achieved when using synonymy and antonymy relations only and ranking positive and negative words simultaneously. Our method achieves an accuracy of at the top 3 000 negative words and at the top 3 000 positive words. 1 Introduction One of the key tasks in subjectivity analysis is the automatic detection of subjective as opposed to objective factual statements in written documents Mihalcea and Liu 2006 . This task is essential for applications such as online marketing research where companies want to know what customers say about the companies their products specific products features and whether comments made are positive or negative. Another application is in political research where public opinion could be assessed by analyzing usergenerated online data blogs discussion forums etc. . Most current methods for subjectivity identification rely on subjectivity lexicons which list words that are usually associated with positive or negative sentiments or opinions . words with polarity . Such a lexicon can be used . to classify individual sentences or phrases as subjective or not and as bearing positive or negative sentiments Pang et al. 2002 Kim and Hovy 2004 Wilson et al. 2005a . For English manually created subjectivity lexicons have been available for a while but for many other languages such resources are still missing. We describe a .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.