TAILIEUCHUNG - Báo cáo khoa học: "Classifying author personality from weblog text"

We report initial results on the relatively novel task of automatic classification of author personality. Using a corpus of personal weblogs, or ‘blogs’, we investigate the accuracy that can be achieved when classifying authors on four important personality traits. We explore both binary and multiple classification, using differing sets of n-gram features. Results are promising for all four traits examined. | Whose thumb is it anyway Classifying author personality from weblog text Jon Oberlander School of Informatics University of Edinburgh 2 Buccleuch Place Edinburgh EH8 9LW Scott Nowson School of Informatics University of Edinburgh 2 Buccleuch Place Edinburgh EH8 9LW Abstract We report initial results on the relatively novel task of automatic classification of author personality. Using a corpus of personal weblogs or blogs we investigate the accuracy that can be achieved when classifying authors on four important personality traits. We explore both binary and multiple classification using differing sets of n-gram features. Results are promising for all four traits examined. 1 Introduction There is now considerable interest in affective language processing. Work focusses on analysing subjective features of text or speech such as sentiment opinion emotion or point of view Pang et al. 2002 Turney 2002 Dave et al. 2003 Liu et al. 2003 Pang and Lee 2005 Shanahan et al. 2005 . Discussing affective computing in general Picard 1997 notes that phenomena vary in duration ranging from short-lived feelings through emotions to moods and ultimately to long-lived slowly-changing personality characteristics. Within computational linguistics most work has focussed on sentiment and opinion concerning specific entities or events and on binary classifications of these. For instance both Pang and Lee 2002 and Turney 2002 consider the thumbs up thumbs down decision is a film review positive or negative However Pang and Lee 2005 point out that ranking items or comparing reviews will benefit from finer-grained classifications over multiple ordered classes is a film review two- or three- or four-star And at the same time some work now considers longer-term affective states. For example Mishne 2005 aims to classify the primary mood of weblog postings the study encompasses both fine-grained but non-ordered multiple classification frus-trated loved etc. .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.