TAILIEUCHUNG - Báo cáo khoa học: "Kinds of Features for Chinese Opinionated Information Retrieval"

This paper presents the results of experiments in which we tested different kinds of features for retrieval of Chinese opinionated texts. We assume that the task of retrieval of opinionated texts (OIR) can be regarded as a subtask of general IR, but with some distinct features. The experiments showed that the best results were obtained from the combination of character-based processing, dictionary look up (maximum matching) and a negation check. | Kinds of Features for Chinese Opinionated Information Retrieval Taras Zagibalov Department of Informatics University of Sussex United Kingdom Abstract This paper presents the results of experiments in which we tested different kinds of features for retrieval of Chinese opinionated texts. We assume that the task of retrieval of opinionated texts OIR can be regarded as a subtask of general IR but with some distinct features. The experiments showed that the best results were obtained from the combination of character-based processing dictionary look up maximum matching and a negation check. 1 Introduction The extraction of opinionated information has recently become an important research topic. Business and governmental institutions often need to have information about how their products or actions are perceived by people. Individuals may be interested in other people s opinions on various topics ranging from political events to consumer products. At the same time globalization has made the whole world smaller and a notion of the world as a global village does not surprise people nowadays. In this context we assume information in Chinese to be of particular interest as the Chinese world the mainland China Taiwan Hong Kong Singapore and numerous Chinese communities all over the world is getting more and more influential over the world economy and politics. We therefore believe that a system capable of providing access to opinionated information in other languages especially in Chinese might be of great use for individuals as well as for institutions in volved in international trade or international relations. The sentiment classification experiments presented in this paper were done in the context of Opinionated Information Retrieval which is planned to be a module in a Cross-Language Opinion Extraction system CLOE . The main goal of this system is to provide access to opinionated information on any topic ad-hoc in a language different to the .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.