TAILIEUCHUNG - Báo cáo khoa học: "Hierarchical Text Classification with Latent Concepts"

Recently, hierarchical text classification has become an active research topic. The essential idea is that the descendant classes can share the information of the ancestor classes in a predefined taxonomy. In this paper, we claim that each class has several latent concepts and its subclasses share information with these different concepts respectively. Then, we propose a variant Passive-Aggressive (PA) algorithm for hierarchical text classification with latent concepts. | Hierarchical Text Classification with Latent Concepts Xipeng Qiu Xuanjing Huang Zhao Liu and Jinlong Zhou School of Computer Science Fudan University xpqiu xjhuang @ abc9703 @ Abstract Recently hierarchical text classification has become an active research topic. The essential idea is that the descendant classes can share the information of the ancestor classes in a predefined taxonomy. In this paper we claim that each class has several latent concepts and its subclasses share information with these different concepts respectively. Then we propose a variant Passive-Aggressive PA algorithm for hierarchical text classification with latent concepts. Experimental results show that the performance of our algorithm is competitive with the recently proposed hierarchical classification algorithms. 1 Introduction Text classification is a crucial and well-proven method for organizing the collection of large scale documents. The predefined categories are formed by different criterions . Entertainment Sports and Education in news classification Junk Email and Ordinary Email in email classification. In the literature many algorithms Sebastiani 2002 Yang and Liu 1999 Yang and Pedersen 1997 have been proposed such as Support Vector Machines SVM k-Nearest Neighbor kNN Naive Bayes NB and so on. Empirical evaluations have shown that most of these methods are quite effective in traditional text classification applications. In past serval years hierarchical text classification has become an active research topic in database area Koller and Sahami 1997 Weigend et al. 1999 and machine learning area Rousu et al. 2006 Cai and Hofmann 2007 . Different with traditional classification the document collections are organized 598 as hierarchical class structure in many application fields web taxonomies . the Yahoo Directory http and the Open Directory Project ODP http email folders and product catalogs. The approaches of hierarchical .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.