TAILIEUCHUNG - An enhanced learning topic model for exploiting document relationships using classification and regression

This model identifies underlying semantic structure of a document collection from word to topic and topic to document quickly, effectively without human intervention. Initially, this model creates the topics based on Dirichlet Distribution using vector space model. | ISSN: 2249-5789 Adapala Spurthi, International Journal of Computer Science & Communication Networks,Vol 8(5),46-49 AN ENHANCED LEARNING TOPIC MODEL FOR EXPLOITING DOCUMENT RELATIONSHIPS USING CLASSIFICATION AND REGRESSION ADAPALA SPURTHI M. Tech, Department of CSE, Shri Vishnu Engineering College for Women (A), Vishnupur, Bhimavaram, West Godavari District, Andhra Pradesh Abstract- Massive influx of information technology produces high volumes of electronic documents and this repository has become potentially high dimensional, complex, and diversified in nature. The growing need of analyzing documents, discovering Latent data and identifying relationships among the documents in such large repositories, got the attention of present researchers towards the most powerful era of topic modeling. As the data in the repository growing dynamically and determined by higher variances, the conventional techniques modeled on Multinomial distributions, Dirichlet distributions in the topic models are limited in performing tasks like organizing, searching, and indexing. In handling such data and overcoming the limitations of previous techniques it is necessary to use the most popular method Latent Dirichlet Allocation (LDA) that can find the relationships among the data by representing as generative probabilistic distributions over latent topics. Towards this, the proposed work namely “AN Enhanced Learning Topic Model for Exploiting Document Relationships Using Classification and Regression” which is designed based on the principles of generative probabilistic topic models. This model identifies underlying semantic structure of a document collection from word to topic and topic to document quickly, effectively without human intervention. Initially, this model creates the topics based on Dirichlet Distribution using vector space model. Subsequently, the proposed model chooses the suitable annotators by distributing topics for each document to conduct fundamental validation in .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.