Đang chuẩn bị liên kết để tải về tài liệu:
Chapter 6: Unsupervised Learning – Clustering

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ

Chapter 6: Unsupervised Learning – Clustering Introduction to unsupervised learning and clustering, Partitional clustering (k-Means algorithm), Hierarchical clustering, Expectation Maximization (EM) algorithm, Incremental Clustering. | Chapter 6 Unsupervised Learning – Clustering Assoc. Prof. Dr. Duong Tuan Anh Faculty of Computer Science and Engineering, HCMC Univ. of Technology 3/2015 Outline 1 Introduction to unsupervised learning and clustering 2 Partitional clustering (k-Means algorithm) 3 Hierarchical clustering 4 Expectation Maximization (EM) algorithm 5 Incremental Clustering 1. Introduction to clustering Clustering is the process of grouping a set of patterns. It generates a partition consisting of groups or clusters from a given collection of patterns. Representations or descriptions of the clusters formed are used in decision making – classification, prediction, outlier detection. A clustering-based classification scheme is very useful in solving large-scale pattern classification problems in data mining. Patterns to be clustered are either labeled or unlabeled. We have: Clustering algorithms which group sets of unlabeled patterns. These types of approaches are so popular that clustering is viewed as | Chapter 6 Unsupervised Learning – Clustering Assoc. Prof. Dr. Duong Tuan Anh Faculty of Computer Science and Engineering, HCMC Univ. of Technology 3/2015 Outline 1 Introduction to unsupervised learning and clustering 2 Partitional clustering (k-Means algorithm) 3 Hierarchical clustering 4 Expectation Maximization (EM) algorithm 5 Incremental Clustering 1. Introduction to clustering Clustering is the process of grouping a set of patterns. It generates a partition consisting of groups or clusters from a given collection of patterns. Representations or descriptions of the clusters formed are used in decision making – classification, prediction, outlier detection. A clustering-based classification scheme is very useful in solving large-scale pattern classification problems in data mining. Patterns to be clustered are either labeled or unlabeled. We have: Clustering algorithms which group sets of unlabeled patterns. These types of approaches are so popular that clustering is viewed as an unsupervised learning of unlabeled patterns. Algorithms which cluster labeled patterns. These types of approaches are practically important and are called supervised clustering. Supervised clustering is helpful in identifying clusters within collections of labeled patterns. Abstractions in the form of cluster representatives/ descriptions which are useful for efficient classification (e.g., in data reduction for classification). Clustering The process of clustering is carried out so that patterns in the same cluster are similar in some sense and patterns in different clusters are dissimilar in a corresponding sense. Figure 6.1 The Euclidean distance between any two points characterizes similarity: intra-cluster distance is small and inter-cluster distance is large. Centroid and medoid Clustering is useful for generating data abstraction. A cluster of points is represented by its centroid or its medoid. A centroid stands for the sample mean of the points in cluster C; it is .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.