Đang chuẩn bị liên kết để tải về tài liệu:
A Fast Parallel Algorithm for Discovering Frequent Patterns

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ

Department of Computer Science and Information Engineering National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan, R.O.C. kim-x@yahoo.com.tw [2] (Apriori-like) approach and 2) the frequent pattern growth approach [6] (FP-growth-like). The Apriori-like methods iteratively generate candidate itemset of size (k+1) from frequent itemset of size k and scan the database repetitively to test the frequency of each candidate itemset. | A Fast Parallel Algorithm for Discovering Frequent Patterns Kawuu w. Lin Department of Computer Science and Information Engineering National Kaohsiung University of Applied Sciences Kaohsiung Taiwan R.o.c. linwc@cc.kuas.edu.tw Abstract Fast discovery of frequent patterns is the most extensively discussed problem in data mining fields due to its wide applications. As the size of database increases the computation time and the required memory increase severely. The difficulty of mining large database launched the research of designing parallel and distributed algorithms to solve the problem. Most of the past studies tried to parallelize the computation by dividing the database and distribute the divided database to other nodes for mining. This approach might leak data out and evidently is not suitable to be applied to sensitive domains like health-care. In this paper we propose a novel data mining algorithm named FD-Mine that is able to efficiently utilize the nodes to discover frequent patterns in cloud computing environments with data privacy preserved. Through empirical evaluations on various simulation conditions the proposed FD-Mine delivers excellent performance in terms of scalability and execution time. Keywords Data mining cloud computing association rule mining frequent pattern mining privacy preserved I. Introduction With the progress of information technology data mining techniques have been extensively applied to many applications in various domains. The goal of data mining is to discover the hidden useful information from large databases. The discovered information could help the decision processes aid the commercial promotion and so forth. The data mining includes four main topics association rule mining 2 sequential pattern mining 3 clustering 11 and classification 5 . Among the data mining studies the problem of frequent pattern mining i.e. association rule mining and sequential pattern mining is mostly discussed due to its wide applications. The .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.