TAILIEUCHUNG - Data Streams Models and Algorithms- P3

Data Streams Models and Algorithms- P3: In recent years, the progress in hardware technology has made it possible for organizations to store and record large streams of transactional data. Such data sets which continuously and rapidly grow over time are referred to as data streams. In addition, the development of sensor technology has resulted in the possibility of monitoring many events in real time. | 44 DATA STREAMS MODELSAND ALGORITHMS Technique Definition Pros Cons Sampling Choosing a data subset for analysis Error Bounds Guaranteed Poor for anomaly detection Load Shedding Ignoring a chunk of data Efficient for queries Very poor for anomaly detection Sketching Random projection on feature set Extremely Efficient May ignore Relevant features Synopsis Structure Quick Transformation Analysis Task Independent Not sufficient for very fast stream Aggregation Compiling summary statistics Analysis Task Independent May ignore Relevant features Table . Data Based Techniques Technique Definition Pros Cons Approximation Algorithms Algorithms with Error Bounds Efficient Resource adaptivity with data rates not always possible Sliding Window Analyzing most recent streams General Ignores part of stream Algorithm Output Granularity Highly Resource aware technique with memory and fluctuating data rates General Cost overhead of resource aware component Table . Task Based Techniques the task-based techniques. Each table provides a definition advantages and disadvantages of each technique. While the methods in Tables and provide an overview of the broad methods which can be used to adapt conventional methods to classification it is more useful to study specific techniques which are expressly designed for the purpose of classification. In the next section we will provide a review of these methods. 4. Classification Techniques This section reviews the state-of-the-art of data stream classification techniques. We have provided an overview of some of the key methods how well they address the research problems discussed earlier. lease purchase PDF Split-Merge on to remove this watermark. A Survey of Classification Methods in Data Streams 45 Ensemble Based Classification Wang et al. 30 have proposed a generic framework for mining concept drifting data streams. The framework is based on the observation that many data stream mining algorithms do not .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.