TAILIEUCHUNG - Data Streams Models and Algorithms- P6

Data Streams Models and Algorithms- P6: In recent years, the progress in hardware technology has made it possible for organizations to store and record large streams of transactional data. Such data sets which continuously and rapidly grow over time are referred to as data streams. In addition, the development of sensor technology has resulted in the possibility of monitoring many events in real time. | 136 DATA STREAMS MODELSAND ALGORITHMS Placement of Load Shedders. For now assume that we have guessed the right value of emax so that we know the exact effective sampling rate Pi for each query. In fact this assumption is unnecessary as we will explain below. Then our task is reduced to solving the following problem Given a data flow diagram along with a set of target effective sampling rates Pi for each query q modify the diagram by inserting load shedding operators and set their sampling rates so that the effective sampling rate for each query qi is equal to Pi and the total processing time is minimized. If there is no sharing of operators among queries it is straightforward to see that the optimal solution is to introduce a load shedder with sampling rate Pi Pi before the first operator in the query path for each query qi. Introducing a load shedder as early in the query path as possible reduces the effective input rate for all downstream operators and conforms to the general query optimization principle of pushing selection conditions down. Introducing load shedders and setting their sampling rates is more complicated when there is sharing among query plans. Suppose that two queries qi and Q2 share the first portion of their query paths but have different effective sampling rate targets Pi and P2. Since a load shedder placed at the shared beginning of the query path will affect the effective sampling rates for both queries it is not immediately clear how to simultaneously achieve both effective sampling rate targets in the most efficient manner though clearly any solution will necessarily involve the introduction of load shedding at intermediate points in the query paths. We will define a shared segment in the data flow diagram as follows Suppose we label each operator with the set of all queries that contain the operator in their query paths. Then the set of all operators having the same label is a shared segment. Observation In the optimal solution load .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.