Load shedding for multi-way stream joins based on arrival order patterns
Citations

WEB OF SCIENCE

4
Citations

SCOPUS

8

초록

We address the problem of load shedding for continuous multi-way join queries over multiple data streams. When the arrival rates of tuples from data streams exceed the system capacity, a load shedding algorithm drops some subset of input tuples to avoid system overloads. To decide which tuples to drop among the input tuples, most existing load shedding algorithms determine the priority of each input tuple based on the frequency or some historical statistics of its join attribute value, and then drop tuples with the lowest priority. However, those value-based algorithms cannot determine the priorities of tuples properly in environments where join attribute values are unique and each join attribute value occurs at most once in each data stream. In this paper, we propose a load shedding algorithm specifically designed for such environments. The proposed load shedding algorithm determines the priority of each tuple based on the order of streams in which its join attribute value appears, rather than its join attribute value itself. Consequently, the priorities of tuples can be determined effectively in environments where join attribute values are unique and do not repeat. The experimental results show that the proposed algorithm outperforms the existing algorithms in such environments in terms of effectiveness and efficiency.

키워드

Load sheddingMulti-way stream joinStream processingArrival order patternALGORITHM
제목
Load shedding for multi-way stream joins based on arrival order patterns
저자
Kwon, Tae-HyungLee, Ki YongKim, Myoung Ho
DOI
10.1007/s10844-010-0138-z
발행일
2011-10
유형
Article
저널명
Journal of Intelligent Information Systems
37
2
페이지
245 ~ 265