TY - GEN
T1 - Data vitalization
T2 - 16th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2010
AU - Xiong, Zhang
AU - Luo, Wuman
AU - Chen, Lei
AU - Ni, Lionel M.
PY - 2010
Y1 - 2010
N2 - Nowadays, datasets grow enormously both in size and complexity. One of the key issues confronted by large-scale dataset analysis is how to adapt systems to new, unprecedented query loads. Existing systems nail down the data organization scheme once and for all at the beginning of the system design, thus inevitably will see the performance goes down when user requirements change. In this paper, we propose a new paradigm, Data Vitalization, for large-scale dataset analysis. Our goal is to enable high flexibility such that the system is adaptive to complex analytical applications. Specifically, data are organized into a group of vitalized cells, each of which is a collection of data coupled with computing power. As user requirements change over time, cells evolve spontaneously to meet the potential new query loads. Besides basic functionality of Data Vitalization, we also explore an envisioned architecture of Data Vitalization including possible approaches for query processing, data evolution, as well as its tight-coupled mechanism for data storage and computing.
AB - Nowadays, datasets grow enormously both in size and complexity. One of the key issues confronted by large-scale dataset analysis is how to adapt systems to new, unprecedented query loads. Existing systems nail down the data organization scheme once and for all at the beginning of the system design, thus inevitably will see the performance goes down when user requirements change. In this paper, we propose a new paradigm, Data Vitalization, for large-scale dataset analysis. Our goal is to enable high flexibility such that the system is adaptive to complex analytical applications. Specifically, data are organized into a group of vitalized cells, each of which is a collection of data coupled with computing power. As user requirements change over time, cells evolve spontaneously to meet the potential new query loads. Besides basic functionality of Data Vitalization, we also explore an envisioned architecture of Data Vitalization including possible approaches for query processing, data evolution, as well as its tight-coupled mechanism for data storage and computing.
KW - Data analysis
KW - Data vitalization
KW - Large-scale dataset
KW - Vitalized data cell
UR - http://www.scopus.com/inward/record.url?scp=79951735976&partnerID=8YFLogxK
U2 - 10.1109/ICPADS.2010.102
DO - 10.1109/ICPADS.2010.102
M3 - Conference contribution
AN - SCOPUS:79951735976
SN - 9780769543079
T3 - Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
SP - 251
EP - 258
BT - Proceedings - 16th International Conference on Parallel and Distributed Systems, ICPADS 2010
Y2 - 8 December 2010 through 10 December 2010
ER -