Study on an Ensemble Classification Algorithm for Data Streams with Cloud Computing
-
Abstract
According to comprehensive analysis on data streams classification algorithms and the basic theory of cloud computing,it is proposed an ensemble classification algorithm for data streams running on Hadoop framework,and it takes MapReduce parallel programming model to improve traditional dynamic weight-based ensemble,finally speed up classification efficiency.Results show that the algorithm for high speed massive data stream has much better running efficiency than traditional ensemble algorithm.
-
-