Research and Implementation of Parallel FP-Growth Algorithm Based on Hadoop
-
Abstract
This paper,we propose a load-balanced and parallel FP-Growth algorithm based on Map/Reduce, which evenly parallelizes FP-Growth in the MapReduce approach. LBPFP(Load-Balanced Parallel FP-Growth) adds into PFP(Parallel FP-Growth) the load balance feature and the effectively pruning strategy, which improves parallelization and thereby improves performance. Finally,the experimental result shows the algorithm has good effect in the large data processing. It is proved the feasibility of the algorithm.
-
-