Balanced Parallel Algorithm for Mining Frequent Pattern from Data Stream
Keywords:
Association Rule Mining (ARM), Frequent itemset, FP-growth, Directed Graph, Massive Data Stream, MapReduce, Hadoop, Partitioning data.Abstract
The frequent pattern mining methods play very important role to generate association rules from massive data stream such as include customer click streams, network monitoring data, etc. The continuous, unbounded and high-speed characteristics of massive data stream are a huge challenge for the current frequent pattern mining approach. The complexities related to finding frequent itemset for mining association rules from a massive data stream in this work can be minimized by using modified FP-growth algorithm and parallelizing the mining task with MapReduce technique in Hadoop framework, improves performance by using balanced load technique, which exploits correlations among transactions. In this paper, we introduce (Balanced Parallel Graph Frequent Pattern BPGFP-growth), a modified FP-growth with one-pass scan based on directed graph, Hadoop framework, partitioning and balancing load strategy in order to reduce the execution time for the massive dynamic database and the volume of data exchanged between computational nodes (computers). The algorithm was tested, our experimental results demonstrated that the proposed algorithm could scale well and efficiently process large dynamic datasets. In addition, it achieves improvement in memory consumption to store frequent patterns and time complexity.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2020 �ttps://creativecommons.org/licenses/by-nc-sa/4.0/

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
The authors retain the copyright and grant the right to publish in the magazine for the first time with the transfer of the commercial right to Tishreen University Journal for Research and Scientific Studies - Engineering Sciences Series
Under a CC BY- NC-SA 04 license that allows others to share the work with of the work's authorship and initial publication in this journal. Authors can use a copy of their articles in their scientific activity, and on their scientific websites, provided that the place of publication is indicted in Tishreen University Journal for Research and Scientific Studies - Engineering Sciences Series . The Readers have the right to send, print and subscribe to the initial version of the article, and the title of Tishreen University Journal for Research and Scientific Studies - Engineering Sciences Series Publisher
journal uses a CC BY-NC-SA license which mean
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material
- The licensor cannot revoke these freedoms as long as you follow the license terms.
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- NonCommercial — You may not use the material for commercial purposes.
- ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.