Towards scalable rough set based attribute subset selection for intrusion detection using parallel genetic algorithm in MapReduce

El Sayed M. El-Alfy*, Mashaan A. Alshammari

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

42 Scopus citations

Abstract

Attribute subset selection based on rough sets is a crucial preprocessing step in data mining and pattern recognition to reduce the modeling complexity. To cope with the new era of big data, new approaches need to be explored to address this problem effectively. In this paper, we review recent work related to attribute subset selection in decision-theoretic rough set models. We also introduce a scalable implementation of a parallel genetic algorithm in Hadoop MapReduce to approximate the minimum reduct which has the same discernibility power as the original attribute set in the decision table. Then, we focus on intrusion detection in computer networks and apply the proposed approach on four datasets with varying characteristics. The results show that the proposed model can be a powerful tool to boost the performance of identifying attributes in the minimum reduct in large-scale decision systems.

Original languageEnglish
Pages (from-to)18-29
Number of pages12
JournalSimulation Modelling Practice and Theory
Volume64
DOIs
StatePublished - May 2016

Bibliographical note

Publisher Copyright:
© 2016

Keywords

  • Attribute subset selection
  • Big data
  • Hybrid methods
  • MapReduce
  • Minimum reduct
  • Parallel genetic algorithms
  • Rough sets

ASJC Scopus subject areas

  • Software
  • Modeling and Simulation
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Towards scalable rough set based attribute subset selection for intrusion detection using parallel genetic algorithm in MapReduce'. Together they form a unique fingerprint.

Cite this