Skip to main navigation Skip to search Skip to main content

A parallel data mining algorithm for pagerank computation

  • Massinissa Saoudi
  • , Massinissa Lounis
  • , Ahcène Bounceur
  • , Reinhardt Euler
  • , Tahar Kechadi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We study the utility of graphics processing units (GPUs) for an acceleration of the data mining PageRank algorithm and a reduction of the memory size of the web graph. We first present a new web graph representation using a compressed format in order to reduce the memory allocation of the web graph. Then, this web graph is simply partitioned into small chunks to be processed on the GPUs' device. The basic steps of the algorithm are then split up into parallel oper- ations allowing to exploit the computing power of GPUs in the CUDA language as best as possible. In the experiments, we have tested the algorithm using GPUs with a set of real web data, and compared the computation with a CPU-based one. The obtained results show that the proposed PageR- ank computation on GPUs outperforms the CPU version by a factor of 100, reducing at the same time the web graph memory storage by 93; 928%.

Original languageEnglish
Title of host publicationProceedings of the International Conference on Big Data and Advanced Wireless Technologies, BDAW 2016
EditorsDjallel Eddine Boubiche, Hani Hamdan, Ahcene Bounceur
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450347792
DOIs
StatePublished - 10 Nov 2016
Externally publishedYes
Event2016 International Conference on Big Data and Advanced Wireless Technologies, BDAW 2016 - Blagoevgrad, Bulgaria
Duration: 10 Nov 201611 Nov 2016

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2016 International Conference on Big Data and Advanced Wireless Technologies, BDAW 2016
Country/TerritoryBulgaria
CityBlagoevgrad
Period10/11/1611/11/16

Bibliographical note

Publisher Copyright:
© 2016 ACM.

Keywords

  • Big Data
  • CUDA
  • Data Mining
  • PageRank
  • Parallel Computation

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'A parallel data mining algorithm for pagerank computation'. Together they form a unique fingerprint.

Cite this