Abstract
We study the utility of graphics processing units (GPUs) for an acceleration of the data mining PageRank algorithm and a reduction of the memory size of the web graph. We first present a new web graph representation using a compressed format in order to reduce the memory allocation of the web graph. Then, this web graph is simply partitioned into small chunks to be processed on the GPUs' device. The basic steps of the algorithm are then split up into parallel oper- ations allowing to exploit the computing power of GPUs in the CUDA language as best as possible. In the experiments, we have tested the algorithm using GPUs with a set of real web data, and compared the computation with a CPU-based one. The obtained results show that the proposed PageR- ank computation on GPUs outperforms the CPU version by a factor of 100, reducing at the same time the web graph memory storage by 93; 928%.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the International Conference on Big Data and Advanced Wireless Technologies, BDAW 2016 |
| Editors | Djallel Eddine Boubiche, Hani Hamdan, Ahcene Bounceur |
| Publisher | Association for Computing Machinery |
| ISBN (Electronic) | 9781450347792 |
| DOIs | |
| State | Published - 10 Nov 2016 |
| Externally published | Yes |
| Event | 2016 International Conference on Big Data and Advanced Wireless Technologies, BDAW 2016 - Blagoevgrad, Bulgaria Duration: 10 Nov 2016 → 11 Nov 2016 |
Publication series
| Name | ACM International Conference Proceeding Series |
|---|
Conference
| Conference | 2016 International Conference on Big Data and Advanced Wireless Technologies, BDAW 2016 |
|---|---|
| Country/Territory | Bulgaria |
| City | Blagoevgrad |
| Period | 10/11/16 → 11/11/16 |
Bibliographical note
Publisher Copyright:© 2016 ACM.
Keywords
- Big Data
- CUDA
- Data Mining
- PageRank
- Parallel Computation
ASJC Scopus subject areas
- Software
- Human-Computer Interaction
- Computer Vision and Pattern Recognition
- Computer Networks and Communications
Fingerprint
Dive into the research topics of 'A parallel data mining algorithm for pagerank computation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver