Abstract
Cache coherence has been studied extensively in the context of chip multiprocessors (CMP). It is well known that conventional directory-based and snooping coherence protocols generate considerable coherence traffic as the number of hardware thread contexts increase. Since GPUs support hundreds or even thousands of threads, conventional coherence mechanisms when applied to GPUs will exacerbate the the bandwidth constraints that GPUs already face. Recognizing this constraint, prior work has proposed time-based coherence protocols. The main idea is to assign a lease period to the accessed cache block, and after the lease expires the cache block is self-invalidated. However, time-based coherence protocols require global synchronized clocks. Furthermore, this approach may increase execution stalls since threads have to wait to access data with an unexpired lease. Tardis is timestamp-based coherence protocol that has been proposed recently to alleviate the need for global clocks in CPUs. This paper builds on this prior work and proposes G-TSC, a novel cache coherence protocol for GPUs that is based on timestamp ordering. G-TSC conducts its coherence transactions in logical time. This work demonstrates the challenges in adopting timestamp coherence for GPUs which support massive thread parallelism and have unique microarchitecture features. This work then presents a number of solutions that tackle GPU-centric challenges. Evaluation of G-TSC implemented in the GPGPU-Sim simulation framework shows that G-TSC outperforms time-based coherence by 38% with release consistency.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 24th IEEE International Symposium on High Performance Computer Architecture, HPCA 2018 |
| Publisher | IEEE Computer Society |
| Pages | 403-415 |
| Number of pages | 13 |
| ISBN (Electronic) | 9781538636596 |
| DOIs | |
| State | Published - 27 Mar 2018 |
| Externally published | Yes |
Publication series
| Name | Proceedings - International Symposium on High-Performance Computer Architecture |
|---|---|
| Volume | 2018-February |
| ISSN (Print) | 1530-0897 |
Bibliographical note
Publisher Copyright:© 2018 IEEE.
Keywords
- Cache coherence
- GPU
ASJC Scopus subject areas
- Hardware and Architecture