Compiler transformation to improve memory access time in SIMD systems

  • Mayez Al-Mouhamed*
  • , Lubomir Bic
  • , Hussam Abu-Haimed
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

The serialization of memory accesses and network conflicts are two major limiting factors in lock-step parallel memories. We derive conditions for accessing parallel memories which is free of both network and memory conflicts. This applies to accessing arbitrary sets of linear data patterns. We also combine different access patterns (NP-complete) into one single compiler address transformation. The synthesized storage scheme applies to arbitrary linear patterns, arbitrary multistage networks, and arbitrary number of power-of-2 memories. We propose a new heuristic for synthesizing combined XOR-matrices. Performance of optimized storage schemes is presented for sorting and for combining arbitrary sets of power-of-2 patterns.

Original languageEnglish
Pages (from-to)174-178
Number of pages5
JournalParallel Architectures and Compilation Techniques - Conference Proceedings, PACT
StatePublished - 1996

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Compiler transformation to improve memory access time in SIMD systems'. Together they form a unique fingerprint.

Cite this