Effects of loop fusion and statement migration on the speedup of vector multiprocessors

Mayez Al-Mouhamed*, Lubomir Bic

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Vector multiprocessors rely on both spatial and temporal parallelism for achieving significant speedup. For singly nested loops, we study the effect on the speedup of: 1) loop fusion and, 2) increasing the granule-size of parallel-vector loops using extracted statements from scalar loops. The proposed optimizations migrate vector statements from one loop to another, create new loops, and reduce others. Loops and statements that belong to strongly connected data paths are vertically fused, whenever possible, in order to promote chaining and cache/register reuse. To reduce loop synchronization, horizontal fusion is also used for independent loops having compatible dependence types. Finally, vector operations are scheduled based on knowledge of the timing of arithmetic pipelines, load/store operations, and management of the available resource. Testing is carried out using synthetic Fortran programs on the Convex C240 vector multiprocessor. The proposed loop fusion improves the speedup by 18% to 43% over the C240 commercial optimizing compiler. Chaining-oriented scheduling and allocation yields 9% to 15% improvement over the highest optimization option of the C240 compiler.

Original languageEnglish
Pages (from-to)193-202
Number of pages10
JournalUnknown Journal
Issue numberA-50
StatePublished - 1994

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Effects of loop fusion and statement migration on the speedup of vector multiprocessors'. Together they form a unique fingerprint.

Cite this