TY - GEN
T1 - Development of a simultaneously threaded multi-core processor
AU - Zaghloul, Soha S.
AU - Mudawar, Mohamed
AU - Darwish, Mohamed G.
PY - 2005
Y1 - 2005
N2 - Simultaneous Multithreading (SMT) is becoming one of the major trends in the design of future generations of microarchitectures. Its key strength comes from its ability to exploit both thread-level and instruction-level parallelism; it uses hardware resources efficiently. Nevertheless, SMT has its limitations: contention between threads may cause conflicts; lack of scalability, additional pipeline stages, and inefficient handling of long latency operations. Alternatively, Chip Multiprocessors (CMP) are highly scalable and easy to program. On the other hand, they are expensive and suffer from cache coherence and memory consistency problems. This paper proposes a microarchitecture that exploits parallelism at instruction, thread, and processor levels. It merges both concepts of SMT and CMP. Like CMP, multiple cores are used on a single chip. Hardware resources are replicated in each core except for the secondary-level cache which is shared among all cores. The processor applies the SMT technique within each core to make full use of available hardware resources. Moreover, the communication overhead is reduced due to the inter-independence between cores. Results show that the proposed microarchitecrure outperforms both SMT and CMP. In addition, resources are more evenly distributed among running threads.
AB - Simultaneous Multithreading (SMT) is becoming one of the major trends in the design of future generations of microarchitectures. Its key strength comes from its ability to exploit both thread-level and instruction-level parallelism; it uses hardware resources efficiently. Nevertheless, SMT has its limitations: contention between threads may cause conflicts; lack of scalability, additional pipeline stages, and inefficient handling of long latency operations. Alternatively, Chip Multiprocessors (CMP) are highly scalable and easy to program. On the other hand, they are expensive and suffer from cache coherence and memory consistency problems. This paper proposes a microarchitecture that exploits parallelism at instruction, thread, and processor levels. It merges both concepts of SMT and CMP. Like CMP, multiple cores are used on a single chip. Hardware resources are replicated in each core except for the secondary-level cache which is shared among all cores. The processor applies the SMT technique within each core to make full use of available hardware resources. Moreover, the communication overhead is reduced due to the inter-independence between cores. Results show that the proposed microarchitecrure outperforms both SMT and CMP. In addition, resources are more evenly distributed among running threads.
UR - http://www.scopus.com/inward/record.url?scp=33847715967&partnerID=8YFLogxK
U2 - 10.1109/ITICT.2005.1609676
DO - 10.1109/ITICT.2005.1609676
M3 - Conference contribution
AN - SCOPUS:33847715967
SN - 0780392701
SN - 9780780392700
T3 - ITI 3rd International Conference on Information and Communications Technology, ICICT 2005 - Enabling Technologies for the New Knowledge Society
SP - 913
EP - 928
BT - ITI 3rd International Conference on Information and Communications Technology, ICICT 2005 - Enabling Technologies for the New Knowledge Society
ER -