The search functionality is under construction.

IEICE TRANSACTIONS on Information

Technology Scalable Matrix Architecture for Data Parallel Applications

Mostafa SOLIMAN, Stanislav SEDUKHIN

  • Full Text Views

    0

  • Cite this

Summary :

Within a few years it will be possible to integrate a billion transistors on a single chip operating at frequency more than 10 GHz. At this integration level, we propose using a multi-level ISA to express fine-grain data parallelism to hardware instead of using a huge transistor budget to dynamically extract it. Since the fundamental data structures for a wide variety of data parallel applications are scalar, vector, and matrix, our proposed Trident processor extends a scalar ISA with vector and matrix instruction sets to effectively process matrix formulated applications. Like vector architectures, the Trident processor consists of a set of parallel lanes (each lane contains a set of vector pipelines and a slice of register file) combined with a fast scalar core. However, Trident processor can effectively process on the parallel lanes not only vector but also matrix data. One key point of our architecture is the local communication within and across lanes to overcome the limitations of the future VLSI technology. Another key point is the effective execution of a mixture of scalar, vector, and matrix operations. This paper describes the architecture of the Trident processor and evaluates its performance on BLAS and on the standard matrix bidiagonalization algorithm. The last one is evaluated as an example of an entire application based on a mixture of scalar, vector, and matrix operations. Our results show that many data parallel applications, such as scientific, engineering, multimedia, etc., can be speeded up on the Trident processor. Besides, the scalability of the Trident processor does not require more fetch, decode, or issue bandwidth, but requires only replication of parallel lanes.

Publication
IEICE TRANSACTIONS on Information Vol.E86-D No.9 pp.1549-1559
Publication Date
2003/09/01
Publicized
Online ISSN
DOI
Type of Manuscript
Special Section PAPER (Special Issue on Parallel and Distributed Computing, Applications and Technologies)
Category
Networking and Architectures

Authors

Keyword