Explicit parallel block Cholesky algorithms on the CRAY APP:

Abstract: "In this paper we consider the CRAY APP, the Attached Parallel Processor of the Cray S-MP, which consists of seven buses with each bus supporting up to 12 processing elements. Processing elements on different buses can communicate simultaneously with the shared main memory, but proces...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Nool, Margreet (VerfasserIn)
Format: Buch
Sprache:English
Veröffentlicht: Amsterdam 1994
Schriftenreihe:Centrum voor Wiskunde en Informatica <Amsterdam> / Afdeling Numerieke Wiskunde: Report NM 1994,25
Schlagworte:
Zusammenfassung:Abstract: "In this paper we consider the CRAY APP, the Attached Parallel Processor of the Cray S-MP, which consists of seven buses with each bus supporting up to 12 processing elements. Processing elements on different buses can communicate simultaneously with the shared main memory, but processing elements sharing the same bus can not, since only one processing element per bus can access memory at a given time. Applications with a high level of data reuse, or, with a high compute intensity, and applications being highly parallel are very suitable to run on the APP. An example of such an algorithm is matrix-matrix multiplication. We illustrate how the data traffic's restriction influences the performance and we discuss the scalability of the CRAY APP. Furthermore, two different algorithms for Cholesky factorization are discussed: a block left-looking algorithm and a block right-looking algorithm. The maximum achievable speed on the CRAY APP is mainly determined by the performance of the matrix-matrix multiplication. Parallelism is applied explicitly over the blocks, which makes it possible to concatenate different block operations in cache. The results obtained on CWI's APP (a machine having twenty- eight processing elements) indicate how block algorithms can be parallelized on machines with hundreds or thousands of processors."
Beschreibung:22 S.