Fast parallel permutation algorithms:

Abstract: "We investigate the problem of permuting n data items, each covering D global memory cells, on an EREW PRAM with n/log n processors and less than Dn additional storage. We present a family of algorithms (A[subscript k])[subscript k], where k = 1,2, ..., log[superscript *] n, such that...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Keller, Jörg (VerfasserIn)
Format: Buch
Sprache:English
Veröffentlicht: Amsterdam 1993
Schriftenreihe:Report CS-R / Centrum voor Wiskunde en Informatica, Department of Computer Science 93,3
Schlagworte:
Zusammenfassung:Abstract: "We investigate the problem of permuting n data items, each covering D global memory cells, on an EREW PRAM with n/log n processors and less than Dn additional storage. We present a family of algorithms (A[subscript k])[subscript k], where k = 1,2, ..., log[superscript *] n, such that A[subscript k] needs time D log n log[superscript (k)]n. Here log[superscript (k)]n denotes k times application of log to n, and [formula]. All algorithms need [theta](n) operations which is optimal. The storage requirements of A[subscript k] are n global bits, n/log[superscript (k-1)]n global memory cells if k [> or =] 2, and D log n/log[superscript (k-1)]n (resp. D) local memory cells per processor if k [> or =] 2 (resp. k = 1)
Hence, the family (A[subscript k])[subscript k] reveals a time- space tradeoff. The result can be generalized to machines with p <n/log n processors such that the run time is (Dn/p)log[superscript (k)]n, (Dn/p)/log[superscript (k-1)]n local cells per processor are needed, and the global storage requirements remain as given above.
Beschreibung:7 S.