Verfügbarkeit: Reducing synchronization and scheduling overhead in parallel loops

Reducing synchronization and scheduling overhead in parallel loops:

Abstract: "This thesis examines the effect of run-time overhead on the performance of parallel loops on shared-memory multiprocessors. Parallel execution time and speedup are examined as a function of both the loop scheduling strategy and the barrier synchronization mechanism. Analytic models f...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Beckmann, Carl J. (VerfasserIn)
Format:	Buch
Sprache:	English
Veröffentlicht:	Urbana, Ill. 1989
Schriftenreihe:	Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report 922
Schlagworte:	Multiprocessors Parallel processing (Electronic computers) Hochschulschrift
Zusammenfassung:	Abstract: "This thesis examines the effect of run-time overhead on the performance of parallel loops on shared-memory multiprocessors. Parallel execution time and speedup are examined as a function of both the loop scheduling strategy and the barrier synchronization mechanism. Analytic models for parallel loop performance are developed and compared to simulation results, and are found to be in good agreement. The models take into account not only deterministic runtime overhead due to scheduling and synchronization, but also random variations in loop iteration execution times with both normal and exponential distributions These models may be used by an intelligent compiler to determine the optimum scheduling strategy and the optimum number of processors for executing a given parallel loop. Various hardware techniques for supporting efficient synchronization and scheduling operations in multiprocessors are examined. A broadcast network is proposed as a lower-cost alternative to a combining network for supporting efficient synchronization operations. A new special-purpose hardware scheme for supporting ideal barrier synchronization is also presented. This scheme allows very fast barrier synchronization, and is hierarchically expandable to almost arbitrarily large systems.
Beschreibung:	Zugl.: Urbana, Ill., Univ., Diss.
Beschreibung:	XII, 164 S.

Internformat

MARC


LEADER	00000nam a2200000 cb4500
001	BV008949527
003	DE-604
005	00000000000000.0
007	t
008	940206s1989 m\|\|\| 00\|\|\| eng d
035			\|a (OCoLC)21941741
035			\|a (DE-599)BVBBV008949527
040			\|a DE-604 \|b ger \|e rakddb
041	0		\|a eng
049			\|a DE-29T
100	1		\|a Beckmann, Carl J. \|e Verfasser \|4 aut
245	1	0	\|a Reducing synchronization and scheduling overhead in parallel loops \|c by Carl Josef Beckmann
246	1	3	\|a Reportnr. UILU ENG 89 8014
264		1	\|a Urbana, Ill. \|c 1989
300			\|a XII, 164 S.
336			\|b txt \|2 rdacontent
337			\|b n \|2 rdamedia
338			\|b nc \|2 rdacarrier
490	1		\|a Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report \|v 922
500			\|a Zugl.: Urbana, Ill., Univ., Diss.
520	3		\|a Abstract: "This thesis examines the effect of run-time overhead on the performance of parallel loops on shared-memory multiprocessors. Parallel execution time and speedup are examined as a function of both the loop scheduling strategy and the barrier synchronization mechanism. Analytic models for parallel loop performance are developed and compared to simulation results, and are found to be in good agreement. The models take into account not only deterministic runtime overhead due to scheduling and synchronization, but also random variations in loop iteration execution times with both normal and exponential distributions
520	3		\|a These models may be used by an intelligent compiler to determine the optimum scheduling strategy and the optimum number of processors for executing a given parallel loop. Various hardware techniques for supporting efficient synchronization and scheduling operations in multiprocessors are examined. A broadcast network is proposed as a lower-cost alternative to a combining network for supporting efficient synchronization operations. A new special-purpose hardware scheme for supporting ideal barrier synchronization is also presented. This scheme allows very fast barrier synchronization, and is hierarchically expandable to almost arbitrarily large systems.
650		4	\|a Multiprocessors
650		4	\|a Parallel processing (Electronic computers)
655		7	\|0 (DE-588)4113937-9 \|a Hochschulschrift \|2 gnd-content
830		0	\|a Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report \|v 922 \|w (DE-604)BV008930033 \|9 922
999			\|a oai:aleph.bib-bvb.de:BVB01-005905184

Datensatz im Suchindex

_version_	1804123282673238016
any_adam_object
author	Beckmann, Carl J.
author_facet	Beckmann, Carl J.
author_role	aut
author_sort	Beckmann, Carl J.
author_variant	c j b cj cjb
building	Verbundindex
bvnumber	BV008949527
ctrlnum	(OCoLC)21941741 (DE-599)BVBBV008949527
format	Book
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02480nam a2200349 cb4500</leader><controlfield tag="001">BV008949527</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">940206s1989 m\|\|\| 00\|\|\| eng d</controlfield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)21941741</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV008949527</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Beckmann, Carl J.</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Reducing synchronization and scheduling overhead in parallel loops</subfield><subfield code="c">by Carl Josef Beckmann</subfield></datafield><datafield tag="246" ind1="1" ind2="3"><subfield code="a">Reportnr. UILU ENG 89 8014</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Urbana, Ill.</subfield><subfield code="c">1989</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XII, 164 S.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report</subfield><subfield code="v">922</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Zugl.: Urbana, Ill., Univ., Diss.</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">Abstract: "This thesis examines the effect of run-time overhead on the performance of parallel loops on shared-memory multiprocessors. Parallel execution time and speedup are examined as a function of both the loop scheduling strategy and the barrier synchronization mechanism. Analytic models for parallel loop performance are developed and compared to simulation results, and are found to be in good agreement. The models take into account not only deterministic runtime overhead due to scheduling and synchronization, but also random variations in loop iteration execution times with both normal and exponential distributions</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">These models may be used by an intelligent compiler to determine the optimum scheduling strategy and the optimum number of processors for executing a given parallel loop. Various hardware techniques for supporting efficient synchronization and scheduling operations in multiprocessors are examined. A broadcast network is proposed as a lower-cost alternative to a combining network for supporting efficient synchronization operations. A new special-purpose hardware scheme for supporting ideal barrier synchronization is also presented. This scheme allows very fast barrier synchronization, and is hierarchically expandable to almost arbitrarily large systems.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Multiprocessors</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Parallel processing (Electronic computers)</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report</subfield><subfield code="v">922</subfield><subfield code="w">(DE-604)BV008930033</subfield><subfield code="9">922</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-005905184</subfield></datafield></record></collection>
genre	(DE-588)4113937-9 Hochschulschrift gnd-content
genre_facet	Hochschulschrift
id	DE-604.BV008949527
illustrated	Not Illustrated
indexdate	2024-07-09T17:27:18Z
institution	BVB
language	English
oai_aleph_id	oai:aleph.bib-bvb.de:BVB01-005905184
oclc_num	21941741
open_access_boolean
owner	DE-29T
owner_facet	DE-29T
physical	XII, 164 S.
publishDate	1989
publishDateSearch	1989
publishDateSort	1989
record_format	marc
series	Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report
series2	Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report
spelling	Beckmann, Carl J. Verfasser aut Reducing synchronization and scheduling overhead in parallel loops by Carl Josef Beckmann Reportnr. UILU ENG 89 8014 Urbana, Ill. 1989 XII, 164 S. txt rdacontent n rdamedia nc rdacarrier Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report 922 Zugl.: Urbana, Ill., Univ., Diss. Abstract: "This thesis examines the effect of run-time overhead on the performance of parallel loops on shared-memory multiprocessors. Parallel execution time and speedup are examined as a function of both the loop scheduling strategy and the barrier synchronization mechanism. Analytic models for parallel loop performance are developed and compared to simulation results, and are found to be in good agreement. The models take into account not only deterministic runtime overhead due to scheduling and synchronization, but also random variations in loop iteration execution times with both normal and exponential distributions These models may be used by an intelligent compiler to determine the optimum scheduling strategy and the optimum number of processors for executing a given parallel loop. Various hardware techniques for supporting efficient synchronization and scheduling operations in multiprocessors are examined. A broadcast network is proposed as a lower-cost alternative to a combining network for supporting efficient synchronization operations. A new special-purpose hardware scheme for supporting ideal barrier synchronization is also presented. This scheme allows very fast barrier synchronization, and is hierarchically expandable to almost arbitrarily large systems. Multiprocessors Parallel processing (Electronic computers) (DE-588)4113937-9 Hochschulschrift gnd-content Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report 922 (DE-604)BV008930033 922
spellingShingle	Beckmann, Carl J. Reducing synchronization and scheduling overhead in parallel loops Center for Supercomputing Research and Development <Urbana, Ill.>: CSRD report Multiprocessors Parallel processing (Electronic computers)
subject_GND	(DE-588)4113937-9
title	Reducing synchronization and scheduling overhead in parallel loops
title_alt	Reportnr. UILU ENG 89 8014
title_auth	Reducing synchronization and scheduling overhead in parallel loops
title_exact_search	Reducing synchronization and scheduling overhead in parallel loops
title_full	Reducing synchronization and scheduling overhead in parallel loops by Carl Josef Beckmann
title_fullStr	Reducing synchronization and scheduling overhead in parallel loops by Carl Josef Beckmann
title_full_unstemmed	Reducing synchronization and scheduling overhead in parallel loops by Carl Josef Beckmann
title_short	Reducing synchronization and scheduling overhead in parallel loops
title_sort	reducing synchronization and scheduling overhead in parallel loops
topic	Multiprocessors Parallel processing (Electronic computers)
topic_facet	Multiprocessors Parallel processing (Electronic computers) Hochschulschrift
volume_link	(DE-604)BV008930033
work_keys_str_mv	AT beckmanncarlj reducingsynchronizationandschedulingoverheadinparallelloops AT beckmanncarlj reportnruilueng898014

Verfügbarkeit

Es ist kein Print-Exemplar vorhanden.

Fernleihe Bestellen Achtung: Nicht im THWS-Bestand!

MARC

Datensatz im Suchindex

Es ist kein Print-Exemplar vorhanden.

Ähnliche Einträge