Volume driven selection of loop and data transformations for cache coherent parallel processors:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
1999
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Paderborn, Univ., Diss., 1999 |
Beschreibung: | 238 S. graph. Darst. : 21 cm |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV013512404 | ||
003 | DE-604 | ||
005 | 00000000000000.0 | ||
007 | t | ||
008 | 001219s1999 gw d||| m||| 00||| eng d | ||
016 | 7 | |a 960164685 |2 DE-101 | |
035 | |a (OCoLC)47310426 | ||
035 | |a (DE-599)BVBBV013512404 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
044 | |a gw |c DE | ||
049 | |a DE-739 | ||
100 | 1 | |a Slowik, Adrian |e Verfasser |4 aut | |
245 | 1 | 0 | |a Volume driven selection of loop and data transformations for cache coherent parallel processors |c vorgelegt von Adrian Slowik |
264 | 1 | |c 1999 | |
300 | |a 238 S. |b graph. Darst. : 21 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
500 | |a Paderborn, Univ., Diss., 1999 | ||
650 | 7 | |a Hochschulschrift |2 gtt | |
650 | 0 | 7 | |a Parallelrechner |0 (DE-588)4173280-7 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Programmanalyse |0 (DE-588)4175841-9 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Programmtransformation |0 (DE-588)4263654-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Automatische Parallelisierung |0 (DE-588)4203963-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Cache-Speicher |0 (DE-588)4362843-6 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Programmschleife |0 (DE-588)4343964-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Pufferspeicher |0 (DE-588)4176324-5 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Automatische Parallelisierung |0 (DE-588)4203963-0 |D s |
689 | 0 | 1 | |a Programmschleife |0 (DE-588)4343964-0 |D s |
689 | 0 | 2 | |a Programmanalyse |0 (DE-588)4175841-9 |D s |
689 | 0 | 3 | |a Programmtransformation |0 (DE-588)4263654-1 |D s |
689 | 0 | 4 | |a Parallelrechner |0 (DE-588)4173280-7 |D s |
689 | 0 | 5 | |a Cache-Speicher |0 (DE-588)4362843-6 |D s |
689 | 0 | |5 DE-604 | |
689 | 1 | 0 | |a Pufferspeicher |0 (DE-588)4176324-5 |D s |
689 | 1 | |8 1\p |5 DE-604 | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=009223995&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-009223995 | ||
883 | 1 | |8 1\p |a cgwrk |d 20201028 |q DE-101 |u https://d-nb.info/provenance/plan#cgwrk |
Datensatz im Suchindex
_version_ | 1804128305046093824 |
---|---|
adam_text | Titel: Volume driven selection of loop and data transformations for cache coherent parallel processors
Autor: Slowik, Adrian
Jahr: 1999
Contents
1 Introduction 10
1.1 Motivation 10
1.2 Introductory Example 15
1.3 Thesis 19
1.4 Contributions 19
1.5 Overview 20
2 Regular Nested Loop Programs 23
2.1 Nested Loop Programs 24
2.2 Iteration Spaces of Nested Loop Programs 28
2.3 Index Spaces of Nested Loop Programs 33
2.4 Index Functions of Nested Loop Programs 36
2.5 Linearly Bounded Lattices 38
2.6 Utilization Patterns 40
3 Volume Computations on Polytopes 43
3.1 Diophantine Equations 44
3.2 Coupling Patterns 46
3.3 Convolution Patterns 49
3.4 Counting Solutions 51
3.5 Application to Polytopes 62
3.6 Related Work 65
4 Cache Miss Candidates 67
4.1 Spatial and Temporal Locality 68
4.2 Set-Associative Cache Memories 69
4.3 Compulsory Misses 72
4.4 Conflict Misses 74
4.5 Coherence Misses 77
4.6 Related Work 82
5 Transformations 86
5.1 Legality of Transformations 86
5.2 Linear Transformations on Polytopes 87
5.3 Non-linear Transformations on Polytopes 92
5.4 Transformations on Multiple Polytopes 93
5.5 Simultaneous Loop and Data Transformations 95
5.6 Related Work 97
6 Selecting Loop Transformations 101
6.1 Parallelizing Computations 103
6.2 Enumerating Computations 115
6.3 Tiling Computations 159
6.4 Grouping References 164
6.5 Putting It All Together 168
6.6 Related Work 169
7 Selecting Data Transformations 173
7.1 Enumerating Array Elements 174
7.2 Tiling Arrays 178
7.3 Grouping Array elements 180
7.4 Further Applications of Data Access Abstractions 180
7.5 Bilinear Equations 183
7.6 Related Work 185
8 Experiments 188
8.1 Prototype Compiler Implementation 188
8.2 Successive Over-Relaxation 191
8.3 Homer 194
8.4 Convolution 197
8.5 ADI-Integration 200
8.6 Householder-Decomposition 204
8.7 Improvements 207
9 Conclusion 209
A Appendix 212
A.I Abbreviations and Notation 212
A.2 Intersection of Linearly Bounded Lattices 214
A.3 Transformation Matrices 215
A.4 Example Ehrhart Polynomial 216
Bibliography 217
Index 234
|
any_adam_object | 1 |
author | Slowik, Adrian |
author_facet | Slowik, Adrian |
author_role | aut |
author_sort | Slowik, Adrian |
author_variant | a s as |
building | Verbundindex |
bvnumber | BV013512404 |
ctrlnum | (OCoLC)47310426 (DE-599)BVBBV013512404 |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02182nam a2200517 c 4500</leader><controlfield tag="001">BV013512404</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">001219s1999 gw d||| m||| 00||| eng d</controlfield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">960164685</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)47310426</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV013512404</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">DE</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-739</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Slowik, Adrian</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Volume driven selection of loop and data transformations for cache coherent parallel processors</subfield><subfield code="c">vorgelegt von Adrian Slowik</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">1999</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">238 S.</subfield><subfield code="b">graph. Darst. : 21 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Paderborn, Univ., Diss., 1999</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Hochschulschrift</subfield><subfield code="2">gtt</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Parallelrechner</subfield><subfield code="0">(DE-588)4173280-7</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Programmanalyse</subfield><subfield code="0">(DE-588)4175841-9</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Programmtransformation</subfield><subfield code="0">(DE-588)4263654-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Automatische Parallelisierung</subfield><subfield code="0">(DE-588)4203963-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Cache-Speicher</subfield><subfield code="0">(DE-588)4362843-6</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Programmschleife</subfield><subfield code="0">(DE-588)4343964-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Pufferspeicher</subfield><subfield code="0">(DE-588)4176324-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Automatische Parallelisierung</subfield><subfield code="0">(DE-588)4203963-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Programmschleife</subfield><subfield code="0">(DE-588)4343964-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Programmanalyse</subfield><subfield code="0">(DE-588)4175841-9</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="3"><subfield code="a">Programmtransformation</subfield><subfield code="0">(DE-588)4263654-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="4"><subfield code="a">Parallelrechner</subfield><subfield code="0">(DE-588)4173280-7</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="5"><subfield code="a">Cache-Speicher</subfield><subfield code="0">(DE-588)4362843-6</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Pufferspeicher</subfield><subfield code="0">(DE-588)4176324-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=009223995&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-009223995</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV013512404 |
illustrated | Illustrated |
indexdate | 2024-07-09T18:47:07Z |
institution | BVB |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-009223995 |
oclc_num | 47310426 |
open_access_boolean | |
owner | DE-739 |
owner_facet | DE-739 |
physical | 238 S. graph. Darst. : 21 cm |
publishDate | 1999 |
publishDateSearch | 1999 |
publishDateSort | 1999 |
record_format | marc |
spelling | Slowik, Adrian Verfasser aut Volume driven selection of loop and data transformations for cache coherent parallel processors vorgelegt von Adrian Slowik 1999 238 S. graph. Darst. : 21 cm txt rdacontent n rdamedia nc rdacarrier Paderborn, Univ., Diss., 1999 Hochschulschrift gtt Parallelrechner (DE-588)4173280-7 gnd rswk-swf Programmanalyse (DE-588)4175841-9 gnd rswk-swf Programmtransformation (DE-588)4263654-1 gnd rswk-swf Automatische Parallelisierung (DE-588)4203963-0 gnd rswk-swf Cache-Speicher (DE-588)4362843-6 gnd rswk-swf Programmschleife (DE-588)4343964-0 gnd rswk-swf Pufferspeicher (DE-588)4176324-5 gnd rswk-swf (DE-588)4113937-9 Hochschulschrift gnd-content Automatische Parallelisierung (DE-588)4203963-0 s Programmschleife (DE-588)4343964-0 s Programmanalyse (DE-588)4175841-9 s Programmtransformation (DE-588)4263654-1 s Parallelrechner (DE-588)4173280-7 s Cache-Speicher (DE-588)4362843-6 s DE-604 Pufferspeicher (DE-588)4176324-5 s 1\p DE-604 HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=009223995&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis 1\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk |
spellingShingle | Slowik, Adrian Volume driven selection of loop and data transformations for cache coherent parallel processors Hochschulschrift gtt Parallelrechner (DE-588)4173280-7 gnd Programmanalyse (DE-588)4175841-9 gnd Programmtransformation (DE-588)4263654-1 gnd Automatische Parallelisierung (DE-588)4203963-0 gnd Cache-Speicher (DE-588)4362843-6 gnd Programmschleife (DE-588)4343964-0 gnd Pufferspeicher (DE-588)4176324-5 gnd |
subject_GND | (DE-588)4173280-7 (DE-588)4175841-9 (DE-588)4263654-1 (DE-588)4203963-0 (DE-588)4362843-6 (DE-588)4343964-0 (DE-588)4176324-5 (DE-588)4113937-9 |
title | Volume driven selection of loop and data transformations for cache coherent parallel processors |
title_auth | Volume driven selection of loop and data transformations for cache coherent parallel processors |
title_exact_search | Volume driven selection of loop and data transformations for cache coherent parallel processors |
title_full | Volume driven selection of loop and data transformations for cache coherent parallel processors vorgelegt von Adrian Slowik |
title_fullStr | Volume driven selection of loop and data transformations for cache coherent parallel processors vorgelegt von Adrian Slowik |
title_full_unstemmed | Volume driven selection of loop and data transformations for cache coherent parallel processors vorgelegt von Adrian Slowik |
title_short | Volume driven selection of loop and data transformations for cache coherent parallel processors |
title_sort | volume driven selection of loop and data transformations for cache coherent parallel processors |
topic | Hochschulschrift gtt Parallelrechner (DE-588)4173280-7 gnd Programmanalyse (DE-588)4175841-9 gnd Programmtransformation (DE-588)4263654-1 gnd Automatische Parallelisierung (DE-588)4203963-0 gnd Cache-Speicher (DE-588)4362843-6 gnd Programmschleife (DE-588)4343964-0 gnd Pufferspeicher (DE-588)4176324-5 gnd |
topic_facet | Hochschulschrift Parallelrechner Programmanalyse Programmtransformation Automatische Parallelisierung Cache-Speicher Programmschleife Pufferspeicher |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=009223995&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT slowikadrian volumedrivenselectionofloopanddatatransformationsforcachecoherentparallelprocessors |