Memory hierarchy management schemes in large scale shared-memory multiprocessors:
Abstract: "The speed of access to shared global memory is one of the most important aspects, if not the most important one, in the performance of any shared-memory multiprocessor. Shared-memory multiprocessors with a multistage interconnection network, systems that are known to be both scalable...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Seattle, Wash.
1989
|
Schriftenreihe: | University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report
89,8,7 |
Schlagworte: | |
Zusammenfassung: | Abstract: "The speed of access to shared global memory is one of the most important aspects, if not the most important one, in the performance of any shared-memory multiprocessor. Shared-memory multiprocessors with a multistage interconnection network, systems that are known to be both scalable and cost-effective, have two important weaknesses: slow access to global memory and the absence of an instantaneous broadcast mechanism. In this dissertation we propose ways of managing memory hierarchies in such an environment to cope with these weaknesses. Slow access to global memory can be alleviated by associating private caches with the processing elements. However, this gives rise to the cache coherence problem We propose a timestamp-based self-invalidation cache coherence scheme that allows the exploration of localities across different computational units and show that it performs significantly better than the previous schemes with no or limited such ability. This scheme does not require any shared resource (e.g., sophisticated memory controllers) and is not affected by the absence of an instantaneous broadcast mechanism. Compile-time optimizations which are aimed at increasing the effectiveness of the memory hierarchy are proposed assuming the presence of the timestamp-based cache coherence scheme. We also propose architectural enhancements in the form of super registers which are registers embedded in the interconnection network These super registers are shared by a proper subset of the processors to reduce the average access time of shared variables with known access patterns. |
Beschreibung: | Zugl.: Washington, DC, Univ., Diss. |
Beschreibung: | X, 200 S. |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV009258297 | ||
003 | DE-604 | ||
005 | 00000000000000.0 | ||
007 | t | ||
008 | 940313s1989 m||| 00||| eng d | ||
035 | |a (OCoLC)23932816 | ||
035 | |a (DE-599)BVBBV009258297 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
049 | |a DE-29T | ||
100 | 1 | |a Min, Sang L. |e Verfasser |4 aut | |
245 | 1 | 0 | |a Memory hierarchy management schemes in large scale shared-memory multiprocessors |c Sang Lyul Min |
264 | 1 | |a Seattle, Wash. |c 1989 | |
300 | |a X, 200 S. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report |v 89,8,7 | |
500 | |a Zugl.: Washington, DC, Univ., Diss. | ||
520 | 3 | |a Abstract: "The speed of access to shared global memory is one of the most important aspects, if not the most important one, in the performance of any shared-memory multiprocessor. Shared-memory multiprocessors with a multistage interconnection network, systems that are known to be both scalable and cost-effective, have two important weaknesses: slow access to global memory and the absence of an instantaneous broadcast mechanism. In this dissertation we propose ways of managing memory hierarchies in such an environment to cope with these weaknesses. Slow access to global memory can be alleviated by associating private caches with the processing elements. However, this gives rise to the cache coherence problem | |
520 | 3 | |a We propose a timestamp-based self-invalidation cache coherence scheme that allows the exploration of localities across different computational units and show that it performs significantly better than the previous schemes with no or limited such ability. This scheme does not require any shared resource (e.g., sophisticated memory controllers) and is not affected by the absence of an instantaneous broadcast mechanism. Compile-time optimizations which are aimed at increasing the effectiveness of the memory hierarchy are proposed assuming the presence of the timestamp-based cache coherence scheme. We also propose architectural enhancements in the form of super registers which are registers embedded in the interconnection network | |
520 | 3 | |a These super registers are shared by a proper subset of the processors to reduce the average access time of shared variables with known access patterns. | |
650 | 4 | |a Computer storage devices | |
650 | 4 | |a Multiprocessors | |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
810 | 2 | |a Department of Computer Science: Technical report |t University of Washington <Seattle, Wash.> |v 89,8,7 |w (DE-604)BV008930431 |9 89,8,7 | |
999 | |a oai:aleph.bib-bvb.de:BVB01-006160353 |
Datensatz im Suchindex
_version_ | 1804123705550307329 |
---|---|
any_adam_object | |
author | Min, Sang L. |
author_facet | Min, Sang L. |
author_role | aut |
author_sort | Min, Sang L. |
author_variant | s l m sl slm |
building | Verbundindex |
bvnumber | BV009258297 |
ctrlnum | (OCoLC)23932816 (DE-599)BVBBV009258297 |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02785nam a2200349 cb4500</leader><controlfield tag="001">BV009258297</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">940313s1989 m||| 00||| eng d</controlfield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)23932816</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV009258297</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Min, Sang L.</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Memory hierarchy management schemes in large scale shared-memory multiprocessors</subfield><subfield code="c">Sang Lyul Min</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Seattle, Wash.</subfield><subfield code="c">1989</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">X, 200 S.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report</subfield><subfield code="v">89,8,7</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Zugl.: Washington, DC, Univ., Diss.</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">Abstract: "The speed of access to shared global memory is one of the most important aspects, if not the most important one, in the performance of any shared-memory multiprocessor. Shared-memory multiprocessors with a multistage interconnection network, systems that are known to be both scalable and cost-effective, have two important weaknesses: slow access to global memory and the absence of an instantaneous broadcast mechanism. In this dissertation we propose ways of managing memory hierarchies in such an environment to cope with these weaknesses. Slow access to global memory can be alleviated by associating private caches with the processing elements. However, this gives rise to the cache coherence problem</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">We propose a timestamp-based self-invalidation cache coherence scheme that allows the exploration of localities across different computational units and show that it performs significantly better than the previous schemes with no or limited such ability. This scheme does not require any shared resource (e.g., sophisticated memory controllers) and is not affected by the absence of an instantaneous broadcast mechanism. Compile-time optimizations which are aimed at increasing the effectiveness of the memory hierarchy are proposed assuming the presence of the timestamp-based cache coherence scheme. We also propose architectural enhancements in the form of super registers which are registers embedded in the interconnection network</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">These super registers are shared by a proper subset of the processors to reduce the average access time of shared variables with known access patterns.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Computer storage devices</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Multiprocessors</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="810" ind1="2" ind2=" "><subfield code="a">Department of Computer Science: Technical report</subfield><subfield code="t">University of Washington <Seattle, Wash.></subfield><subfield code="v">89,8,7</subfield><subfield code="w">(DE-604)BV008930431</subfield><subfield code="9">89,8,7</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-006160353</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV009258297 |
illustrated | Not Illustrated |
indexdate | 2024-07-09T17:34:01Z |
institution | BVB |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-006160353 |
oclc_num | 23932816 |
open_access_boolean | |
owner | DE-29T |
owner_facet | DE-29T |
physical | X, 200 S. |
publishDate | 1989 |
publishDateSearch | 1989 |
publishDateSort | 1989 |
record_format | marc |
series2 | University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report |
spelling | Min, Sang L. Verfasser aut Memory hierarchy management schemes in large scale shared-memory multiprocessors Sang Lyul Min Seattle, Wash. 1989 X, 200 S. txt rdacontent n rdamedia nc rdacarrier University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report 89,8,7 Zugl.: Washington, DC, Univ., Diss. Abstract: "The speed of access to shared global memory is one of the most important aspects, if not the most important one, in the performance of any shared-memory multiprocessor. Shared-memory multiprocessors with a multistage interconnection network, systems that are known to be both scalable and cost-effective, have two important weaknesses: slow access to global memory and the absence of an instantaneous broadcast mechanism. In this dissertation we propose ways of managing memory hierarchies in such an environment to cope with these weaknesses. Slow access to global memory can be alleviated by associating private caches with the processing elements. However, this gives rise to the cache coherence problem We propose a timestamp-based self-invalidation cache coherence scheme that allows the exploration of localities across different computational units and show that it performs significantly better than the previous schemes with no or limited such ability. This scheme does not require any shared resource (e.g., sophisticated memory controllers) and is not affected by the absence of an instantaneous broadcast mechanism. Compile-time optimizations which are aimed at increasing the effectiveness of the memory hierarchy are proposed assuming the presence of the timestamp-based cache coherence scheme. We also propose architectural enhancements in the form of super registers which are registers embedded in the interconnection network These super registers are shared by a proper subset of the processors to reduce the average access time of shared variables with known access patterns. Computer storage devices Multiprocessors (DE-588)4113937-9 Hochschulschrift gnd-content Department of Computer Science: Technical report University of Washington <Seattle, Wash.> 89,8,7 (DE-604)BV008930431 89,8,7 |
spellingShingle | Min, Sang L. Memory hierarchy management schemes in large scale shared-memory multiprocessors Computer storage devices Multiprocessors |
subject_GND | (DE-588)4113937-9 |
title | Memory hierarchy management schemes in large scale shared-memory multiprocessors |
title_auth | Memory hierarchy management schemes in large scale shared-memory multiprocessors |
title_exact_search | Memory hierarchy management schemes in large scale shared-memory multiprocessors |
title_full | Memory hierarchy management schemes in large scale shared-memory multiprocessors Sang Lyul Min |
title_fullStr | Memory hierarchy management schemes in large scale shared-memory multiprocessors Sang Lyul Min |
title_full_unstemmed | Memory hierarchy management schemes in large scale shared-memory multiprocessors Sang Lyul Min |
title_short | Memory hierarchy management schemes in large scale shared-memory multiprocessors |
title_sort | memory hierarchy management schemes in large scale shared memory multiprocessors |
topic | Computer storage devices Multiprocessors |
topic_facet | Computer storage devices Multiprocessors Hochschulschrift |
volume_link | (DE-604)BV008930431 |
work_keys_str_mv | AT minsangl memoryhierarchymanagementschemesinlargescalesharedmemorymultiprocessors |