Scaling shared-bus multiprocessors with multiple busses and shared caches: a performance study
Abstract: "The main limitation of shared-bus multiprocessors is that the common bus tends to be the primary source for contention, and thus imposes a limit on the number of processors in the system. Alternate architectural features are necessary to reduce the memory bandwidth demands and to inc...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Seattle, Wash.
1991
|
Schriftenreihe: | University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report
91,9,4 |
Schlagworte: | |
Zusammenfassung: | Abstract: "The main limitation of shared-bus multiprocessors is that the common bus tends to be the primary source for contention, and thus imposes a limit on the number of processors in the system. Alternate architectural features are necessary to reduce the memory bandwidth demands and to increase the bus bandwidth. In this paper, we investigate the cost-performance effects of two enhancements: higher bus transaction rates, e.g., through the use of multiple busses, and shared two-level caches. The performance figures are obtained via simulation with loads derived from traces of real applications, some of which show a significant skew in the distribution of memory bank access. A new multiple bus scheme, called multiple interleaved busses, is described and analyzed This scheme is a generalization of previous approaches, and attempts to balance performance and cost tradeoffs in a snoopy-cache multiprocessor environment. The results from simulation show that multiple interleaved busses perform almost as well as multiple independent busses, but with simpler and less costly implementation. Furthermore, multiple interleaved busses are shown to deliver much better performance than interleaved busses when the skew of accesses across the interleaves is large. Shared second-level caches have been shown to be very effective in the design space under consideration. Such systems might offer considerable implementation economies with relatively small design cost We show that depending on the design point in question, bus operation buffers might be useful in shared second level caches by reducing the effects of high skew and greater multiprocessing level. With the presence of these buffers, the uses of shared caches resulted in only a small throughput degradation. |
Beschreibung: | 22 S. |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV009015553 | ||
003 | DE-604 | ||
005 | 19951206 | ||
007 | t | ||
008 | 940206s1991 |||| 00||| eng d | ||
035 | |a (OCoLC)28390720 | ||
035 | |a (DE-599)BVBBV009015553 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
049 | |a DE-29T | ||
100 | 1 | |a Bertoni, Jonathan |e Verfasser |4 aut | |
245 | 1 | 0 | |a Scaling shared-bus multiprocessors with multiple busses and shared caches |b a performance study |c Jonathan Bertoni ; Jean-Loup Baer ; Wen-Hann Wang |
264 | 1 | |a Seattle, Wash. |c 1991 | |
300 | |a 22 S. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report |v 91,9,4 | |
520 | 3 | |a Abstract: "The main limitation of shared-bus multiprocessors is that the common bus tends to be the primary source for contention, and thus imposes a limit on the number of processors in the system. Alternate architectural features are necessary to reduce the memory bandwidth demands and to increase the bus bandwidth. In this paper, we investigate the cost-performance effects of two enhancements: higher bus transaction rates, e.g., through the use of multiple busses, and shared two-level caches. The performance figures are obtained via simulation with loads derived from traces of real applications, some of which show a significant skew in the distribution of memory bank access. A new multiple bus scheme, called multiple interleaved busses, is described and analyzed | |
520 | 3 | |a This scheme is a generalization of previous approaches, and attempts to balance performance and cost tradeoffs in a snoopy-cache multiprocessor environment. The results from simulation show that multiple interleaved busses perform almost as well as multiple independent busses, but with simpler and less costly implementation. Furthermore, multiple interleaved busses are shown to deliver much better performance than interleaved busses when the skew of accesses across the interleaves is large. Shared second-level caches have been shown to be very effective in the design space under consideration. Such systems might offer considerable implementation economies with relatively small design cost | |
520 | 3 | |a We show that depending on the design point in question, bus operation buffers might be useful in shared second level caches by reducing the effects of high skew and greater multiprocessing level. With the presence of these buffers, the uses of shared caches resulted in only a small throughput degradation. | |
650 | 4 | |a Multiprocessors | |
700 | 1 | |a Baer, Jean-Loup |e Verfasser |4 aut | |
700 | 1 | |a Wang, Wen-Hann |e Verfasser |4 aut | |
810 | 2 | |a Department of Computer Science: Technical report |t University of Washington <Seattle, Wash.> |v 91,9,4 |w (DE-604)BV008930431 |9 91,9,4 | |
999 | |a oai:aleph.bib-bvb.de:BVB01-005961128 |
Datensatz im Suchindex
_version_ | 1804123364833361920 |
---|---|
any_adam_object | |
author | Bertoni, Jonathan Baer, Jean-Loup Wang, Wen-Hann |
author_facet | Bertoni, Jonathan Baer, Jean-Loup Wang, Wen-Hann |
author_role | aut aut aut |
author_sort | Bertoni, Jonathan |
author_variant | j b jb j l b jlb w h w whw |
building | Verbundindex |
bvnumber | BV009015553 |
ctrlnum | (OCoLC)28390720 (DE-599)BVBBV009015553 |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02948nam a2200337 cb4500</leader><controlfield tag="001">BV009015553</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">19951206 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">940206s1991 |||| 00||| eng d</controlfield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)28390720</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV009015553</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Bertoni, Jonathan</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scaling shared-bus multiprocessors with multiple busses and shared caches</subfield><subfield code="b">a performance study</subfield><subfield code="c">Jonathan Bertoni ; Jean-Loup Baer ; Wen-Hann Wang</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Seattle, Wash.</subfield><subfield code="c">1991</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">22 S.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report</subfield><subfield code="v">91,9,4</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">Abstract: "The main limitation of shared-bus multiprocessors is that the common bus tends to be the primary source for contention, and thus imposes a limit on the number of processors in the system. Alternate architectural features are necessary to reduce the memory bandwidth demands and to increase the bus bandwidth. In this paper, we investigate the cost-performance effects of two enhancements: higher bus transaction rates, e.g., through the use of multiple busses, and shared two-level caches. The performance figures are obtained via simulation with loads derived from traces of real applications, some of which show a significant skew in the distribution of memory bank access. A new multiple bus scheme, called multiple interleaved busses, is described and analyzed</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">This scheme is a generalization of previous approaches, and attempts to balance performance and cost tradeoffs in a snoopy-cache multiprocessor environment. The results from simulation show that multiple interleaved busses perform almost as well as multiple independent busses, but with simpler and less costly implementation. Furthermore, multiple interleaved busses are shown to deliver much better performance than interleaved busses when the skew of accesses across the interleaves is large. Shared second-level caches have been shown to be very effective in the design space under consideration. Such systems might offer considerable implementation economies with relatively small design cost</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">We show that depending on the design point in question, bus operation buffers might be useful in shared second level caches by reducing the effects of high skew and greater multiprocessing level. With the presence of these buffers, the uses of shared caches resulted in only a small throughput degradation.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Multiprocessors</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Baer, Jean-Loup</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Wang, Wen-Hann</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="810" ind1="2" ind2=" "><subfield code="a">Department of Computer Science: Technical report</subfield><subfield code="t">University of Washington <Seattle, Wash.></subfield><subfield code="v">91,9,4</subfield><subfield code="w">(DE-604)BV008930431</subfield><subfield code="9">91,9,4</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-005961128</subfield></datafield></record></collection> |
id | DE-604.BV009015553 |
illustrated | Not Illustrated |
indexdate | 2024-07-09T17:28:36Z |
institution | BVB |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-005961128 |
oclc_num | 28390720 |
open_access_boolean | |
owner | DE-29T |
owner_facet | DE-29T |
physical | 22 S. |
publishDate | 1991 |
publishDateSearch | 1991 |
publishDateSort | 1991 |
record_format | marc |
series2 | University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report |
spelling | Bertoni, Jonathan Verfasser aut Scaling shared-bus multiprocessors with multiple busses and shared caches a performance study Jonathan Bertoni ; Jean-Loup Baer ; Wen-Hann Wang Seattle, Wash. 1991 22 S. txt rdacontent n rdamedia nc rdacarrier University of Washington <Seattle, Wash.> / Department of Computer Science: Technical report 91,9,4 Abstract: "The main limitation of shared-bus multiprocessors is that the common bus tends to be the primary source for contention, and thus imposes a limit on the number of processors in the system. Alternate architectural features are necessary to reduce the memory bandwidth demands and to increase the bus bandwidth. In this paper, we investigate the cost-performance effects of two enhancements: higher bus transaction rates, e.g., through the use of multiple busses, and shared two-level caches. The performance figures are obtained via simulation with loads derived from traces of real applications, some of which show a significant skew in the distribution of memory bank access. A new multiple bus scheme, called multiple interleaved busses, is described and analyzed This scheme is a generalization of previous approaches, and attempts to balance performance and cost tradeoffs in a snoopy-cache multiprocessor environment. The results from simulation show that multiple interleaved busses perform almost as well as multiple independent busses, but with simpler and less costly implementation. Furthermore, multiple interleaved busses are shown to deliver much better performance than interleaved busses when the skew of accesses across the interleaves is large. Shared second-level caches have been shown to be very effective in the design space under consideration. Such systems might offer considerable implementation economies with relatively small design cost We show that depending on the design point in question, bus operation buffers might be useful in shared second level caches by reducing the effects of high skew and greater multiprocessing level. With the presence of these buffers, the uses of shared caches resulted in only a small throughput degradation. Multiprocessors Baer, Jean-Loup Verfasser aut Wang, Wen-Hann Verfasser aut Department of Computer Science: Technical report University of Washington <Seattle, Wash.> 91,9,4 (DE-604)BV008930431 91,9,4 |
spellingShingle | Bertoni, Jonathan Baer, Jean-Loup Wang, Wen-Hann Scaling shared-bus multiprocessors with multiple busses and shared caches a performance study Multiprocessors |
title | Scaling shared-bus multiprocessors with multiple busses and shared caches a performance study |
title_auth | Scaling shared-bus multiprocessors with multiple busses and shared caches a performance study |
title_exact_search | Scaling shared-bus multiprocessors with multiple busses and shared caches a performance study |
title_full | Scaling shared-bus multiprocessors with multiple busses and shared caches a performance study Jonathan Bertoni ; Jean-Loup Baer ; Wen-Hann Wang |
title_fullStr | Scaling shared-bus multiprocessors with multiple busses and shared caches a performance study Jonathan Bertoni ; Jean-Loup Baer ; Wen-Hann Wang |
title_full_unstemmed | Scaling shared-bus multiprocessors with multiple busses and shared caches a performance study Jonathan Bertoni ; Jean-Loup Baer ; Wen-Hann Wang |
title_short | Scaling shared-bus multiprocessors with multiple busses and shared caches |
title_sort | scaling shared bus multiprocessors with multiple busses and shared caches a performance study |
title_sub | a performance study |
topic | Multiprocessors |
topic_facet | Multiprocessors |
volume_link | (DE-604)BV008930431 |
work_keys_str_mv | AT bertonijonathan scalingsharedbusmultiprocessorswithmultiplebussesandsharedcachesaperformancestudy AT baerjeanloup scalingsharedbusmultiprocessorswithmultiplebussesandsharedcachesaperformancestudy AT wangwenhann scalingsharedbusmultiprocessorswithmultiplebussesandsharedcachesaperformancestudy |