Computation checkpointing and migration:
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
New York
Nova Science Publ.
2010
|
Schriftenreihe: | Embedded and high performance computing series
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Includes bibliographical references (p. 129-136) and index |
ISBN: | 9781607418405 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV035742640 | ||
003 | DE-604 | ||
005 | 20110426 | ||
007 | t | ||
008 | 090928s2010 |||| 00||| eng d | ||
010 | |a 2009025493 | ||
020 | |a 9781607418405 |c hardcover : alk. paper |9 978-1-60741-840-5 | ||
035 | |a (OCoLC)368015308 | ||
035 | |a (DE-599)GBV603673805 | ||
040 | |a DE-604 |b ger |e aacr | ||
041 | 0 | |a eng | |
049 | |a DE-473 | ||
050 | 0 | |a QA76.88 | |
082 | 0 | |a 004/.35 |2 22 | |
084 | |a ST 151 |0 (DE-625)143595: |2 rvk | ||
100 | 1 | |a Chaudhary, Vipin |e Verfasser |4 aut | |
245 | 1 | 0 | |a Computation checkpointing and migration |c Vipin Chaudhary, John Paul N. Walters and Hai Jiang |
264 | 1 | |a New York |b Nova Science Publ. |c 2010 | |
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Embedded and high performance computing series | |
500 | |a Includes bibliographical references (p. 129-136) and index | ||
650 | 0 | |a High performance computing | |
650 | 0 | |a Fault-tolerant computing | |
650 | 0 | |a Heterogeneous computing | |
650 | 4 | |a Fault-tolerant computing | |
650 | 4 | |a Heterogeneous computing | |
650 | 4 | |a High performance computing | |
650 | 0 | 7 | |a Hochleistungsrechnen |0 (DE-588)4532701-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Heterogenes Rechnernetz |0 (DE-588)4199974-5 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Fehlertoleranz |0 (DE-588)4123192-2 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Hochleistungsrechnen |0 (DE-588)4532701-4 |D s |
689 | 0 | 1 | |a Heterogenes Rechnernetz |0 (DE-588)4199974-5 |D s |
689 | 0 | 2 | |a Fehlertoleranz |0 (DE-588)4123192-2 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Walters, John Paul N. |e Verfasser |4 aut | |
700 | 1 | |a Jiang, Hai |e Verfasser |0 (DE-588)138181195 |4 aut | |
856 | 4 | 2 | |m Digitalisierung UB Bamberg |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=018602832&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-018602832 |
Datensatz im Suchindex
_version_ | 1804140655598895104 |
---|---|
adam_text | Contents
1
Introduction
1
1.1.
Introduction to Checkpointing
........................ 1
1.2.
Background on Checkpointing
........................ 2
1.2.1.
LAM/MPI
.............................. 2
1.2.2.
Checkpointing Distributed Systems
................. 2
1.2.3.
Distributed State and Consistency
.................. 5
2
Application-level Checkpointing/Migration
9
2.1.
Thread Migration
............................... 10
2.2.
Adaptive
DSM
Systems
............................ 11
2.2.1.
Background
.............................. 11
2.2.2.
Strings
................................ 12
2.2.3.
Thread Scheduling
.......................... 13
2.2.4.
DSM
Migration Policy
........................ 15
2.2.5.
Adaptation Points
........................... 15
2.3.
Thread States
................................. 16
2.4.
Compile-time Support
............................ 18
2.4.1.
Function Call Graph
......................... 18
2.4.2.
Data Variables
............................ 19
2.4.3.
Pointers
................................ 20
2.4.4.
Function Parameters
......................... 22
2.4.5.
Program Counter
........................... 22
2.4.6.
Adaptation Positions
......................... 22
2.4.7.
Preprocessor
............................. 23
2.5.
Run-time Support
............................... 24
2.5.1.
Stacks
................................. 25
2.5.2.
Memory Segments in Heaps
..................... 26
2.5.3.
Thread State Transfer
......................... 27
2.5.4.
State Restoration and Pointer Translation
.............. 27
2.6.
Performance Analysis
............................. 29
2.7. Microbenchmarks............................... 31
2.8.
Experimental Results
............................. 33
2.9.
Summary
................................... 37
vi
Contents
Migration
Safety
39
3.1.
Checkpointing/Migration-Unsafe Factors
.................. 39
3.1.1.
Pointer Casting
............................ 40
3.1.2.
Pointers in Unions
.......................... 42
3.1.3.
Library Calls
............................. 43
3.1.4.
State-Carrying Instructions
...................... 44
3.1.5.
Incompatible Data Conversion
.................... 45
3.2.
Pointer Representations in
С
......................... 45
3.2.1.
Data Types in
С
............................ 46
3.2.2.
Data Updating Operations
...................... 46
3.2.3.
Pointer Casting
............................ 47
3.3.
Pointer Inference System
........................... 49
3.3.1.
Pointer Inference Rules
........................ 50
3.3.2.
Static Analysis
............................ 52
3.3.3.
Dynamic Check
............................ 54
3.3.4.
Complexity
.............................. 54
3.4. Microbenchmarks............................... 55
3.5.
Experimental Results
............................. 56
3.6.
Related Research
............................... 58
3.7.
Summary
................................... 59
Heterogeneity Support
61
4.1.
Data Representations in Heterogeneous Environments
............ 62
4.1.1.
Tags
.................................. 62
4.1.2.
Canonical Intermediate Form
.................... 62
4.1.3.
Receiver-Makes-Right (RMR)
.................... 62
4.2.
Data Conversion Issues
............................ 63
4.2.1.
Endianness
.............................. 63
4.2.2.
Character Sets
............................ 63
4.2.3.
Floating Point Standards
....................... 63
4.2.4.
Data Alignment and Padding
..................... 64
4.2.5.
Loss of Precision
........................... 65
4.2.6.
Pointers
................................ 65
4.3.
Coarse-grained Tagged RMR in
Mig
Thread
................. 66
4.3.1.
Tagging and Padding Detection
................... 66
4.3.2.
Data Restoration
........................... 69
4.3.3.
Data Resizing
............................. 70
4.3.4.
Address Resizing
........................... 70
4.3.5.
Plug-and-play
............................. 71
4.4.
The Compile Time Support Module
..................... 72
4.5.
The Run-time Support Module
........................ 77
4.6.
Complexity Analysis
............................. 78
4.7. Microbenchmarks............................... 79
4.8.
Experimental Results
............................. 83
4.9.
Related Research
............................... 86
Contents
vii
4.10.
Summary
................................... 91
5
User-Level Checkpointing with LAM
93
5.1.
User-Lever Checkpoint/Fault Tolerance
................... 93
5.2.
User-level LAM Checkpointing with Arbitrary Restart
Structure
.................................... 93
5.2.1.
Existing Implementations
...................... 93
5.2.2.
Enhancements to LAM s Checkpointing
............... 95
5.3.
Checkpoint Storage, Resilience, and Performance
.............. 97
5.3.1.
Dedicated Checkpoint Servers versus Checkpointing to Network
Storage
................................ 98
5.3.2.
Checkpoint Replication
........................ 100
5.3.3.
The Degree of Replication
...................... 102
5.3.4.
Restarting Computation
....................... 102
5.3.5.
Scalability Studies
.......................... 104
6
HPC and Virtualization
107
6.1.
Virtualization within HPC
.......................... 107
6.2.
Virtualization Background
.......................... 108
6.2.1.
Overview of Test Virtualization Implementations
.......... 109
6.3.
Performance Results
............................. 110
6.3.1.
Network Performance
........................
Ill
6.3.2.
File System Performance
....................... 112
6.3.3.
Single Node Benchmarks
....................... 114
6.3.4.
MPI
Benchmarks
........................... 116
6.4.
Fault Tolerant OpenVZ
............................ 119
6.5.
Checkpointing/Restart System Design
.................... 119
6.5.1.
System Startup
............................ 119
6.5.2.
Checkpointing
............................ 120
6.5.3.
Restarting
............................... 121
6.5.4.
Data Resiliency to Node Failures
................... 122
6.6.
Checkpoint/Replication Analysis
....................... 122
6.7.
Performance Results
............................. 123
6.7.1.
Replication Overhead
......................... 125
References
129
Index
137
|
any_adam_object | 1 |
author | Chaudhary, Vipin Walters, John Paul N. Jiang, Hai |
author_GND | (DE-588)138181195 |
author_facet | Chaudhary, Vipin Walters, John Paul N. Jiang, Hai |
author_role | aut aut aut |
author_sort | Chaudhary, Vipin |
author_variant | v c vc j p n w jpn jpnw h j hj |
building | Verbundindex |
bvnumber | BV035742640 |
callnumber-first | Q - Science |
callnumber-label | QA76 |
callnumber-raw | QA76.88 |
callnumber-search | QA76.88 |
callnumber-sort | QA 276.88 |
callnumber-subject | QA - Mathematics |
classification_rvk | ST 151 |
ctrlnum | (OCoLC)368015308 (DE-599)GBV603673805 |
dewey-full | 004/.35 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 004 - Computer science |
dewey-raw | 004/.35 |
dewey-search | 004/.35 |
dewey-sort | 14 235 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02024nam a2200505 c 4500</leader><controlfield tag="001">BV035742640</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20110426 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">090928s2010 |||| 00||| eng d</controlfield><datafield tag="010" ind1=" " ind2=" "><subfield code="a">2009025493</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781607418405</subfield><subfield code="c">hardcover : alk. paper</subfield><subfield code="9">978-1-60741-840-5</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)368015308</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)GBV603673805</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-473</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">QA76.88</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">004/.35</subfield><subfield code="2">22</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 151</subfield><subfield code="0">(DE-625)143595:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Chaudhary, Vipin</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Computation checkpointing and migration</subfield><subfield code="c">Vipin Chaudhary, John Paul N. Walters and Hai Jiang</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">New York</subfield><subfield code="b">Nova Science Publ.</subfield><subfield code="c">2010</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Embedded and high performance computing series</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references (p. 129-136) and index</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">High performance computing</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Fault-tolerant computing</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Heterogeneous computing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Fault-tolerant computing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Heterogeneous computing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">High performance computing</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Hochleistungsrechnen</subfield><subfield code="0">(DE-588)4532701-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Heterogenes Rechnernetz</subfield><subfield code="0">(DE-588)4199974-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Fehlertoleranz</subfield><subfield code="0">(DE-588)4123192-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Hochleistungsrechnen</subfield><subfield code="0">(DE-588)4532701-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Heterogenes Rechnernetz</subfield><subfield code="0">(DE-588)4199974-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Fehlertoleranz</subfield><subfield code="0">(DE-588)4123192-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Walters, John Paul N.</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Jiang, Hai</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)138181195</subfield><subfield code="4">aut</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Bamberg</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=018602832&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-018602832</subfield></datafield></record></collection> |
id | DE-604.BV035742640 |
illustrated | Not Illustrated |
indexdate | 2024-07-09T22:03:26Z |
institution | BVB |
isbn | 9781607418405 |
language | English |
lccn | 2009025493 |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-018602832 |
oclc_num | 368015308 |
open_access_boolean | |
owner | DE-473 DE-BY-UBG |
owner_facet | DE-473 DE-BY-UBG |
publishDate | 2010 |
publishDateSearch | 2010 |
publishDateSort | 2010 |
publisher | Nova Science Publ. |
record_format | marc |
series2 | Embedded and high performance computing series |
spelling | Chaudhary, Vipin Verfasser aut Computation checkpointing and migration Vipin Chaudhary, John Paul N. Walters and Hai Jiang New York Nova Science Publ. 2010 txt rdacontent n rdamedia nc rdacarrier Embedded and high performance computing series Includes bibliographical references (p. 129-136) and index High performance computing Fault-tolerant computing Heterogeneous computing Hochleistungsrechnen (DE-588)4532701-4 gnd rswk-swf Heterogenes Rechnernetz (DE-588)4199974-5 gnd rswk-swf Fehlertoleranz (DE-588)4123192-2 gnd rswk-swf Hochleistungsrechnen (DE-588)4532701-4 s Heterogenes Rechnernetz (DE-588)4199974-5 s Fehlertoleranz (DE-588)4123192-2 s DE-604 Walters, John Paul N. Verfasser aut Jiang, Hai Verfasser (DE-588)138181195 aut Digitalisierung UB Bamberg application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=018602832&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Chaudhary, Vipin Walters, John Paul N. Jiang, Hai Computation checkpointing and migration High performance computing Fault-tolerant computing Heterogeneous computing Hochleistungsrechnen (DE-588)4532701-4 gnd Heterogenes Rechnernetz (DE-588)4199974-5 gnd Fehlertoleranz (DE-588)4123192-2 gnd |
subject_GND | (DE-588)4532701-4 (DE-588)4199974-5 (DE-588)4123192-2 |
title | Computation checkpointing and migration |
title_auth | Computation checkpointing and migration |
title_exact_search | Computation checkpointing and migration |
title_full | Computation checkpointing and migration Vipin Chaudhary, John Paul N. Walters and Hai Jiang |
title_fullStr | Computation checkpointing and migration Vipin Chaudhary, John Paul N. Walters and Hai Jiang |
title_full_unstemmed | Computation checkpointing and migration Vipin Chaudhary, John Paul N. Walters and Hai Jiang |
title_short | Computation checkpointing and migration |
title_sort | computation checkpointing and migration |
topic | High performance computing Fault-tolerant computing Heterogeneous computing Hochleistungsrechnen (DE-588)4532701-4 gnd Heterogenes Rechnernetz (DE-588)4199974-5 gnd Fehlertoleranz (DE-588)4123192-2 gnd |
topic_facet | High performance computing Fault-tolerant computing Heterogeneous computing Hochleistungsrechnen Heterogenes Rechnernetz Fehlertoleranz |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=018602832&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT chaudharyvipin computationcheckpointingandmigration AT waltersjohnpauln computationcheckpointingandmigration AT jianghai computationcheckpointingandmigration |