Data-intensive workflow management: for clouds and data-intensive and scalable computing environments
Workflows may be defined as abstractions used to model the coherent flow of activities in the context of an in silico scientific experiment. They are employed in many domains of science such as bioinformatics, astronomy, and engineering. Such workflows usually present a considerable number of activi...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
[San Rafael, California]
Morgan & Claypool Publishers
[2019]
|
Schriftenreihe: | Synthesis lectures on data management
#60 |
Schlagworte: | |
Online-Zugang: | Volltext |
Zusammenfassung: | Workflows may be defined as abstractions used to model the coherent flow of activities in the context of an in silico scientific experiment. They are employed in many domains of science such as bioinformatics, astronomy, and engineering. Such workflows usually present a considerable number of activities and activations (i.e., tasks associated with activities) and may need a long time for execution. Due to the continuous need to store and process data efficiently (making them data-intensive workflows), high-performance computing environments allied to parallelization techniques are used to run these workflows. At the beginning of the 2010s, cloud technologies emerged as a promising environment to run scientific workflows. By using clouds, scientists have expanded beyond single parallel computers to hundreds or even thousands of virtual machines. More recently, Data-Intensive Scalable Computing (DISC) frameworks (e.g., Apache Spark and Hadoop) and environments emerged and are being used to execute data-intensive workflows. DISC environments are composed of processors and disks in large-commodity computing clusters connected using high-speed communications switches and networks. The main advantage of DISC frameworks is that they support and grant efficient in-memory data management for large-scale applications, such as data-intensive workflows. However, the execution of workflows in cloud and DISC environments raise many challenges such as scheduling workflow activities and activations, managing produced data, collecting provenance data, etc. Several existing approaches deal with the challenges mentioned earlier. This way, there is a real need for understanding how to manage these workflows and various big data platforms that have been developed and introduced. As such, this book can help researchers understand how linking workflow management with Data-Intensive Scalable Computing can help in understanding and analyzing scientific big data. In this book, we aim to identify and distill the body of work on workflow management in clouds and DISC environments. We start by discussing the basic principles of data-intensive scientific workflows. Next, we present two workflows that are executed in a single site and multi-site clouds taking advantage of provenance. Afterward, we go towards workflow management in DISC environments, and we present, in detail, solutions that enable the optimized execution of the workflow using frameworks such as Apache Spark and its extensions |
Beschreibung: | Part of: Synthesis digital library of engineering and computer science Title from PDF title page (viewed on May 29, 2019) |
Beschreibung: | 1 Online-Resource (xvii, 161 Seiten) Illustrationen |
ISBN: | 9781681735580 |
DOI: | 10.2200/S00915ED1V01Y201904DTM060 |
Internformat
MARC
LEADER | 00000nmm a2200000zcb4500 | ||
---|---|---|---|
001 | BV046427697 | ||
003 | DE-604 | ||
005 | 20211124 | ||
007 | cr|uuu---uuuuu | ||
008 | 200217s2019 |||| o||u| ||||||eng d | ||
020 | |a 9781681735580 |c ebook |9 978-1-68173-558-0 | ||
024 | 7 | |a 10.2200/S00915ED1V01Y201904DTM060 |2 doi | |
035 | |a (ZDB-105-MCS)8715841 | ||
035 | |a (OCoLC)1141158495 | ||
035 | |a (DE-599)BVBBV046427697 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-83 | ||
082 | 0 | |a 004.67/82 |2 23 | |
084 | |a ST 530 |0 (DE-625)143679: |2 rvk | ||
100 | 1 | |a Oliveira, Daniel de |e Verfasser |0 (DE-588)1192516095 |4 aut | |
245 | 1 | 0 | |a Data-intensive workflow management |b for clouds and data-intensive and scalable computing environments |c Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti |
264 | 1 | |a [San Rafael, California] |b Morgan & Claypool Publishers |c [2019] | |
264 | 4 | |c © 2019 | |
300 | |a 1 Online-Resource (xvii, 161 Seiten) |b Illustrationen | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
490 | 1 | |a Synthesis lectures on data management |v #60 | |
500 | |a Part of: Synthesis digital library of engineering and computer science | ||
500 | |a Title from PDF title page (viewed on May 29, 2019) | ||
520 | |a Workflows may be defined as abstractions used to model the coherent flow of activities in the context of an in silico scientific experiment. They are employed in many domains of science such as bioinformatics, astronomy, and engineering. Such workflows usually present a considerable number of activities and activations (i.e., tasks associated with activities) and may need a long time for execution. Due to the continuous need to store and process data efficiently (making them data-intensive workflows), high-performance computing environments allied to parallelization techniques are used to run these workflows. At the beginning of the 2010s, cloud technologies emerged as a promising environment to run scientific workflows. By using clouds, scientists have expanded beyond single parallel computers to hundreds or even thousands of virtual machines. | ||
520 | |a More recently, Data-Intensive Scalable Computing (DISC) frameworks (e.g., Apache Spark and Hadoop) and environments emerged and are being used to execute data-intensive workflows. DISC environments are composed of processors and disks in large-commodity computing clusters connected using high-speed communications switches and networks. The main advantage of DISC frameworks is that they support and grant efficient in-memory data management for large-scale applications, such as data-intensive workflows. However, the execution of workflows in cloud and DISC environments raise many challenges such as scheduling workflow activities and activations, managing produced data, collecting provenance data, etc. Several existing approaches deal with the challenges mentioned earlier. This way, there is a real need for understanding how to manage these workflows and various big data platforms that have been developed and introduced. | ||
520 | |a As such, this book can help researchers understand how linking workflow management with Data-Intensive Scalable Computing can help in understanding and analyzing scientific big data. In this book, we aim to identify and distill the body of work on workflow management in clouds and DISC environments. We start by discussing the basic principles of data-intensive scientific workflows. Next, we present two workflows that are executed in a single site and multi-site clouds taking advantage of provenance. Afterward, we go towards workflow management in DISC environments, and we present, in detail, solutions that enable the optimized execution of the workflow using frameworks such as Apache Spark and its extensions | ||
650 | 4 | |a Cloud computing | |
650 | 4 | |a Database management | |
650 | 0 | 7 | |a Datenverwaltung |0 (DE-588)4011168-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Cloud Computing |0 (DE-588)7623494-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Prozessmanagement |0 (DE-588)4353072-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Big Data |0 (DE-588)4802620-7 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Prozessmanagement |0 (DE-588)4353072-2 |D s |
689 | 0 | 1 | |a Cloud Computing |0 (DE-588)7623494-0 |D s |
689 | 0 | 2 | |a Big Data |0 (DE-588)4802620-7 |D s |
689 | 0 | 3 | |a Datenverwaltung |0 (DE-588)4011168-4 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Liu, Ji |e Verfasser |4 aut | |
700 | 1 | |a Pacitti, Esther |e Verfasser |4 aut | |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe, hardcover |z 978-1-68173-559-7 |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe, paperback |z 978-1-68173-557-3 |
830 | 0 | |a Synthesis lectures on data management |v #60 |w (DE-604)BV036731811 |9 60 | |
856 | 4 | 0 | |u https://doi.org/10.2200/S00915ED1V01Y201904DTM060 |x Verlag |z URL des Erstveröffentlichers |3 Volltext |
912 | |a ZDB-105-MCS |a ZDB-105-MCDM | ||
999 | |a oai:aleph.bib-bvb.de:BVB01-031839999 |
Datensatz im Suchindex
_version_ | 1804180976806395904 |
---|---|
any_adam_object | |
author | Oliveira, Daniel de Liu, Ji Pacitti, Esther |
author_GND | (DE-588)1192516095 |
author_facet | Oliveira, Daniel de Liu, Ji Pacitti, Esther |
author_role | aut aut aut |
author_sort | Oliveira, Daniel de |
author_variant | d d o dd ddo j l jl e p ep |
building | Verbundindex |
bvnumber | BV046427697 |
classification_rvk | ST 530 |
collection | ZDB-105-MCS ZDB-105-MCDM |
ctrlnum | (ZDB-105-MCS)8715841 (OCoLC)1141158495 (DE-599)BVBBV046427697 |
dewey-full | 004.67/82 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 004 - Computer science |
dewey-raw | 004.67/82 |
dewey-search | 004.67/82 |
dewey-sort | 14.67 282 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
doi_str_mv | 10.2200/S00915ED1V01Y201904DTM060 |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>04924nmm a2200601zcb4500</leader><controlfield tag="001">BV046427697</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20211124 </controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">200217s2019 |||| o||u| ||||||eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781681735580</subfield><subfield code="c">ebook</subfield><subfield code="9">978-1-68173-558-0</subfield></datafield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.2200/S00915ED1V01Y201904DTM060</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ZDB-105-MCS)8715841</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1141158495</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV046427697</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-83</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">004.67/82</subfield><subfield code="2">23</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 530</subfield><subfield code="0">(DE-625)143679:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Oliveira, Daniel de</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1192516095</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Data-intensive workflow management</subfield><subfield code="b">for clouds and data-intensive and scalable computing environments</subfield><subfield code="c">Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">[San Rafael, California]</subfield><subfield code="b">Morgan & Claypool Publishers</subfield><subfield code="c">[2019]</subfield></datafield><datafield tag="264" ind1=" " ind2="4"><subfield code="c">© 2019</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Resource (xvii, 161 Seiten)</subfield><subfield code="b">Illustrationen</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Synthesis lectures on data management</subfield><subfield code="v">#60</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Part of: Synthesis digital library of engineering and computer science</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Title from PDF title page (viewed on May 29, 2019)</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Workflows may be defined as abstractions used to model the coherent flow of activities in the context of an in silico scientific experiment. They are employed in many domains of science such as bioinformatics, astronomy, and engineering. Such workflows usually present a considerable number of activities and activations (i.e., tasks associated with activities) and may need a long time for execution. Due to the continuous need to store and process data efficiently (making them data-intensive workflows), high-performance computing environments allied to parallelization techniques are used to run these workflows. At the beginning of the 2010s, cloud technologies emerged as a promising environment to run scientific workflows. By using clouds, scientists have expanded beyond single parallel computers to hundreds or even thousands of virtual machines. </subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">More recently, Data-Intensive Scalable Computing (DISC) frameworks (e.g., Apache Spark and Hadoop) and environments emerged and are being used to execute data-intensive workflows. DISC environments are composed of processors and disks in large-commodity computing clusters connected using high-speed communications switches and networks. The main advantage of DISC frameworks is that they support and grant efficient in-memory data management for large-scale applications, such as data-intensive workflows. However, the execution of workflows in cloud and DISC environments raise many challenges such as scheduling workflow activities and activations, managing produced data, collecting provenance data, etc. Several existing approaches deal with the challenges mentioned earlier. This way, there is a real need for understanding how to manage these workflows and various big data platforms that have been developed and introduced. </subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">As such, this book can help researchers understand how linking workflow management with Data-Intensive Scalable Computing can help in understanding and analyzing scientific big data. In this book, we aim to identify and distill the body of work on workflow management in clouds and DISC environments. We start by discussing the basic principles of data-intensive scientific workflows. Next, we present two workflows that are executed in a single site and multi-site clouds taking advantage of provenance. Afterward, we go towards workflow management in DISC environments, and we present, in detail, solutions that enable the optimized execution of the workflow using frameworks such as Apache Spark and its extensions</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Cloud computing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Database management</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenverwaltung</subfield><subfield code="0">(DE-588)4011168-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Cloud Computing</subfield><subfield code="0">(DE-588)7623494-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Prozessmanagement</subfield><subfield code="0">(DE-588)4353072-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Big Data</subfield><subfield code="0">(DE-588)4802620-7</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Prozessmanagement</subfield><subfield code="0">(DE-588)4353072-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Cloud Computing</subfield><subfield code="0">(DE-588)7623494-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Big Data</subfield><subfield code="0">(DE-588)4802620-7</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="3"><subfield code="a">Datenverwaltung</subfield><subfield code="0">(DE-588)4011168-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Liu, Ji</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Pacitti, Esther</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe, hardcover</subfield><subfield code="z">978-1-68173-559-7</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe, paperback</subfield><subfield code="z">978-1-68173-557-3</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Synthesis lectures on data management</subfield><subfield code="v">#60</subfield><subfield code="w">(DE-604)BV036731811</subfield><subfield code="9">60</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://doi.org/10.2200/S00915ED1V01Y201904DTM060</subfield><subfield code="x">Verlag</subfield><subfield code="z">URL des Erstveröffentlichers</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-105-MCS</subfield><subfield code="a">ZDB-105-MCDM</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-031839999</subfield></datafield></record></collection> |
id | DE-604.BV046427697 |
illustrated | Not Illustrated |
indexdate | 2024-07-10T08:44:19Z |
institution | BVB |
isbn | 9781681735580 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-031839999 |
oclc_num | 1141158495 |
open_access_boolean | |
owner | DE-83 |
owner_facet | DE-83 |
physical | 1 Online-Resource (xvii, 161 Seiten) Illustrationen |
psigel | ZDB-105-MCS ZDB-105-MCDM |
publishDate | 2019 |
publishDateSearch | 2019 |
publishDateSort | 2019 |
publisher | Morgan & Claypool Publishers |
record_format | marc |
series | Synthesis lectures on data management |
series2 | Synthesis lectures on data management |
spelling | Oliveira, Daniel de Verfasser (DE-588)1192516095 aut Data-intensive workflow management for clouds and data-intensive and scalable computing environments Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti [San Rafael, California] Morgan & Claypool Publishers [2019] © 2019 1 Online-Resource (xvii, 161 Seiten) Illustrationen txt rdacontent c rdamedia cr rdacarrier Synthesis lectures on data management #60 Part of: Synthesis digital library of engineering and computer science Title from PDF title page (viewed on May 29, 2019) Workflows may be defined as abstractions used to model the coherent flow of activities in the context of an in silico scientific experiment. They are employed in many domains of science such as bioinformatics, astronomy, and engineering. Such workflows usually present a considerable number of activities and activations (i.e., tasks associated with activities) and may need a long time for execution. Due to the continuous need to store and process data efficiently (making them data-intensive workflows), high-performance computing environments allied to parallelization techniques are used to run these workflows. At the beginning of the 2010s, cloud technologies emerged as a promising environment to run scientific workflows. By using clouds, scientists have expanded beyond single parallel computers to hundreds or even thousands of virtual machines. More recently, Data-Intensive Scalable Computing (DISC) frameworks (e.g., Apache Spark and Hadoop) and environments emerged and are being used to execute data-intensive workflows. DISC environments are composed of processors and disks in large-commodity computing clusters connected using high-speed communications switches and networks. The main advantage of DISC frameworks is that they support and grant efficient in-memory data management for large-scale applications, such as data-intensive workflows. However, the execution of workflows in cloud and DISC environments raise many challenges such as scheduling workflow activities and activations, managing produced data, collecting provenance data, etc. Several existing approaches deal with the challenges mentioned earlier. This way, there is a real need for understanding how to manage these workflows and various big data platforms that have been developed and introduced. As such, this book can help researchers understand how linking workflow management with Data-Intensive Scalable Computing can help in understanding and analyzing scientific big data. In this book, we aim to identify and distill the body of work on workflow management in clouds and DISC environments. We start by discussing the basic principles of data-intensive scientific workflows. Next, we present two workflows that are executed in a single site and multi-site clouds taking advantage of provenance. Afterward, we go towards workflow management in DISC environments, and we present, in detail, solutions that enable the optimized execution of the workflow using frameworks such as Apache Spark and its extensions Cloud computing Database management Datenverwaltung (DE-588)4011168-4 gnd rswk-swf Cloud Computing (DE-588)7623494-0 gnd rswk-swf Prozessmanagement (DE-588)4353072-2 gnd rswk-swf Big Data (DE-588)4802620-7 gnd rswk-swf Prozessmanagement (DE-588)4353072-2 s Cloud Computing (DE-588)7623494-0 s Big Data (DE-588)4802620-7 s Datenverwaltung (DE-588)4011168-4 s DE-604 Liu, Ji Verfasser aut Pacitti, Esther Verfasser aut Erscheint auch als Druck-Ausgabe, hardcover 978-1-68173-559-7 Erscheint auch als Druck-Ausgabe, paperback 978-1-68173-557-3 Synthesis lectures on data management #60 (DE-604)BV036731811 60 https://doi.org/10.2200/S00915ED1V01Y201904DTM060 Verlag URL des Erstveröffentlichers Volltext |
spellingShingle | Oliveira, Daniel de Liu, Ji Pacitti, Esther Data-intensive workflow management for clouds and data-intensive and scalable computing environments Synthesis lectures on data management Cloud computing Database management Datenverwaltung (DE-588)4011168-4 gnd Cloud Computing (DE-588)7623494-0 gnd Prozessmanagement (DE-588)4353072-2 gnd Big Data (DE-588)4802620-7 gnd |
subject_GND | (DE-588)4011168-4 (DE-588)7623494-0 (DE-588)4353072-2 (DE-588)4802620-7 |
title | Data-intensive workflow management for clouds and data-intensive and scalable computing environments |
title_auth | Data-intensive workflow management for clouds and data-intensive and scalable computing environments |
title_exact_search | Data-intensive workflow management for clouds and data-intensive and scalable computing environments |
title_full | Data-intensive workflow management for clouds and data-intensive and scalable computing environments Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti |
title_fullStr | Data-intensive workflow management for clouds and data-intensive and scalable computing environments Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti |
title_full_unstemmed | Data-intensive workflow management for clouds and data-intensive and scalable computing environments Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti |
title_short | Data-intensive workflow management |
title_sort | data intensive workflow management for clouds and data intensive and scalable computing environments |
title_sub | for clouds and data-intensive and scalable computing environments |
topic | Cloud computing Database management Datenverwaltung (DE-588)4011168-4 gnd Cloud Computing (DE-588)7623494-0 gnd Prozessmanagement (DE-588)4353072-2 gnd Big Data (DE-588)4802620-7 gnd |
topic_facet | Cloud computing Database management Datenverwaltung Cloud Computing Prozessmanagement Big Data |
url | https://doi.org/10.2200/S00915ED1V01Y201904DTM060 |
volume_link | (DE-604)BV036731811 |
work_keys_str_mv | AT oliveiradanielde dataintensiveworkflowmanagementforcloudsanddataintensiveandscalablecomputingenvironments AT liuji dataintensiveworkflowmanagementforcloudsanddataintensiveandscalablecomputingenvironments AT pacittiesther dataintensiveworkflowmanagementforcloudsanddataintensiveandscalablecomputingenvironments |