Spark: big data cluster computing in production
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Indianapolis, IN
Wiley
2016
|
Schlagworte: | |
Online-Zugang: | FHI01 FRO01 UBG01 Volltext |
Beschreibung: | Description based upon print version of record Spark™ Big Data Cluster Computing in Production; About the Authors; About the Technical Editors; Credits; Acknowledgments; Contents at a glance; Contents; Introduction; Chapter 1 Finishing Your Spark Job; Installation of the Necessary Components; Native Installation Using a Spark Standalone Cluster; The History of Distributed Computing That Led to Spark; Enter the Cloud; Understanding Resource Management; Using Various Formats for Storage; Text Files; Sequence Files; Avro Files; Parquet Files; Making Sense of Monitoring and Instrumentation; Spark UI; Spark Standalone UI; Metrics REST API Metrics SystemExternal Monitoring Tools; Summary; Chapter 2 Cluster Management; Background; Spark Components; Driver; Workers and Executors; Configuration; Spark Standalone; Architecture; Single-Node Setup Scenario; Multi-Node Setup; YARN; Architecture; Dynamic Resource Allocation; Scenario; Mesos; Setup; Architecture; Dynamic Resource Allocation; Basic Setup Scenario; Comparison; Summary; Chapter 3 Performance Tuning; Spark Execution Model; Partitioning; Controlling Parallelism; Partitioners; Shuffling Data; Shuffling and Data Partitioning; Operators and Shuffling Shuffling Is Not That Bad After AllSerialization; Kryo Registrators; Spark Cache; Spark SQL Cache; Memory Management; Garbage Collection; Shared Variables; Broadcast Variables; Accumulators; Data Locality; Summary; Chapter 4 Security; Architecture; Security Manager; Setup Configurations; ACL; Configuration; Job Submission; Web UI; Network Security; Encryption; Event logging; Kerberos; Apache Sentry; Summary; Chapter 5 Fault Tolerance or Job Execution; Lifecycle of a Spark Job; Spark Master; Spark Driver; Spark Worker; Job Lifecycle; Job Scheduling; Scheduling within an Application Scheduling with External UtilitiesFault Tolerance; Internal and External Fault Tolerance; Service Level Agreements (SLAs); Resilient Distributed Datasets (RDDs); Batch versus Streaming; Testing Strategies; Recommended Configurations; Summary; Chapter 6 Beyond Spark; Data Warehousing; Spark SQL CLI; Thrift JDBC/ODBC Server; Hive on Spark; Machine Learning; DataFrame; MLlib and ML; Mahout on Spark; Hivemall on Spark; External Frameworks; Spark Package; XGBoost; spark-jobserver; Future Works; Integration with the Parameter Server; Deep Learning; Enterprise Usage Collecting User Activity Log with Spark and KafkaReal-Time Recommendation with Spark; Real-Time Categorization of Twitter Bots; Summary; Index; EULA |
Beschreibung: | 1 Online-Ressource (219 Seiten) |
ISBN: | 1119254043 1119254051 1119254809 9781119254041 9781119254058 9781119254805 |
Internformat
MARC
LEADER | 00000nmm a2200000zc 4500 | ||
---|---|---|---|
001 | BV043835594 | ||
003 | DE-604 | ||
005 | 20171129 | ||
007 | cr|uuu---uuuuu | ||
008 | 161020s2016 |||| o||u| ||||||eng d | ||
020 | |a 1119254043 |9 1-119-25404-3 | ||
020 | |a 1119254051 |9 1-119-25405-1 | ||
020 | |a 1119254809 |9 1-119-25480-9 | ||
020 | |a 9781119254041 |9 978-1-119-25404-1 | ||
020 | |a 9781119254058 |9 978-1-119-25405-8 | ||
020 | |a 9781119254805 |c Online |9 978-1-119-25480-5 | ||
024 | 7 | |a 10.1002/9781119254805 |2 doi | |
035 | |a (ZDB-35-WIC)ocn945137904 | ||
035 | |a (OCoLC)951120543 | ||
035 | |a (DE-599)BVBBV043835594 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-861 |a DE-573 | ||
082 | 0 | |a 006.3/12 | |
084 | |a ST 250 |0 (DE-625)143626: |2 rvk | ||
100 | 1 | |a Ganelin, Ilya |e Verfasser |0 (DE-588)1106273389 |4 aut | |
245 | 1 | 0 | |a Spark |b big data cluster computing in production |c Ilya Ganelin ... [et al.] |
264 | 1 | |a Indianapolis, IN |b Wiley |c 2016 | |
300 | |a 1 Online-Ressource (219 Seiten) | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
500 | |a Description based upon print version of record | ||
500 | |a Spark™ Big Data Cluster Computing in Production; About the Authors; About the Technical Editors; Credits; Acknowledgments; Contents at a glance; Contents; Introduction; Chapter 1 Finishing Your Spark Job; Installation of the Necessary Components; Native Installation Using a Spark Standalone Cluster; The History of Distributed Computing That Led to Spark; Enter the Cloud; Understanding Resource Management; Using Various Formats for Storage; Text Files; Sequence Files; Avro Files; Parquet Files; Making Sense of Monitoring and Instrumentation; Spark UI; Spark Standalone UI; Metrics REST API | ||
500 | |a Metrics SystemExternal Monitoring Tools; Summary; Chapter 2 Cluster Management; Background; Spark Components; Driver; Workers and Executors; Configuration; Spark Standalone; Architecture; Single-Node Setup Scenario; Multi-Node Setup; YARN; Architecture; Dynamic Resource Allocation; Scenario; Mesos; Setup; Architecture; Dynamic Resource Allocation; Basic Setup Scenario; Comparison; Summary; Chapter 3 Performance Tuning; Spark Execution Model; Partitioning; Controlling Parallelism; Partitioners; Shuffling Data; Shuffling and Data Partitioning; Operators and Shuffling | ||
500 | |a Shuffling Is Not That Bad After AllSerialization; Kryo Registrators; Spark Cache; Spark SQL Cache; Memory Management; Garbage Collection; Shared Variables; Broadcast Variables; Accumulators; Data Locality; Summary; Chapter 4 Security; Architecture; Security Manager; Setup Configurations; ACL; Configuration; Job Submission; Web UI; Network Security; Encryption; Event logging; Kerberos; Apache Sentry; Summary; Chapter 5 Fault Tolerance or Job Execution; Lifecycle of a Spark Job; Spark Master; Spark Driver; Spark Worker; Job Lifecycle; Job Scheduling; Scheduling within an Application | ||
500 | |a Scheduling with External UtilitiesFault Tolerance; Internal and External Fault Tolerance; Service Level Agreements (SLAs); Resilient Distributed Datasets (RDDs); Batch versus Streaming; Testing Strategies; Recommended Configurations; Summary; Chapter 6 Beyond Spark; Data Warehousing; Spark SQL CLI; Thrift JDBC/ODBC Server; Hive on Spark; Machine Learning; DataFrame; MLlib and ML; Mahout on Spark; Hivemall on Spark; External Frameworks; Spark Package; XGBoost; spark-jobserver; Future Works; Integration with the Parameter Server; Deep Learning; Enterprise Usage | ||
500 | |a Collecting User Activity Log with Spark and KafkaReal-Time Recommendation with Spark; Real-Time Categorization of Twitter Bots; Summary; Index; EULA | ||
630 | 0 | 4 | |a SPARK (Electronic resource) |
650 | 7 | |a SPARK (Electronic resource) |2 fast | |
650 | 7 | |a COMPUTERS / General |2 bisacsh | |
650 | 7 | |a Big data |2 fast | |
650 | 4 | |a Big data | |
650 | 0 | 7 | |a Big Data |0 (DE-588)4802620-7 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a SPARK 2.0 |0 (DE-588)4338029-3 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a SPARK 2.0 |0 (DE-588)4338029-3 |D s |
689 | 0 | 1 | |a Big Data |0 (DE-588)4802620-7 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Orhian, Ema |e Sonstige |0 (DE-588)1106273788 |4 oth | |
700 | 1 | |a Sasaki, Kai |e Sonstige |0 (DE-588)1106274229 |4 oth | |
700 | 1 | |a York, Brennon |e Sonstige |0 (DE-588)1106275314 |4 oth | |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe |z 9781119254010 |
776 | 0 | 8 | |i Erscheint auch als |n Druckausgabe |z 978-1-119-25480-5 |
856 | 4 | 0 | |u https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805 |x Verlag |z URL des Erstveröffentlichers |3 Volltext |
912 | |a ZDB-35-WIC | ||
940 | 1 | |q UBG_PDA_WIC | |
999 | |a oai:aleph.bib-bvb.de:BVB01-029246279 | ||
966 | e | |u https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805 |l FHI01 |p ZDB-35-WIC |x Verlag |3 Volltext | |
966 | e | |u https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805 |l FRO01 |p ZDB-35-WIC |q FRO_PDA_WIC |x Verlag |3 Volltext | |
966 | e | |u https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805 |l UBG01 |p ZDB-35-WIC |q UBG_PDA_WIC |x Verlag |3 Volltext |
Datensatz im Suchindex
_version_ | 1804176701654040576 |
---|---|
any_adam_object | |
author | Ganelin, Ilya |
author_GND | (DE-588)1106273389 (DE-588)1106273788 (DE-588)1106274229 (DE-588)1106275314 |
author_facet | Ganelin, Ilya |
author_role | aut |
author_sort | Ganelin, Ilya |
author_variant | i g ig |
building | Verbundindex |
bvnumber | BV043835594 |
classification_rvk | ST 250 |
collection | ZDB-35-WIC |
ctrlnum | (ZDB-35-WIC)ocn945137904 (OCoLC)951120543 (DE-599)BVBBV043835594 |
dewey-full | 006.3/12 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.3/12 |
dewey-search | 006.3/12 |
dewey-sort | 16.3 212 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>05092nmm a2200685zc 4500</leader><controlfield tag="001">BV043835594</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20171129 </controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">161020s2016 |||| o||u| ||||||eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1119254043</subfield><subfield code="9">1-119-25404-3</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1119254051</subfield><subfield code="9">1-119-25405-1</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1119254809</subfield><subfield code="9">1-119-25480-9</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781119254041</subfield><subfield code="9">978-1-119-25404-1</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781119254058</subfield><subfield code="9">978-1-119-25405-8</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781119254805</subfield><subfield code="c">Online</subfield><subfield code="9">978-1-119-25480-5</subfield></datafield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1002/9781119254805</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ZDB-35-WIC)ocn945137904</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)951120543</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV043835594</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-861</subfield><subfield code="a">DE-573</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.3/12</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 250</subfield><subfield code="0">(DE-625)143626:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Ganelin, Ilya</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1106273389</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Spark</subfield><subfield code="b">big data cluster computing in production</subfield><subfield code="c">Ilya Ganelin ... [et al.]</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Indianapolis, IN</subfield><subfield code="b">Wiley</subfield><subfield code="c">2016</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (219 Seiten)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Description based upon print version of record</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Spark™ Big Data Cluster Computing in Production; About the Authors; About the Technical Editors; Credits; Acknowledgments; Contents at a glance; Contents; Introduction; Chapter 1 Finishing Your Spark Job; Installation of the Necessary Components; Native Installation Using a Spark Standalone Cluster; The History of Distributed Computing That Led to Spark; Enter the Cloud; Understanding Resource Management; Using Various Formats for Storage; Text Files; Sequence Files; Avro Files; Parquet Files; Making Sense of Monitoring and Instrumentation; Spark UI; Spark Standalone UI; Metrics REST API</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Metrics SystemExternal Monitoring Tools; Summary; Chapter 2 Cluster Management; Background; Spark Components; Driver; Workers and Executors; Configuration; Spark Standalone; Architecture; Single-Node Setup Scenario; Multi-Node Setup; YARN; Architecture; Dynamic Resource Allocation; Scenario; Mesos; Setup; Architecture; Dynamic Resource Allocation; Basic Setup Scenario; Comparison; Summary; Chapter 3 Performance Tuning; Spark Execution Model; Partitioning; Controlling Parallelism; Partitioners; Shuffling Data; Shuffling and Data Partitioning; Operators and Shuffling</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Shuffling Is Not That Bad After AllSerialization; Kryo Registrators; Spark Cache; Spark SQL Cache; Memory Management; Garbage Collection; Shared Variables; Broadcast Variables; Accumulators; Data Locality; Summary; Chapter 4 Security; Architecture; Security Manager; Setup Configurations; ACL; Configuration; Job Submission; Web UI; Network Security; Encryption; Event logging; Kerberos; Apache Sentry; Summary; Chapter 5 Fault Tolerance or Job Execution; Lifecycle of a Spark Job; Spark Master; Spark Driver; Spark Worker; Job Lifecycle; Job Scheduling; Scheduling within an Application</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Scheduling with External UtilitiesFault Tolerance; Internal and External Fault Tolerance; Service Level Agreements (SLAs); Resilient Distributed Datasets (RDDs); Batch versus Streaming; Testing Strategies; Recommended Configurations; Summary; Chapter 6 Beyond Spark; Data Warehousing; Spark SQL CLI; Thrift JDBC/ODBC Server; Hive on Spark; Machine Learning; DataFrame; MLlib and ML; Mahout on Spark; Hivemall on Spark; External Frameworks; Spark Package; XGBoost; spark-jobserver; Future Works; Integration with the Parameter Server; Deep Learning; Enterprise Usage</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Collecting User Activity Log with Spark and KafkaReal-Time Recommendation with Spark; Real-Time Categorization of Twitter Bots; Summary; Index; EULA</subfield></datafield><datafield tag="630" ind1="0" ind2="4"><subfield code="a">SPARK (Electronic resource)</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">SPARK (Electronic resource)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS / General</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Big data</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Big data</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Big Data</subfield><subfield code="0">(DE-588)4802620-7</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">SPARK 2.0</subfield><subfield code="0">(DE-588)4338029-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">SPARK 2.0</subfield><subfield code="0">(DE-588)4338029-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Big Data</subfield><subfield code="0">(DE-588)4802620-7</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Orhian, Ema</subfield><subfield code="e">Sonstige</subfield><subfield code="0">(DE-588)1106273788</subfield><subfield code="4">oth</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sasaki, Kai</subfield><subfield code="e">Sonstige</subfield><subfield code="0">(DE-588)1106274229</subfield><subfield code="4">oth</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">York, Brennon</subfield><subfield code="e">Sonstige</subfield><subfield code="0">(DE-588)1106275314</subfield><subfield code="4">oth</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">9781119254010</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druckausgabe</subfield><subfield code="z">978-1-119-25480-5</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805</subfield><subfield code="x">Verlag</subfield><subfield code="z">URL des Erstveröffentlichers</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-35-WIC</subfield></datafield><datafield tag="940" ind1="1" ind2=" "><subfield code="q">UBG_PDA_WIC</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-029246279</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805</subfield><subfield code="l">FHI01</subfield><subfield code="p">ZDB-35-WIC</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805</subfield><subfield code="l">FRO01</subfield><subfield code="p">ZDB-35-WIC</subfield><subfield code="q">FRO_PDA_WIC</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805</subfield><subfield code="l">UBG01</subfield><subfield code="p">ZDB-35-WIC</subfield><subfield code="q">UBG_PDA_WIC</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield></record></collection> |
id | DE-604.BV043835594 |
illustrated | Not Illustrated |
indexdate | 2024-07-10T07:36:22Z |
institution | BVB |
isbn | 1119254043 1119254051 1119254809 9781119254041 9781119254058 9781119254805 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-029246279 |
oclc_num | 945137904 951120543 |
open_access_boolean | |
owner | DE-861 DE-573 |
owner_facet | DE-861 DE-573 |
physical | 1 Online-Ressource (219 Seiten) |
psigel | ZDB-35-WIC UBG_PDA_WIC ZDB-35-WIC FRO_PDA_WIC ZDB-35-WIC UBG_PDA_WIC |
publishDate | 2016 |
publishDateSearch | 2016 |
publishDateSort | 2016 |
publisher | Wiley |
record_format | marc |
spelling | Ganelin, Ilya Verfasser (DE-588)1106273389 aut Spark big data cluster computing in production Ilya Ganelin ... [et al.] Indianapolis, IN Wiley 2016 1 Online-Ressource (219 Seiten) txt rdacontent c rdamedia cr rdacarrier Description based upon print version of record Spark™ Big Data Cluster Computing in Production; About the Authors; About the Technical Editors; Credits; Acknowledgments; Contents at a glance; Contents; Introduction; Chapter 1 Finishing Your Spark Job; Installation of the Necessary Components; Native Installation Using a Spark Standalone Cluster; The History of Distributed Computing That Led to Spark; Enter the Cloud; Understanding Resource Management; Using Various Formats for Storage; Text Files; Sequence Files; Avro Files; Parquet Files; Making Sense of Monitoring and Instrumentation; Spark UI; Spark Standalone UI; Metrics REST API Metrics SystemExternal Monitoring Tools; Summary; Chapter 2 Cluster Management; Background; Spark Components; Driver; Workers and Executors; Configuration; Spark Standalone; Architecture; Single-Node Setup Scenario; Multi-Node Setup; YARN; Architecture; Dynamic Resource Allocation; Scenario; Mesos; Setup; Architecture; Dynamic Resource Allocation; Basic Setup Scenario; Comparison; Summary; Chapter 3 Performance Tuning; Spark Execution Model; Partitioning; Controlling Parallelism; Partitioners; Shuffling Data; Shuffling and Data Partitioning; Operators and Shuffling Shuffling Is Not That Bad After AllSerialization; Kryo Registrators; Spark Cache; Spark SQL Cache; Memory Management; Garbage Collection; Shared Variables; Broadcast Variables; Accumulators; Data Locality; Summary; Chapter 4 Security; Architecture; Security Manager; Setup Configurations; ACL; Configuration; Job Submission; Web UI; Network Security; Encryption; Event logging; Kerberos; Apache Sentry; Summary; Chapter 5 Fault Tolerance or Job Execution; Lifecycle of a Spark Job; Spark Master; Spark Driver; Spark Worker; Job Lifecycle; Job Scheduling; Scheduling within an Application Scheduling with External UtilitiesFault Tolerance; Internal and External Fault Tolerance; Service Level Agreements (SLAs); Resilient Distributed Datasets (RDDs); Batch versus Streaming; Testing Strategies; Recommended Configurations; Summary; Chapter 6 Beyond Spark; Data Warehousing; Spark SQL CLI; Thrift JDBC/ODBC Server; Hive on Spark; Machine Learning; DataFrame; MLlib and ML; Mahout on Spark; Hivemall on Spark; External Frameworks; Spark Package; XGBoost; spark-jobserver; Future Works; Integration with the Parameter Server; Deep Learning; Enterprise Usage Collecting User Activity Log with Spark and KafkaReal-Time Recommendation with Spark; Real-Time Categorization of Twitter Bots; Summary; Index; EULA SPARK (Electronic resource) SPARK (Electronic resource) fast COMPUTERS / General bisacsh Big data fast Big data Big Data (DE-588)4802620-7 gnd rswk-swf SPARK 2.0 (DE-588)4338029-3 gnd rswk-swf SPARK 2.0 (DE-588)4338029-3 s Big Data (DE-588)4802620-7 s DE-604 Orhian, Ema Sonstige (DE-588)1106273788 oth Sasaki, Kai Sonstige (DE-588)1106274229 oth York, Brennon Sonstige (DE-588)1106275314 oth Erscheint auch als Druck-Ausgabe 9781119254010 Erscheint auch als Druckausgabe 978-1-119-25480-5 https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805 Verlag URL des Erstveröffentlichers Volltext |
spellingShingle | Ganelin, Ilya Spark big data cluster computing in production SPARK (Electronic resource) SPARK (Electronic resource) fast COMPUTERS / General bisacsh Big data fast Big data Big Data (DE-588)4802620-7 gnd SPARK 2.0 (DE-588)4338029-3 gnd |
subject_GND | (DE-588)4802620-7 (DE-588)4338029-3 |
title | Spark big data cluster computing in production |
title_auth | Spark big data cluster computing in production |
title_exact_search | Spark big data cluster computing in production |
title_full | Spark big data cluster computing in production Ilya Ganelin ... [et al.] |
title_fullStr | Spark big data cluster computing in production Ilya Ganelin ... [et al.] |
title_full_unstemmed | Spark big data cluster computing in production Ilya Ganelin ... [et al.] |
title_short | Spark |
title_sort | spark big data cluster computing in production |
title_sub | big data cluster computing in production |
topic | SPARK (Electronic resource) SPARK (Electronic resource) fast COMPUTERS / General bisacsh Big data fast Big data Big Data (DE-588)4802620-7 gnd SPARK 2.0 (DE-588)4338029-3 gnd |
topic_facet | SPARK (Electronic resource) COMPUTERS / General Big data Big Data SPARK 2.0 |
url | https://onlinelibrary.wiley.com/doi/book/10.1002/9781119254805 |
work_keys_str_mv | AT ganelinilya sparkbigdataclustercomputinginproduction AT orhianema sparkbigdataclustercomputinginproduction AT sasakikai sparkbigdataclustercomputinginproduction AT yorkbrennon sparkbigdataclustercomputinginproduction |