The enterprise big data lake: delivering the promise of big data and data science
Intro -- Copyright -- Table of Contents -- Preface -- Who Should Read This Book? -- Conventions Used in This Book -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Chapter 1. Introduction to Data Lakes -- Data Lake Maturity -- Data Puddles -- Data Ponds -- Creating a Succes...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Beijing ; Boston ; Farnham ; Sebastopol ; Tokyo
O'Reilly
2019
|
Ausgabe: | First edition |
Schlagworte: | |
Online-Zugang: | UBY01 |
Zusammenfassung: | Intro -- Copyright -- Table of Contents -- Preface -- Who Should Read This Book? -- Conventions Used in This Book -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Chapter 1. Introduction to Data Lakes -- Data Lake Maturity -- Data Puddles -- Data Ponds -- Creating a Successful Data Lake -- The Right Platform -- The Right Data -- The Right Interface -- The Data Swamp -- Roadmap to Data Lake Success -- Standing Up a Data Lake -- Organizing the Data Lake -- Setting Up the Data Lake for Self-Service -- Data Lake Architectures -- Data Lakes in the Public Cloud -- Logical Data Lakes -- Conclusion -- Chapter 2. Historical Perspective -- The Drive for Self-Service Data-The Birth of Databases -- The Analytics Imperative-The Birth of Data Warehousing -- The Data Warehouse Ecosystem -- Storing and Querying the Data -- Loading the Data-Data Integration Tools -- Organizing and Managing the Data -- Consuming the Data -- Conclusion -- Chapter 3. Introduction to Big Data and Data Science -- Hadoop Leads the Historic Shift to Big Data -- The Hadoop File System -- How Processing and Storage Interact in a MapReduce Job -- Schema on Read -- Hadoop Projects -- Data Science -- What Should Your Analytics Organization Focus On? -- Machine Learning -- Explainability -- Change Management -- Conclusion -- Chapter 4. Starting a Data Lake -- The What and Why of Hadoop -- Preventing Proliferation of Data Puddles -- Taking Advantage of Big Data -- Leading with Data Science -- Strategy 1: Offload Existing Functionality -- Strategy 2: Data Lakes for New Projects -- Strategy 3: Establish a Central Point of Governance -- Which Way Is Right for You? -- Conclusion -- Chapter 5. From Data Ponds/Big Data Warehouses to Data Lakes -- Essential Functions of a Data Warehouse -- Dimensional Modeling for Analytics -- Integrating Data from Disparate Sources |
Beschreibung: | 1 Online-Ressource (xii, 205 Seiten) Illustrationen |
ISBN: | 9781491931523 |
Internformat
MARC
LEADER | 00000nmm a2200000 c 4500 | ||
---|---|---|---|
001 | BV047338357 | ||
003 | DE-604 | ||
005 | 20210702 | ||
007 | cr|uuu---uuuuu | ||
008 | 210622s2019 cc |||| o||u| ||||||eng d | ||
020 | |a 9781491931523 |c Online |9 978-1-4919-3152-3 | ||
035 | |a (ZDB-30-PQE)5718882 | ||
035 | |a (OCoLC)1257808653 | ||
035 | |a (DE-599)KXP166313779X | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
044 | |a cc |c XB-CN |a xxu |c XD-US |a xxk |c XA-GB |a ru |c XA-RU |a ja |c XB-JP | ||
049 | |a DE-706 | ||
100 | 1 | |a Gorelik, Alex |e Verfasser |0 (DE-588)1182582699 |4 aut | |
245 | 1 | 0 | |a The enterprise big data lake |b delivering the promise of big data and data science |c Alex Gorelik |
250 | |a First edition | ||
264 | 1 | |a Beijing ; Boston ; Farnham ; Sebastopol ; Tokyo |b O'Reilly |c 2019 | |
300 | |a 1 Online-Ressource (xii, 205 Seiten) |b Illustrationen | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
520 | 3 | |a Intro -- Copyright -- Table of Contents -- Preface -- Who Should Read This Book? -- Conventions Used in This Book -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Chapter 1. Introduction to Data Lakes -- Data Lake Maturity -- Data Puddles -- Data Ponds -- Creating a Successful Data Lake -- The Right Platform -- The Right Data -- The Right Interface -- The Data Swamp -- Roadmap to Data Lake Success -- Standing Up a Data Lake -- Organizing the Data Lake -- Setting Up the Data Lake for Self-Service -- Data Lake Architectures -- Data Lakes in the Public Cloud -- Logical Data Lakes -- Conclusion -- Chapter 2. Historical Perspective -- The Drive for Self-Service Data-The Birth of Databases -- The Analytics Imperative-The Birth of Data Warehousing -- The Data Warehouse Ecosystem -- Storing and Querying the Data -- Loading the Data-Data Integration Tools -- Organizing and Managing the Data -- Consuming the Data -- Conclusion -- Chapter 3. Introduction to Big Data and Data Science -- Hadoop Leads the Historic Shift to Big Data -- The Hadoop File System -- How Processing and Storage Interact in a MapReduce Job -- Schema on Read -- Hadoop Projects -- Data Science -- What Should Your Analytics Organization Focus On? -- Machine Learning -- Explainability -- Change Management -- Conclusion -- Chapter 4. Starting a Data Lake -- The What and Why of Hadoop -- Preventing Proliferation of Data Puddles -- Taking Advantage of Big Data -- Leading with Data Science -- Strategy 1: Offload Existing Functionality -- Strategy 2: Data Lakes for New Projects -- Strategy 3: Establish a Central Point of Governance -- Which Way Is Right for You? -- Conclusion -- Chapter 5. From Data Ponds/Big Data Warehouses to Data Lakes -- Essential Functions of a Data Warehouse -- Dimensional Modeling for Analytics -- Integrating Data from Disparate Sources | |
653 | 0 | |a Big data | |
653 | 0 | |a Computer science | |
912 | |a ZDB-30-PQE | ||
999 | |a oai:aleph.bib-bvb.de:BVB01-032740808 | ||
966 | e | |u https://ebookcentral.proquest.com/lib/unibwm/detail.action?docID=5718882 |l UBY01 |p ZDB-30-PQE |q UBY01_Einzelkauf21 |x Aggregator |3 Volltext |
Datensatz im Suchindex
_version_ | 1804182553048907776 |
---|---|
adam_txt | |
any_adam_object | |
any_adam_object_boolean | |
author | Gorelik, Alex |
author_GND | (DE-588)1182582699 |
author_facet | Gorelik, Alex |
author_role | aut |
author_sort | Gorelik, Alex |
author_variant | a g ag |
building | Verbundindex |
bvnumber | BV047338357 |
collection | ZDB-30-PQE |
ctrlnum | (ZDB-30-PQE)5718882 (OCoLC)1257808653 (DE-599)KXP166313779X |
edition | First edition |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03113nmm a2200349 c 4500</leader><controlfield tag="001">BV047338357</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20210702 </controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">210622s2019 cc |||| o||u| ||||||eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781491931523</subfield><subfield code="c">Online</subfield><subfield code="9">978-1-4919-3152-3</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ZDB-30-PQE)5718882</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1257808653</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)KXP166313779X</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">cc</subfield><subfield code="c">XB-CN</subfield><subfield code="a">xxu</subfield><subfield code="c">XD-US</subfield><subfield code="a">xxk</subfield><subfield code="c">XA-GB</subfield><subfield code="a">ru</subfield><subfield code="c">XA-RU</subfield><subfield code="a">ja</subfield><subfield code="c">XB-JP</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-706</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Gorelik, Alex</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1182582699</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">The enterprise big data lake</subfield><subfield code="b">delivering the promise of big data and data science</subfield><subfield code="c">Alex Gorelik</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">First edition</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Beijing ; Boston ; Farnham ; Sebastopol ; Tokyo</subfield><subfield code="b">O'Reilly</subfield><subfield code="c">2019</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (xii, 205 Seiten)</subfield><subfield code="b">Illustrationen</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">Intro -- Copyright -- Table of Contents -- Preface -- Who Should Read This Book? -- Conventions Used in This Book -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Chapter 1. Introduction to Data Lakes -- Data Lake Maturity -- Data Puddles -- Data Ponds -- Creating a Successful Data Lake -- The Right Platform -- The Right Data -- The Right Interface -- The Data Swamp -- Roadmap to Data Lake Success -- Standing Up a Data Lake -- Organizing the Data Lake -- Setting Up the Data Lake for Self-Service -- Data Lake Architectures -- Data Lakes in the Public Cloud -- Logical Data Lakes -- Conclusion -- Chapter 2. Historical Perspective -- The Drive for Self-Service Data-The Birth of Databases -- The Analytics Imperative-The Birth of Data Warehousing -- The Data Warehouse Ecosystem -- Storing and Querying the Data -- Loading the Data-Data Integration Tools -- Organizing and Managing the Data -- Consuming the Data -- Conclusion -- Chapter 3. Introduction to Big Data and Data Science -- Hadoop Leads the Historic Shift to Big Data -- The Hadoop File System -- How Processing and Storage Interact in a MapReduce Job -- Schema on Read -- Hadoop Projects -- Data Science -- What Should Your Analytics Organization Focus On? -- Machine Learning -- Explainability -- Change Management -- Conclusion -- Chapter 4. Starting a Data Lake -- The What and Why of Hadoop -- Preventing Proliferation of Data Puddles -- Taking Advantage of Big Data -- Leading with Data Science -- Strategy 1: Offload Existing Functionality -- Strategy 2: Data Lakes for New Projects -- Strategy 3: Establish a Central Point of Governance -- Which Way Is Right for You? -- Conclusion -- Chapter 5. From Data Ponds/Big Data Warehouses to Data Lakes -- Essential Functions of a Data Warehouse -- Dimensional Modeling for Analytics -- Integrating Data from Disparate Sources</subfield></datafield><datafield tag="653" ind1=" " ind2="0"><subfield code="a">Big data</subfield></datafield><datafield tag="653" ind1=" " ind2="0"><subfield code="a">Computer science</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-PQE</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-032740808</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://ebookcentral.proquest.com/lib/unibwm/detail.action?docID=5718882</subfield><subfield code="l">UBY01</subfield><subfield code="p">ZDB-30-PQE</subfield><subfield code="q">UBY01_Einzelkauf21</subfield><subfield code="x">Aggregator</subfield><subfield code="3">Volltext</subfield></datafield></record></collection> |
id | DE-604.BV047338357 |
illustrated | Not Illustrated |
index_date | 2024-07-03T17:33:28Z |
indexdate | 2024-07-10T09:09:22Z |
institution | BVB |
isbn | 9781491931523 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-032740808 |
oclc_num | 1257808653 |
open_access_boolean | |
owner | DE-706 |
owner_facet | DE-706 |
physical | 1 Online-Ressource (xii, 205 Seiten) Illustrationen |
psigel | ZDB-30-PQE ZDB-30-PQE UBY01_Einzelkauf21 |
publishDate | 2019 |
publishDateSearch | 2019 |
publishDateSort | 2019 |
publisher | O'Reilly |
record_format | marc |
spelling | Gorelik, Alex Verfasser (DE-588)1182582699 aut The enterprise big data lake delivering the promise of big data and data science Alex Gorelik First edition Beijing ; Boston ; Farnham ; Sebastopol ; Tokyo O'Reilly 2019 1 Online-Ressource (xii, 205 Seiten) Illustrationen txt rdacontent c rdamedia cr rdacarrier Intro -- Copyright -- Table of Contents -- Preface -- Who Should Read This Book? -- Conventions Used in This Book -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Chapter 1. Introduction to Data Lakes -- Data Lake Maturity -- Data Puddles -- Data Ponds -- Creating a Successful Data Lake -- The Right Platform -- The Right Data -- The Right Interface -- The Data Swamp -- Roadmap to Data Lake Success -- Standing Up a Data Lake -- Organizing the Data Lake -- Setting Up the Data Lake for Self-Service -- Data Lake Architectures -- Data Lakes in the Public Cloud -- Logical Data Lakes -- Conclusion -- Chapter 2. Historical Perspective -- The Drive for Self-Service Data-The Birth of Databases -- The Analytics Imperative-The Birth of Data Warehousing -- The Data Warehouse Ecosystem -- Storing and Querying the Data -- Loading the Data-Data Integration Tools -- Organizing and Managing the Data -- Consuming the Data -- Conclusion -- Chapter 3. Introduction to Big Data and Data Science -- Hadoop Leads the Historic Shift to Big Data -- The Hadoop File System -- How Processing and Storage Interact in a MapReduce Job -- Schema on Read -- Hadoop Projects -- Data Science -- What Should Your Analytics Organization Focus On? -- Machine Learning -- Explainability -- Change Management -- Conclusion -- Chapter 4. Starting a Data Lake -- The What and Why of Hadoop -- Preventing Proliferation of Data Puddles -- Taking Advantage of Big Data -- Leading with Data Science -- Strategy 1: Offload Existing Functionality -- Strategy 2: Data Lakes for New Projects -- Strategy 3: Establish a Central Point of Governance -- Which Way Is Right for You? -- Conclusion -- Chapter 5. From Data Ponds/Big Data Warehouses to Data Lakes -- Essential Functions of a Data Warehouse -- Dimensional Modeling for Analytics -- Integrating Data from Disparate Sources Big data Computer science |
spellingShingle | Gorelik, Alex The enterprise big data lake delivering the promise of big data and data science |
title | The enterprise big data lake delivering the promise of big data and data science |
title_auth | The enterprise big data lake delivering the promise of big data and data science |
title_exact_search | The enterprise big data lake delivering the promise of big data and data science |
title_exact_search_txtP | The enterprise big data lake delivering the promise of big data and data science |
title_full | The enterprise big data lake delivering the promise of big data and data science Alex Gorelik |
title_fullStr | The enterprise big data lake delivering the promise of big data and data science Alex Gorelik |
title_full_unstemmed | The enterprise big data lake delivering the promise of big data and data science Alex Gorelik |
title_short | The enterprise big data lake |
title_sort | the enterprise big data lake delivering the promise of big data and data science |
title_sub | delivering the promise of big data and data science |
work_keys_str_mv | AT gorelikalex theenterprisebigdatalakedeliveringthepromiseofbigdataanddatascience |