Real-time analytics: techniques to analyze and visualize streaming data
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Indianapolis, IN
Wiley
2014
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | XIX, 412 S. graph. Darst. |
ISBN: | 9781118837917 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV042022684 | ||
003 | DE-604 | ||
005 | 20170413 | ||
007 | t | ||
008 | 140813s2014 d||| |||| 00||| eng d | ||
020 | |a 9781118837917 |c pbk |9 978-1-118-83791-7 | ||
035 | |a (OCoLC)889722289 | ||
035 | |a (DE-599)BVBBV042022684 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-91G |a DE-11 |a DE-188 |a DE-703 | ||
084 | |a ST 234 |0 (DE-625)143621: |2 rvk | ||
084 | |a DAT 260f |2 stub | ||
084 | |a DAT 600f |2 stub | ||
100 | 1 | |a Ellis, Byron |e Verfasser |4 aut | |
245 | 1 | 0 | |a Real-time analytics |b techniques to analyze and visualize streaming data |c Byron Ellis |
264 | 1 | |a Indianapolis, IN |b Wiley |c 2014 | |
300 | |a XIX, 412 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
650 | 4 | |a Real-time data processing | |
650 | 4 | |a Data flow computing | |
650 | 4 | |a Data mining | |
650 | 0 | 7 | |a Echtzeitverarbeitung |0 (DE-588)4151002-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Visualisierung |0 (DE-588)4188417-6 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Datenanalyse |0 (DE-588)4123037-1 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Datenanalyse |0 (DE-588)4123037-1 |D s |
689 | 0 | 1 | |a Visualisierung |0 (DE-588)4188417-6 |D s |
689 | 0 | 2 | |a Echtzeitverarbeitung |0 (DE-588)4151002-1 |D s |
689 | 0 | |5 DE-604 | |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-1-118-83793-1 |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-1-118-83802-0 |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027464319&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-027464319 |
Datensatz im Suchindex
_version_ | 1804152441388662784 |
---|---|
adam_text | Titel: Real-time analytics
Autor: Ellis, Byron
Jahr: 2014
Contents Introduction XV Chapter 1 Introduction to Streaming Data 1 Sources of Streaming Data 2 Operational Monitoring 3 Web Analytics 3 Online Advertising 4 Social Media 5 Mobile Data and the Internet of Things 5 Why Streaming Data Is Different 7 Always On, Always Flowing 7 Loosely Structured 8 High-Cardinality Storage 9 Infrastructures and Algorithms 10 Conclusion 10 Parti Streaming Analytics Architecture 13 Chapter 2 Designing Real-Time Streaming Architectures 15 Real-Time Architecture Components 16 Collection 16 Data Flow 17 Processing 19 Storage 20 Delivery 22 Features of a Real-Time Architecture 24 High Availability 24 Low Latency 25 Horizontal Scalability 26
X Contents Languages for Real-Time Programming 27 Java 27 Scala and Clojure 28 JavaScript 29 The Go Language 30 A Real-Time Architecture Checklist 30 Collection 31 Data Flow 31 Processing 32 Storage 32 Delivery 33 Conclusion 34 Chapter 3 Service Configuration and Coordination 35 Motivation for Configuration and Coordination Systems 36 Maintaining Distributed State 36 Unreliable Network Connections 36 Clock Synchronization 37 Consensus in an Unreliable World 38 Apache ZooKeeper 39 The znode 39 Watches and Notifications 41 Maintaining Consistency 41 Creating a ZooKeeper Cluster 42 ZooKeeper s Native Java Client 47 The Curator Client 56 Curator Recipes 63 Conclusion 70 Chapter 4 Data-Flow Management in Streaming Analysis 71 Distributed Data Flows 72 At Least Once Delivery 72 The n+1 Problem 73 Apache Kafka: High-Throughput Distributed Messaging 74 Design and Implementation 74 Configuring a Kafka Environment 80 Interacting with Kafka Brokers 89 Apache Flume: Distributed Log Collection 92 The Flume Agent • 92 Configuring the Agent 94 The Flume Data Model 95 Channel Selectors 95 Flume Sources 98 Flume Sinks 107 Sink Processors 110 Flume Channels 110
Contents xi Flume Interceptors 112 Integrating Custom Flume Components 114 Running Flume Agents 114 Conclusion 115 Chapter 5 Processing Streaming Data 117 Distributed Streaming Data Processing 118 Coordination 118 Partitions and Merges 119 Transactions 119 Processing Data with Storm 119 Components of a Storm Cluster 120 Configuring a Storm Cluster 122 Distributed Clusters 123 Local Clusters 126 Storm Topologies 127 Implementing Bolts 130 Implementing and Using Spouts 136 Distributed Remote Procedure Calls 142 Trident: The Storm DSL 144 Processing Data with Samza 151 Apache YARN 151 Getting Started with YARN and Samza 153 Integrating Samza into the Data Flow 157 Samza Jobs 157 Conclusion 166 Chapter 6 Storing Streaming Data 167 Consistent Hashing 168 NoSQL Storage Systems 169 Redis 170 MongoDB 180 Cassandra 203 Other Storage Technologies 215 Relational Databases 215 Distributed In-Memory Data Grids 215 Choosing a Technology 215 Key-Value Stores 216 Document Stores 216 Distributed Hash Table Stores 216 In-Memory Grids 217 Relational Databases 217 Warehousing 217 Hadoop as ETL and Warehouse 218 Lambda Architectures 223 Conclusion 224
xii Contents Part II Analysis and Visualization 225 Chapter 7 Delivering Streaming Metrics 227 Streaming Web Applications 228 Working with Node 229 Managing a Node Project with NPM 231 Developing Node Web Applications 235 A Basic Streaming Dashboard 238 Adding Streaming to Web Applications 242 Visualizing Data 254 HTML5 Canvas and Inline SVG 254 Data-Driven Documents: D3.js 262 High-Level Tools 272 Mobile Streaming Applications 277 Conclusion 279 Chapter 8 Exact Aggregation and Delivery 281 Timed Counting and Summation 285 Counting in Bolts 286 Counting with Trident 288 Counting in Samza 289 Multi-Resolution Time-Series Aggregation 290 Quantization Framework 290 Stochastic Optimization 296 Delivering Time-Series Data 297 Strip Charts with D3.js 298 High-Speed Canvas Charts 299 Horizon Charts 301 Conclusion 303 Chapter 9 Statistical Approximation of Streaming Data 305 Numerical Libraries 306 Probabilities and Distributions 307 Expectation and Variance 309 Statistical Distributions 310 Discrete Distributions 310 Continuous Distributions 312 Joint Distributions 315 Working with Distributions 316 Inferring Parameters 316 The Delta Method 317 Distribution Inequalities 319 Random Number Generation 319 Generating Specific Distributions 321
Contents xiii Sampling Procedures 324 Sampling from a Fixed Population 325 Sampling from a Streaming Population 326 Biased Streaming Sampling 327 Conclusion 329 Chapter 10 Approximating Streaming Data with Sketching 331 Registers and Hash Functions 332 Registers 332 Hash Functions 332 Working with Sets 336 The Bloom Filter 338 The Algorithm 338 Choosing a Filter Size 340 Unions and Intersections 341 Cardinality Estimation 342 Interesting Variations 344 Distinct Value Sketches 347 The Min-Count Algorithm 348 The HyperLogLog Algorithm 351 The Count-Min Sketch 356 Point Queries 356 Count-Min Sketch Implementation 357 Top-K and Heavy Hitters 358 Range and Quantile Queries 360 Other Applications 364 Conclusion 364 Chapter 11 Beyond Aggregation 367 Models for Real-Time Data 368 Simple Time-Series Models 369 Linear Models 373 Logistic Regression 378 Neural Network Models 380 Forecasting with Models 389 Exponential Smoothing Methods 390 Regression Methods 393 Neural Network Methods 394 Monitoring 396 Outlier Detection 397 Change Detection 399 Real-Time Optimization 400 Conclusion 402 Index 403
|
any_adam_object | 1 |
author | Ellis, Byron |
author_facet | Ellis, Byron |
author_role | aut |
author_sort | Ellis, Byron |
author_variant | b e be |
building | Verbundindex |
bvnumber | BV042022684 |
classification_rvk | ST 234 |
classification_tum | DAT 260f DAT 600f |
ctrlnum | (OCoLC)889722289 (DE-599)BVBBV042022684 |
discipline | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01754nam a2200445 c 4500</leader><controlfield tag="001">BV042022684</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20170413 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">140813s2014 d||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781118837917</subfield><subfield code="c">pbk</subfield><subfield code="9">978-1-118-83791-7</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)889722289</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV042022684</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91G</subfield><subfield code="a">DE-11</subfield><subfield code="a">DE-188</subfield><subfield code="a">DE-703</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 234</subfield><subfield code="0">(DE-625)143621:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">DAT 260f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">DAT 600f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Ellis, Byron</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Real-time analytics</subfield><subfield code="b">techniques to analyze and visualize streaming data</subfield><subfield code="c">Byron Ellis</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Indianapolis, IN</subfield><subfield code="b">Wiley</subfield><subfield code="c">2014</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XIX, 412 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Real-time data processing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data flow computing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Echtzeitverarbeitung</subfield><subfield code="0">(DE-588)4151002-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Visualisierung</subfield><subfield code="0">(DE-588)4188417-6</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenanalyse</subfield><subfield code="0">(DE-588)4123037-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Datenanalyse</subfield><subfield code="0">(DE-588)4123037-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Visualisierung</subfield><subfield code="0">(DE-588)4188417-6</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Echtzeitverarbeitung</subfield><subfield code="0">(DE-588)4151002-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-1-118-83793-1</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-1-118-83802-0</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027464319&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-027464319</subfield></datafield></record></collection> |
id | DE-604.BV042022684 |
illustrated | Illustrated |
indexdate | 2024-07-10T01:10:46Z |
institution | BVB |
isbn | 9781118837917 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-027464319 |
oclc_num | 889722289 |
open_access_boolean | |
owner | DE-91G DE-BY-TUM DE-11 DE-188 DE-703 |
owner_facet | DE-91G DE-BY-TUM DE-11 DE-188 DE-703 |
physical | XIX, 412 S. graph. Darst. |
publishDate | 2014 |
publishDateSearch | 2014 |
publishDateSort | 2014 |
publisher | Wiley |
record_format | marc |
spelling | Ellis, Byron Verfasser aut Real-time analytics techniques to analyze and visualize streaming data Byron Ellis Indianapolis, IN Wiley 2014 XIX, 412 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Real-time data processing Data flow computing Data mining Echtzeitverarbeitung (DE-588)4151002-1 gnd rswk-swf Visualisierung (DE-588)4188417-6 gnd rswk-swf Datenanalyse (DE-588)4123037-1 gnd rswk-swf Datenanalyse (DE-588)4123037-1 s Visualisierung (DE-588)4188417-6 s Echtzeitverarbeitung (DE-588)4151002-1 s DE-604 Erscheint auch als Online-Ausgabe 978-1-118-83793-1 Erscheint auch als Online-Ausgabe 978-1-118-83802-0 HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027464319&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Ellis, Byron Real-time analytics techniques to analyze and visualize streaming data Real-time data processing Data flow computing Data mining Echtzeitverarbeitung (DE-588)4151002-1 gnd Visualisierung (DE-588)4188417-6 gnd Datenanalyse (DE-588)4123037-1 gnd |
subject_GND | (DE-588)4151002-1 (DE-588)4188417-6 (DE-588)4123037-1 |
title | Real-time analytics techniques to analyze and visualize streaming data |
title_auth | Real-time analytics techniques to analyze and visualize streaming data |
title_exact_search | Real-time analytics techniques to analyze and visualize streaming data |
title_full | Real-time analytics techniques to analyze and visualize streaming data Byron Ellis |
title_fullStr | Real-time analytics techniques to analyze and visualize streaming data Byron Ellis |
title_full_unstemmed | Real-time analytics techniques to analyze and visualize streaming data Byron Ellis |
title_short | Real-time analytics |
title_sort | real time analytics techniques to analyze and visualize streaming data |
title_sub | techniques to analyze and visualize streaming data |
topic | Real-time data processing Data flow computing Data mining Echtzeitverarbeitung (DE-588)4151002-1 gnd Visualisierung (DE-588)4188417-6 gnd Datenanalyse (DE-588)4123037-1 gnd |
topic_facet | Real-time data processing Data flow computing Data mining Echtzeitverarbeitung Visualisierung Datenanalyse |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027464319&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT ellisbyron realtimeanalyticstechniquestoanalyzeandvisualizestreamingdata |