Getting started with Impala: [interactive SQL for apache hadoop]
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Beijing
O'Reilly
2014
|
Ausgabe: | 1. ed. |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | X, 96 S. graph. Darst. |
ISBN: | 9781491905777 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV042212595 | ||
003 | DE-604 | ||
005 | 20141202 | ||
007 | t | ||
008 | 141127s2014 d||| |||| 00||| eng d | ||
020 | |a 9781491905777 |9 978-1-491-90577-7 | ||
035 | |a (OCoLC)915521223 | ||
035 | |a (DE-599)BVBBV042212595 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-11 | ||
084 | |a ST 201 |0 (DE-625)143612: |2 rvk | ||
100 | 1 | |a Russell, John |e Verfasser |4 aut | |
245 | 1 | 0 | |a Getting started with Impala |b [interactive SQL for apache hadoop] |c John Russell |
250 | |a 1. ed. | ||
264 | 1 | |a Beijing |b O'Reilly |c 2014 | |
300 | |a X, 96 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
650 | 0 | 7 | |a Relationales Datenbanksystem |0 (DE-588)4130535-8 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Abfrageverarbeitung |0 (DE-588)4378490-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a SQL |0 (DE-588)4134010-3 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Relationales Datenbanksystem |0 (DE-588)4130535-8 |D s |
689 | 0 | 1 | |a SQL |0 (DE-588)4134010-3 |D s |
689 | 0 | 2 | |a Abfrageverarbeitung |0 (DE-588)4378490-2 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027651247&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-027651247 |
Datensatz im Suchindex
_version_ | 1804152735468093440 |
---|---|
adam_text | Titel: Getting started with Impala
Autor: Russell, John
Jahr: 2014
Table of Contents Introduction.................................................................. vii 1. Whylmpala?................................................................1 Impalas Place in the Big Data Ecosystem 1 Flexibility for Your Big Data Workflow 2 High-Performance Analytics 3 Exploratory Business Intelligence 3 2. Getting Up and Running with Impala...........................................5 Installation 5 Connecting to Impala 6 Your First Impala Queries 7 3 . Impala for the Database Developer........................................... 11 The SQL Language 12 Standard SQL 12 Limited DML 12 No Transactions 13 Numbers 13 Recent Additions 14 Big Data Considerations 15 Billions and Billions of Rows 15 HDFS Block Size 16 Parquet Files: The Biggest Blocks of All 16 How Impala Is Like a Data Warehouse 17 Physical and Logical Data Layouts 18 The HDFS Storage Model 18 Distributed Queries 19 Normalized and Denormalized Data 21 iii
File Formats 21 Text File Format 22 Parquet File Format 23 Getting File Format Information 25 Switching File Formats 25 Aggregation 26 4. Common Developer Tasks for Impala.......................................... 27 Getting Data into an Impala Table 27 INSERT Statement 28 LOAD DATA Statement 28 External Tables 29 Figuring Out Where Impala Data Resides 29 Manually Loading Data Files into HDFS 30 Hive 30 Sqoop 31 Kite 31 Porting SQL Code to Impala 32 Using Impala from a JDBC or ODBC Application 32 JDBC 33 ODBC 33 Using Impala with a Scripting Language 34 Running Impala SQL Statements from Scripts 34 Variable Substitution 34 Saving Query Results 35 The impyla Package for Python Scripting 35 Optimizing Impala Performance 36 Optimizing Query Performance 37 Optimizing Memory Usage 37 Working with Partitioned Tables 39 Finding the Ideal Granularity 40 Inserting into Partitioned Tables 40 Adding and Loading New Partitions 41 Writing User-Defined Functions 42 Collaborating with Your Administrators 43 Designing for Security 43 Understanding Resource Management 44 Helping to Plan for Performance (Stats, HDFS Caching) 44 Understanding Cluster Topology 45 Always Close Your Queries 45 iv | Table of Contents
5. Tutorials and Deep Dives.................................................... 47 Tutorial: From Unix Data File to Impala Table 47 Tutorial: Queries Without a Table 49 Tutorial: The Journey of a Billion Rows 51 Generating a Billion Rows of CSV Data 51 Normalizing the Original Data 57 Converting to Parquet Format 61 Making a Partitioned Table 64 Next Steps 69 Deep Dive: Joins and the Role of Statistics 69 Creating a Million-Row Table to Join With 69 Loading Data and Computing Stats 70 Reviewing the EXPLAIN Plan 71 Trying a Real Query 74 The Story So Far 78 Final Join Query with IB x 1M Rows 79 Anti-Pattern: A Million Little Pieces 79 Tutorial: Across the Fourth Dimension 81 TIMESTAMP Data Type 81 Format Strings for Dates and Times 81 Working with Individual Date and Time Fields 82 Date and Time Arithmetic 83 Lets Solve the Y2K Problem 84 More Fun with Dates 87 Tutorial: Verbose and Quiet impala-shell Output 88 Tutorial: When Schemas Evolve 89 Numbers Versus Strings 91 Dealing with Out-of-Range Integers 92 Tutorial: Levels of Abstraction 95 String Formatting 95 Temperature Conversion 96 Table of Contents | v
|
any_adam_object | 1 |
author | Russell, John |
author_facet | Russell, John |
author_role | aut |
author_sort | Russell, John |
author_variant | j r jr |
building | Verbundindex |
bvnumber | BV042212595 |
classification_rvk | ST 201 |
ctrlnum | (OCoLC)915521223 (DE-599)BVBBV042212595 |
discipline | Informatik |
edition | 1. ed. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01434nam a2200373 c 4500</leader><controlfield tag="001">BV042212595</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20141202 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">141127s2014 d||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781491905777</subfield><subfield code="9">978-1-491-90577-7</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)915521223</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV042212595</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-11</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 201</subfield><subfield code="0">(DE-625)143612:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Russell, John</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Getting started with Impala</subfield><subfield code="b">[interactive SQL for apache hadoop]</subfield><subfield code="c">John Russell</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1. ed.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Beijing</subfield><subfield code="b">O'Reilly</subfield><subfield code="c">2014</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">X, 96 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Relationales Datenbanksystem</subfield><subfield code="0">(DE-588)4130535-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Abfrageverarbeitung</subfield><subfield code="0">(DE-588)4378490-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">SQL</subfield><subfield code="0">(DE-588)4134010-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Relationales Datenbanksystem</subfield><subfield code="0">(DE-588)4130535-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">SQL</subfield><subfield code="0">(DE-588)4134010-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Abfrageverarbeitung</subfield><subfield code="0">(DE-588)4378490-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027651247&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-027651247</subfield></datafield></record></collection> |
id | DE-604.BV042212595 |
illustrated | Illustrated |
indexdate | 2024-07-10T01:15:26Z |
institution | BVB |
isbn | 9781491905777 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-027651247 |
oclc_num | 915521223 |
open_access_boolean | |
owner | DE-11 |
owner_facet | DE-11 |
physical | X, 96 S. graph. Darst. |
publishDate | 2014 |
publishDateSearch | 2014 |
publishDateSort | 2014 |
publisher | O'Reilly |
record_format | marc |
spelling | Russell, John Verfasser aut Getting started with Impala [interactive SQL for apache hadoop] John Russell 1. ed. Beijing O'Reilly 2014 X, 96 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Relationales Datenbanksystem (DE-588)4130535-8 gnd rswk-swf Abfrageverarbeitung (DE-588)4378490-2 gnd rswk-swf SQL (DE-588)4134010-3 gnd rswk-swf Relationales Datenbanksystem (DE-588)4130535-8 s SQL (DE-588)4134010-3 s Abfrageverarbeitung (DE-588)4378490-2 s DE-604 HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027651247&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Russell, John Getting started with Impala [interactive SQL for apache hadoop] Relationales Datenbanksystem (DE-588)4130535-8 gnd Abfrageverarbeitung (DE-588)4378490-2 gnd SQL (DE-588)4134010-3 gnd |
subject_GND | (DE-588)4130535-8 (DE-588)4378490-2 (DE-588)4134010-3 |
title | Getting started with Impala [interactive SQL for apache hadoop] |
title_auth | Getting started with Impala [interactive SQL for apache hadoop] |
title_exact_search | Getting started with Impala [interactive SQL for apache hadoop] |
title_full | Getting started with Impala [interactive SQL for apache hadoop] John Russell |
title_fullStr | Getting started with Impala [interactive SQL for apache hadoop] John Russell |
title_full_unstemmed | Getting started with Impala [interactive SQL for apache hadoop] John Russell |
title_short | Getting started with Impala |
title_sort | getting started with impala interactive sql for apache hadoop |
title_sub | [interactive SQL for apache hadoop] |
topic | Relationales Datenbanksystem (DE-588)4130535-8 gnd Abfrageverarbeitung (DE-588)4378490-2 gnd SQL (DE-588)4134010-3 gnd |
topic_facet | Relationales Datenbanksystem Abfrageverarbeitung SQL |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027651247&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT russelljohn gettingstartedwithimpalainteractivesqlforapachehadoop |