Apache hive essentials :: immerse yourself on a fantastic journey to discover the attributes of big data by using hive /
Annotation
Gespeichert in:
1. Verfasser: | |
---|---|
Weitere Verfasser: | , |
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Birmingham, England ; Mumbai [India] :
Packt Publishing,
2015.
|
Schriftenreihe: | Community experience distilled.
|
Schlagworte: | |
Online-Zugang: | Volltext |
Zusammenfassung: | Annotation |
Beschreibung: | Includes index. |
Beschreibung: | 1 online resource (208 pages) : color illustrations |
ISBN: | 9781782175056 1782175059 1783558571 9781783558575 |
Internformat
MARC
LEADER | 00000cam a2200000 i 4500 | ||
---|---|---|---|
001 | ZDB-4-EBA-ocn905919222 | ||
003 | OCoLC | ||
005 | 20241004212047.0 | ||
006 | m o d | ||
007 | cr cn||||||||| | ||
008 | 150313t20152015enka o 001 0 eng d | ||
040 | |a E7B |b eng |e rda |e pn |c E7B |d OCLCO |d N$T |d COO |d REB |d OCLCF |d OCLCQ |d OCLCO |d CNNOR |d D6H |d OCLCQ |d NLE |d STF |d UKMGB |d OCLCQ |d OCLCO |d G3B |d S9I |d UKAHL |d VT2 |d RDF |d OCLCO |d OCLCQ |d QGK |d OCLCO |d OCLCL |d OCLCQ | ||
016 | 7 | |a 018005605 |2 Uk | |
019 | |a 1259202797 | ||
020 | |a 9781782175056 |q (electronic bk.) | ||
020 | |a 1782175059 |q (electronic bk.) | ||
020 | |a 1783558571 | ||
020 | |a 9781783558575 | ||
020 | |z 9781783558575 | ||
035 | |a (OCoLC)905919222 |z (OCoLC)1259202797 | ||
037 | |a 9781782175056 |b Packt Publishing Pvt. Ltd | ||
050 | 4 | |a TK5105.8885.A63 |b .D8 2015eb | |
072 | 7 | |a COM |x 060160 |2 bisacsh | |
082 | 7 | |a 005.713769 |2 23 | |
049 | |a MAIN | ||
100 | 1 | |a Du, Dayong, |e author. | |
245 | 1 | 0 | |a Apache hive essentials : |b immerse yourself on a fantastic journey to discover the attributes of big data by using hive / |c Dayong Du ; copy editors Sameen Siddiqui, Laxmi Subramanian. |
264 | 1 | |a Birmingham, England ; |a Mumbai [India] : |b Packt Publishing, |c 2015. | |
264 | 4 | |c ©2015 | |
300 | |a 1 online resource (208 pages) : |b color illustrations | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
347 | |a data file | ||
490 | 1 | |a Community Experience Distilled | |
500 | |a Includes index. | ||
588 | 0 | |a Online resource; title from PDF title page (ebrary, viewed March 13, 2015). | |
520 | 8 | |a Annotation |b If you are a data analyst, developer, or simply someone who wants to use Hive to explore and analyze data in Hadoop, this is the book for you. Whether you are new to big data or an expert, with this book, you will be able to master both the basic and the advanced features of Hive. Since Hive is an SQL-like language, some previous experience with the SQL language and databases is useful to have a better understanding of this book. | |
505 | 0 | |a Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Overview of Big Data and Hive; A short history; Introducing big data; Relational and NoSQL database versus Hadoop; Batch, real-time, and stream processing; Overview of the Hadoop ecosystem; Hive overview; Summary; Chapter 2: Setting Up the Hive Environment; Installing Hive from Apache; Installing Hive from vendor packages; Starting Hive in the cloud; Using the Hive command line and Beeline; The Hive-integrated development environment; Summary | |
505 | 8 | |a Chapter 3: Data Definition and DescriptionUnderstanding Hive data types; Data type conversions; Hive Data Definition Language; Hive database; Hive internal and external tables; Hive partitions; Hive buckets; Hive views; Summary; Chapter 4: Data Selection and Scope; The SELECT statement; The INNER JOIN statement; The OUTER JOIN and CROSS JOIN statements; Special JOIN -- MAPJOIN; Set operation -- UNION ALL; Summary; Chapter 5: Data Manipulation; Data exchange -- LOAD; Data exchange -- INSERT; Data exchange -- EXPORT and IMPORT; ORDER and SORT; Operators and functions; Transactions; Summary | |
505 | 8 | |a Chapter 6: Data Aggregation and SamplingBasic aggregation -- GROUP BY; Advanced aggregation -- GROUPING SETS; Advanced aggregation -- ROLLUP and CUBE; Aggregation condition -- HAVING; Analytic functions; Sampling; Summary; Chapter 7: Performance Considerations; Performance utilities; The EXPLAIN statement; The ANALYZE statement; Design optimization; Partition tables; Bucket tables; Index; Data file optimization; File format; Compression; Storage optimization; Job and query optimization; Local mode; JVM reuse; Parallel execution; Join optimization; Common join; Map join; Bucket map join | |
505 | 8 | |a Sort merge bucket (SMB) joinSort merge bucket map (SMBM) join; Skew join; Summary; Chapter 8: Extensibility Considerations; User-defined functions; The UDF code template; The UDAF code template; The UDTF code template; Development and deployment; Streaming; SerDe; Summary; Chapter 9: Security Considerations; Authentication; Metastore server authentication; HiveServer2 authentication; Authorization; Legacy mode; Storage-based mode; SQL standard-based mode; Encryption; Summary; Chapter 10: Working with Other Tools; JDBC/ODBC connector; HBase; Hue; HCatalog; ZooKeeper; Oozie; Hive roadmap | |
546 | |a English. | ||
630 | 0 | 0 | |a Apache (Computer file : Apache Group) |0 http://id.loc.gov/authorities/names/no97023874 |
630 | 0 | 0 | |a Apache Hadoop. |0 http://id.loc.gov/authorities/names/n2013024279 |
630 | 0 | 7 | |a Apache (Computer file : Apache Group) |2 fast |
630 | 0 | 7 | |a Apache Hadoop |2 fast |
650 | 0 | |a Database management. |0 http://id.loc.gov/authorities/subjects/sh85035848 | |
650 | 0 | |a SQL (Computer program language) |0 http://id.loc.gov/authorities/subjects/sh86006628 | |
650 | 6 | |a Bases de données |x Gestion. | |
650 | 6 | |a SQL (Langage de programmation) | |
650 | 7 | |a COMPUTERS |x Web |x Web Programming. |2 bisacsh | |
650 | 7 | |a Database management |2 fast | |
650 | 7 | |a SQL (Computer program language) |2 fast | |
700 | 1 | |a Siddiqui, Sameen, |e editor. | |
700 | 1 | |a Subramanian, Laxmi, |e editor. | |
776 | 0 | 8 | |i Print version: |a Du, Dayong. |t Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive. |d Birmingham, England ; Mumbai, [India] : Packt Publishing, ©2015 |h iii, 189 pages |k Community experience distilled. |z 9781783558575 |
830 | 0 | |a Community experience distilled. |0 http://id.loc.gov/authorities/names/no2011030603 | |
856 | 4 | 0 | |l FWS01 |p ZDB-4-EBA |q FWS_PDA_EBA |u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=959971 |3 Volltext |
936 | |a BATCHLOAD | ||
938 | |a Askews and Holts Library Services |b ASKH |n AH28295372 | ||
938 | |a ebrary |b EBRY |n ebr11025934 | ||
938 | |a EBSCOhost |b EBSC |n 959971 | ||
994 | |a 92 |b GEBAY | ||
912 | |a ZDB-4-EBA | ||
049 | |a DE-863 |
Datensatz im Suchindex
DE-BY-FWS_katkey | ZDB-4-EBA-ocn905919222 |
---|---|
_version_ | 1816882308560453632 |
adam_text | |
any_adam_object | |
author | Du, Dayong |
author2 | Siddiqui, Sameen Subramanian, Laxmi |
author2_role | edt edt |
author2_variant | s s ss l s ls |
author_facet | Du, Dayong Siddiqui, Sameen Subramanian, Laxmi |
author_role | aut |
author_sort | Du, Dayong |
author_variant | d d dd |
building | Verbundindex |
bvnumber | localFWS |
callnumber-first | T - Technology |
callnumber-label | TK5105 |
callnumber-raw | TK5105.8885.A63 .D8 2015eb |
callnumber-search | TK5105.8885.A63 .D8 2015eb |
callnumber-sort | TK 45105.8885 A63 D8 42015EB |
callnumber-subject | TK - Electrical and Nuclear Engineering |
collection | ZDB-4-EBA |
contents | Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Overview of Big Data and Hive; A short history; Introducing big data; Relational and NoSQL database versus Hadoop; Batch, real-time, and stream processing; Overview of the Hadoop ecosystem; Hive overview; Summary; Chapter 2: Setting Up the Hive Environment; Installing Hive from Apache; Installing Hive from vendor packages; Starting Hive in the cloud; Using the Hive command line and Beeline; The Hive-integrated development environment; Summary Chapter 3: Data Definition and DescriptionUnderstanding Hive data types; Data type conversions; Hive Data Definition Language; Hive database; Hive internal and external tables; Hive partitions; Hive buckets; Hive views; Summary; Chapter 4: Data Selection and Scope; The SELECT statement; The INNER JOIN statement; The OUTER JOIN and CROSS JOIN statements; Special JOIN -- MAPJOIN; Set operation -- UNION ALL; Summary; Chapter 5: Data Manipulation; Data exchange -- LOAD; Data exchange -- INSERT; Data exchange -- EXPORT and IMPORT; ORDER and SORT; Operators and functions; Transactions; Summary Chapter 6: Data Aggregation and SamplingBasic aggregation -- GROUP BY; Advanced aggregation -- GROUPING SETS; Advanced aggregation -- ROLLUP and CUBE; Aggregation condition -- HAVING; Analytic functions; Sampling; Summary; Chapter 7: Performance Considerations; Performance utilities; The EXPLAIN statement; The ANALYZE statement; Design optimization; Partition tables; Bucket tables; Index; Data file optimization; File format; Compression; Storage optimization; Job and query optimization; Local mode; JVM reuse; Parallel execution; Join optimization; Common join; Map join; Bucket map join Sort merge bucket (SMB) joinSort merge bucket map (SMBM) join; Skew join; Summary; Chapter 8: Extensibility Considerations; User-defined functions; The UDF code template; The UDAF code template; The UDTF code template; Development and deployment; Streaming; SerDe; Summary; Chapter 9: Security Considerations; Authentication; Metastore server authentication; HiveServer2 authentication; Authorization; Legacy mode; Storage-based mode; SQL standard-based mode; Encryption; Summary; Chapter 10: Working with Other Tools; JDBC/ODBC connector; HBase; Hue; HCatalog; ZooKeeper; Oozie; Hive roadmap |
ctrlnum | (OCoLC)905919222 |
dewey-full | 005.713769 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 005 - Computer programming, programs, data, security |
dewey-raw | 005.713769 |
dewey-search | 005.713769 |
dewey-sort | 15.713769 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>06011cam a2200733 i 4500</leader><controlfield tag="001">ZDB-4-EBA-ocn905919222</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr cn|||||||||</controlfield><controlfield tag="008">150313t20152015enka o 001 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">E7B</subfield><subfield code="b">eng</subfield><subfield code="e">rda</subfield><subfield code="e">pn</subfield><subfield code="c">E7B</subfield><subfield code="d">OCLCO</subfield><subfield code="d">N$T</subfield><subfield code="d">COO</subfield><subfield code="d">REB</subfield><subfield code="d">OCLCF</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">CNNOR</subfield><subfield code="d">D6H</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">NLE</subfield><subfield code="d">STF</subfield><subfield code="d">UKMGB</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">G3B</subfield><subfield code="d">S9I</subfield><subfield code="d">UKAHL</subfield><subfield code="d">VT2</subfield><subfield code="d">RDF</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">QGK</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield><subfield code="d">OCLCQ</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">018005605</subfield><subfield code="2">Uk</subfield></datafield><datafield tag="019" ind1=" " ind2=" "><subfield code="a">1259202797</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781782175056</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1782175059</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1783558571</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781783558575</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781783558575</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)905919222</subfield><subfield code="z">(OCoLC)1259202797</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">9781782175056</subfield><subfield code="b">Packt Publishing Pvt. Ltd</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">TK5105.8885.A63</subfield><subfield code="b">.D8 2015eb</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">COM</subfield><subfield code="x">060160</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">005.713769</subfield><subfield code="2">23</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Du, Dayong,</subfield><subfield code="e">author.</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Apache hive essentials :</subfield><subfield code="b">immerse yourself on a fantastic journey to discover the attributes of big data by using hive /</subfield><subfield code="c">Dayong Du ; copy editors Sameen Siddiqui, Laxmi Subramanian.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Birmingham, England ;</subfield><subfield code="a">Mumbai [India] :</subfield><subfield code="b">Packt Publishing,</subfield><subfield code="c">2015.</subfield></datafield><datafield tag="264" ind1=" " ind2="4"><subfield code="c">©2015</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (208 pages) :</subfield><subfield code="b">color illustrations</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="347" ind1=" " ind2=" "><subfield code="a">data file</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Community Experience Distilled</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes index.</subfield></datafield><datafield tag="588" ind1="0" ind2=" "><subfield code="a">Online resource; title from PDF title page (ebrary, viewed March 13, 2015).</subfield></datafield><datafield tag="520" ind1="8" ind2=" "><subfield code="a">Annotation</subfield><subfield code="b">If you are a data analyst, developer, or simply someone who wants to use Hive to explore and analyze data in Hadoop, this is the book for you. Whether you are new to big data or an expert, with this book, you will be able to master both the basic and the advanced features of Hive. Since Hive is an SQL-like language, some previous experience with the SQL language and databases is useful to have a better understanding of this book.</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Overview of Big Data and Hive; A short history; Introducing big data; Relational and NoSQL database versus Hadoop; Batch, real-time, and stream processing; Overview of the Hadoop ecosystem; Hive overview; Summary; Chapter 2: Setting Up the Hive Environment; Installing Hive from Apache; Installing Hive from vendor packages; Starting Hive in the cloud; Using the Hive command line and Beeline; The Hive-integrated development environment; Summary</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Chapter 3: Data Definition and DescriptionUnderstanding Hive data types; Data type conversions; Hive Data Definition Language; Hive database; Hive internal and external tables; Hive partitions; Hive buckets; Hive views; Summary; Chapter 4: Data Selection and Scope; The SELECT statement; The INNER JOIN statement; The OUTER JOIN and CROSS JOIN statements; Special JOIN -- MAPJOIN; Set operation -- UNION ALL; Summary; Chapter 5: Data Manipulation; Data exchange -- LOAD; Data exchange -- INSERT; Data exchange -- EXPORT and IMPORT; ORDER and SORT; Operators and functions; Transactions; Summary</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Chapter 6: Data Aggregation and SamplingBasic aggregation -- GROUP BY; Advanced aggregation -- GROUPING SETS; Advanced aggregation -- ROLLUP and CUBE; Aggregation condition -- HAVING; Analytic functions; Sampling; Summary; Chapter 7: Performance Considerations; Performance utilities; The EXPLAIN statement; The ANALYZE statement; Design optimization; Partition tables; Bucket tables; Index; Data file optimization; File format; Compression; Storage optimization; Job and query optimization; Local mode; JVM reuse; Parallel execution; Join optimization; Common join; Map join; Bucket map join</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Sort merge bucket (SMB) joinSort merge bucket map (SMBM) join; Skew join; Summary; Chapter 8: Extensibility Considerations; User-defined functions; The UDF code template; The UDAF code template; The UDTF code template; Development and deployment; Streaming; SerDe; Summary; Chapter 9: Security Considerations; Authentication; Metastore server authentication; HiveServer2 authentication; Authorization; Legacy mode; Storage-based mode; SQL standard-based mode; Encryption; Summary; Chapter 10: Working with Other Tools; JDBC/ODBC connector; HBase; Hue; HCatalog; ZooKeeper; Oozie; Hive roadmap</subfield></datafield><datafield tag="546" ind1=" " ind2=" "><subfield code="a">English.</subfield></datafield><datafield tag="630" ind1="0" ind2="0"><subfield code="a">Apache (Computer file : Apache Group)</subfield><subfield code="0">http://id.loc.gov/authorities/names/no97023874</subfield></datafield><datafield tag="630" ind1="0" ind2="0"><subfield code="a">Apache Hadoop.</subfield><subfield code="0">http://id.loc.gov/authorities/names/n2013024279</subfield></datafield><datafield tag="630" ind1="0" ind2="7"><subfield code="a">Apache (Computer file : Apache Group)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="630" ind1="0" ind2="7"><subfield code="a">Apache Hadoop</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Database management.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh85035848</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">SQL (Computer program language)</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh86006628</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Bases de données</subfield><subfield code="x">Gestion.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">SQL (Langage de programmation)</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS</subfield><subfield code="x">Web</subfield><subfield code="x">Web Programming.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Database management</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">SQL (Computer program language)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Siddiqui, Sameen,</subfield><subfield code="e">editor.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Subramanian, Laxmi,</subfield><subfield code="e">editor.</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Print version:</subfield><subfield code="a">Du, Dayong.</subfield><subfield code="t">Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive.</subfield><subfield code="d">Birmingham, England ; Mumbai, [India] : Packt Publishing, ©2015</subfield><subfield code="h">iii, 189 pages</subfield><subfield code="k">Community experience distilled.</subfield><subfield code="z">9781783558575</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Community experience distilled.</subfield><subfield code="0">http://id.loc.gov/authorities/names/no2011030603</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=959971</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="936" ind1=" " ind2=" "><subfield code="a">BATCHLOAD</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">Askews and Holts Library Services</subfield><subfield code="b">ASKH</subfield><subfield code="n">AH28295372</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ebrary</subfield><subfield code="b">EBRY</subfield><subfield code="n">ebr11025934</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">959971</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection> |
id | ZDB-4-EBA-ocn905919222 |
illustrated | Illustrated |
indexdate | 2024-11-27T13:26:33Z |
institution | BVB |
isbn | 9781782175056 1782175059 1783558571 9781783558575 |
language | English |
oclc_num | 905919222 |
open_access_boolean | |
owner | MAIN DE-863 DE-BY-FWS |
owner_facet | MAIN DE-863 DE-BY-FWS |
physical | 1 online resource (208 pages) : color illustrations |
psigel | ZDB-4-EBA |
publishDate | 2015 |
publishDateSearch | 2015 |
publishDateSort | 2015 |
publisher | Packt Publishing, |
record_format | marc |
series | Community experience distilled. |
series2 | Community Experience Distilled |
spelling | Du, Dayong, author. Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive / Dayong Du ; copy editors Sameen Siddiqui, Laxmi Subramanian. Birmingham, England ; Mumbai [India] : Packt Publishing, 2015. ©2015 1 online resource (208 pages) : color illustrations text txt rdacontent computer c rdamedia online resource cr rdacarrier data file Community Experience Distilled Includes index. Online resource; title from PDF title page (ebrary, viewed March 13, 2015). Annotation If you are a data analyst, developer, or simply someone who wants to use Hive to explore and analyze data in Hadoop, this is the book for you. Whether you are new to big data or an expert, with this book, you will be able to master both the basic and the advanced features of Hive. Since Hive is an SQL-like language, some previous experience with the SQL language and databases is useful to have a better understanding of this book. Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Overview of Big Data and Hive; A short history; Introducing big data; Relational and NoSQL database versus Hadoop; Batch, real-time, and stream processing; Overview of the Hadoop ecosystem; Hive overview; Summary; Chapter 2: Setting Up the Hive Environment; Installing Hive from Apache; Installing Hive from vendor packages; Starting Hive in the cloud; Using the Hive command line and Beeline; The Hive-integrated development environment; Summary Chapter 3: Data Definition and DescriptionUnderstanding Hive data types; Data type conversions; Hive Data Definition Language; Hive database; Hive internal and external tables; Hive partitions; Hive buckets; Hive views; Summary; Chapter 4: Data Selection and Scope; The SELECT statement; The INNER JOIN statement; The OUTER JOIN and CROSS JOIN statements; Special JOIN -- MAPJOIN; Set operation -- UNION ALL; Summary; Chapter 5: Data Manipulation; Data exchange -- LOAD; Data exchange -- INSERT; Data exchange -- EXPORT and IMPORT; ORDER and SORT; Operators and functions; Transactions; Summary Chapter 6: Data Aggregation and SamplingBasic aggregation -- GROUP BY; Advanced aggregation -- GROUPING SETS; Advanced aggregation -- ROLLUP and CUBE; Aggregation condition -- HAVING; Analytic functions; Sampling; Summary; Chapter 7: Performance Considerations; Performance utilities; The EXPLAIN statement; The ANALYZE statement; Design optimization; Partition tables; Bucket tables; Index; Data file optimization; File format; Compression; Storage optimization; Job and query optimization; Local mode; JVM reuse; Parallel execution; Join optimization; Common join; Map join; Bucket map join Sort merge bucket (SMB) joinSort merge bucket map (SMBM) join; Skew join; Summary; Chapter 8: Extensibility Considerations; User-defined functions; The UDF code template; The UDAF code template; The UDTF code template; Development and deployment; Streaming; SerDe; Summary; Chapter 9: Security Considerations; Authentication; Metastore server authentication; HiveServer2 authentication; Authorization; Legacy mode; Storage-based mode; SQL standard-based mode; Encryption; Summary; Chapter 10: Working with Other Tools; JDBC/ODBC connector; HBase; Hue; HCatalog; ZooKeeper; Oozie; Hive roadmap English. Apache (Computer file : Apache Group) http://id.loc.gov/authorities/names/no97023874 Apache Hadoop. http://id.loc.gov/authorities/names/n2013024279 Apache (Computer file : Apache Group) fast Apache Hadoop fast Database management. http://id.loc.gov/authorities/subjects/sh85035848 SQL (Computer program language) http://id.loc.gov/authorities/subjects/sh86006628 Bases de données Gestion. SQL (Langage de programmation) COMPUTERS Web Web Programming. bisacsh Database management fast SQL (Computer program language) fast Siddiqui, Sameen, editor. Subramanian, Laxmi, editor. Print version: Du, Dayong. Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive. Birmingham, England ; Mumbai, [India] : Packt Publishing, ©2015 iii, 189 pages Community experience distilled. 9781783558575 Community experience distilled. http://id.loc.gov/authorities/names/no2011030603 FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=959971 Volltext |
spellingShingle | Du, Dayong Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive / Community experience distilled. Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Overview of Big Data and Hive; A short history; Introducing big data; Relational and NoSQL database versus Hadoop; Batch, real-time, and stream processing; Overview of the Hadoop ecosystem; Hive overview; Summary; Chapter 2: Setting Up the Hive Environment; Installing Hive from Apache; Installing Hive from vendor packages; Starting Hive in the cloud; Using the Hive command line and Beeline; The Hive-integrated development environment; Summary Chapter 3: Data Definition and DescriptionUnderstanding Hive data types; Data type conversions; Hive Data Definition Language; Hive database; Hive internal and external tables; Hive partitions; Hive buckets; Hive views; Summary; Chapter 4: Data Selection and Scope; The SELECT statement; The INNER JOIN statement; The OUTER JOIN and CROSS JOIN statements; Special JOIN -- MAPJOIN; Set operation -- UNION ALL; Summary; Chapter 5: Data Manipulation; Data exchange -- LOAD; Data exchange -- INSERT; Data exchange -- EXPORT and IMPORT; ORDER and SORT; Operators and functions; Transactions; Summary Chapter 6: Data Aggregation and SamplingBasic aggregation -- GROUP BY; Advanced aggregation -- GROUPING SETS; Advanced aggregation -- ROLLUP and CUBE; Aggregation condition -- HAVING; Analytic functions; Sampling; Summary; Chapter 7: Performance Considerations; Performance utilities; The EXPLAIN statement; The ANALYZE statement; Design optimization; Partition tables; Bucket tables; Index; Data file optimization; File format; Compression; Storage optimization; Job and query optimization; Local mode; JVM reuse; Parallel execution; Join optimization; Common join; Map join; Bucket map join Sort merge bucket (SMB) joinSort merge bucket map (SMBM) join; Skew join; Summary; Chapter 8: Extensibility Considerations; User-defined functions; The UDF code template; The UDAF code template; The UDTF code template; Development and deployment; Streaming; SerDe; Summary; Chapter 9: Security Considerations; Authentication; Metastore server authentication; HiveServer2 authentication; Authorization; Legacy mode; Storage-based mode; SQL standard-based mode; Encryption; Summary; Chapter 10: Working with Other Tools; JDBC/ODBC connector; HBase; Hue; HCatalog; ZooKeeper; Oozie; Hive roadmap Apache (Computer file : Apache Group) http://id.loc.gov/authorities/names/no97023874 Apache Hadoop. http://id.loc.gov/authorities/names/n2013024279 Apache (Computer file : Apache Group) fast Apache Hadoop fast Database management. http://id.loc.gov/authorities/subjects/sh85035848 SQL (Computer program language) http://id.loc.gov/authorities/subjects/sh86006628 Bases de données Gestion. SQL (Langage de programmation) COMPUTERS Web Web Programming. bisacsh Database management fast SQL (Computer program language) fast |
subject_GND | http://id.loc.gov/authorities/names/no97023874 http://id.loc.gov/authorities/names/n2013024279 http://id.loc.gov/authorities/subjects/sh85035848 http://id.loc.gov/authorities/subjects/sh86006628 |
title | Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive / |
title_auth | Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive / |
title_exact_search | Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive / |
title_full | Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive / Dayong Du ; copy editors Sameen Siddiqui, Laxmi Subramanian. |
title_fullStr | Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive / Dayong Du ; copy editors Sameen Siddiqui, Laxmi Subramanian. |
title_full_unstemmed | Apache hive essentials : immerse yourself on a fantastic journey to discover the attributes of big data by using hive / Dayong Du ; copy editors Sameen Siddiqui, Laxmi Subramanian. |
title_short | Apache hive essentials : |
title_sort | apache hive essentials immerse yourself on a fantastic journey to discover the attributes of big data by using hive |
title_sub | immerse yourself on a fantastic journey to discover the attributes of big data by using hive / |
topic | Apache (Computer file : Apache Group) http://id.loc.gov/authorities/names/no97023874 Apache Hadoop. http://id.loc.gov/authorities/names/n2013024279 Apache (Computer file : Apache Group) fast Apache Hadoop fast Database management. http://id.loc.gov/authorities/subjects/sh85035848 SQL (Computer program language) http://id.loc.gov/authorities/subjects/sh86006628 Bases de données Gestion. SQL (Langage de programmation) COMPUTERS Web Web Programming. bisacsh Database management fast SQL (Computer program language) fast |
topic_facet | Apache (Computer file : Apache Group) Apache Hadoop. Apache Hadoop Database management. SQL (Computer program language) Bases de données Gestion. SQL (Langage de programmation) COMPUTERS Web Web Programming. Database management |
url | https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=959971 |
work_keys_str_mv | AT dudayong apachehiveessentialsimmerseyourselfonafantasticjourneytodiscovertheattributesofbigdatabyusinghive AT siddiquisameen apachehiveessentialsimmerseyourselfonafantasticjourneytodiscovertheattributesofbigdatabyusinghive AT subramanianlaxmi apachehiveessentialsimmerseyourselfonafantasticjourneytodiscovertheattributesofbigdatabyusinghive |