Mining of massive datasets:
Gespeichert in:
Späterer Titel: | Leskovec, Jure Mining of massive datasets |
---|---|
Hauptverfasser: | , |
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Cambridge [u.a.]
Cambridge Univ. Press
2012
|
Ausgabe: | 1. publ. |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | 2. Aufl. u.d.T.: Leskovec, Jure: Mining of massive datasets |
Beschreibung: | X, 315 S. Ill., graph. Darst. |
ISBN: | 9781107015357 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV039744649 | ||
003 | DE-604 | ||
005 | 20201124 | ||
007 | t| | ||
008 | 111206s2012 xx ad|| |||| 00||| eng d | ||
020 | |a 9781107015357 |9 978-1-107-01535-7 | ||
035 | |a (OCoLC)775417063 | ||
035 | |a (DE-599)BVBBV039744649 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-858 |a DE-92 |a DE-91G |a DE-473 |a DE-898 |a DE-824 |a DE-859 |a DE-634 |a DE-20 | ||
082 | 0 | |a 006.312 |2 23 | |
084 | |a SK 850 |0 (DE-625)143263: |2 rvk | ||
084 | |a ST 270 |0 (DE-625)143638: |2 rvk | ||
084 | |a ST 530 |0 (DE-625)143679: |2 rvk | ||
084 | |a DAT 450f |2 stub | ||
084 | |a DAT 616f |2 stub | ||
100 | 1 | |a Rajaraman, Anand |e Verfasser |4 aut | |
245 | 1 | 0 | |a Mining of massive datasets |c Anand Rajaraman ; Jeffrey David Ullman |
250 | |a 1. publ. | ||
264 | 1 | |a Cambridge [u.a.] |b Cambridge Univ. Press |c 2012 | |
300 | |a X, 315 S. |b Ill., graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
500 | |a 2. Aufl. u.d.T.: Leskovec, Jure: Mining of massive datasets | ||
650 | 0 | 7 | |a Big Data |0 (DE-588)4802620-7 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Data Mining |0 (DE-588)4428654-5 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Big Data |0 (DE-588)4802620-7 |D s |
689 | 0 | 1 | |a Data Mining |0 (DE-588)4428654-5 |D s |
689 | 0 | |8 1\p |5 DE-604 | |
700 | 1 | |a Ullman, Jeffrey D. |d 1942- |e Verfasser |0 (DE-588)123598230 |4 aut | |
785 | 0 | 0 | |i Ersetzt durch |a Leskovec, Jure |t Mining of massive datasets |b Second edition |d 2014 |z 978-1-107-07723-2 |w (DE-604)BV042002309 |
856 | 4 | 2 | |m Digitalisierung UB Bamberg |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024592236&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
883 | 1 | |8 1\p |a cgwrk |d 20201028 |q DE-101 |u https://d-nb.info/provenance/plan#cgwrk | |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-024592236 |
Datensatz im Suchindex
_version_ | 1815702943897223168 |
---|---|
adam_text |
Contents
e
page
ix
Data Mining
ι
1.1
What is Data Mining?
1
1.2
Statistical Limits on Data Mining
4
1.3
Things Useful to Know
7
1.4
Outline of the Book
15
1.5
Summary of Chapter
1 16
1.6
References for Chapter
1 17
Large-Scale File Systems and Map-Reduce
18
2.1
Distributed File Systems
18
2.2
Map-Reduce
21
2.3
Algorithms Using Map-Reduce
26
2.4
Extensions to Map-Reduce
37
2.5
Efficiency of Cluster-Computing Algorithms
42
2.6
Summary of Chapter
2 49
2.7
References for Chapter
2 51
Finding Similar Items
53
3.1
Applications of Near-Neighbor Search
53
3.2
Shingling of Documents
57
3.3
Similarity-Preserving Summaries of Sets
60
3.4
Locality-Sensitive Hashing for Documents
67
3.5
Distance Measures
71
3.6
The Theory of Locality-Sensitive Functions
77
3.7
LSH Families for Other Distance Measures
83
3.8
Applications of Locality-Sensitive Hashing
88
3.9
Methods for High Degrees of Similarity
96
3.10
Summary of Chapter
3 104
3.11
References for Chapter
3 106
Mining Data Streams
108
4.1
The Stream Data Model
108
Contents
4.2
Sampling Data in a Stream
112
4.3
Filtering Streams
115
4.4
Counting Distinct Elements in a Stream
118
4.5
Estimating Moments
122
4.6
Counting Ones in a Window
127
4.7
Decaying Windows
133
4.8
Summary of Chapter
4 136
4.9
References for Chapter
4 137
Link Analysis
139
5.1
PageRank
139
5.2
Efficient Computation of PageRank
153
5.3
Topic-Sensitive PageRank
159
5.4
Link Spam
163
5.5
Hubs and Authorities
167
5.6
Summary of Chapter
5 172
5.7
References for Chapter
5 175
Frequent itemsets
176
6.1
The Market-Basket Model
176
6.2
Market. Baskets and the
А
-Priori
Algorithm
183
6.3
Handling Larger
Datasets in
Main Memory
192
6.4
Limited-Pass Algorithms
199
6.5
Counting Frequent Items in a Stream
205
6.6
Summary of Chapter
6 209
6.7
References for Chapter
6 211
Clustering
213
7.1
Introduction to Clustering Techniques
213
7.2
Hierarchical Clustering
217
7.3
K-means Algorithms
226
7.4
The CURE Algorithm
234
7.5
Clustering in Non-Euclidean Spaces
237
7.6
Clustering for Streams and Parallelism
241
7.7
Summary of Chapter
7 247
7.8
References for Chapter
7 250
Advertising on the Web
252
8.1
Issues in On-Line Advertising
252
8.2
On-Line Algorithms
255
8.3
The Matching Problem
258
8.4
The Adwords Problem
261
8.5
Âdwords
Implementation
270
8.6
Summary of Chapter
8 273
Contents
8.7
References for Chapter
8 275
Recommendation Systems
277
9.1
A Model for Recommendation Systems
277
9.2
Content-Based Recommendations
281
9.3
Collaborative Filtering
291
9.4
Dimensionality Reduction
297
9.5
The NetFlix Challenge
305
9.6
Summary of Chapter
9 306
9.7
References for Chapter
9 308
Index
310 |
any_adam_object | 1 |
author | Rajaraman, Anand Ullman, Jeffrey D. 1942- |
author_GND | (DE-588)123598230 |
author_facet | Rajaraman, Anand Ullman, Jeffrey D. 1942- |
author_role | aut aut |
author_sort | Rajaraman, Anand |
author_variant | a r ar j d u jd jdu |
building | Verbundindex |
bvnumber | BV039744649 |
classification_rvk | SK 850 ST 270 ST 530 |
classification_tum | DAT 450f DAT 616f |
ctrlnum | (OCoLC)775417063 (DE-599)BVBBV039744649 |
dewey-full | 006.312 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.312 |
dewey-search | 006.312 |
dewey-sort | 16.312 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik Mathematik |
edition | 1. publ. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 c 4500</leader><controlfield tag="001">BV039744649</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20201124</controlfield><controlfield tag="007">t|</controlfield><controlfield tag="008">111206s2012 xx ad|| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781107015357</subfield><subfield code="9">978-1-107-01535-7</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)775417063</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV039744649</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-858</subfield><subfield code="a">DE-92</subfield><subfield code="a">DE-91G</subfield><subfield code="a">DE-473</subfield><subfield code="a">DE-898</subfield><subfield code="a">DE-824</subfield><subfield code="a">DE-859</subfield><subfield code="a">DE-634</subfield><subfield code="a">DE-20</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.312</subfield><subfield code="2">23</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SK 850</subfield><subfield code="0">(DE-625)143263:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 270</subfield><subfield code="0">(DE-625)143638:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 530</subfield><subfield code="0">(DE-625)143679:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">DAT 450f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">DAT 616f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Rajaraman, Anand</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Mining of massive datasets</subfield><subfield code="c">Anand Rajaraman ; Jeffrey David Ullman</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1. publ.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Cambridge [u.a.]</subfield><subfield code="b">Cambridge Univ. Press</subfield><subfield code="c">2012</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">X, 315 S.</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">2. Aufl. u.d.T.: Leskovec, Jure: Mining of massive datasets</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Big Data</subfield><subfield code="0">(DE-588)4802620-7</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Big Data</subfield><subfield code="0">(DE-588)4802620-7</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="8">1\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Ullman, Jeffrey D.</subfield><subfield code="d">1942-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)123598230</subfield><subfield code="4">aut</subfield></datafield><datafield tag="785" ind1="0" ind2="0"><subfield code="i">Ersetzt durch</subfield><subfield code="a">Leskovec, Jure</subfield><subfield code="t">Mining of massive datasets</subfield><subfield code="b">Second edition</subfield><subfield code="d">2014</subfield><subfield code="z">978-1-107-07723-2</subfield><subfield code="w">(DE-604)BV042002309</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Bamberg</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024592236&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-024592236</subfield></datafield></record></collection> |
id | DE-604.BV039744649 |
illustrated | Illustrated |
indexdate | 2024-11-14T13:01:03Z |
institution | BVB |
isbn | 9781107015357 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-024592236 |
oclc_num | 775417063 |
open_access_boolean | |
owner | DE-858 DE-92 DE-91G DE-BY-TUM DE-473 DE-BY-UBG DE-898 DE-BY-UBR DE-824 DE-859 DE-634 DE-20 |
owner_facet | DE-858 DE-92 DE-91G DE-BY-TUM DE-473 DE-BY-UBG DE-898 DE-BY-UBR DE-824 DE-859 DE-634 DE-20 |
physical | X, 315 S. Ill., graph. Darst. |
publishDate | 2012 |
publishDateSearch | 2012 |
publishDateSort | 2012 |
publisher | Cambridge Univ. Press |
record_format | marc |
spelling | Rajaraman, Anand Verfasser aut Mining of massive datasets Anand Rajaraman ; Jeffrey David Ullman 1. publ. Cambridge [u.a.] Cambridge Univ. Press 2012 X, 315 S. Ill., graph. Darst. txt rdacontent n rdamedia nc rdacarrier 2. Aufl. u.d.T.: Leskovec, Jure: Mining of massive datasets Big Data (DE-588)4802620-7 gnd rswk-swf Data Mining (DE-588)4428654-5 gnd rswk-swf Big Data (DE-588)4802620-7 s Data Mining (DE-588)4428654-5 s 1\p DE-604 Ullman, Jeffrey D. 1942- Verfasser (DE-588)123598230 aut Ersetzt durch Leskovec, Jure Mining of massive datasets Second edition 2014 978-1-107-07723-2 (DE-604)BV042002309 Digitalisierung UB Bamberg application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024592236&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis 1\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk |
spellingShingle | Rajaraman, Anand Ullman, Jeffrey D. 1942- Mining of massive datasets Big Data (DE-588)4802620-7 gnd Data Mining (DE-588)4428654-5 gnd |
subject_GND | (DE-588)4802620-7 (DE-588)4428654-5 |
title | Mining of massive datasets |
title_auth | Mining of massive datasets |
title_exact_search | Mining of massive datasets |
title_full | Mining of massive datasets Anand Rajaraman ; Jeffrey David Ullman |
title_fullStr | Mining of massive datasets Anand Rajaraman ; Jeffrey David Ullman |
title_full_unstemmed | Mining of massive datasets Anand Rajaraman ; Jeffrey David Ullman |
title_new | Leskovec, Jure Mining of massive datasets |
title_short | Mining of massive datasets |
title_sort | mining of massive datasets |
topic | Big Data (DE-588)4802620-7 gnd Data Mining (DE-588)4428654-5 gnd |
topic_facet | Big Data Data Mining |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024592236&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT rajaramananand miningofmassivedatasets AT ullmanjeffreyd miningofmassivedatasets |