Data science and analytics with Python:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Boca Raton ; London ; New York
CRC Press, Taylor & Francis Group
[2017]
|
Schriftenreihe: | Chapman & Hall/CRC data mining and knowledge discovery series
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Includes bibliographical references and index |
Beschreibung: | xxxv, 376 Seiten Illustrationen 24 cm |
ISBN: | 9781498742092 9781138043176 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV044927682 | ||
003 | DE-604 | ||
005 | 20181005 | ||
007 | t | ||
008 | 180503s2017 xxua||| |||| 00||| eng d | ||
010 | |a 016057021 | ||
020 | |a 9781498742092 |9 978-1-4987-4209-2 | ||
020 | |a 9781138043176 |9 978-1-138-04317-6 | ||
035 | |a (OCoLC)1002916422 | ||
035 | |a (DE-599)BVBBV044927682 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
044 | |a xxu |c US | ||
049 | |a DE-703 |a DE-19 |a DE-739 | ||
050 | 0 | |a QA76.9.D343 | |
082 | 0 | |a 006.3/12 |2 23 | |
084 | |a ST 530 |0 (DE-625)143679: |2 rvk | ||
100 | 1 | |a Rogel-Salazar, Jesús |e Verfasser |0 (DE-588)1139793233 |4 aut | |
245 | 1 | 0 | |a Data science and analytics with Python |c Jesús Rogel-Salazar |
264 | 1 | |a Boca Raton ; London ; New York |b CRC Press, Taylor & Francis Group |c [2017] | |
300 | |a xxxv, 376 Seiten |b Illustrationen |c 24 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Chapman & Hall/CRC data mining and knowledge discovery series | |
500 | |a Includes bibliographical references and index | ||
650 | 4 | |a Data mining | |
650 | 4 | |a Python (Computer program language) | |
650 | 4 | |a Databases | |
650 | 0 | 7 | |a Python |g Programmiersprache |0 (DE-588)4434275-5 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Datenbank |0 (DE-588)4011119-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Data Mining |0 (DE-588)4428654-5 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Data Mining |0 (DE-588)4428654-5 |D s |
689 | 0 | 1 | |a Python |g Programmiersprache |0 (DE-588)4434275-5 |D s |
689 | 0 | 2 | |a Datenbank |0 (DE-588)4011119-2 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | 2 | |m Digitalisierung UB Passau - ADAM Catalogue Enrichment |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=030320823&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-030320823 |
Datensatz im Suchindex
_version_ | 1804178501278892032 |
---|---|
adam_text | Contents
Trials and Tribulations of a Data Scientist i
1.1 Data? Science? Data Science! 2
1.1.1 So, What Is Data Science? 3
1.2 The Data Scientist: A Modern Jackalope 7
1.2.1 Characteristics of a Data Scientist and a Data Science Team
1.3 Data Science Tools 17
2.3.1 Open Source Tools 20
1.4 From Data to Insight: the Data Science Workflow 22
1.4.1 Identify the Question 24
7.4.2 Acquire Data 23
1.4.3 DataMunging 25
2.4.4 Modelling and Evaluation 26
1.4.5 Representation and Interaction 26
1.4.6 Data Science: an Iterative Process 27
2.5 Summary 28
X J. ROGEL-SALAZAR
Python: For Something Completely Different
2.1
2.1.1
2.1.2
2.2
2.2.1
2.2.2
2.2.3
2.2.4
2.2.5
2.2.6
2.2.7
2.3
2.3.1
2.3.2
2.3.3
2.3.4
2.3.5
Why Python? Why not?! 33
To Shell or not To Shell 36
iPython/Jupyter Notebook 39
FzVsis Slithers with Python 40
Basic Types 40
Numbers 41
Strings 41
Complex Numbers 43
Lz sfs 44
Tup/es 49
Dictionaries 52
Control Flow 54
if... elif... else 55
while 56
for 57
try... except 58
Functions 61
2.3.6 Scripts and Modules 65
2.4 Computation and Data Manipulation 68
2.4.1 Matrix Manipulations and Linear Algebra
2.4.2 NumPy Arrays and Matrices 71
2.4.3 Indexing and Slicing 74
DATA SCIENCE AND ANALYTICS WITH PYTHON xi
2.5 Pandas to the Rescue 76
2.6 Plotting and Visualising: Matplotlib 81
2.7 Summary 83
The Machine that Goes Ping : Machine Learning and Pattern •
Recognition 87
3.1 Recognising Patterns 87
3.2 Artificial Intelligence and Machine Learning 90
3.3 Data is Good, but other Things are also Needed 92
3.4 Learning, Predicting and Classifying 94
3.5 Machine Learning and Data Science 98
3.6 Feature Selection 100
3.7 Bias, Variance and Régularisation: A Balancing Act 102
3.8 SottiE Useful Measures: Distance and Similarity 105
3.9 Beware the Curse of Dimensionality 110
3.10 Scikit-Learn is our Friend 116
3.11 Training and Testing 119
3.12 Cross-Validation 124
3.32.3 k-fold Cross-Validation 125
3.13 Summary 128
XU J. ROGEL-SALAZAR
The Relationship Conundrum: Regression 131
4.1 Relationships between Variables: Regression 131
4.2 Multivariate Linear Regression 136
4-3 Ordinary Least Squares 138
4.3.1 The Maths Way 139
4.4 Brain and Body: Regression with One Variable 144
4.4.1 Regression with Scikit-learn 133
4-5 Logarithmic Transformation 155
4.6 Making the Task Easier: Standardisation and Scaling
4.6.1 Normalisation or Unit Scaling 161
4.6.2 z-Score Scaling 162
4-7 Polynomial Regression 164
4.7.1 Multivariate Regression 169
4.8 Variance-Bias Trade-Off 170
4.9 Shrinkage: LASSO and Ridge 172
4.10 Summary 179
5 Jackalopes and Hares: Clustering 181
5.2 Clustering 182
5.2 Clustering with k-means 283
5.2.1
5.2.2
Cluster Validation 186
k-means in Action 189
DATA SCIENCE AND ANALYTICS WITH PYTHON Xlll
5.3 Summary 193
Unicorns and Horses: Classification 195
6.1 Classification 196
6.1.1 Confusion Matrices 198
6.1.2 ROCandAUC 202
6.2 Classification with KNN 205
6.2.1 KNN in Action 206
6.3 Classification with Logistic Regression 211
6.3.1 Logistic Regression Interpretation 216
6.3.2 Logistic Regression in Action 218
6.4 Classification with Naïve Bayes 226
6.4.1 Naïve Bayes Classifier 232
6.4.2 Naïve Bayes in Action 233
6.5 Summary 238
Decisions, Decisions: Hierarchical Clustering, Decision Trees and
Ensemble Techniques 241
7.1 Hierarchical Clustering 242
7.1.1 Hierarchical Clustering in Action 245
7.2 Decision Trees 249
7.2.1 Decision Trees in Action 256
XIV J. ROGEL-SALAZAR
7.3 Ensemble Techniques 265
7.3.1 Bagging 271
7.3.2 Boosting 272
7.3.3 Random Forests 274
7.3.4 Stacking and Blending 276
7.4 Ensemble Techniques in Action 277
7.5 Summary 282
8 Less is More: Dimensionality Reduction
8.1 Dimensionality Reduction 286
8.2 Principal Component Analysis 291
8.2.1 PCA in Action 295
8.2.2 PCA in the Iris Dataset 300
8.3 Singular Value Decomposition 304
8.3.1 SVD in Action 306
8.4 Recommendation Systems 310
8.4.1 Content-Based Filtering in Action 312
8.4.2 Collaborative Filtering in Action 316
8.5 Summary 323
9 Kerne/ Tricks up the Sleeve: Support Vector Machines
9.1 Support Vector Machines and Kernel Methods 328
DATA SCIENCE AND ANALYTICS WITH PYTHON XV
9.1.1 Support Vector Machines 331
9.1.2 The Kernel Trick 340
9.1.3 SVM in Action: Regression 343
9.1.4 SVM in Action: Classification 347
9.2 Summary 353
Pipelines in Scikit-Learn 355
Bibliography 361
Index 369
|
any_adam_object | 1 |
author | Rogel-Salazar, Jesús |
author_GND | (DE-588)1139793233 |
author_facet | Rogel-Salazar, Jesús |
author_role | aut |
author_sort | Rogel-Salazar, Jesús |
author_variant | j r s jrs |
building | Verbundindex |
bvnumber | BV044927682 |
callnumber-first | Q - Science |
callnumber-label | QA76 |
callnumber-raw | QA76.9.D343 |
callnumber-search | QA76.9.D343 |
callnumber-sort | QA 276.9 D343 |
callnumber-subject | QA - Mathematics |
classification_rvk | ST 530 |
ctrlnum | (OCoLC)1002916422 (DE-599)BVBBV044927682 |
dewey-full | 006.3/12 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.3/12 |
dewey-search | 006.3/12 |
dewey-sort | 16.3 212 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01930nam a2200481 c 4500</leader><controlfield tag="001">BV044927682</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20181005 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">180503s2017 xxua||| |||| 00||| eng d</controlfield><datafield tag="010" ind1=" " ind2=" "><subfield code="a">016057021</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781498742092</subfield><subfield code="9">978-1-4987-4209-2</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781138043176</subfield><subfield code="9">978-1-138-04317-6</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1002916422</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV044927682</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">xxu</subfield><subfield code="c">US</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-703</subfield><subfield code="a">DE-19</subfield><subfield code="a">DE-739</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">QA76.9.D343</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.3/12</subfield><subfield code="2">23</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 530</subfield><subfield code="0">(DE-625)143679:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Rogel-Salazar, Jesús</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1139793233</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Data science and analytics with Python</subfield><subfield code="c">Jesús Rogel-Salazar</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Boca Raton ; London ; New York</subfield><subfield code="b">CRC Press, Taylor & Francis Group</subfield><subfield code="c">[2017]</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xxxv, 376 Seiten</subfield><subfield code="b">Illustrationen</subfield><subfield code="c">24 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Chapman & Hall/CRC data mining and knowledge discovery series</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references and index</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Python (Computer program language)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Databases</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Python</subfield><subfield code="g">Programmiersprache</subfield><subfield code="0">(DE-588)4434275-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenbank</subfield><subfield code="0">(DE-588)4011119-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Python</subfield><subfield code="g">Programmiersprache</subfield><subfield code="0">(DE-588)4434275-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Datenbank</subfield><subfield code="0">(DE-588)4011119-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Passau - ADAM Catalogue Enrichment</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=030320823&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-030320823</subfield></datafield></record></collection> |
id | DE-604.BV044927682 |
illustrated | Illustrated |
indexdate | 2024-07-10T08:04:58Z |
institution | BVB |
isbn | 9781498742092 9781138043176 |
language | English |
lccn | 016057021 |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-030320823 |
oclc_num | 1002916422 |
open_access_boolean | |
owner | DE-703 DE-19 DE-BY-UBM DE-739 |
owner_facet | DE-703 DE-19 DE-BY-UBM DE-739 |
physical | xxxv, 376 Seiten Illustrationen 24 cm |
publishDate | 2017 |
publishDateSearch | 2017 |
publishDateSort | 2017 |
publisher | CRC Press, Taylor & Francis Group |
record_format | marc |
series2 | Chapman & Hall/CRC data mining and knowledge discovery series |
spelling | Rogel-Salazar, Jesús Verfasser (DE-588)1139793233 aut Data science and analytics with Python Jesús Rogel-Salazar Boca Raton ; London ; New York CRC Press, Taylor & Francis Group [2017] xxxv, 376 Seiten Illustrationen 24 cm txt rdacontent n rdamedia nc rdacarrier Chapman & Hall/CRC data mining and knowledge discovery series Includes bibliographical references and index Data mining Python (Computer program language) Databases Python Programmiersprache (DE-588)4434275-5 gnd rswk-swf Datenbank (DE-588)4011119-2 gnd rswk-swf Data Mining (DE-588)4428654-5 gnd rswk-swf Data Mining (DE-588)4428654-5 s Python Programmiersprache (DE-588)4434275-5 s Datenbank (DE-588)4011119-2 s DE-604 Digitalisierung UB Passau - ADAM Catalogue Enrichment application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=030320823&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Rogel-Salazar, Jesús Data science and analytics with Python Data mining Python (Computer program language) Databases Python Programmiersprache (DE-588)4434275-5 gnd Datenbank (DE-588)4011119-2 gnd Data Mining (DE-588)4428654-5 gnd |
subject_GND | (DE-588)4434275-5 (DE-588)4011119-2 (DE-588)4428654-5 |
title | Data science and analytics with Python |
title_auth | Data science and analytics with Python |
title_exact_search | Data science and analytics with Python |
title_full | Data science and analytics with Python Jesús Rogel-Salazar |
title_fullStr | Data science and analytics with Python Jesús Rogel-Salazar |
title_full_unstemmed | Data science and analytics with Python Jesús Rogel-Salazar |
title_short | Data science and analytics with Python |
title_sort | data science and analytics with python |
topic | Data mining Python (Computer program language) Databases Python Programmiersprache (DE-588)4434275-5 gnd Datenbank (DE-588)4011119-2 gnd Data Mining (DE-588)4428654-5 gnd |
topic_facet | Data mining Python (Computer program language) Databases Python Programmiersprache Datenbank Data Mining |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=030320823&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT rogelsalazarjesus datascienceandanalyticswithpython |