Making sense of data: 2 A practical guide to data visualization, advanced data mining methods, and applications
Gespeichert in:
Format: | Buch |
---|---|
Sprache: | English |
Veröffentlicht: |
Hoboken, NJ
Wiley
2009
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | XIII, 291 S. Ill., graph. Darst. |
ISBN: | 9780470222805 |
Internformat
MARC
LEADER | 00000nam a2200000 cc4500 | ||
---|---|---|---|
001 | BV037438304 | ||
003 | DE-604 | ||
005 | 20110628 | ||
007 | t | ||
008 | 110606s2009 ad|| |||| 00||| eng d | ||
020 | |a 9780470222805 |9 978-0-470-22280-5 | ||
035 | |a (OCoLC)636157775 | ||
035 | |a (DE-599)BVBBV037438304 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-824 |a DE-11 | ||
082 | 0 | |a 006.312 | |
084 | |a ST 530 |0 (DE-625)143679: |2 rvk | ||
245 | 1 | 0 | |a Making sense of data |n 2 |p A practical guide to data visualization, advanced data mining methods, and applications |c Glenn J. Myatt ; Wayne P. Johnson |
264 | 1 | |a Hoboken, NJ |b Wiley |c 2009 | |
300 | |a XIII, 291 S. |b Ill., graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
650 | 0 | 7 | |a Datenanalyse |0 (DE-588)4123037-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Data Mining |0 (DE-588)4428654-5 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Data Mining |0 (DE-588)4428654-5 |D s |
689 | 0 | 1 | |a Datenanalyse |0 (DE-588)4123037-1 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Myatt, Glenn J. |e Sonstige |4 oth | |
700 | 1 | |a Johnson, Wayne P. |e Sonstige |4 oth | |
773 | 0 | 8 | |w (DE-604)BV037438294 |g 2 |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=022590287&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-022590287 |
Datensatz im Suchindex
_version_ | 1804145748424523776 |
---|---|
adam_text | Titel: Bd. 2. Making sense of data. A practical guide to data visualization, advanced data mining methods,
Autor:
Jahr: 2009
CONTENTS
PREFACE
1 INTRODUCTION
1.1 Overview 1
1.2 Definition 1
1.3 Preparation 2
1.3.1 Overview 2
1.3.2 Accessing Tabular Data 3
1.3.3 Accessing Unstructured Data 3
1.3.4 Understanding the Variables and Observations 3
1.3.5 Data Cleaning 6
1.3.6 Transformation 7
1.3.7 Variable Reduction 9
1.3.8 Segmentation 10
1.3.9 Preparing Data to Apply 10
1.4 Analysis 11
1.4.1 Data Mining Tasks 11
1.4.2 Optimization 12
1.4.3 Evaluation 12
1.4.4 Model Forensics 13
1.5 Deployment 13
1.6 Outline of Book 14
1.6.1 Overview 14
1.6.2 Data Visualization 14
1.6.3 Clustering 15
1.6.4 Predictive Analytics 15
1.6.5 Applications 16
1.6.6 Software 16
1.7 Summary 16
1.8 Further Reading 17
! DATA VISUALIZATION________________________________________________/9
2.1 Overview 19
2.2 Visualization Design Principles 20
2.2.1 General Principles 20
2.2.2 Graphics Design 23
2.2.3 Anatomy of a Graph 28
VI CONTENTS
2.3 Tables 32
2.3.1 Simple Tables 32
2.3.2 Summary Tables 33
2.3.3 Two-Way Contingency Tables 34
2.3.4 Supertables 34
2.4 Univariate Data Visualization 36
2.4.1 Bar Chart 36
2.4.2 Histograms 37
2.4.3 Frequency Polygram 41
2.4.4 Box Plots 41
2.4.5 Dot Plot 43
2.4.6 Stem-and-Leaf Plot 44
2.4.7 Quantile Plot 46
2.4.8 Quantile-Quantile Plot 48
2.5 Bivariate Data Visualization 49
2.5.1 Scatterplot 49
2.6 Multivariate Data Visualization 50
2.6.1 Histogram Matrix 52
2.6.2 Scatterplot Matrix 54
2.6.3 Multiple Box Plot 56
2.6.4 Trellis Plot 56
2.7 Visualizing Groups 59
2.7.1 Dendrograms 59
2.7.2 Decision Trees 60
2.7.3 Cluster Image Maps 60
2.8 Dynamic Techniques 63
2.8.1 Overview 63
2.8.2 Data Brushing 64
2.8.3 Nearness Selection 65
2.8.4 Sorting and Rearranging 65
2.8.5 Searching and Filtering 65
2.9 Summary 65
2.10 Further Reading 66
3 CLUSTERING ________________ 67
3.1 Overview 67
3.2 Distance Measures 75
3.2.1 Overview 75
3.2.2 Numeric Distance Measures 77
3.2.3 Binary Distance Measures 79
3.2.4 Mixed Variables 84
3.2.5 Other Measures 86
3.3 Agglomerative Hierarchical Clustering 87
3.3.1 Overview 87
3.3.2 Single Linkage 88
3.3.3 Complete Linkage 92
3.3.4 Average Linkage 93
3.3.5 Other Methods 96
3.3.6 Selecting Groups 96
CONTENTS VII
3.4 Partitioned-Based Clustering 98
3.4.1 Overview 98
3.4.2 A-Means 98
3.4.3 Worked Example 100
3.4.4 Miscellaneous Partitioned-Based Clustering 101
3.5 Fuzzy Clustering 103
3.5.1 Overview 103
3.5.2 Fuzzy £-Means 103
3.5.3 Worked Examples 104
3.6 Summary 109
3.7 Further Reading 110
* PREDICTIVE ANALYTICS 111
4.1 Overview 111
4.1.1 Predictive Modeling 111
4.1.2 Testing Model Accuracy 116
4.1.3 Evaluating Regression Models Predictive Accuracy 117
4.1.4 Evaluating Classification Models Predictive Accuracy 119
4.1.5 Evaluating Binary Models Predictive Accuracy 120
4.1.6 ROC Charts 122
4.1.7 Lift Chart 124
4.2 Principal Component Analysis 126
4.2.1 Overview 126
4.2.2 Principal Components 126
4.2.3 Generating Principal Components 127
4.2.4 Interpretation of Principal Components 128
4.3 Multiple Linear Regression 130
4.3.1 Overview 130
4.3.2 Generating Models 133
4.3.3 Prediction 136
4.3.4 Analysis of Residuais 136
4.3.5 Standard Error 139
4.3.6 Coefficient of Multiple Determination 140
4.3.7 Testing the Model Significance 142
4.3.8 Selecting and Transforming Variables 143
4.4 Discriminant Analysis 145
4.4.1 Overview 145
4.4.2 Discriminant Function 146
4.4.3 Discriminant Analysis Example 146
4.5 Logistic Regression 151
4.5.1 Overview 151
4.5.2 Logistic Regression Formula 151
4.5.3 Estimating Coefficients 153
4.5.4 Assessing and Optimizing Results 156
4.6 Naive Bayes Classifiers 157
4.6.1 Overview 157
4.6.2 Bayes Theorem and the Independence Assumption 158
4.6.3 Independence Assumption 158
4.6.4 Classification Process 159
VIII CONTENTS
4.7 Summary 161
4.8 Further Reading 163
5 APPLICATIONS_____________________________________________________161
165
166
169
169
171
172
173
174
175
176
177
179
181
181
181
183
192
192
192
199
203
203
203
203
210
213
215
215
215
216
217
217
_______219
B.l Software Overview 219
B.l.l Software Objectives 219
B.l.2, Access and Installation 221
B.l.3 User Interface Overview 221
B.2 Data Preparation 223
B.2.1 Overview 223
B.2.2 Reading in Data 224
B.2.3 Searching the Data 225
5.1 Overview
5.2 Sales and Marketing
5.3 Industry-Specific Data Mining
5.3.1 Finance
5.3.2 Insurance
5.3.3 Retail
5.3.4 Telecommunications
5.3.5 Manufacturing
5.3.6 Entertainment
5.3.7 Government
5.3.8 Pharmaceuticals
5.3.9 Healthcare
5.4 microRNA Data Analysis Case Study
5.4.1 Defining the Problem
5.4.2 Preparing the Data
5.4.3 Analysis
5.5 Credit Scoring Case Study
5.5.1 Defining the Problem
5.5.2 Preparing the Data
5.5.3 Analysis
5.5.4 Deployment
5.6 Data Mining Nontabular Data
5.6.1 Overview
5.6.2 Data Mining Chemical Data
5.6.3 Data Mining Text
5.7 Further Reading
APPENDIX A MATRICES
A.l Overview of Matrices
A.2 Matrix Addition
A.3 Matrix Multiplication
A.4 Transpose of a Matrix
A.5 Inverse of a Matrix
APPENDIX B SOFTWARE
CONTENTS IX
227
228
228
230
235
236
238
238
239
240
242
246
246
246
248
248
249
250
251
251
253
253
254
257
258
261
261
263
265
266
267
269
269
270
271
273
INDEX 279
B.2.4 Variable Characterization
B.2.5 Removing Observations and Variables
B.2.6 Cleaning the Data
B.2.7 Transforming the Data
B.2.8 Segmentation
B.2.9 Principal Component Analysis
B.3 Tables and Graphs
B.3.1 Overview
B.3.2 Contingency Tables
B.3.3 Summary Tables
B.3.4 Graphs
B.3.5 Graph Matrices
B.4 Statistics
B.4.1 Overview
B.4.2 Descriptive Statistics
B.4.3 Confidence Intervals
B.4.4 Hypothesis Tests
B.4.5 Chi-Square Test
B.4.6 ANOVA
B.4.7 Comparative Statistics
B.5 Groupi ing
B.5.1 Overview
B.5.2 Clustering
B.5.3 Associative Rules
B.5.4 Decision Trees
B.6 Prediction
B.6.1 Overview
B.6.2 Linear Regression
B.6.3 Discriminant Analysis
B.6.4 Logistic Regression
B.6.5 Naive Bayes
B.6.6 *NN
B.6.7 CART
B.6.8 Neural Networks
B.6.9 Apply Model
|
any_adam_object | 1 |
building | Verbundindex |
bvnumber | BV037438304 |
classification_rvk | ST 530 |
ctrlnum | (OCoLC)636157775 (DE-599)BVBBV037438304 |
dewey-full | 006.312 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.312 |
dewey-search | 006.312 |
dewey-sort | 16.312 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01472nam a2200373 cc4500</leader><controlfield tag="001">BV037438304</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20110628 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">110606s2009 ad|| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780470222805</subfield><subfield code="9">978-0-470-22280-5</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)636157775</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV037438304</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-824</subfield><subfield code="a">DE-11</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.312</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 530</subfield><subfield code="0">(DE-625)143679:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Making sense of data</subfield><subfield code="n">2</subfield><subfield code="p">A practical guide to data visualization, advanced data mining methods, and applications</subfield><subfield code="c">Glenn J. Myatt ; Wayne P. Johnson</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Hoboken, NJ</subfield><subfield code="b">Wiley</subfield><subfield code="c">2009</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XIII, 291 S.</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenanalyse</subfield><subfield code="0">(DE-588)4123037-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Datenanalyse</subfield><subfield code="0">(DE-588)4123037-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Myatt, Glenn J.</subfield><subfield code="e">Sonstige</subfield><subfield code="4">oth</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Johnson, Wayne P.</subfield><subfield code="e">Sonstige</subfield><subfield code="4">oth</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="w">(DE-604)BV037438294</subfield><subfield code="g">2</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=022590287&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-022590287</subfield></datafield></record></collection> |
id | DE-604.BV037438304 |
illustrated | Illustrated |
indexdate | 2024-07-09T23:24:23Z |
institution | BVB |
isbn | 9780470222805 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-022590287 |
oclc_num | 636157775 |
open_access_boolean | |
owner | DE-824 DE-11 |
owner_facet | DE-824 DE-11 |
physical | XIII, 291 S. Ill., graph. Darst. |
publishDate | 2009 |
publishDateSearch | 2009 |
publishDateSort | 2009 |
publisher | Wiley |
record_format | marc |
spelling | Making sense of data 2 A practical guide to data visualization, advanced data mining methods, and applications Glenn J. Myatt ; Wayne P. Johnson Hoboken, NJ Wiley 2009 XIII, 291 S. Ill., graph. Darst. txt rdacontent n rdamedia nc rdacarrier Datenanalyse (DE-588)4123037-1 gnd rswk-swf Data Mining (DE-588)4428654-5 gnd rswk-swf Data Mining (DE-588)4428654-5 s Datenanalyse (DE-588)4123037-1 s DE-604 Myatt, Glenn J. Sonstige oth Johnson, Wayne P. Sonstige oth (DE-604)BV037438294 2 HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=022590287&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Making sense of data Datenanalyse (DE-588)4123037-1 gnd Data Mining (DE-588)4428654-5 gnd |
subject_GND | (DE-588)4123037-1 (DE-588)4428654-5 |
title | Making sense of data |
title_auth | Making sense of data |
title_exact_search | Making sense of data |
title_full | Making sense of data 2 A practical guide to data visualization, advanced data mining methods, and applications Glenn J. Myatt ; Wayne P. Johnson |
title_fullStr | Making sense of data 2 A practical guide to data visualization, advanced data mining methods, and applications Glenn J. Myatt ; Wayne P. Johnson |
title_full_unstemmed | Making sense of data 2 A practical guide to data visualization, advanced data mining methods, and applications Glenn J. Myatt ; Wayne P. Johnson |
title_short | Making sense of data |
title_sort | making sense of data a practical guide to data visualization advanced data mining methods and applications |
topic | Datenanalyse (DE-588)4123037-1 gnd Data Mining (DE-588)4428654-5 gnd |
topic_facet | Datenanalyse Data Mining |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=022590287&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV037438294 |
work_keys_str_mv | AT myattglennj makingsenseofdata2 AT johnsonwaynep makingsenseofdata2 |