Data mining: concepts, models, methods, and algorithms
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Hoboken, NJ
Wiley
2011
|
Ausgabe: | 2. ed. |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Includes bibliographical references and index. |
Beschreibung: | XVII, 534 S. Ill., graph. Darst. |
ISBN: | 9780470890455 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV036806384 | ||
003 | DE-604 | ||
005 | 20211026 | ||
007 | t| | ||
008 | 101130s2011 xx ad|| |||| 00||| eng d | ||
015 | |a GBB098510 |2 dnb | ||
020 | |a 9780470890455 |9 978-0-470-89045-5 | ||
035 | |a (OCoLC)706018027 | ||
035 | |a (DE-599)BVBBV036806384 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-473 |a DE-20 |a DE-863 |a DE-634 |a DE-355 |a DE-91G |a DE-824 |a DE-B768 |a DE-1043 | ||
084 | |a ST 270 |0 (DE-625)143638: |2 rvk | ||
084 | |a ST 530 |0 (DE-625)143679: |2 rvk | ||
084 | |a DAT 450f |2 stub | ||
100 | 1 | |a Kantardzic, Mehmed |d 1947- |e Verfasser |0 (DE-588)1244240834 |4 aut | |
245 | 1 | 0 | |a Data mining |b concepts, models, methods, and algorithms |c Mehmed Kantardzic |
250 | |a 2. ed. | ||
264 | 1 | |a Hoboken, NJ |b Wiley |c 2011 | |
300 | |a XVII, 534 S. |b Ill., graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
500 | |a Includes bibliographical references and index. | ||
650 | 4 | |a Data mining | |
650 | 0 | 7 | |a Datenanalyse |0 (DE-588)4123037-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Data Mining |0 (DE-588)4428654-5 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4148875-1 |a Datensammlung |2 gnd-content | |
689 | 0 | 0 | |a Data Mining |0 (DE-588)4428654-5 |D s |
689 | 0 | |5 DE-604 | |
689 | 1 | 0 | |a Datenanalyse |0 (DE-588)4123037-1 |D s |
689 | 1 | |5 DE-604 | |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-1-118-02914-5 |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe, PDF |z 978-1-118-02912-1 |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe, EPUB |z 978-1-118-02913-8 |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020722459&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-020722459 |
Datensatz im Suchindex
DE-BY-863_location | 1340 |
---|---|
DE-BY-FWS_call_number | 1340/ST 530 K16(2) |
DE-BY-FWS_katkey | 413155 |
DE-BY-FWS_media_number | 083101159296 |
_version_ | 1821282190641594368 |
adam_text |
Titel: Data mining
Autor: Kantardzic, Mehmed
Jahr: 2011
CONTENTS
Preface to the Second Edition xiii
Preface to the First Edition xv
1 DATA-MINING CONCEPTS 1
1.1 Introduction 1
1.2 Data-Mining Roots 4
1.3 Data-Mining Process 6
1.4 Large Data Sets 9
1.5 Data Warehouses for Data Mining 14
1.6 Business Aspects of Data Mining: Why a Data-Mining Project Fails 17
1.7 Organization of This Book 21
1.8 Review Questions and Problems 23
1.9 References for Further Study 24
2 PREPARING THE DATA 26
2.1 Representation of Raw Data 26
2.2 Characteristics of Raw Data 31
2.3 Transformation of Raw Data 33
2.4 Missing Data 36
2.5 Time-Dependent Data 37
2.6 Outlier Analysis 41
2.7 Review Questions and Problems 48
2.8 References for Further Study 51
3 DATA REDUCTION 53
3.1 Dimensions of Large Data Sets 54
3.2 Feature Reduction 56
3.3 Relief Algorithm 66
vii
Viii CONTENTS
3.4 Entropy Measure for Ranking Features 68
3.5 PCA 70
3.6 Value Reduction 73
3.7 Feature Discretization: ChiMerge Technique 77
3.8 Case Reduction 80
3.9 Review Questions and Problems 83
3.10 References for Further Study 85
4 LEARNING FROM DATA 87
4.1 Learning Machine 89
4.2 SLT 93
4.3 Types of Learning Methods 99
4.4 Common Learning Tasks 101
4.5 SVMs 105
4.6 kNN: Nearest Neighbor Classifier 118
4.7 Model Selection versus Generalization 122
4.8 Model Estimation 126
4.9 90% Accuracy: Now What? 132
4.10 Review Questions and Problems 136
4.11 References for Further Study 138
5 STATISTICAL METHODS 140
5.1 Statistical Inference 141
5.2 Assessing Differences in Data Sets 143
5.3 Bayesian Inference 146
5.4 Predictive Regression 149
5.5 ANOVA 155
5.6 Logistic Regression 157
5.7 Log-Linear Models 158
5.8 LDA 162
5.9 Review Questions and Problems 164
5.10 References for Further Study 167
6 DECISION TREES AND DECISION RULES 169
6.1 Decision Trees 171
6.2 C4.5 Algorithm: Generating a Decision Tree 173
6.3 Unknown Attribute Values 180
CONTENTS ix
6.4 Pruning Decision Trees 184
6.5 C4.5 Algorithm: Generating Decision Rules 185
6.6 CART Algorithm Gini Index 189
6.7 Limitations of Decision Trees and Decision Rules 192
6.8 Review Questions and Problems 194
6.9 References for Further Study 198
7 ARTIFICIAL NEURAL NETWORKS 199
7.1 Model of an Artificial Neuron 201
7.2 Architectures of ANNs 205
7.3 Learning Process 207
7.4 Learning Tasks Using ANNs 210
7.5 Multilayer Perceptrons (MLPs) 213
7.6 Competitive Networks and Competitive Learning 221
7.7 SOMs 225
7.8 Review Questions and Problems 231
7.9 References for Further Study 233
8 ENSEMBLE LEARNING 235
8.1 Ensemble-Learning Methodologies 236
8.2 Combination Schemes for Multiple Learners 240
8.3 Bagging and Boosting 241
8.4 AdaBoost 243
8.5 Review Questions and Problems 245
8.6 References for Further Study 247
9 CLUSTER ANALYSIS 249
9.1 Clustering Concepts 250
9.2 Similarity Measures 253
9.3 Agglomerative Hierarchical Clustering 259
9.4 Partitional Clustering 263
9.5 Incremental Clustering 266
9.6 DBSCAN Algorithm 270
9.7 BIRCH Algorithm 272
9.8 Clustering Validation 275
9.9 Review Questions and Problems 275
9.10 References for Further Study 279
X CONTENTS
10 ASSOCIATION RULES 280
10.1 Market-Basket Analysis 281
10.2 Algorithm Apriori 283
10.3 From Frequent Itemsets to Association Rules 285
10.4 Improving the Efficiency of the Apriori Algorithm 286
10.5 FP Growth Method 288
10.6 Associative-Classification Method 290
10.7 Multidimensional Association-Rules Mining 293
10.8 Review Questions and Problems 295
10.9 References for Further Study 298
11 WEB MINING AND TEXT MINING 300
11.1 Web Mining 300
11.2 Web Content, Structure, and Usage Mining 302
11.3 HITS and LOGSOM Algorithms 305
11.4 Mining Path-Traversal Patterns 310
11.5 PageRank Algorithm 313
11.6 Text Mining 316
11.7 Latent Semantic Analysis (LSA) 320
11.8 Review Questions and Problems 324
11.9 References for Further Study 326
12 ADVANCES IN DATA MINING 328
12.1 Graph Mining 329
12.2 Temporal Data Mining 343
12.3 Spatial Data Mining (SDM) 357
12.4 Distributed Data Mining (DDM) 360
12.5 Correlation Does Not Imply Causality 369
12.6 Privacy, Security, and Legal Aspects of Data Mining 376
12.7 Review Questions and Problems 381
12.8 References for Further Study 382
13 GENETIC ALGORITHMS 385
13.1 Fundamentals of GAs 386
13.2 Optimization Using GAs 388
13.3 A Simple Illustration of a GA 394
13.4 Schemata 399
13.5 TSP 402
CONTENTS X¡
13.6 Machine Learning Using GAs 404
13.7 GAs for Clustering 409
13.8 Review Questions and Problems 411
13.9 References for Further Study 413
14 FUZZY SETS AND FUZZY LOGIC 414
14.1 Fuzzy Sets 415
14.2 Fuzzy-Set Operations 420
14.3 Extension Principle and Fuzzy Relations 425
14.4 Fuzzy Logic and Fuzzy Inference Systems 429
14.5 Multifactorial Evaluation 433
14.6 Extracting Fuzzy Models from Data 436
14.7 Data Mining and Fuzzy Sets 441
14.8 Review Questions and Problems 443
14.9 References for Further Study 445
15 VISUALIZATION METHODS 447
15.1 Perception and Visualization 448
15.2 Scientific Visualization and
Information Visualization 449
15.3 Parallel Coordinates 455
15.4 Radial Visualization 458
15.5 Visualization Using Self-Organizing Maps (SOMs) 460
15.6 Visualization Systems for Data Mining 462
15.7 Review Questions and Problems 467
15.8 References for Further Study 468
Appendix A 470
A. 1 Data-Mining Journals 470
A.2 Data-Mining Conferences 473
A.3 Data-Mining Forums/Blogs 477
A.4 Data Sets 478
A.5 Comercially and Publicly Available Tools 480
A.6 Web Site Links 489
Appendix B: Data-Mining Applications 496
B.I Data Mining for Financial Data Analysis 496
B.2 Data Mining for the Telecomunications Industry 499
XÜ CONTENTS
B.3 Data Mining for the Retail Industry 501
B.4 Data Mining in Health Care and Biomedicai Research 503
B.5 Data Mining in Science and Engineering 506
B.6 Pitfalls of Data Mining 509
Bibliography 510
Index 529 |
any_adam_object | 1 |
author | Kantardzic, Mehmed 1947- |
author_GND | (DE-588)1244240834 |
author_facet | Kantardzic, Mehmed 1947- |
author_role | aut |
author_sort | Kantardzic, Mehmed 1947- |
author_variant | m k mk |
building | Verbundindex |
bvnumber | BV036806384 |
classification_rvk | ST 270 ST 530 |
classification_tum | DAT 450f |
ctrlnum | (OCoLC)706018027 (DE-599)BVBBV036806384 |
discipline | Informatik |
edition | 2. ed. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 c 4500</leader><controlfield tag="001">BV036806384</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20211026</controlfield><controlfield tag="007">t|</controlfield><controlfield tag="008">101130s2011 xx ad|| |||| 00||| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBB098510</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780470890455</subfield><subfield code="9">978-0-470-89045-5</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)706018027</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV036806384</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-473</subfield><subfield code="a">DE-20</subfield><subfield code="a">DE-863</subfield><subfield code="a">DE-634</subfield><subfield code="a">DE-355</subfield><subfield code="a">DE-91G</subfield><subfield code="a">DE-824</subfield><subfield code="a">DE-B768</subfield><subfield code="a">DE-1043</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 270</subfield><subfield code="0">(DE-625)143638:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 530</subfield><subfield code="0">(DE-625)143679:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">DAT 450f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Kantardzic, Mehmed</subfield><subfield code="d">1947-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1244240834</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Data mining</subfield><subfield code="b">concepts, models, methods, and algorithms</subfield><subfield code="c">Mehmed Kantardzic</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">2. ed.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Hoboken, NJ</subfield><subfield code="b">Wiley</subfield><subfield code="c">2011</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XVII, 534 S.</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references and index.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenanalyse</subfield><subfield code="0">(DE-588)4123037-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4148875-1</subfield><subfield code="a">Datensammlung</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Datenanalyse</subfield><subfield code="0">(DE-588)4123037-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-1-118-02914-5</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe, PDF</subfield><subfield code="z">978-1-118-02912-1</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe, EPUB</subfield><subfield code="z">978-1-118-02913-8</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020722459&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-020722459</subfield></datafield></record></collection> |
genre | (DE-588)4148875-1 Datensammlung gnd-content |
genre_facet | Datensammlung |
id | DE-604.BV036806384 |
illustrated | Illustrated |
indexdate | 2025-01-15T04:00:47Z |
institution | BVB |
isbn | 9780470890455 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-020722459 |
oclc_num | 706018027 |
open_access_boolean | |
owner | DE-473 DE-BY-UBG DE-20 DE-863 DE-BY-FWS DE-634 DE-355 DE-BY-UBR DE-91G DE-BY-TUM DE-824 DE-B768 DE-1043 |
owner_facet | DE-473 DE-BY-UBG DE-20 DE-863 DE-BY-FWS DE-634 DE-355 DE-BY-UBR DE-91G DE-BY-TUM DE-824 DE-B768 DE-1043 |
physical | XVII, 534 S. Ill., graph. Darst. |
publishDate | 2011 |
publishDateSearch | 2011 |
publishDateSort | 2011 |
publisher | Wiley |
record_format | marc |
spellingShingle | Kantardzic, Mehmed 1947- Data mining concepts, models, methods, and algorithms Data mining Datenanalyse (DE-588)4123037-1 gnd Data Mining (DE-588)4428654-5 gnd |
subject_GND | (DE-588)4123037-1 (DE-588)4428654-5 (DE-588)4148875-1 |
title | Data mining concepts, models, methods, and algorithms |
title_auth | Data mining concepts, models, methods, and algorithms |
title_exact_search | Data mining concepts, models, methods, and algorithms |
title_full | Data mining concepts, models, methods, and algorithms Mehmed Kantardzic |
title_fullStr | Data mining concepts, models, methods, and algorithms Mehmed Kantardzic |
title_full_unstemmed | Data mining concepts, models, methods, and algorithms Mehmed Kantardzic |
title_short | Data mining |
title_sort | data mining concepts models methods and algorithms |
title_sub | concepts, models, methods, and algorithms |
topic | Data mining Datenanalyse (DE-588)4123037-1 gnd Data Mining (DE-588)4428654-5 gnd |
topic_facet | Data mining Datenanalyse Data Mining Datensammlung |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020722459&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT kantardzicmehmed dataminingconceptsmodelsmethodsandalgorithms |
Inhaltsverzeichnis
THWS Würzburg Teilbibliothek SHL, Raum I.2.11
Signatur: |
1340 ST 530 K16(2) |
---|---|
Exemplar 1 | nicht ausleihbar Verfügbar Bestellen |