Cluster and classification techniques for the biosciences:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Cambridge [u.a.]
Cambridge Univ. Press
2007
|
Ausgabe: | 1. publ. |
Schlagworte: | |
Online-Zugang: | Publisher description Table of contents only Inhaltsverzeichnis |
Beschreibung: | Includes bibliographical references (p. 224-239) and index |
Beschreibung: | XII, 246 S. Ill., graph. Darst. 26 cm |
ISBN: | 0521852811 0521618002 9780521852814 9780521618007 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV022420921 | ||
003 | DE-604 | ||
005 | 20080130 | ||
007 | t | ||
008 | 070510s2007 xxkad|| |||| 00||| eng d | ||
010 | |a 2006027983 | ||
015 | |a GBA678576 |2 dnb | ||
016 | 7 | |a LO 2006027983 |2 DE-101 | |
020 | |a 0521852811 |c hardback |9 0-521-85281-1 | ||
020 | |a 0521618002 |c pbk. |9 0-521-61800-2 | ||
020 | |a 9780521852814 |9 978-0-521-85281-4 | ||
020 | |a 9780521618007 |9 978-0-521-61800-7 | ||
035 | |a (OCoLC)71139443 | ||
035 | |a (DE-599)DNB 2006027983 | ||
040 | |a DE-604 |b ger |e aacr | ||
041 | 0 | |a eng | |
044 | |a xxk |c GB | ||
049 | |a DE-19 |a DE-355 | ||
050 | 0 | |a QH324.2 | |
082 | 0 | |a 570.285 | |
084 | |a WC 7000 |0 (DE-625)148142: |2 rvk | ||
100 | 1 | |a Fielding, Alan H. |e Verfasser |4 aut | |
245 | 1 | 0 | |a Cluster and classification techniques for the biosciences |c Alan H. Fielding |
250 | |a 1. publ. | ||
264 | 1 | |a Cambridge [u.a.] |b Cambridge Univ. Press |c 2007 | |
300 | |a XII, 246 S. |b Ill., graph. Darst. |c 26 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
500 | |a Includes bibliographical references (p. 224-239) and index | ||
650 | 4 | |a Biologie - Classification | |
650 | 4 | |a Biologie - Informatique | |
650 | 4 | |a Classification automatique (Statistique) | |
650 | 4 | |a Datenverarbeitung | |
650 | 4 | |a Biology |x Data processing | |
650 | 4 | |a Biology |v Classification | |
650 | 4 | |a Cluster analysis | |
650 | 4 | |a Cluster Analysis | |
650 | 4 | |a Biometry |x methods | |
650 | 4 | |a Classification |x methods | |
650 | 4 | |a Data Interpretation, Statistical | |
650 | 4 | |a Multivariate Analysis | |
650 | 0 | 7 | |a Biologie |0 (DE-588)4006851-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Cluster-Analyse |0 (DE-588)4070044-6 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Cluster-Analyse |0 (DE-588)4070044-6 |D s |
689 | 0 | 1 | |a Biologie |0 (DE-588)4006851-1 |D s |
689 | 0 | |C b |5 DE-604 | |
856 | 4 | |u http://www.loc.gov/catdir/enhancements/fy0665/2006027983-d.html |3 Publisher description | |
856 | 4 | |u http://www.loc.gov/catdir/enhancements/fy0665/2006027983-t.html |3 Table of contents only | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=015629240&sequence=000004&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-015629240 |
Datensatz im Suchindex
_version_ | 1804136489958768640 |
---|---|
adam_text | Contents
Preface page xi
1 Introduction 1
1.1 Background 1
1.2 Book structure 2
1.3 Classification 2
1.4 Clustering 3
1.5 Structures in data 3
1.6 Glossary 5
1.7 Recommended reading and other resources 10
2 Exploratory data analysis 12
2.1 Background 12
2.2 Dimensionality 13
2.3 Goodness of fit testing 14
2.4 Graphical methods 15
2.5 Variance based data projections 16
2.6 Distance based data projections 29
2.7 Other projection methods 32
2.8 Other methods 36
2.9 Data dredging 38
2.10 Example EDA analysis 38
3 Cluster analysis 46
3.1 Background 46
3.2 Distance and similarity measures 48
3.3 Partitioning methods 55
3.4 Agglomerative hierarchical methods 58
3.5 How many groups are there? 62
3.6 Divisive hierarchical methods 65
vii
vüi Contents
3.7 Two way clustering and gene shaving 66
3.8 Recommended reading 67
3.9 Example analyses 68
4 Introduction to classification 78
4.1 Background 78
4.2 Black box classifiers 81
4.3 Nature of a classifier 82
4.4 No free lunch 85
4.5 Bias and variance 86
4.6 Variable (feature) selection 87
4.7 Multiple classifiers 92
4.8 Why do classifiers fail? 94
4.9 Generalisation 95
4.10 Types of classifier 96
5 Classification algorithms 1 97
5.1 Background 97
5.2 Naive Bayes 99
5.3 Discriminant analysis 100
5.4 Logistic regression 117
5.5 Discriminant analysis or logistic regression? 128
5.6 Generalised additive modeis 130
5.7 Summary 136
6 Other classification methods 137
6.1 Background 137
6.2 Decision trees 137
6.3 Support vector machines 154
6.4 Artificial neural networks 156
6.5 Genetic algorithms 170
6.6 Others 175
6.7 Where next? 177
7 Classification accuracy 179
7.1 Background 179
7.2 Appropriate metrics 180
7.3 Binary accuracy measures 180
7.4 Appropriate testing data 183
7 5 Decision thresholds 186
7.6 Example 187
7.7 ROC plots 190
7.8 Incorporating costs 194
7.9 Comparing classifiers 196
7.10 Recommended reading 199
Contents ix
Appendix A 200
Appendix B 203
Appendix C 207
Appendix D 208
Appendix E 210
Appendix F 217
Appendix G 220
References 224
Index 241
|
adam_txt |
Contents
Preface page xi
1 Introduction 1
1.1 Background 1
1.2 Book structure 2
1.3 Classification 2
1.4 Clustering 3
1.5 Structures in data 3
1.6 Glossary 5
1.7 Recommended reading and other resources 10
2 Exploratory data analysis 12
2.1 Background 12
2.2 Dimensionality 13
2.3 Goodness of fit testing 14
2.4 Graphical methods 15
2.5 Variance based data projections 16
2.6 Distance based data projections 29
2.7 Other projection methods 32
2.8 Other methods 36
2.9 Data dredging 38
2.10 Example EDA analysis 38
3 Cluster analysis 46
3.1 Background 46
3.2 Distance and similarity measures 48
3.3 Partitioning methods 55
3.4 Agglomerative hierarchical methods 58
3.5 How many groups are there? 62
3.6 Divisive hierarchical methods 65
vii
vüi Contents
3.7 Two way clustering and gene shaving 66
3.8 Recommended reading 67
3.9 Example analyses 68
4 Introduction to classification 78
4.1 Background 78
4.2 Black box classifiers 81
4.3 Nature of a classifier 82
4.4 No free lunch 85
4.5 Bias and variance 86
4.6 Variable (feature) selection 87
4.7 Multiple classifiers 92
4.8 Why do classifiers fail? 94
4.9 Generalisation 95
4.10 Types of classifier 96
5 Classification algorithms 1 97
5.1 Background 97
5.2 Naive Bayes 99
5.3 Discriminant analysis 100
5.4 Logistic regression 117
5.5 Discriminant analysis or logistic regression? 128
5.6 Generalised additive modeis 130
5.7 Summary 136
6 Other classification methods 137
6.1 Background 137
6.2 Decision trees 137
6.3 Support vector machines 154
6.4 Artificial neural networks 156
6.5 Genetic algorithms 170
6.6 Others 175
6.7 Where next? 177
7 Classification accuracy 179
7.1 Background 179
7.2 Appropriate metrics 180
7.3 Binary accuracy measures 180
7.4 Appropriate testing data 183
7 5 Decision thresholds 186
7.6 Example 187
7.7 ROC plots 190
7.8 Incorporating costs 194
7.9 Comparing classifiers 196
7.10 Recommended reading 199
Contents ix
Appendix A 200
Appendix B 203
Appendix C 207
Appendix D 208
Appendix E 210
Appendix F 217
Appendix G 220
References 224
Index 241 |
any_adam_object | 1 |
any_adam_object_boolean | 1 |
author | Fielding, Alan H. |
author_facet | Fielding, Alan H. |
author_role | aut |
author_sort | Fielding, Alan H. |
author_variant | a h f ah ahf |
building | Verbundindex |
bvnumber | BV022420921 |
callnumber-first | Q - Science |
callnumber-label | QH324 |
callnumber-raw | QH324.2 |
callnumber-search | QH324.2 |
callnumber-sort | QH 3324.2 |
callnumber-subject | QH - Natural History and Biology |
classification_rvk | WC 7000 |
ctrlnum | (OCoLC)71139443 (DE-599)DNB 2006027983 |
dewey-full | 570.285 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 570 - Biology |
dewey-raw | 570.285 |
dewey-search | 570.285 |
dewey-sort | 3570.285 |
dewey-tens | 570 - Biology |
discipline | Biologie |
discipline_str_mv | Biologie |
edition | 1. publ. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02421nam a2200637 c 4500</leader><controlfield tag="001">BV022420921</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20080130 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">070510s2007 xxkad|| |||| 00||| eng d</controlfield><datafield tag="010" ind1=" " ind2=" "><subfield code="a">2006027983</subfield></datafield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBA678576</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">LO 2006027983</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0521852811</subfield><subfield code="c">hardback</subfield><subfield code="9">0-521-85281-1</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0521618002</subfield><subfield code="c">pbk.</subfield><subfield code="9">0-521-61800-2</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780521852814</subfield><subfield code="9">978-0-521-85281-4</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780521618007</subfield><subfield code="9">978-0-521-61800-7</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)71139443</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB 2006027983</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">xxk</subfield><subfield code="c">GB</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-19</subfield><subfield code="a">DE-355</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">QH324.2</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">570.285</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">WC 7000</subfield><subfield code="0">(DE-625)148142:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Fielding, Alan H.</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Cluster and classification techniques for the biosciences</subfield><subfield code="c">Alan H. Fielding</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1. publ.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Cambridge [u.a.]</subfield><subfield code="b">Cambridge Univ. Press</subfield><subfield code="c">2007</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XII, 246 S.</subfield><subfield code="b">Ill., graph. Darst.</subfield><subfield code="c">26 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references (p. 224-239) and index</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Biologie - Classification</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Biologie - Informatique</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Classification automatique (Statistique)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Datenverarbeitung</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Biology</subfield><subfield code="x">Data processing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Biology</subfield><subfield code="v">Classification</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Cluster analysis</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Cluster Analysis</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Biometry</subfield><subfield code="x">methods</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Classification</subfield><subfield code="x">methods</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data Interpretation, Statistical</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Multivariate Analysis</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Biologie</subfield><subfield code="0">(DE-588)4006851-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Cluster-Analyse</subfield><subfield code="0">(DE-588)4070044-6</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Cluster-Analyse</subfield><subfield code="0">(DE-588)4070044-6</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Biologie</subfield><subfield code="0">(DE-588)4006851-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="C">b</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2=" "><subfield code="u">http://www.loc.gov/catdir/enhancements/fy0665/2006027983-d.html</subfield><subfield code="3">Publisher description</subfield></datafield><datafield tag="856" ind1="4" ind2=" "><subfield code="u">http://www.loc.gov/catdir/enhancements/fy0665/2006027983-t.html</subfield><subfield code="3">Table of contents only</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=015629240&sequence=000004&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-015629240</subfield></datafield></record></collection> |
id | DE-604.BV022420921 |
illustrated | Illustrated |
index_date | 2024-07-02T17:25:39Z |
indexdate | 2024-07-09T20:57:13Z |
institution | BVB |
isbn | 0521852811 0521618002 9780521852814 9780521618007 |
language | English |
lccn | 2006027983 |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-015629240 |
oclc_num | 71139443 |
open_access_boolean | |
owner | DE-19 DE-BY-UBM DE-355 DE-BY-UBR |
owner_facet | DE-19 DE-BY-UBM DE-355 DE-BY-UBR |
physical | XII, 246 S. Ill., graph. Darst. 26 cm |
publishDate | 2007 |
publishDateSearch | 2007 |
publishDateSort | 2007 |
publisher | Cambridge Univ. Press |
record_format | marc |
spelling | Fielding, Alan H. Verfasser aut Cluster and classification techniques for the biosciences Alan H. Fielding 1. publ. Cambridge [u.a.] Cambridge Univ. Press 2007 XII, 246 S. Ill., graph. Darst. 26 cm txt rdacontent n rdamedia nc rdacarrier Includes bibliographical references (p. 224-239) and index Biologie - Classification Biologie - Informatique Classification automatique (Statistique) Datenverarbeitung Biology Data processing Biology Classification Cluster analysis Cluster Analysis Biometry methods Classification methods Data Interpretation, Statistical Multivariate Analysis Biologie (DE-588)4006851-1 gnd rswk-swf Cluster-Analyse (DE-588)4070044-6 gnd rswk-swf Cluster-Analyse (DE-588)4070044-6 s Biologie (DE-588)4006851-1 s b DE-604 http://www.loc.gov/catdir/enhancements/fy0665/2006027983-d.html Publisher description http://www.loc.gov/catdir/enhancements/fy0665/2006027983-t.html Table of contents only HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=015629240&sequence=000004&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Fielding, Alan H. Cluster and classification techniques for the biosciences Biologie - Classification Biologie - Informatique Classification automatique (Statistique) Datenverarbeitung Biology Data processing Biology Classification Cluster analysis Cluster Analysis Biometry methods Classification methods Data Interpretation, Statistical Multivariate Analysis Biologie (DE-588)4006851-1 gnd Cluster-Analyse (DE-588)4070044-6 gnd |
subject_GND | (DE-588)4006851-1 (DE-588)4070044-6 |
title | Cluster and classification techniques for the biosciences |
title_auth | Cluster and classification techniques for the biosciences |
title_exact_search | Cluster and classification techniques for the biosciences |
title_exact_search_txtP | Cluster and classification techniques for the biosciences |
title_full | Cluster and classification techniques for the biosciences Alan H. Fielding |
title_fullStr | Cluster and classification techniques for the biosciences Alan H. Fielding |
title_full_unstemmed | Cluster and classification techniques for the biosciences Alan H. Fielding |
title_short | Cluster and classification techniques for the biosciences |
title_sort | cluster and classification techniques for the biosciences |
topic | Biologie - Classification Biologie - Informatique Classification automatique (Statistique) Datenverarbeitung Biology Data processing Biology Classification Cluster analysis Cluster Analysis Biometry methods Classification methods Data Interpretation, Statistical Multivariate Analysis Biologie (DE-588)4006851-1 gnd Cluster-Analyse (DE-588)4070044-6 gnd |
topic_facet | Biologie - Classification Biologie - Informatique Classification automatique (Statistique) Datenverarbeitung Biology Data processing Biology Classification Cluster analysis Cluster Analysis Biometry methods Classification methods Data Interpretation, Statistical Multivariate Analysis Biologie Cluster-Analyse |
url | http://www.loc.gov/catdir/enhancements/fy0665/2006027983-d.html http://www.loc.gov/catdir/enhancements/fy0665/2006027983-t.html http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=015629240&sequence=000004&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT fieldingalanh clusterandclassificationtechniquesforthebiosciences |