Statistical learning for biomedical data:
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Cambridge [u.a.]
Cambridge Univ. Press
2011
|
Schriftenreihe: | Practical guides to biostatistics and epidemiology
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | XII, 285 S. Ill., graph. Darst. |
ISBN: | 9780521699099 0521699096 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV039109575 | ||
003 | DE-604 | ||
005 | 20220106 | ||
007 | t | ||
008 | 110630s2011 ad|| |||| 00||| eng d | ||
020 | |a 9780521699099 |c (pbk.) £28.99 |9 978-0-521-69909-9 | ||
020 | |a 0521699096 |c (pbk.) £28.99 |9 0-521-69909-6 | ||
035 | |a (OCoLC)732255567 | ||
035 | |a (DE-599)HBZHT016766794 | ||
040 | |a DE-604 |b ger | ||
041 | 0 | |a eng | |
049 | |a DE-M49 |a DE-526 |a DE-83 | ||
082 | 0 | |a 614.285 | |
084 | |a XF 3400 |0 (DE-625)152765: |2 rvk | ||
084 | |a MAT 620f |2 stub | ||
084 | |a BIO 107f |2 stub | ||
084 | |a MED 730f |2 stub | ||
084 | |a 92C50 |2 msc | ||
084 | |a 62P10 |2 msc | ||
100 | 1 | |a Malley, James D. |e Verfasser |4 aut | |
245 | 1 | 0 | |a Statistical learning for biomedical data |c James D. Malley ; Karen G. Malley ; Sinisa Pajevic |
264 | 1 | |a Cambridge [u.a.] |b Cambridge Univ. Press |c 2011 | |
300 | |a XII, 285 S. |b Ill., graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Practical guides to biostatistics and epidemiology | |
650 | 0 | 7 | |a Biostatistik |0 (DE-588)4729990-3 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Medizinische Statistik |0 (DE-588)4127563-9 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Epidemiologie |0 (DE-588)4015016-1 |2 gnd |9 rswk-swf |
653 | |a Medical statistics--Data processing. | ||
653 | |a Biometry--Data processing. | ||
689 | 0 | 0 | |a Biostatistik |0 (DE-588)4729990-3 |D s |
689 | 0 | 1 | |a Medizinische Statistik |0 (DE-588)4127563-9 |D s |
689 | 0 | 2 | |a Epidemiologie |0 (DE-588)4015016-1 |D s |
689 | 0 | |C b |5 DE-604 | |
700 | 1 | |a Malley, Karen G. |e Verfasser |4 aut | |
700 | 1 | |a Pajevic, Sinisa |e Verfasser |4 aut | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=022653322&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-022653322 |
Datensatz im Suchindex
_version_ | 1804145822588207104 |
---|---|
adam_text | Titel: Statistical learning for biomedical data
Autor: Malley, James D.
Jahr: 2011
Preface page xi
Acknowledgments xii
Part I Introduction 1
1 Prologue 3
1.1 Machines that learn - some recent history 3
1.2 Twenty canonical questions 7
1.3 Outline of the book 9
1.4 A comment about example datasets 11
1.5 Software 12
Note 13
2 The landscape of learning machines 14
2.1 Introduction 14
2.2 Types of data for learning machines 15
2.3 Will that be supervised or unsupervised? 17
2.4 An unsupervised example 18
2.5 More lack of supervision - where are the parents? 20
2.6 Engines, complex and primitive 20
2.7 Model richness means what, exactly? 22
2.8 Membership or probability of membership? 25
2.9 A taxonomy of machines? 27
2.10 A note of caution - one of many 30
2.11 Highlights from the theory 30
Notes 36
3 A mangle of machines 41
3.1 Introduction 41
3.2 Linear regression 41
3.3 Logistic regression 42
3.4 Linear discriminant 43
3.5 Bayes classifiers - regular and naïve 45
3.6 Logic regression 47
3.7 Ac-Nearest neighbors 48
3.8 Support vector machines 50
3.9 Neural networks 53
3.10 Boosting 54
3.11 Evolutionary and genetic algorithms 55
Notes 56
4 Three examples and several machines 57
4.1 Introduction 57
4.2 Simulated cholesterol data 58
4.3 Lupus data 61
4.4 Stroke data 62
4.5 Biomedical means unbalanced 63
4.6 Measures of machine performance 64
4.7 Linear analysis of cholesterol data 66
4.8 Nonlinear analysis of cholesterol data 67
4.9 Analysis of the lupus data 70
4.10 Analysis of the stroke data 75
4.11 Further analysis of the lupus and stroke data 79
Notes 87
Part II A machine toolkit 89
5 Logistic regression 91
5.1 Introduction 91
5.2 Inside and around the model 92
5.3 Interpreting the coefficients 93
5.4 Using logistic regression as a decision rule 94
5.5 Logistic regression applied to the cholesterol data 94
5.6 A cautionary note 98
5.7 Another cautionary note 101
5.8 Probability estimates and decision rules 102
5.9 Evaluating the goodness-of-fit of a logistic regression model 103
5.10 Calibrating a logistic regression 106
5.11 Beyond calibration 111
5.12 Logistic regression and reference models 113
Notes 115
6 A single decision tree 118
6.1 Introduction 118
6.2 Dropping down trees 118
6.3 Growing a tree 120
6.4 Selecting features, making splits 120
6.5 Good split, bad split 121
6.6 Finding good features for making splits 124
6.7 Misreading trees 125
6.8 Stopping and pruning rules 127
6.9 Using functions of the features 128
6.10 Unstable trees? 129
6.11 Variable importance - growing on trees? 132
6.12 Permuting for importance 134
6.13 The continuing mystery of trees 135
7 Random Forests - trees everywhere 137
7.1 Random Forests in less than five minutes 137
7.2 Random treks through the data 138
7.3 Random treks through the features 139
7.4 Walking through the forest 140
7.5 Weighted and unweighted voting 140
7.6 Finding subsets in the data using proximities 142
7.7 Applying Random Forests to the Stroke data 144
7.8 Random Forests in the universe of machines 151
Notes 153
Part III Analysis fundamentals 155
8 Merely two variables 157
8.1 Introduction 157
8.2 Understanding correlations 158
8.3 Hazards of correlations 159
8.4 Correlations big and small 163
Notes 168
9 More than two variables 171
9.1 Introduction 171
9.2 Tiny problems, large consequences 172
9.3 Mathematics to the rescue? 174
9.4 Good models need not be unique 176
9.5 Contexts and coefficients 179
9.6 Interpreting and testing coefficients in models 181
9.7 Merging models, pooling lists, ranking features 186
Notes 190
10 Resampling methods 198
10.1 Introduction 198
10.2 The bootstrap 198
10.3 When the bootstrap works 201
10.4 When the bootstrap doesn t work 202
10.5 Resampling from a single group in different ways 203
10.6 Resampling from groups with unequal sizes 204
10.7 Resampling from small datasets 206
10.8 Permutation methods 207
10.9 Still more on permutation methods 210
Note 214
11 Error analysis and model validation 215
11.1 Introduction 215
11.2 Errors? What errors? 217
11.3 Unbalanced data, unbalanced errors 218
11.4 Error analysis for a single machine 219
11.5 Cross-validation error estimation 222
11.6 Cross-validation or cross-training? 224
11.7 The leave-one-out method 226
11.8 The out-of-bag method 227
11.9 Intervals for error estimates for a single machine 228
11.10 Tossing random coins into the abyss 230
11.11 Error estimates for unbalanced data 232
11.12 Confidence intervals for comparing error values 233
11.13 Other measures of machine accuracy 236
11.14 Benchmarking and winning the lottery 238
11.15 Error analysis for predicting continuous outcomes 239
Notes 240
Machine strategies 245
Ensemble methods - let s take a vote 247
12.1 Pools of machines 247
12.2 Weak correlation with outcome can be good enough 247
12.3 Model averaging 250
Notes 254
13 Summary and conclusions 255
13.1 Where have we been? 255
13.2 So many machines 257
13.3 Binary decision or probability estimate? 259
13.4 Survival machines? Risk machines? 259
13.5 And where are we going? 260
Appendix 263
References 271
Index 281
The color plate is situated between pages 244 and 245.
|
any_adam_object | 1 |
author | Malley, James D. Malley, Karen G. Pajevic, Sinisa |
author_facet | Malley, James D. Malley, Karen G. Pajevic, Sinisa |
author_role | aut aut aut |
author_sort | Malley, James D. |
author_variant | j d m jd jdm k g m kg kgm s p sp |
building | Verbundindex |
bvnumber | BV039109575 |
classification_rvk | XF 3400 |
classification_tum | MAT 620f BIO 107f MED 730f |
ctrlnum | (OCoLC)732255567 (DE-599)HBZHT016766794 |
dewey-full | 614.285 |
dewey-hundreds | 600 - Technology (Applied sciences) |
dewey-ones | 614 - Forensic medicine; incidence of disease |
dewey-raw | 614.285 |
dewey-search | 614.285 |
dewey-sort | 3614.285 |
dewey-tens | 610 - Medicine and health |
discipline | Biologie Mathematik Medizin |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01971nam a2200505 c 4500</leader><controlfield tag="001">BV039109575</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20220106 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">110630s2011 ad|| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780521699099</subfield><subfield code="c">(pbk.) £28.99</subfield><subfield code="9">978-0-521-69909-9</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0521699096</subfield><subfield code="c">(pbk.) £28.99</subfield><subfield code="9">0-521-69909-6</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)732255567</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)HBZHT016766794</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-M49</subfield><subfield code="a">DE-526</subfield><subfield code="a">DE-83</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">614.285</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">XF 3400</subfield><subfield code="0">(DE-625)152765:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">MAT 620f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">BIO 107f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">MED 730f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">92C50</subfield><subfield code="2">msc</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">62P10</subfield><subfield code="2">msc</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Malley, James D.</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Statistical learning for biomedical data</subfield><subfield code="c">James D. Malley ; Karen G. Malley ; Sinisa Pajevic</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Cambridge [u.a.]</subfield><subfield code="b">Cambridge Univ. Press</subfield><subfield code="c">2011</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XII, 285 S.</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Practical guides to biostatistics and epidemiology</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Biostatistik</subfield><subfield code="0">(DE-588)4729990-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Medizinische Statistik</subfield><subfield code="0">(DE-588)4127563-9</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Epidemiologie</subfield><subfield code="0">(DE-588)4015016-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Medical statistics--Data processing.</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Biometry--Data processing.</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Biostatistik</subfield><subfield code="0">(DE-588)4729990-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Medizinische Statistik</subfield><subfield code="0">(DE-588)4127563-9</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Epidemiologie</subfield><subfield code="0">(DE-588)4015016-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="C">b</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Malley, Karen G.</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Pajevic, Sinisa</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=022653322&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-022653322</subfield></datafield></record></collection> |
id | DE-604.BV039109575 |
illustrated | Illustrated |
indexdate | 2024-07-09T23:25:34Z |
institution | BVB |
isbn | 9780521699099 0521699096 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-022653322 |
oclc_num | 732255567 |
open_access_boolean | |
owner | DE-M49 DE-BY-TUM DE-526 DE-83 |
owner_facet | DE-M49 DE-BY-TUM DE-526 DE-83 |
physical | XII, 285 S. Ill., graph. Darst. |
publishDate | 2011 |
publishDateSearch | 2011 |
publishDateSort | 2011 |
publisher | Cambridge Univ. Press |
record_format | marc |
series2 | Practical guides to biostatistics and epidemiology |
spelling | Malley, James D. Verfasser aut Statistical learning for biomedical data James D. Malley ; Karen G. Malley ; Sinisa Pajevic Cambridge [u.a.] Cambridge Univ. Press 2011 XII, 285 S. Ill., graph. Darst. txt rdacontent n rdamedia nc rdacarrier Practical guides to biostatistics and epidemiology Biostatistik (DE-588)4729990-3 gnd rswk-swf Medizinische Statistik (DE-588)4127563-9 gnd rswk-swf Epidemiologie (DE-588)4015016-1 gnd rswk-swf Medical statistics--Data processing. Biometry--Data processing. Biostatistik (DE-588)4729990-3 s Medizinische Statistik (DE-588)4127563-9 s Epidemiologie (DE-588)4015016-1 s b DE-604 Malley, Karen G. Verfasser aut Pajevic, Sinisa Verfasser aut HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=022653322&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Malley, James D. Malley, Karen G. Pajevic, Sinisa Statistical learning for biomedical data Biostatistik (DE-588)4729990-3 gnd Medizinische Statistik (DE-588)4127563-9 gnd Epidemiologie (DE-588)4015016-1 gnd |
subject_GND | (DE-588)4729990-3 (DE-588)4127563-9 (DE-588)4015016-1 |
title | Statistical learning for biomedical data |
title_auth | Statistical learning for biomedical data |
title_exact_search | Statistical learning for biomedical data |
title_full | Statistical learning for biomedical data James D. Malley ; Karen G. Malley ; Sinisa Pajevic |
title_fullStr | Statistical learning for biomedical data James D. Malley ; Karen G. Malley ; Sinisa Pajevic |
title_full_unstemmed | Statistical learning for biomedical data James D. Malley ; Karen G. Malley ; Sinisa Pajevic |
title_short | Statistical learning for biomedical data |
title_sort | statistical learning for biomedical data |
topic | Biostatistik (DE-588)4729990-3 gnd Medizinische Statistik (DE-588)4127563-9 gnd Epidemiologie (DE-588)4015016-1 gnd |
topic_facet | Biostatistik Medizinische Statistik Epidemiologie |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=022653322&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT malleyjamesd statisticallearningforbiomedicaldata AT malleykareng statisticallearningforbiomedicaldata AT pajevicsinisa statisticallearningforbiomedicaldata |