Exploratory model comparison: interactive model ensemble selection and management
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Abschlussarbeit Buch |
Sprache: | English |
Veröffentlicht: |
Berlin
Logos-Verl.
2011
|
Schriftenreihe: | Augsburger Schriften zur Mathematik, Physik und Informatik
17 |
Schlagworte: | |
Online-Zugang: | Inhaltstext Inhaltsverzeichnis |
Beschreibung: | V, 189 S. Ill., graph. Darst., Kt. |
ISBN: | 9783832529277 3832529276 |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV039740111 | ||
003 | DE-604 | ||
005 | 20111215 | ||
007 | t| | ||
008 | 111205s2011 xx abd| m||| 00||| eng d | ||
015 | |a 11,N33 |2 dnb | ||
016 | 7 | |a 101425129X |2 DE-101 | |
020 | |a 9783832529277 |c kart. : EUR 42.00 (DE), EUR 43.20 (AT), sfr 74.80 (freier Pr.) |9 978-3-8325-2927-7 | ||
020 | |a 3832529276 |9 3-8325-2927-6 | ||
024 | 3 | |a 9783832529277 | |
035 | |a (OCoLC)772870598 | ||
035 | |a (DE-599)DNB101425129X | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
049 | |a DE-384 |a DE-188 | ||
082 | 0 | |a 519.5 |2 22/ger | |
084 | |a SK 840 |0 (DE-625)143261: |2 rvk | ||
084 | |a 004 |2 sdnb | ||
084 | |a 510 |2 sdnb | ||
100 | 1 | |a Seger, Ralf |e Verfasser |4 aut | |
245 | 1 | 0 | |a Exploratory model comparison |b interactive model ensemble selection and management |c Ralf Seger |
264 | 1 | |a Berlin |b Logos-Verl. |c 2011 | |
300 | |a V, 189 S. |b Ill., graph. Darst., Kt. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Augsburger Schriften zur Mathematik, Physik und Informatik |v 17 | |
502 | |a Zugl.: Augsburg, Univ., Diss., 2011 | ||
650 | 0 | 7 | |a Modellwahl |0 (DE-588)4304786-5 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Computerunterstütztes Verfahren |0 (DE-588)4139030-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Statistisches Modell |0 (DE-588)4121722-6 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Statistisches Modell |0 (DE-588)4121722-6 |D s |
689 | 0 | 1 | |a Modellwahl |0 (DE-588)4304786-5 |D s |
689 | 0 | 2 | |a Computerunterstütztes Verfahren |0 (DE-588)4139030-1 |D s |
689 | 0 | |5 DE-604 | |
830 | 0 | |a Augsburger Schriften zur Mathematik, Physik und Informatik |v 17 |w (DE-604)BV017601953 |9 17 | |
856 | 4 | 2 | |m X:MVB |q text/html |u http://deposit.dnb.de/cgi-bin/dokserv?id=3866553&prov=M&dok_var=1&dok_ext=htm |3 Inhaltstext |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024587812&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-024587812 |
Datensatz im Suchindex
_version_ | 1817681162390208512 |
---|---|
adam_text |
IMAGE 1
CONTENTS
PREFACE V
1 INTRODUCTION 1
1.1 WHAT IS A MODEL ? 1
1.2 STATISTICAL MODELS 3
1.3 EXPLORATORY MODEL ANALYSIS 4
1.3.1 EXAMPLE: SCOTTISH HILL RACES 5
1.3.2 GLOBAL STATISTICS 8
1.3.3 MORE SPECIFIC STATISTICS 8
1.4 MODEL ARCHITECTURE 9
1.4.1 MODEL SELECTION 10
1.4.1.1 MODEL SELECTION BIAS 10
1.4.1.2 MODEL SELECTION UNCERTAINTY 11
1.5 SUMMARY AND OUTLOOK 12
2 MODEL COMPARISON 13
2.1 INTRODUCTION 13
2.2 A DISTANCE BETWEEN MODELS 14
2.2.1 FINDING THE BEST MODEL 14
2.2.1.1 ORDINARY LEAST SQUARES 15
2.2.1.2 MAXIMUM LIKELIHOOD 15
2.2.2 INFORMATION CRITERIA 10
3 MODEL SELECTION STRATEGIES 23
3.1 MODEL VALIDATION 23
3.1.1 CROSS-VALIDATION 24
3.1.2 RESAMPLING TECHNIQUES 24
3.2 SINGLE MODEL SELECTION 25
3.3 MODEL ENSEMBLES 26
3.3.1 HOW TO GAIN INFERENCE FROM MORE THAN ONE MODEL 27
3.3.2 BAGGING 28
3.3.3 BOOSTING 28
3.3.3.1 EXAMPLE: BOOSTING A REGRESSION MODEL 30
3.3.4 BUMPING 35
3.3.5 RANDOM SUBSPACE METHOD (RSM) 36
3.3.6 BAYESIAN MODEL AVERAGING 36
3.3.7 STACKED GENERALIZATION 37
3.4 SUMMARY 37
4 INTERACTIVE MODEL ANALYSIS 39
BIBLIOGRAFISCHE INFORMATIONEN HTTP://D-NB.INFO/101425129X
DIGITALISIERT DURCH
IMAGE 2
4.1 INTERACTION 41
4.1.1 AN EXAMPLE 44
4.1.2 SUMMARY 49
THE SOFTWARE MORET 50
5.1 GENERAL THOUGHTS 50
5.2 EXAMPLE: ELECTIONS04 52
5.2.1 ABOUT THE DATA 52
5.2.2 MODELS 60
5.2.3 INTERMEDIATE SUMMARY 61
5.3 FROM REQUIREMENTS TO DESIGN 62
5.3.1 REPRODUCIBILITY 62
5.3.2 ACCESS TO MODEL STATISTICS 63
5.3.3 EASY DATA ACCESSIBILITY 65
5.3.4 DATA ACCESSIBILITY FOR OTHER SOFTWARE 65
5.3.5 ADDITIONAL DATA 65
5.3.6 DATA ORGANIZATION 66
5.3.7 SUMMARY 66
5.3.7.1 FEATURE OVERVIEW 67
5.4 THE DEVELOPMENT OF MORET 68
5.4.1 THE BEGINNING: TRIAL AND ERROR 68
5.4.2 THE FIRST WORKING VERSION 69
5.4.2.1 ANALYZING MODELS 71
5.4.3 ADVANCED FEATURES 79
5.4.3.1 A SELF-CONFIGURING DATABASE 80
5.4.3.2 GENERIC DATABASE STRUCTURE 81
5.4.3.3 MODEL CONFIGURATION 82
5.4.4 MISCELLANEOUS FEATURES 84
5.4.4.1 FINDING MODELS BY VALUES 84
5.4.5 SUMMARY 86
WORKING WITH MORET 87
6.1 OVERVIEW 87
6.1.1 DATA SET HANDLING 87
6.1.2 THE IMPORTANCE OF DATA INTEGRITY 88
6.1.3 A TEXTBOOK EXAMPLE 89
6.2 CUSTOM MODEL CONFIGURATION 92
6.2.1 EXAMPLE CONFIGURATION FOR AN RPART TREE MODEL 92
6.3 MODELS IN MORET 94
6.3.1 HOW TO CREATE A CANDIDATE SET 95
6.3.2 CONSTRAINED MODEL CREATION 99
6.3.3 SUMMARY 102
6.3.4 TYPICAL STEPS 103
6.3.5 FEATURE QUERIES - AN ALTERNATIVE MODEL TABLE SELECTION APPROACH
110 6.4 ORGANIZING DATA 115
6.4.1 FILTERS 115
6.4.2 GROUPS 116
6.5 EXAMPLE: INTERACTIVE SEARCH 118
IMAGE 3
6.5.1 FLORIDA ELECTION DATA 118
6.5.2 BODYFAT DATA 122
6.5.2.1 USING TWO CONCURRENT DATA SETS 123
6.5.2.2 AN ALTERNATIVE APPROACH TO MODEL SELECTION 130
6.5.2.3 AVERAGED MODEL PROPERTIES 135
6.6 SUMMARY 141
7 CONCLUSION 142
7.1 SUMMARY 142
7.2 ACHIEVEMENTS 143
7.3 NEXT STEPS 143
A APPENDIX 145
A.I R MODEL PACKAGES 145
A.I.I EXAMPLES: BAYESIAN MODEL ANALYSIS 145
B TECHNICAL DETAILS 148
B.I EXTRACTING DATA FROM R 148
B.I.I WHEN TO EXTRACT DATA 150
B.1.2 MAPPING DATA FROM R 152
B.1.3 UML SEQUENCE DIAGRAM: INTERCEPTING USER COMMANDS 153
B.2 R SOURCE CODE 155
B.2.1 CREATE BOOTSTRAP DATA.FRAME FROM DATA.FRAME 155
B.2.2 EXTRACT R 2^ FOR LINEAR MODEL 155
B.2.3 CREATE RANK MATRIX FOR LINEAR MODELS 156
B.3 REGULAR EXPRESSIONS 157
B.3.1 QUANTIFIERS 158
B.3.2 CHARACTER CLASSES 158
B.3.3 SPECIAL CHARACTERS 159
B.3.4 USING REGEX FOR MORET TABLE FILTERING 160
B.3.5 SUMMARY ICI
B.4 UML 162
B.4.1 NOMENCLATURE 162
B.4.1.1 CLASS 162
B.4.1.2 RELATIONS 163
B.4.1.3 INTERFACES AND ABSTRACT CLASSES 164
B.4.2 SEQUENCE DIAGRAMS 165
B.4.3 COMPOSITE STATE DIAGRAMS 166
B.4.4 SUMMARY 167
B.5 DEVELOPMENT COMMENTS 167
B.5.1 RECOMMENDATIONS 168
B.5.2 UNIFICATION OF COMMANDS 168
B.5.3 BACKWARD COMPATIBILITY 168
B.5.4 INTEGRATION 169
BIBLIOGRAPHY 170
LIST OF FIGURES 182
IMAGE 4
LIST OF TABLES 186
LIST OF ALGORITHMS 187
INDEX 188
ABOUT THE AUTHOR 191 |
any_adam_object | 1 |
author | Seger, Ralf |
author_facet | Seger, Ralf |
author_role | aut |
author_sort | Seger, Ralf |
author_variant | r s rs |
building | Verbundindex |
bvnumber | BV039740111 |
classification_rvk | SK 840 |
ctrlnum | (OCoLC)772870598 (DE-599)DNB101425129X |
dewey-full | 519.5 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 519 - Probabilities and applied mathematics |
dewey-raw | 519.5 |
dewey-search | 519.5 |
dewey-sort | 3519.5 |
dewey-tens | 510 - Mathematics |
discipline | Informatik Mathematik |
format | Thesis Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 cb4500</leader><controlfield tag="001">BV039740111</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20111215</controlfield><controlfield tag="007">t|</controlfield><controlfield tag="008">111205s2011 xx abd| m||| 00||| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">11,N33</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">101425129X</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783832529277</subfield><subfield code="c">kart. : EUR 42.00 (DE), EUR 43.20 (AT), sfr 74.80 (freier Pr.)</subfield><subfield code="9">978-3-8325-2927-7</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">3832529276</subfield><subfield code="9">3-8325-2927-6</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9783832529277</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)772870598</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB101425129X</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-384</subfield><subfield code="a">DE-188</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">519.5</subfield><subfield code="2">22/ger</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SK 840</subfield><subfield code="0">(DE-625)143261:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">004</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">510</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Seger, Ralf</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Exploratory model comparison</subfield><subfield code="b">interactive model ensemble selection and management</subfield><subfield code="c">Ralf Seger</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin</subfield><subfield code="b">Logos-Verl.</subfield><subfield code="c">2011</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">V, 189 S.</subfield><subfield code="b">Ill., graph. Darst., Kt.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Augsburger Schriften zur Mathematik, Physik und Informatik</subfield><subfield code="v">17</subfield></datafield><datafield tag="502" ind1=" " ind2=" "><subfield code="a">Zugl.: Augsburg, Univ., Diss., 2011</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Modellwahl</subfield><subfield code="0">(DE-588)4304786-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Computerunterstütztes Verfahren</subfield><subfield code="0">(DE-588)4139030-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Statistisches Modell</subfield><subfield code="0">(DE-588)4121722-6</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Statistisches Modell</subfield><subfield code="0">(DE-588)4121722-6</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Modellwahl</subfield><subfield code="0">(DE-588)4304786-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Computerunterstütztes Verfahren</subfield><subfield code="0">(DE-588)4139030-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Augsburger Schriften zur Mathematik, Physik und Informatik</subfield><subfield code="v">17</subfield><subfield code="w">(DE-604)BV017601953</subfield><subfield code="9">17</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">X:MVB</subfield><subfield code="q">text/html</subfield><subfield code="u">http://deposit.dnb.de/cgi-bin/dokserv?id=3866553&prov=M&dok_var=1&dok_ext=htm</subfield><subfield code="3">Inhaltstext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024587812&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-024587812</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV039740111 |
illustrated | Illustrated |
indexdate | 2024-12-06T09:03:58Z |
institution | BVB |
isbn | 9783832529277 3832529276 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-024587812 |
oclc_num | 772870598 |
open_access_boolean | |
owner | DE-384 DE-188 |
owner_facet | DE-384 DE-188 |
physical | V, 189 S. Ill., graph. Darst., Kt. |
publishDate | 2011 |
publishDateSearch | 2011 |
publishDateSort | 2011 |
publisher | Logos-Verl. |
record_format | marc |
series | Augsburger Schriften zur Mathematik, Physik und Informatik |
series2 | Augsburger Schriften zur Mathematik, Physik und Informatik |
spelling | Seger, Ralf Verfasser aut Exploratory model comparison interactive model ensemble selection and management Ralf Seger Berlin Logos-Verl. 2011 V, 189 S. Ill., graph. Darst., Kt. txt rdacontent n rdamedia nc rdacarrier Augsburger Schriften zur Mathematik, Physik und Informatik 17 Zugl.: Augsburg, Univ., Diss., 2011 Modellwahl (DE-588)4304786-5 gnd rswk-swf Computerunterstütztes Verfahren (DE-588)4139030-1 gnd rswk-swf Statistisches Modell (DE-588)4121722-6 gnd rswk-swf (DE-588)4113937-9 Hochschulschrift gnd-content Statistisches Modell (DE-588)4121722-6 s Modellwahl (DE-588)4304786-5 s Computerunterstütztes Verfahren (DE-588)4139030-1 s DE-604 Augsburger Schriften zur Mathematik, Physik und Informatik 17 (DE-604)BV017601953 17 X:MVB text/html http://deposit.dnb.de/cgi-bin/dokserv?id=3866553&prov=M&dok_var=1&dok_ext=htm Inhaltstext DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024587812&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Seger, Ralf Exploratory model comparison interactive model ensemble selection and management Augsburger Schriften zur Mathematik, Physik und Informatik Modellwahl (DE-588)4304786-5 gnd Computerunterstütztes Verfahren (DE-588)4139030-1 gnd Statistisches Modell (DE-588)4121722-6 gnd |
subject_GND | (DE-588)4304786-5 (DE-588)4139030-1 (DE-588)4121722-6 (DE-588)4113937-9 |
title | Exploratory model comparison interactive model ensemble selection and management |
title_auth | Exploratory model comparison interactive model ensemble selection and management |
title_exact_search | Exploratory model comparison interactive model ensemble selection and management |
title_full | Exploratory model comparison interactive model ensemble selection and management Ralf Seger |
title_fullStr | Exploratory model comparison interactive model ensemble selection and management Ralf Seger |
title_full_unstemmed | Exploratory model comparison interactive model ensemble selection and management Ralf Seger |
title_short | Exploratory model comparison |
title_sort | exploratory model comparison interactive model ensemble selection and management |
title_sub | interactive model ensemble selection and management |
topic | Modellwahl (DE-588)4304786-5 gnd Computerunterstütztes Verfahren (DE-588)4139030-1 gnd Statistisches Modell (DE-588)4121722-6 gnd |
topic_facet | Modellwahl Computerunterstütztes Verfahren Statistisches Modell Hochschulschrift |
url | http://deposit.dnb.de/cgi-bin/dokserv?id=3866553&prov=M&dok_var=1&dok_ext=htm http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024587812&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV017601953 |
work_keys_str_mv | AT segerralf exploratorymodelcomparisoninteractivemodelensembleselectionandmanagement |