Statistical learning in genetics: an introduction using R
This book provides an introduction to computer-based methods for the analysis of genomic data. Breakthroughs in molecular and computational biology have contributed to the emergence of vast data sets, where millions of genetic markers for each individual are coupled with medical records, generating...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Cham, Switzerland
Springer
[2023]
|
Schriftenreihe: | Statistics for biology and health
|
Schlagworte: | |
Zusammenfassung: | This book provides an introduction to computer-based methods for the analysis of genomic data. Breakthroughs in molecular and computational biology have contributed to the emergence of vast data sets, where millions of genetic markers for each individual are coupled with medical records, generating an unparalleled resource for linking human genetic variation to human biology and disease. Similar developments have taken place in animal and plant breeding, where genetic marker information is combined with production traits. An important task for the statistical geneticist is to adapt, construct and implement models that can extract information from these large-scale data. An initial step is to understand the methodology that underlies the probability models and to learn the modern computer-intensive methods required for fitting these models. The objective of this book, suitable for readers who wish to develop analytic skills to perform genomic research, is to provide guidance to take this first step.This book is addressed to numerate biologists who typically lack the formal mathematical background of the professional statistician. For this reason, considerably more detail in explanations and derivations is offered. It is written in a concise style and examples are used profusely. A large proportion of the examples involve programming with the open-source package R. The R code needed to solve the exercises is provided. The MarkDown interface allows the students to implement the code on their own computer, contributing to a better understanding of the underlying theory.Part I presents methods of inference based on likelihood and Bayesian methods, including computational techniques for fitting likelihood and Bayesian models. Part II discusses prediction for continuous and binary data using both frequentist and Bayesian approaches. Some of the models used for prediction are also used for gene discovery. The challenge is to find promising genes without incurring a large proportion of false positive results. Therefore, Part II includes a detour on False Discovery Rate assuming frequentist and Bayesian perspectives. The last chapter of Part II provides an overview of a selected number of non-parametric methods. Part III consists of exercises and their solutions.Daniel Sorensen holds PhD and DSc degrees from the University of Edinburgh and is an elected Fellow of the American Statistical Association. He was professor of Statistical Genetics at Aarhus University where, at present, he is professor emeritus |
Beschreibung: | 1. Overview. - Part I Fitting Likelihood and Bayesian Models. - 2. Likelihood. - 3. Computing the Likelihood. - 4. Bayesian Methods. - 5. McMC in Practice. - Part II Prediction. - 6. Fundamentals of Prediction. - 7. Shrinkage Methods. - 8. Digression on Multiple Testing: False Discovery Rates. - 9. Binary Data. - 10. Bayesian Prediction and Model Checking. - 11. Nonparametric Methods: A Selected Overview. - Part III Exercises and Solutions. - 12. Exercises. - 13. Solution to Exercises |
Beschreibung: | xvi, 693 Seiten Illustrationen, Diagramme |
ISBN: | 9783031358500 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV049390707 | ||
003 | DE-604 | ||
005 | 20240118 | ||
007 | t | ||
008 | 231031s2023 a||| |||| 00||| eng d | ||
020 | |a 9783031358500 |c hbk |9 978-3-031-35850-0 | ||
024 | 3 | |a 9783031358500 | |
035 | |a (OCoLC)1416408947 | ||
035 | |a (DE-599)BVBBV049390707 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-29T | ||
100 | 1 | |a Sörensen, Daniel |d 1976- |e Verfasser |0 (DE-588)132202263 |4 aut | |
245 | 1 | 0 | |a Statistical learning in genetics |b an introduction using R |c Daniel Sorensen |
264 | 1 | |a Cham, Switzerland |b Springer |c [2023] | |
300 | |a xvi, 693 Seiten |b Illustrationen, Diagramme | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Statistics for biology and health | |
500 | |a 1. Overview. - Part I Fitting Likelihood and Bayesian Models. - 2. Likelihood. - 3. Computing the Likelihood. - 4. Bayesian Methods. - 5. McMC in Practice. - Part II Prediction. - 6. Fundamentals of Prediction. - 7. Shrinkage Methods. - 8. Digression on Multiple Testing: False Discovery Rates. - 9. Binary Data. - 10. Bayesian Prediction and Model Checking. - 11. Nonparametric Methods: A Selected Overview. - Part III Exercises and Solutions. - 12. Exercises. - 13. Solution to Exercises | ||
520 | |a This book provides an introduction to computer-based methods for the analysis of genomic data. Breakthroughs in molecular and computational biology have contributed to the emergence of vast data sets, where millions of genetic markers for each individual are coupled with medical records, generating an unparalleled resource for linking human genetic variation to human biology and disease. Similar developments have taken place in animal and plant breeding, where genetic marker information is combined with production traits. An important task for the statistical geneticist is to adapt, construct and implement models that can extract information from these large-scale data. An initial step is to understand the methodology that underlies the probability models and to learn the modern computer-intensive methods required for fitting these models. | ||
520 | |a The objective of this book, suitable for readers who wish to develop analytic skills to perform genomic research, is to provide guidance to take this first step.This book is addressed to numerate biologists who typically lack the formal mathematical background of the professional statistician. For this reason, considerably more detail in explanations and derivations is offered. It is written in a concise style and examples are used profusely. A large proportion of the examples involve programming with the open-source package R. The R code needed to solve the exercises is provided. The MarkDown interface allows the students to implement the code on their own computer, contributing to a better understanding of the underlying theory.Part I presents methods of inference based on likelihood and Bayesian methods, including computational techniques for fitting likelihood and Bayesian models. | ||
520 | |a Part II discusses prediction for continuous and binary data using both frequentist and Bayesian approaches. Some of the models used for prediction are also used for gene discovery. The challenge is to find promising genes without incurring a large proportion of false positive results. Therefore, Part II includes a detour on False Discovery Rate assuming frequentist and Bayesian perspectives. The last chapter of Part II provides an overview of a selected number of non-parametric methods. Part III consists of exercises and their solutions.Daniel Sorensen holds PhD and DSc degrees from the University of Edinburgh and is an elected Fellow of the American Statistical Association. He was professor of Statistical Genetics at Aarhus University where, at present, he is professor emeritus | ||
650 | 4 | |a Quantitative research | |
650 | 4 | |a Biometry | |
650 | 4 | |a Genetics | |
650 | 4 | |a Statistics | |
650 | 0 | 7 | |a Statistik |0 (DE-588)4056995-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a R |g Programm |0 (DE-588)4705956-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Genetik |0 (DE-588)4071711-2 |2 gnd |9 rswk-swf |
653 | |a Hardcover, Softcover / Mathematik/Wahrscheinlichkeitstheorie, Stochastik, Mathematische Statistik | ||
689 | 0 | 0 | |a Genetik |0 (DE-588)4071711-2 |D s |
689 | 0 | 1 | |a Statistik |0 (DE-588)4056995-0 |D s |
689 | 0 | 2 | |a R |g Programm |0 (DE-588)4705956-4 |D s |
689 | 0 | |5 DE-604 | |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-3-031-35851-7 |
999 | |a oai:aleph.bib-bvb.de:BVB01-034718133 |
Datensatz im Suchindex
_version_ | 1804186100033388544 |
---|---|
adam_txt | |
any_adam_object | |
any_adam_object_boolean | |
author | Sörensen, Daniel 1976- |
author_GND | (DE-588)132202263 |
author_facet | Sörensen, Daniel 1976- |
author_role | aut |
author_sort | Sörensen, Daniel 1976- |
author_variant | d s ds |
building | Verbundindex |
bvnumber | BV049390707 |
ctrlnum | (OCoLC)1416408947 (DE-599)BVBBV049390707 |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>04614nam a2200481 c 4500</leader><controlfield tag="001">BV049390707</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20240118 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">231031s2023 a||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783031358500</subfield><subfield code="c">hbk</subfield><subfield code="9">978-3-031-35850-0</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9783031358500</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1416408947</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV049390707</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Sörensen, Daniel</subfield><subfield code="d">1976-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)132202263</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Statistical learning in genetics</subfield><subfield code="b">an introduction using R</subfield><subfield code="c">Daniel Sorensen</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Cham, Switzerland</subfield><subfield code="b">Springer</subfield><subfield code="c">[2023]</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xvi, 693 Seiten</subfield><subfield code="b">Illustrationen, Diagramme</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Statistics for biology and health</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">1. Overview. - Part I Fitting Likelihood and Bayesian Models. - 2. Likelihood. - 3. Computing the Likelihood. - 4. Bayesian Methods. - 5. McMC in Practice. - Part II Prediction. - 6. Fundamentals of Prediction. - 7. Shrinkage Methods. - 8. Digression on Multiple Testing: False Discovery Rates. - 9. Binary Data. - 10. Bayesian Prediction and Model Checking. - 11. Nonparametric Methods: A Selected Overview. - Part III Exercises and Solutions. - 12. Exercises. - 13. Solution to Exercises</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">This book provides an introduction to computer-based methods for the analysis of genomic data. Breakthroughs in molecular and computational biology have contributed to the emergence of vast data sets, where millions of genetic markers for each individual are coupled with medical records, generating an unparalleled resource for linking human genetic variation to human biology and disease. Similar developments have taken place in animal and plant breeding, where genetic marker information is combined with production traits. An important task for the statistical geneticist is to adapt, construct and implement models that can extract information from these large-scale data. An initial step is to understand the methodology that underlies the probability models and to learn the modern computer-intensive methods required for fitting these models. </subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">The objective of this book, suitable for readers who wish to develop analytic skills to perform genomic research, is to provide guidance to take this first step.This book is addressed to numerate biologists who typically lack the formal mathematical background of the professional statistician. For this reason, considerably more detail in explanations and derivations is offered. It is written in a concise style and examples are used profusely. A large proportion of the examples involve programming with the open-source package R. The R code needed to solve the exercises is provided. The MarkDown interface allows the students to implement the code on their own computer, contributing to a better understanding of the underlying theory.Part I presents methods of inference based on likelihood and Bayesian methods, including computational techniques for fitting likelihood and Bayesian models. </subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Part II discusses prediction for continuous and binary data using both frequentist and Bayesian approaches. Some of the models used for prediction are also used for gene discovery. The challenge is to find promising genes without incurring a large proportion of false positive results. Therefore, Part II includes a detour on False Discovery Rate assuming frequentist and Bayesian perspectives. The last chapter of Part II provides an overview of a selected number of non-parametric methods. Part III consists of exercises and their solutions.Daniel Sorensen holds PhD and DSc degrees from the University of Edinburgh and is an elected Fellow of the American Statistical Association. He was professor of Statistical Genetics at Aarhus University where, at present, he is professor emeritus</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Quantitative research</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Biometry</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Genetics</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Statistics </subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Statistik</subfield><subfield code="0">(DE-588)4056995-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">R</subfield><subfield code="g">Programm</subfield><subfield code="0">(DE-588)4705956-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Genetik</subfield><subfield code="0">(DE-588)4071711-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Hardcover, Softcover / Mathematik/Wahrscheinlichkeitstheorie, Stochastik, Mathematische Statistik</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Genetik</subfield><subfield code="0">(DE-588)4071711-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Statistik</subfield><subfield code="0">(DE-588)4056995-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">R</subfield><subfield code="g">Programm</subfield><subfield code="0">(DE-588)4705956-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-3-031-35851-7</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-034718133</subfield></datafield></record></collection> |
id | DE-604.BV049390707 |
illustrated | Illustrated |
index_date | 2024-07-03T23:01:12Z |
indexdate | 2024-07-10T10:05:45Z |
institution | BVB |
isbn | 9783031358500 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-034718133 |
oclc_num | 1416408947 |
open_access_boolean | |
owner | DE-29T |
owner_facet | DE-29T |
physical | xvi, 693 Seiten Illustrationen, Diagramme |
publishDate | 2023 |
publishDateSearch | 2023 |
publishDateSort | 2023 |
publisher | Springer |
record_format | marc |
series2 | Statistics for biology and health |
spelling | Sörensen, Daniel 1976- Verfasser (DE-588)132202263 aut Statistical learning in genetics an introduction using R Daniel Sorensen Cham, Switzerland Springer [2023] xvi, 693 Seiten Illustrationen, Diagramme txt rdacontent n rdamedia nc rdacarrier Statistics for biology and health 1. Overview. - Part I Fitting Likelihood and Bayesian Models. - 2. Likelihood. - 3. Computing the Likelihood. - 4. Bayesian Methods. - 5. McMC in Practice. - Part II Prediction. - 6. Fundamentals of Prediction. - 7. Shrinkage Methods. - 8. Digression on Multiple Testing: False Discovery Rates. - 9. Binary Data. - 10. Bayesian Prediction and Model Checking. - 11. Nonparametric Methods: A Selected Overview. - Part III Exercises and Solutions. - 12. Exercises. - 13. Solution to Exercises This book provides an introduction to computer-based methods for the analysis of genomic data. Breakthroughs in molecular and computational biology have contributed to the emergence of vast data sets, where millions of genetic markers for each individual are coupled with medical records, generating an unparalleled resource for linking human genetic variation to human biology and disease. Similar developments have taken place in animal and plant breeding, where genetic marker information is combined with production traits. An important task for the statistical geneticist is to adapt, construct and implement models that can extract information from these large-scale data. An initial step is to understand the methodology that underlies the probability models and to learn the modern computer-intensive methods required for fitting these models. The objective of this book, suitable for readers who wish to develop analytic skills to perform genomic research, is to provide guidance to take this first step.This book is addressed to numerate biologists who typically lack the formal mathematical background of the professional statistician. For this reason, considerably more detail in explanations and derivations is offered. It is written in a concise style and examples are used profusely. A large proportion of the examples involve programming with the open-source package R. The R code needed to solve the exercises is provided. The MarkDown interface allows the students to implement the code on their own computer, contributing to a better understanding of the underlying theory.Part I presents methods of inference based on likelihood and Bayesian methods, including computational techniques for fitting likelihood and Bayesian models. Part II discusses prediction for continuous and binary data using both frequentist and Bayesian approaches. Some of the models used for prediction are also used for gene discovery. The challenge is to find promising genes without incurring a large proportion of false positive results. Therefore, Part II includes a detour on False Discovery Rate assuming frequentist and Bayesian perspectives. The last chapter of Part II provides an overview of a selected number of non-parametric methods. Part III consists of exercises and their solutions.Daniel Sorensen holds PhD and DSc degrees from the University of Edinburgh and is an elected Fellow of the American Statistical Association. He was professor of Statistical Genetics at Aarhus University where, at present, he is professor emeritus Quantitative research Biometry Genetics Statistics Statistik (DE-588)4056995-0 gnd rswk-swf R Programm (DE-588)4705956-4 gnd rswk-swf Genetik (DE-588)4071711-2 gnd rswk-swf Hardcover, Softcover / Mathematik/Wahrscheinlichkeitstheorie, Stochastik, Mathematische Statistik Genetik (DE-588)4071711-2 s Statistik (DE-588)4056995-0 s R Programm (DE-588)4705956-4 s DE-604 Erscheint auch als Online-Ausgabe 978-3-031-35851-7 |
spellingShingle | Sörensen, Daniel 1976- Statistical learning in genetics an introduction using R Quantitative research Biometry Genetics Statistics Statistik (DE-588)4056995-0 gnd R Programm (DE-588)4705956-4 gnd Genetik (DE-588)4071711-2 gnd |
subject_GND | (DE-588)4056995-0 (DE-588)4705956-4 (DE-588)4071711-2 |
title | Statistical learning in genetics an introduction using R |
title_auth | Statistical learning in genetics an introduction using R |
title_exact_search | Statistical learning in genetics an introduction using R |
title_exact_search_txtP | Statistical learning in genetics an introduction using R |
title_full | Statistical learning in genetics an introduction using R Daniel Sorensen |
title_fullStr | Statistical learning in genetics an introduction using R Daniel Sorensen |
title_full_unstemmed | Statistical learning in genetics an introduction using R Daniel Sorensen |
title_short | Statistical learning in genetics |
title_sort | statistical learning in genetics an introduction using r |
title_sub | an introduction using R |
topic | Quantitative research Biometry Genetics Statistics Statistik (DE-588)4056995-0 gnd R Programm (DE-588)4705956-4 gnd Genetik (DE-588)4071711-2 gnd |
topic_facet | Quantitative research Biometry Genetics Statistics Statistik R Programm Genetik |
work_keys_str_mv | AT sorensendaniel statisticallearningingeneticsanintroductionusingr |