Population genomics with R:
Population Genomics With R presents a multidisciplinary approach to the analysis of population genomics. The methods treated cover a large number of topics from traditional population genetics to large-scale genomics with high-throughput sequencing data. Several dozen R packages are examined and int...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Boca Raton
CRC Press, Taylor & Francis Group
[2020]
|
Schlagworte: | |
Online-Zugang: | Volltext |
Zusammenfassung: | Population Genomics With R presents a multidisciplinary approach to the analysis of population genomics. The methods treated cover a large number of topics from traditional population genetics to large-scale genomics with high-throughput sequencing data. Several dozen R packages are examined and integrated to provide a coherent software environment with a wide range of computational, statistical, and graphical tools. Small examples are used to illustrate the basics and published data are used as case studies. Readers are expected to have a basic knowledge of biology, genetics, and statistical inference methods. Graduate students and post-doctorate researchers will find resources to analyze their population genetic and genomic data as well as help them design new studies. The first four chapters review the basics of population genomics, data acquisition, and the use of R to store and manipulate genomic data. Chapter 5 treats the exploration of genomic data, an important issue when analysing large data sets. The other five chapters cover linkage disequilibrium, population genomic structure, geographical structure, past demographic events, and natural selection. These chapters include supervised and unsupervised methods, admixture analysis, an in-depth treatment of multivariate methods, and advice on how to handle GIS data. The analysis of natural selection, a traditional issue in evolutionary biology, has known a revival with modern population genomic data. All chapters include exercises. Supplemental materials are available on-line (http://ape-package.ird.fr/PGR.html) |
Beschreibung: | 1. Introduction Heredity, Genetics, and Genomics Principles of Population Genomics Units Genome Structures Mutations Drift and Selection R Packages and Conventions Required Knowledge and Other Readings 2. Data Acquisition Samples and Sampling Designs How Much DNA in a Sample? Degraded Samples Sampling Designs Low-Throughput Technologies Genotypes From Phenotypes DNA Cleavage Methods Repeat Length Polymorphism Sanger and Shotgun Sequencing DNA Methylation and Bisulfite Sequencing High-Throughput Technologies DNA Microarrays High-Throughput Sequencing Restriction Site Associated DNA RNA Sequencing Exome Sequencing Sequencing of Pooled Individuals Designing a Study With HTS The Future of DNA Sequencing File Formats Data Files Archiving and Compression Bioinformatics and Genomics Processing Sanger Sequencing Data With sangerseqR Read Mapping With Rsubread Managing Read Alignments With Rsamtools Simulation of High-Throughput Sequencing Data Exercises 3. - Genomic Data in R What is an R Data Object? Data Classes for Genomic Data The Class "loci" (pegas) The Class "genind" (adegenet) The Classes "SNPbin" and "genlight" (adegenet) The Class "SnpMatrix" (snpStats) The Class "DNAbin" (ape) The Classes "XString" and "XStringSet" (Biostrings) The Package SNPRelate Data Input and Output Reading Text Files Reading Spreadsheet Files Reading VCF Files Reading PED and BED Files Reading Sequence Files Reading Annotation Files Writing Files Internet Databases Managing Files and Projects Exercises 4. Data Manipulation Basic Data Manipulation in R Subsetting, Replacement, and Deletion Commonly Used Functions Recycling and Coercion Logical Vectors Memory Management Conversions Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Human Genomes Influenza HN Virus Sequences Jaguar Microsatellites Bacterial Whole Genome Sequences Metabarcoding of Fish Communities Exercises 5. - Data Exploration and Summaries Genotype and Allele Frequencies Allelic Richness Missing Data Haplotype and Nucleotide Diversity The Class "haplotype" Haplotype and Nucleotide Diversity From DNA Sequences Genetic and Genomic Distances Theoretical Background Hamming Distance Distances From DNA Sequences Distances From Allele Sharing Distances From Microsatellites Summary by Groups Sliding Windows DNA Sequences Summaries With Genomic Positions Package SNPRelate Multivariate Methods Matrix Decomposition Eigendecomposition Singular Value Decomposition Power Method and Random Matrices Principal Component Analysis adegenet SNPRelate flashpcaR Multidimensional Scaling Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Human Genomes Influenza HN Virus Sequences Jaguar Microsatellites Bacterial Whole Genome Sequences Metabarcoding of Fish Communities Exercises 6. - Linkage Disequilibrium and Haplotype Structure Why Linkage Disequilibrium is Important? Linkage Disequilibrium: Two Loci Phased Genotypes Theoretical Background Implementation in pegas Unphased Genotypes More Than Two Loci Haplotypes From Unphased Genotypes The Expectation-Maximization Algorithm Implementation in haplostats Locus-Specific Imputation Maps of Linkage Disequilibrium Phased Genotypes With pegas SNPRelate snpStats Case Studies Complete Genomes of the Fruit Fly Human Genomes Jaguar Microsatellites Exercises 7. - Population Genetic Structure Hardy-Weinberg Equilibrium F-Statistics Theoretical Background Implementations in pegas and in mmod Implementations in snpStats and in SNPRelate Trees and Networks Minimum Spanning Trees and Networks Statistical Parsimony Median Networks Phylogenetic Trees Multivariate Methods Principles of Discriminant Analysis Discriminant Analysis of Principal Components Clustering Maximum Likelihood Methods Bayesian Clustering Admixture Likelihood Method Principal Component Analysis of Coancestry A Second Look at F-Statistics Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Jaguar Microsatellites Exercises 8. - Geographical Structure Geographical Data in R Packages and Classes Calculating Geographical Distances A Third Look at F-Statistics Hierarchical Components of Genetic Diversity Analysis of Molecular Variance Moran I and Spatial Autocorrelation Spatial Principal Component Analysis Finding Boundaries Between Populations Spatial Ancestry (tessr) Bayesian Methods (Geneland) Case Studies Complete Genomes of the Fruit Fly Human Genomes Exercises 9. - Past Demographic Events The Coalescent The Standard Coalescent The Sequential Markovian Coalescent Simulation of Coalescent Data Estimation of _ Heterozygosity Number of Alleles Segregating Sites Microsatellites Trees Coalescent-Based Inference Maximum Likelihood Methods Analysis of Markov Chain Monte Carlo Outputs Skyline Plots Bayesian Methods Heterochronous Samples Site Frequency Spectrum Methods The Stairway Method CubSFS Popsicle Whole-Genome Methods (psmcr) Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Bacterial Whole Genome Sequences Exercises 10. - Natural Selection Testing Neutrality Simple Tests Selection in Protein-Coding Sequences Selection Scans A Fourth Look at F-Statistics Association Studies (LEA) Principal Component Analysis (pcadapt) Scans for Selection With Extended Haplotypes FST Outliers Time-Series of Allele Frequencies Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Exercises A Installing R Packages B Compressing Large Sequence Files C Sampling of Alleles in a Population |
Beschreibung: | 1 online resource (xvi, 378 pages) illustrations |
ISBN: | 9780429466700 0429466706 9780429882432 0429882432 9780429882425 0429882424 9780429882418 0429882416 |
Internformat
MARC
LEADER | 00000nmm a2200000zc 4500 | ||
---|---|---|---|
001 | BV046806498 | ||
003 | DE-604 | ||
005 | 20220322 | ||
007 | cr|uuu---uuuuu | ||
008 | 200714s2020 |||| o||u| ||||||eng d | ||
020 | |a 9780429466700 |9 978-0-429-46670-0 | ||
020 | |a 0429466706 |9 0-429-46670-6 | ||
020 | |a 9780429882432 |9 978-0-429-88243-2 | ||
020 | |a 0429882432 |9 0-429-88243-2 | ||
020 | |a 9780429882425 |9 978-0-429-88242-5 | ||
020 | |a 0429882424 |9 0-429-88242-4 | ||
020 | |a 9780429882418 |9 978-0-429-88241-8 | ||
020 | |a 0429882416 |9 0-429-88241-6 | ||
035 | |a (ZDB-7-TFC)9780429466700 | ||
035 | |a (DE-599)BVBBV046806498 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
082 | 0 | |a 576.5/8 | |
100 | 1 | |a Paradis, Emmanuel |4 aut | |
245 | 1 | 0 | |a Population genomics with R |c by Emmanuel Paradis |
264 | 1 | |a Boca Raton |b CRC Press, Taylor & Francis Group |c [2020] | |
300 | |a 1 online resource (xvi, 378 pages) |b illustrations | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
500 | |a 1. Introduction Heredity, Genetics, and Genomics Principles of Population Genomics Units Genome Structures Mutations Drift and Selection R Packages and Conventions Required Knowledge and Other Readings 2. Data Acquisition Samples and Sampling Designs How Much DNA in a Sample? Degraded Samples Sampling Designs Low-Throughput Technologies Genotypes From Phenotypes DNA Cleavage Methods Repeat Length Polymorphism Sanger and Shotgun Sequencing DNA Methylation and Bisulfite Sequencing High-Throughput Technologies DNA Microarrays High-Throughput Sequencing Restriction Site Associated DNA RNA Sequencing Exome Sequencing Sequencing of Pooled Individuals Designing a Study With HTS The Future of DNA Sequencing File Formats Data Files Archiving and Compression Bioinformatics and Genomics Processing Sanger Sequencing Data With sangerseqR Read Mapping With Rsubread Managing Read Alignments With Rsamtools Simulation of High-Throughput Sequencing Data Exercises 3. | ||
500 | |a - Genomic Data in R What is an R Data Object? Data Classes for Genomic Data The Class "loci" (pegas) The Class "genind" (adegenet) The Classes "SNPbin" and "genlight" (adegenet) The Class "SnpMatrix" (snpStats) The Class "DNAbin" (ape) The Classes "XString" and "XStringSet" (Biostrings) The Package SNPRelate Data Input and Output Reading Text Files Reading Spreadsheet Files Reading VCF Files Reading PED and BED Files Reading Sequence Files Reading Annotation Files Writing Files Internet Databases Managing Files and Projects Exercises 4. Data Manipulation Basic Data Manipulation in R Subsetting, Replacement, and Deletion Commonly Used Functions Recycling and Coercion Logical Vectors Memory Management Conversions Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Human Genomes Influenza HN Virus Sequences Jaguar Microsatellites Bacterial Whole Genome Sequences Metabarcoding of Fish Communities Exercises 5. | ||
500 | |a - Data Exploration and Summaries Genotype and Allele Frequencies Allelic Richness Missing Data Haplotype and Nucleotide Diversity The Class "haplotype" Haplotype and Nucleotide Diversity From DNA Sequences Genetic and Genomic Distances Theoretical Background Hamming Distance Distances From DNA Sequences Distances From Allele Sharing Distances From Microsatellites Summary by Groups Sliding Windows DNA Sequences Summaries With Genomic Positions Package SNPRelate Multivariate Methods Matrix Decomposition Eigendecomposition Singular Value Decomposition Power Method and Random Matrices Principal Component Analysis adegenet SNPRelate flashpcaR Multidimensional Scaling Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Human Genomes Influenza HN Virus Sequences Jaguar Microsatellites Bacterial Whole Genome Sequences Metabarcoding of Fish Communities Exercises 6. | ||
500 | |a - Linkage Disequilibrium and Haplotype Structure Why Linkage Disequilibrium is Important? Linkage Disequilibrium: Two Loci Phased Genotypes Theoretical Background Implementation in pegas Unphased Genotypes More Than Two Loci Haplotypes From Unphased Genotypes The Expectation-Maximization Algorithm Implementation in haplostats Locus-Specific Imputation Maps of Linkage Disequilibrium Phased Genotypes With pegas SNPRelate snpStats Case Studies Complete Genomes of the Fruit Fly Human Genomes Jaguar Microsatellites Exercises 7. | ||
500 | |a - Population Genetic Structure Hardy-Weinberg Equilibrium F-Statistics Theoretical Background Implementations in pegas and in mmod Implementations in snpStats and in SNPRelate Trees and Networks Minimum Spanning Trees and Networks Statistical Parsimony Median Networks Phylogenetic Trees Multivariate Methods Principles of Discriminant Analysis Discriminant Analysis of Principal Components Clustering Maximum Likelihood Methods Bayesian Clustering Admixture Likelihood Method Principal Component Analysis of Coancestry A Second Look at F-Statistics Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Jaguar Microsatellites Exercises 8. | ||
500 | |a - Geographical Structure Geographical Data in R Packages and Classes Calculating Geographical Distances A Third Look at F-Statistics Hierarchical Components of Genetic Diversity Analysis of Molecular Variance Moran I and Spatial Autocorrelation Spatial Principal Component Analysis Finding Boundaries Between Populations Spatial Ancestry (tessr) Bayesian Methods (Geneland) Case Studies Complete Genomes of the Fruit Fly Human Genomes Exercises 9. | ||
500 | |a - Past Demographic Events The Coalescent The Standard Coalescent The Sequential Markovian Coalescent Simulation of Coalescent Data Estimation of _ Heterozygosity Number of Alleles Segregating Sites Microsatellites Trees Coalescent-Based Inference Maximum Likelihood Methods Analysis of Markov Chain Monte Carlo Outputs Skyline Plots Bayesian Methods Heterochronous Samples Site Frequency Spectrum Methods The Stairway Method CubSFS Popsicle Whole-Genome Methods (psmcr) Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Bacterial Whole Genome Sequences Exercises 10. | ||
500 | |a - Natural Selection Testing Neutrality Simple Tests Selection in Protein-Coding Sequences Selection Scans A Fourth Look at F-Statistics Association Studies (LEA) Principal Component Analysis (pcadapt) Scans for Selection With Extended Haplotypes FST Outliers Time-Series of Allele Frequencies Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Exercises A Installing R Packages B Compressing Large Sequence Files C Sampling of Alleles in a Population | ||
520 | |a Population Genomics With R presents a multidisciplinary approach to the analysis of population genomics. The methods treated cover a large number of topics from traditional population genetics to large-scale genomics with high-throughput sequencing data. Several dozen R packages are examined and integrated to provide a coherent software environment with a wide range of computational, statistical, and graphical tools. Small examples are used to illustrate the basics and published data are used as case studies. Readers are expected to have a basic knowledge of biology, genetics, and statistical inference methods. Graduate students and post-doctorate researchers will find resources to analyze their population genetic and genomic data as well as help them design new studies. The first four chapters review the basics of population genomics, data acquisition, and the use of R to store and manipulate genomic data. Chapter 5 treats the exploration of genomic data, an important issue when analysing large data sets. The other five chapters cover linkage disequilibrium, population genomic structure, geographical structure, past demographic events, and natural selection. These chapters include supervised and unsupervised methods, admixture analysis, an in-depth treatment of multivariate methods, and advice on how to handle GIS data. The analysis of natural selection, a traditional issue in evolutionary biology, has known a revival with modern population genomic data. All chapters include exercises. Supplemental materials are available on-line (http://ape-package.ird.fr/PGR.html) | ||
650 | 4 | |a MATHEMATICS / Probability & Statistics / General / bisacsh | |
650 | 4 | |a SCIENCE / Biotechnology / bisacsh | |
650 | 4 | |a SCIENCE / Life Sciences / Genetics & Genomics / bisacsh | |
650 | 4 | |a Population genetics / Mathematical models | |
650 | 4 | |a Genomics / Mathematical models | |
650 | 4 | |a R (Computer program language) | |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe |z 9781138608184 |
856 | 4 | 0 | |u https://www.taylorfrancis.com/books/9780429466700 |x Verlag |z URL des Erstveröffentlichers |3 Volltext |
912 | |a ZDB-7-TFC | ||
999 | |a oai:aleph.bib-bvb.de:BVB01-032215137 |
Datensatz im Suchindex
_version_ | 1804181608362672128 |
---|---|
adam_txt | |
any_adam_object | |
any_adam_object_boolean | |
author | Paradis, Emmanuel |
author_facet | Paradis, Emmanuel |
author_role | aut |
author_sort | Paradis, Emmanuel |
author_variant | e p ep |
building | Verbundindex |
bvnumber | BV046806498 |
collection | ZDB-7-TFC |
ctrlnum | (ZDB-7-TFC)9780429466700 (DE-599)BVBBV046806498 |
dewey-full | 576.5/8 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 576 - Genetics and evolution |
dewey-raw | 576.5/8 |
dewey-search | 576.5/8 |
dewey-sort | 3576.5 18 |
dewey-tens | 570 - Biology |
discipline | Biologie |
discipline_str_mv | Biologie |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>09091nmm a2200553zc 4500</leader><controlfield tag="001">BV046806498</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20220322 </controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">200714s2020 |||| o||u| ||||||eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780429466700</subfield><subfield code="9">978-0-429-46670-0</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0429466706</subfield><subfield code="9">0-429-46670-6</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780429882432</subfield><subfield code="9">978-0-429-88243-2</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0429882432</subfield><subfield code="9">0-429-88243-2</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780429882425</subfield><subfield code="9">978-0-429-88242-5</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0429882424</subfield><subfield code="9">0-429-88242-4</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780429882418</subfield><subfield code="9">978-0-429-88241-8</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0429882416</subfield><subfield code="9">0-429-88241-6</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ZDB-7-TFC)9780429466700</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV046806498</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">576.5/8</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Paradis, Emmanuel</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Population genomics with R</subfield><subfield code="c">by Emmanuel Paradis</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Boca Raton</subfield><subfield code="b">CRC Press, Taylor & Francis Group</subfield><subfield code="c">[2020]</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (xvi, 378 pages)</subfield><subfield code="b">illustrations</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">1. Introduction Heredity, Genetics, and Genomics Principles of Population Genomics Units Genome Structures Mutations Drift and Selection R Packages and Conventions Required Knowledge and Other Readings 2. Data Acquisition Samples and Sampling Designs How Much DNA in a Sample? Degraded Samples Sampling Designs Low-Throughput Technologies Genotypes From Phenotypes DNA Cleavage Methods Repeat Length Polymorphism Sanger and Shotgun Sequencing DNA Methylation and Bisulfite Sequencing High-Throughput Technologies DNA Microarrays High-Throughput Sequencing Restriction Site Associated DNA RNA Sequencing Exome Sequencing Sequencing of Pooled Individuals Designing a Study With HTS The Future of DNA Sequencing File Formats Data Files Archiving and Compression Bioinformatics and Genomics Processing Sanger Sequencing Data With sangerseqR Read Mapping With Rsubread Managing Read Alignments With Rsamtools Simulation of High-Throughput Sequencing Data Exercises 3. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Genomic Data in R What is an R Data Object? Data Classes for Genomic Data The Class "loci" (pegas) The Class "genind" (adegenet) The Classes "SNPbin" and "genlight" (adegenet) The Class "SnpMatrix" (snpStats) The Class "DNAbin" (ape) The Classes "XString" and "XStringSet" (Biostrings) The Package SNPRelate Data Input and Output Reading Text Files Reading Spreadsheet Files Reading VCF Files Reading PED and BED Files Reading Sequence Files Reading Annotation Files Writing Files Internet Databases Managing Files and Projects Exercises 4. Data Manipulation Basic Data Manipulation in R Subsetting, Replacement, and Deletion Commonly Used Functions Recycling and Coercion Logical Vectors Memory Management Conversions Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Human Genomes Influenza HN Virus Sequences Jaguar Microsatellites Bacterial Whole Genome Sequences Metabarcoding of Fish Communities Exercises 5. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Data Exploration and Summaries Genotype and Allele Frequencies Allelic Richness Missing Data Haplotype and Nucleotide Diversity The Class "haplotype" Haplotype and Nucleotide Diversity From DNA Sequences Genetic and Genomic Distances Theoretical Background Hamming Distance Distances From DNA Sequences Distances From Allele Sharing Distances From Microsatellites Summary by Groups Sliding Windows DNA Sequences Summaries With Genomic Positions Package SNPRelate Multivariate Methods Matrix Decomposition Eigendecomposition Singular Value Decomposition Power Method and Random Matrices Principal Component Analysis adegenet SNPRelate flashpcaR Multidimensional Scaling Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Human Genomes Influenza HN Virus Sequences Jaguar Microsatellites Bacterial Whole Genome Sequences Metabarcoding of Fish Communities Exercises 6. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Linkage Disequilibrium and Haplotype Structure Why Linkage Disequilibrium is Important? Linkage Disequilibrium: Two Loci Phased Genotypes Theoretical Background Implementation in pegas Unphased Genotypes More Than Two Loci Haplotypes From Unphased Genotypes The Expectation-Maximization Algorithm Implementation in haplostats Locus-Specific Imputation Maps of Linkage Disequilibrium Phased Genotypes With pegas SNPRelate snpStats Case Studies Complete Genomes of the Fruit Fly Human Genomes Jaguar Microsatellites Exercises 7. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Population Genetic Structure Hardy-Weinberg Equilibrium F-Statistics Theoretical Background Implementations in pegas and in mmod Implementations in snpStats and in SNPRelate Trees and Networks Minimum Spanning Trees and Networks Statistical Parsimony Median Networks Phylogenetic Trees Multivariate Methods Principles of Discriminant Analysis Discriminant Analysis of Principal Components Clustering Maximum Likelihood Methods Bayesian Clustering Admixture Likelihood Method Principal Component Analysis of Coancestry A Second Look at F-Statistics Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Jaguar Microsatellites Exercises 8. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Geographical Structure Geographical Data in R Packages and Classes Calculating Geographical Distances A Third Look at F-Statistics Hierarchical Components of Genetic Diversity Analysis of Molecular Variance Moran I and Spatial Autocorrelation Spatial Principal Component Analysis Finding Boundaries Between Populations Spatial Ancestry (tessr) Bayesian Methods (Geneland) Case Studies Complete Genomes of the Fruit Fly Human Genomes Exercises 9. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Past Demographic Events The Coalescent The Standard Coalescent The Sequential Markovian Coalescent Simulation of Coalescent Data Estimation of _ Heterozygosity Number of Alleles Segregating Sites Microsatellites Trees Coalescent-Based Inference Maximum Likelihood Methods Analysis of Markov Chain Monte Carlo Outputs Skyline Plots Bayesian Methods Heterochronous Samples Site Frequency Spectrum Methods The Stairway Method CubSFS Popsicle Whole-Genome Methods (psmcr) Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Bacterial Whole Genome Sequences Exercises 10. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Natural Selection Testing Neutrality Simple Tests Selection in Protein-Coding Sequences Selection Scans A Fourth Look at F-Statistics Association Studies (LEA) Principal Component Analysis (pcadapt) Scans for Selection With Extended Haplotypes FST Outliers Time-Series of Allele Frequencies Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Exercises A Installing R Packages B Compressing Large Sequence Files C Sampling of Alleles in a Population</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Population Genomics With R presents a multidisciplinary approach to the analysis of population genomics. The methods treated cover a large number of topics from traditional population genetics to large-scale genomics with high-throughput sequencing data. Several dozen R packages are examined and integrated to provide a coherent software environment with a wide range of computational, statistical, and graphical tools. Small examples are used to illustrate the basics and published data are used as case studies. Readers are expected to have a basic knowledge of biology, genetics, and statistical inference methods. Graduate students and post-doctorate researchers will find resources to analyze their population genetic and genomic data as well as help them design new studies. The first four chapters review the basics of population genomics, data acquisition, and the use of R to store and manipulate genomic data. Chapter 5 treats the exploration of genomic data, an important issue when analysing large data sets. The other five chapters cover linkage disequilibrium, population genomic structure, geographical structure, past demographic events, and natural selection. These chapters include supervised and unsupervised methods, admixture analysis, an in-depth treatment of multivariate methods, and advice on how to handle GIS data. The analysis of natural selection, a traditional issue in evolutionary biology, has known a revival with modern population genomic data. All chapters include exercises. Supplemental materials are available on-line (http://ape-package.ird.fr/PGR.html)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">MATHEMATICS / Probability & Statistics / General / bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">SCIENCE / Biotechnology / bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">SCIENCE / Life Sciences / Genetics & Genomics / bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Population genetics / Mathematical models</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Genomics / Mathematical models</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">R (Computer program language)</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">9781138608184</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://www.taylorfrancis.com/books/9780429466700</subfield><subfield code="x">Verlag</subfield><subfield code="z">URL des Erstveröffentlichers</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-7-TFC</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-032215137</subfield></datafield></record></collection> |
id | DE-604.BV046806498 |
illustrated | Illustrated |
index_date | 2024-07-03T14:57:40Z |
indexdate | 2024-07-10T08:54:21Z |
institution | BVB |
isbn | 9780429466700 0429466706 9780429882432 0429882432 9780429882425 0429882424 9780429882418 0429882416 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-032215137 |
open_access_boolean | |
physical | 1 online resource (xvi, 378 pages) illustrations |
psigel | ZDB-7-TFC |
publishDate | 2020 |
publishDateSearch | 2020 |
publishDateSort | 2020 |
publisher | CRC Press, Taylor & Francis Group |
record_format | marc |
spelling | Paradis, Emmanuel aut Population genomics with R by Emmanuel Paradis Boca Raton CRC Press, Taylor & Francis Group [2020] 1 online resource (xvi, 378 pages) illustrations txt rdacontent c rdamedia cr rdacarrier 1. Introduction Heredity, Genetics, and Genomics Principles of Population Genomics Units Genome Structures Mutations Drift and Selection R Packages and Conventions Required Knowledge and Other Readings 2. Data Acquisition Samples and Sampling Designs How Much DNA in a Sample? Degraded Samples Sampling Designs Low-Throughput Technologies Genotypes From Phenotypes DNA Cleavage Methods Repeat Length Polymorphism Sanger and Shotgun Sequencing DNA Methylation and Bisulfite Sequencing High-Throughput Technologies DNA Microarrays High-Throughput Sequencing Restriction Site Associated DNA RNA Sequencing Exome Sequencing Sequencing of Pooled Individuals Designing a Study With HTS The Future of DNA Sequencing File Formats Data Files Archiving and Compression Bioinformatics and Genomics Processing Sanger Sequencing Data With sangerseqR Read Mapping With Rsubread Managing Read Alignments With Rsamtools Simulation of High-Throughput Sequencing Data Exercises 3. - Genomic Data in R What is an R Data Object? Data Classes for Genomic Data The Class "loci" (pegas) The Class "genind" (adegenet) The Classes "SNPbin" and "genlight" (adegenet) The Class "SnpMatrix" (snpStats) The Class "DNAbin" (ape) The Classes "XString" and "XStringSet" (Biostrings) The Package SNPRelate Data Input and Output Reading Text Files Reading Spreadsheet Files Reading VCF Files Reading PED and BED Files Reading Sequence Files Reading Annotation Files Writing Files Internet Databases Managing Files and Projects Exercises 4. Data Manipulation Basic Data Manipulation in R Subsetting, Replacement, and Deletion Commonly Used Functions Recycling and Coercion Logical Vectors Memory Management Conversions Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Human Genomes Influenza HN Virus Sequences Jaguar Microsatellites Bacterial Whole Genome Sequences Metabarcoding of Fish Communities Exercises 5. - Data Exploration and Summaries Genotype and Allele Frequencies Allelic Richness Missing Data Haplotype and Nucleotide Diversity The Class "haplotype" Haplotype and Nucleotide Diversity From DNA Sequences Genetic and Genomic Distances Theoretical Background Hamming Distance Distances From DNA Sequences Distances From Allele Sharing Distances From Microsatellites Summary by Groups Sliding Windows DNA Sequences Summaries With Genomic Positions Package SNPRelate Multivariate Methods Matrix Decomposition Eigendecomposition Singular Value Decomposition Power Method and Random Matrices Principal Component Analysis adegenet SNPRelate flashpcaR Multidimensional Scaling Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Human Genomes Influenza HN Virus Sequences Jaguar Microsatellites Bacterial Whole Genome Sequences Metabarcoding of Fish Communities Exercises 6. - Linkage Disequilibrium and Haplotype Structure Why Linkage Disequilibrium is Important? Linkage Disequilibrium: Two Loci Phased Genotypes Theoretical Background Implementation in pegas Unphased Genotypes More Than Two Loci Haplotypes From Unphased Genotypes The Expectation-Maximization Algorithm Implementation in haplostats Locus-Specific Imputation Maps of Linkage Disequilibrium Phased Genotypes With pegas SNPRelate snpStats Case Studies Complete Genomes of the Fruit Fly Human Genomes Jaguar Microsatellites Exercises 7. - Population Genetic Structure Hardy-Weinberg Equilibrium F-Statistics Theoretical Background Implementations in pegas and in mmod Implementations in snpStats and in SNPRelate Trees and Networks Minimum Spanning Trees and Networks Statistical Parsimony Median Networks Phylogenetic Trees Multivariate Methods Principles of Discriminant Analysis Discriminant Analysis of Principal Components Clustering Maximum Likelihood Methods Bayesian Clustering Admixture Likelihood Method Principal Component Analysis of Coancestry A Second Look at F-Statistics Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Jaguar Microsatellites Exercises 8. - Geographical Structure Geographical Data in R Packages and Classes Calculating Geographical Distances A Third Look at F-Statistics Hierarchical Components of Genetic Diversity Analysis of Molecular Variance Moran I and Spatial Autocorrelation Spatial Principal Component Analysis Finding Boundaries Between Populations Spatial Ancestry (tessr) Bayesian Methods (Geneland) Case Studies Complete Genomes of the Fruit Fly Human Genomes Exercises 9. - Past Demographic Events The Coalescent The Standard Coalescent The Sequential Markovian Coalescent Simulation of Coalescent Data Estimation of _ Heterozygosity Number of Alleles Segregating Sites Microsatellites Trees Coalescent-Based Inference Maximum Likelihood Methods Analysis of Markov Chain Monte Carlo Outputs Skyline Plots Bayesian Methods Heterochronous Samples Site Frequency Spectrum Methods The Stairway Method CubSFS Popsicle Whole-Genome Methods (psmcr) Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Bacterial Whole Genome Sequences Exercises 10. - Natural Selection Testing Neutrality Simple Tests Selection in Protein-Coding Sequences Selection Scans A Fourth Look at F-Statistics Association Studies (LEA) Principal Component Analysis (pcadapt) Scans for Selection With Extended Haplotypes FST Outliers Time-Series of Allele Frequencies Case Studies Mitochondrial Genomes of the Asiatic Golden Cat Complete Genomes of the Fruit Fly Influenza HN Virus Sequences Exercises A Installing R Packages B Compressing Large Sequence Files C Sampling of Alleles in a Population Population Genomics With R presents a multidisciplinary approach to the analysis of population genomics. The methods treated cover a large number of topics from traditional population genetics to large-scale genomics with high-throughput sequencing data. Several dozen R packages are examined and integrated to provide a coherent software environment with a wide range of computational, statistical, and graphical tools. Small examples are used to illustrate the basics and published data are used as case studies. Readers are expected to have a basic knowledge of biology, genetics, and statistical inference methods. Graduate students and post-doctorate researchers will find resources to analyze their population genetic and genomic data as well as help them design new studies. The first four chapters review the basics of population genomics, data acquisition, and the use of R to store and manipulate genomic data. Chapter 5 treats the exploration of genomic data, an important issue when analysing large data sets. The other five chapters cover linkage disequilibrium, population genomic structure, geographical structure, past demographic events, and natural selection. These chapters include supervised and unsupervised methods, admixture analysis, an in-depth treatment of multivariate methods, and advice on how to handle GIS data. The analysis of natural selection, a traditional issue in evolutionary biology, has known a revival with modern population genomic data. All chapters include exercises. Supplemental materials are available on-line (http://ape-package.ird.fr/PGR.html) MATHEMATICS / Probability & Statistics / General / bisacsh SCIENCE / Biotechnology / bisacsh SCIENCE / Life Sciences / Genetics & Genomics / bisacsh Population genetics / Mathematical models Genomics / Mathematical models R (Computer program language) Erscheint auch als Druck-Ausgabe 9781138608184 https://www.taylorfrancis.com/books/9780429466700 Verlag URL des Erstveröffentlichers Volltext |
spellingShingle | Paradis, Emmanuel Population genomics with R MATHEMATICS / Probability & Statistics / General / bisacsh SCIENCE / Biotechnology / bisacsh SCIENCE / Life Sciences / Genetics & Genomics / bisacsh Population genetics / Mathematical models Genomics / Mathematical models R (Computer program language) |
title | Population genomics with R |
title_auth | Population genomics with R |
title_exact_search | Population genomics with R |
title_exact_search_txtP | Population genomics with R |
title_full | Population genomics with R by Emmanuel Paradis |
title_fullStr | Population genomics with R by Emmanuel Paradis |
title_full_unstemmed | Population genomics with R by Emmanuel Paradis |
title_short | Population genomics with R |
title_sort | population genomics with r |
topic | MATHEMATICS / Probability & Statistics / General / bisacsh SCIENCE / Biotechnology / bisacsh SCIENCE / Life Sciences / Genetics & Genomics / bisacsh Population genetics / Mathematical models Genomics / Mathematical models R (Computer program language) |
topic_facet | MATHEMATICS / Probability & Statistics / General / bisacsh SCIENCE / Biotechnology / bisacsh SCIENCE / Life Sciences / Genetics & Genomics / bisacsh Population genetics / Mathematical models Genomics / Mathematical models R (Computer program language) |
url | https://www.taylorfrancis.com/books/9780429466700 |
work_keys_str_mv | AT paradisemmanuel populationgenomicswithr |