Analyzing linguistic data: a practical introduction to statistics using R
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Cambridge [u.a.]
Cambridge Univ. Press
2009
|
Ausgabe: | 1. publ. |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | XIII, 353 S. graph. Darst. |
ISBN: | 9780521709187 9780521882590 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV023195223 | ||
003 | DE-604 | ||
005 | 20160121 | ||
007 | t | ||
008 | 080304s2009 d||| |||| 00||| eng d | ||
020 | |a 9780521709187 |9 978-0-521-70918-7 | ||
020 | |a 9780521882590 |9 978-0-521-88259-0 | ||
035 | |a (OCoLC)255917222 | ||
035 | |a (DE-599)BVBBV023195223 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-19 |a DE-384 |a DE-355 |a DE-473 |a DE-29 |a DE-12 | ||
084 | |a ES 250 |0 (DE-625)27822: |2 rvk | ||
084 | |a SK 840 |0 (DE-625)143261: |2 rvk | ||
100 | 1 | |a Baayen, R. Harald |d 1958- |e Verfasser |0 (DE-588)132384914 |4 aut | |
245 | 1 | 0 | |a Analyzing linguistic data |b a practical introduction to statistics using R |c R. H. Baayen |
250 | |a 1. publ. | ||
264 | 1 | |a Cambridge [u.a.] |b Cambridge Univ. Press |c 2009 | |
300 | |a XIII, 353 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
650 | 0 | 7 | |a R |g Programm |0 (DE-588)4705956-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Sprachstatistik |0 (DE-588)4182534-2 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Sprachstatistik |0 (DE-588)4182534-2 |D s |
689 | 0 | 1 | |a R |g Programm |0 (DE-588)4705956-4 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | 2 | |m Digitalisierung UB Bamberg |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=016381549&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-016381549 |
Datensatz im Suchindex
_version_ | 1804137468962799616 |
---|---|
adam_text | Contents
Preface x
1
An introduction
to R
1
1.1
R
as a calculator
2
1.2
Getting data into and out of
R
4
1.3
Accessing information in data frames
6
1.4
Operations on data frames
10
1.4.1
Sorting a data frame by one or more columns
i o
1.4.2
Changing information in a data frame
12
1.4.3
Extracting contingency tables from data frames
і З
1.4.4
Calculations on data frames
15
1.5
Session management
18
2
Graphical data exploration
20
2.1
Random variables
20
2.2
Visualizing single random variables
21
2.3
Visualizing two or more variables
32
2.4
Trellis graphics
37
3
Probability distributions
44
3.1
Distributions
44
3.2
Discrete distributions
44
3.3
Continuous distributions
57
3.3.1
The normal distribution
58
3.3.2
The
t, F,
and
χ2
distributions
63
4
Basic statistical methods
68
4.1
Tests for single vectors
7
1
4.1.1
Distribution tests
71
4.1.2
Tests for the mean
75
4.2
Tests for two independent vectors
77
4.2.1
Are the distributions the same?
78
4.2.2
Are the means the same?
79
4.2.3
Are the variances the same?
81
4.3
Paired vectors
82
4.3.1
Are the means or medians the same?
82
4.3.2
Functional relations: linear regression
84
CONTENTS
4.3.3
What does the joint density look like?
97
4.4
A numerical vector and a factor: analysis of variance
Ю1
4.4.1
Two numerical vectors and a factor: analysis
ofcovariance
108
4.5
Two vectors with counts
111
4.6
A note on statistical significance
114
Clustering and classification
118
5.1
Clustering
118
5.1.1
Tables with measurements: principal components analysis
Ц8
5.1.2
Tables with measurements: factor analysis
126
5.1.3
Tables with counts: correspondence analysis
128
5.1.4
Tables with distances: multidimensional scaling
136
5.1.5
Tables with distances: hierarchical cluster analysis
1
38
5.2
Classification
148
5.2.1
Classification trees
148
5.2.2
Discriminant analysis
154
5.2.3
Support vector machines
160
Regression modeling
165
6.1
Introduction
165
6.2
Ordinary least squares regression
169
6.2.1
Nonlinearities I74
6.2.2
Collinearity
181
6.2.3
Model criticism
188
6.2.4
Validation I93
6.3
Generalized linear models
195
6.3.1
Logistic regression I95
6.3.2
Ordinal logistic regression
208
6.4
Regression with breakpoints
214
6.5
Models for lexical richness
222
6.6
General considerations
236
Mixed models
241
7.1
Modeling data with fixed and random effects
242
7.2
A comparison with traditional analyses
259
7.2.1
Mixed-effects models and quasi-F
260
7.2.2
Mixed-effects models and Latin Square designs
266
7.2.3
Regression with subjects and items
269
7.3
Shrinkage in mixed-effects models
275
7.4
Generalized linear mixed models
278
7.5
Case studies
284
7.5.1
Primed lexical decision latencies for Dutch neologisms
284
7.5.2
Self-paced reading latencies for Dutch neologisms
287
7.5.3
Visual lexical decision latencies of Dutch
eight-year-olds
289
7.5.4
Mixed-effects models in corpus linguistics
295
Contents
Appendix
A
Solutions
to the exercises
303
Appendix
В
Overview of
R
functions
335
References
342
Index
347
Index of data sets
347
Index of
R
347
Index of topics
349
Index of authors
352
|
adam_txt |
Contents
Preface x
1
An introduction
to R
1
1.1
R
as a calculator
2
1.2
Getting data into and out of
R
4
1.3
Accessing information in data frames
6
1.4
Operations on data frames
10
1.4.1
Sorting a data frame by one or more columns
i o
1.4.2
Changing information in a data frame
12
1.4.3
Extracting contingency tables from data frames
і З
1.4.4
Calculations on data frames
15
1.5
Session management
18
2
Graphical data exploration
20
2.1
Random variables
20
2.2
Visualizing single random variables
21
2.3
Visualizing two or more variables
32
2.4
Trellis graphics
37
3
Probability distributions
44
3.1
Distributions
44
3.2
Discrete distributions
44
3.3
Continuous distributions
57
3.3.1
The normal distribution
58
3.3.2
The
t, F,
and
χ2
distributions
63
4
Basic statistical methods
68
4.1
Tests for single vectors
7
1
4.1.1
Distribution tests
71
4.1.2
Tests for the mean
75
4.2
Tests for two independent vectors
77
4.2.1
Are the distributions the same?
78
4.2.2
Are the means the same?
79
4.2.3
Are the variances the same?
81
4.3
Paired vectors
82
4.3.1
Are the means or medians the same?
82
4.3.2
Functional relations: linear regression
84
CONTENTS
4.3.3
What does the joint density look like?
97
4.4
A numerical vector and a factor: analysis of variance
Ю1
4.4.1
Two numerical vectors and a factor: analysis
ofcovariance
108
4.5
Two vectors with counts
111
4.6
A note on statistical significance
114
Clustering and classification
118
5.1
Clustering
118
5.1.1
Tables with measurements: principal components analysis
Ц8
5.1.2
Tables with measurements: factor analysis
126
5.1.3
Tables with counts: correspondence analysis
128
5.1.4
Tables with distances: multidimensional scaling
136
5.1.5
Tables with distances: hierarchical cluster analysis
1
38
5.2
Classification
148
5.2.1
Classification trees
148
5.2.2
Discriminant analysis
154
5.2.3
Support vector machines
160
Regression modeling
165
6.1
Introduction
165
6.2
Ordinary least squares regression
169
6.2.1
Nonlinearities I74
6.2.2
Collinearity
181
6.2.3
Model criticism
188
6.2.4
Validation I93
6.3
Generalized linear models
195
6.3.1
Logistic regression I95
6.3.2
Ordinal logistic regression
208
6.4
Regression with breakpoints
214
6.5
Models for lexical richness
222
6.6
General considerations
236
Mixed models
241
7.1
Modeling data with fixed and random effects
242
7.2
A comparison with traditional analyses
259
7.2.1
Mixed-effects models and quasi-F
260
7.2.2
Mixed-effects models and Latin Square designs
266
7.2.3
Regression with subjects and items
269
7.3
Shrinkage in mixed-effects models
275
7.4
Generalized linear mixed models
278
7.5
Case studies
284
7.5.1
Primed lexical decision latencies for Dutch neologisms
284
7.5.2
Self-paced reading latencies for Dutch neologisms
287
7.5.3
Visual lexical decision latencies of Dutch
eight-year-olds
289
7.5.4
Mixed-effects models in corpus linguistics
295
Contents
Appendix
A
Solutions
to the exercises
303
Appendix
В
Overview of
R
functions
335
References
342
Index
347
Index of data sets
347
Index of
R
347
Index of topics
349
Index of authors
352 |
any_adam_object | 1 |
any_adam_object_boolean | 1 |
author | Baayen, R. Harald 1958- |
author_GND | (DE-588)132384914 |
author_facet | Baayen, R. Harald 1958- |
author_role | aut |
author_sort | Baayen, R. Harald 1958- |
author_variant | r h b rh rhb |
building | Verbundindex |
bvnumber | BV023195223 |
classification_rvk | ES 250 SK 840 |
ctrlnum | (OCoLC)255917222 (DE-599)BVBBV023195223 |
discipline | Sprachwissenschaft Mathematik Literaturwissenschaft |
discipline_str_mv | Sprachwissenschaft Mathematik Literaturwissenschaft |
edition | 1. publ. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01502nam a2200373 c 4500</leader><controlfield tag="001">BV023195223</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20160121 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">080304s2009 d||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780521709187</subfield><subfield code="9">978-0-521-70918-7</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780521882590</subfield><subfield code="9">978-0-521-88259-0</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)255917222</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV023195223</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-19</subfield><subfield code="a">DE-384</subfield><subfield code="a">DE-355</subfield><subfield code="a">DE-473</subfield><subfield code="a">DE-29</subfield><subfield code="a">DE-12</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ES 250</subfield><subfield code="0">(DE-625)27822:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SK 840</subfield><subfield code="0">(DE-625)143261:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Baayen, R. Harald</subfield><subfield code="d">1958-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)132384914</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Analyzing linguistic data</subfield><subfield code="b">a practical introduction to statistics using R</subfield><subfield code="c">R. H. Baayen</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1. publ.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Cambridge [u.a.]</subfield><subfield code="b">Cambridge Univ. Press</subfield><subfield code="c">2009</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XIII, 353 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">R</subfield><subfield code="g">Programm</subfield><subfield code="0">(DE-588)4705956-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachstatistik</subfield><subfield code="0">(DE-588)4182534-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Sprachstatistik</subfield><subfield code="0">(DE-588)4182534-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">R</subfield><subfield code="g">Programm</subfield><subfield code="0">(DE-588)4705956-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Bamberg</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=016381549&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-016381549</subfield></datafield></record></collection> |
id | DE-604.BV023195223 |
illustrated | Illustrated |
index_date | 2024-07-02T20:06:09Z |
indexdate | 2024-07-09T21:12:47Z |
institution | BVB |
isbn | 9780521709187 9780521882590 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-016381549 |
oclc_num | 255917222 |
open_access_boolean | |
owner | DE-19 DE-BY-UBM DE-384 DE-355 DE-BY-UBR DE-473 DE-BY-UBG DE-29 DE-12 |
owner_facet | DE-19 DE-BY-UBM DE-384 DE-355 DE-BY-UBR DE-473 DE-BY-UBG DE-29 DE-12 |
physical | XIII, 353 S. graph. Darst. |
publishDate | 2009 |
publishDateSearch | 2009 |
publishDateSort | 2009 |
publisher | Cambridge Univ. Press |
record_format | marc |
spelling | Baayen, R. Harald 1958- Verfasser (DE-588)132384914 aut Analyzing linguistic data a practical introduction to statistics using R R. H. Baayen 1. publ. Cambridge [u.a.] Cambridge Univ. Press 2009 XIII, 353 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier R Programm (DE-588)4705956-4 gnd rswk-swf Sprachstatistik (DE-588)4182534-2 gnd rswk-swf Sprachstatistik (DE-588)4182534-2 s R Programm (DE-588)4705956-4 s DE-604 Digitalisierung UB Bamberg application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=016381549&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Baayen, R. Harald 1958- Analyzing linguistic data a practical introduction to statistics using R R Programm (DE-588)4705956-4 gnd Sprachstatistik (DE-588)4182534-2 gnd |
subject_GND | (DE-588)4705956-4 (DE-588)4182534-2 |
title | Analyzing linguistic data a practical introduction to statistics using R |
title_auth | Analyzing linguistic data a practical introduction to statistics using R |
title_exact_search | Analyzing linguistic data a practical introduction to statistics using R |
title_exact_search_txtP | Analyzing linguistic data a practical introduction to statistics using R |
title_full | Analyzing linguistic data a practical introduction to statistics using R R. H. Baayen |
title_fullStr | Analyzing linguistic data a practical introduction to statistics using R R. H. Baayen |
title_full_unstemmed | Analyzing linguistic data a practical introduction to statistics using R R. H. Baayen |
title_short | Analyzing linguistic data |
title_sort | analyzing linguistic data a practical introduction to statistics using r |
title_sub | a practical introduction to statistics using R |
topic | R Programm (DE-588)4705956-4 gnd Sprachstatistik (DE-588)4182534-2 gnd |
topic_facet | R Programm Sprachstatistik |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=016381549&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT baayenrharald analyzinglinguisticdataapracticalintroductiontostatisticsusingr |