Designing and evaluating language corpora: a practical framework for corpus representativeness
Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativenes...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Cambridge
Cambridge University Press
2022
|
Schlagworte: | |
Online-Zugang: | DE-12 DE-473 Volltext |
Zusammenfassung: | Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' - highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation |
Beschreibung: | 1 Online-Ressource (xiii, 284 Seiten) |
ISBN: | 9781316584880 |
DOI: | 10.1017/9781316584880 |
Internformat
MARC
LEADER | 00000nmm a2200000 c 4500 | ||
---|---|---|---|
001 | BV048292377 | ||
003 | DE-604 | ||
005 | 20240801 | ||
007 | cr|uuu---uuuuu | ||
008 | 220621s2022 |||| o||u| ||||||eng d | ||
020 | |a 9781316584880 |c Online |9 978-1-316-58488-0 | ||
024 | 7 | |a 10.1017/9781316584880 |2 doi | |
035 | |a (ZDB-20-CBO)CR9781316584880 | ||
035 | |a (OCoLC)1334029014 | ||
035 | |a (DE-599)BVBBV048292377 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-12 |a DE-473 |a DE-11 | ||
082 | 0 | |a 418.02 | |
084 | |a ES 900 |0 (DE-625)27926: |2 rvk | ||
084 | |a ES 965 |0 (DE-625)27939: |2 rvk | ||
100 | 1 | |a Egbert, Jesse |d 1985- |0 (DE-588)1107801001 |4 aut | |
245 | 1 | 0 | |a Designing and evaluating language corpora |b a practical framework for corpus representativeness |c Jesse Egbert, Douglas Biber, Bethany Gray |
264 | 1 | |a Cambridge |b Cambridge University Press |c 2022 | |
300 | |a 1 Online-Ressource (xiii, 284 Seiten) | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
520 | |a Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' - highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation | ||
650 | 4 | |a Corpora (Linguistics) / Design | |
650 | 0 | 7 | |a Korpus |g Linguistik |0 (DE-588)4165338-5 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Korpus |g Linguistik |0 (DE-588)4165338-5 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Biber, Douglas |d 1952- |0 (DE-588)137511272 |4 aut | |
700 | 1 | |a Gray, Bethany |d ca. 20./21. Jh. |0 (DE-588)1083812181 |4 aut | |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe, Hardcover |z 978-1-107-15138-3 |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe, Paperback |z 978-1-316-60588-2 |
856 | 4 | 0 | |u https://doi.org/10.1017/9781316584880 |x Verlag |z URL des Erstveröffentlichers |3 Volltext |
912 | |a ZDB-20-CBO | ||
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-033672336 | |
966 | e | |u https://doi.org/10.1017/9781316584880 |l DE-12 |p ZDB-20-CBO |q BSB_PDA_CBO |x Verlag |3 Volltext | |
966 | e | |u https://doi.org/10.1017/9781316584880 |l DE-473 |p ZDB-20-CBO |q UBG_PDA_CBO_Kauf24 |x Verlag |3 Volltext |
Datensatz im Suchindex
_version_ | 1806233017766641664 |
---|---|
adam_text | |
adam_txt | |
any_adam_object | |
any_adam_object_boolean | |
author | Egbert, Jesse 1985- Biber, Douglas 1952- Gray, Bethany ca. 20./21. Jh |
author_GND | (DE-588)1107801001 (DE-588)137511272 (DE-588)1083812181 |
author_facet | Egbert, Jesse 1985- Biber, Douglas 1952- Gray, Bethany ca. 20./21. Jh |
author_role | aut aut aut |
author_sort | Egbert, Jesse 1985- |
author_variant | j e je d b db b g bg |
building | Verbundindex |
bvnumber | BV048292377 |
classification_rvk | ES 900 ES 965 |
collection | ZDB-20-CBO |
ctrlnum | (ZDB-20-CBO)CR9781316584880 (OCoLC)1334029014 (DE-599)BVBBV048292377 |
dewey-full | 418.02 |
dewey-hundreds | 400 - Language |
dewey-ones | 418 - Applied linguistics |
dewey-raw | 418.02 |
dewey-search | 418.02 |
dewey-sort | 3418.02 |
dewey-tens | 410 - Linguistics |
discipline | Sprachwissenschaft Literaturwissenschaft |
discipline_str_mv | Sprachwissenschaft Literaturwissenschaft |
doi_str_mv | 10.1017/9781316584880 |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nmm a2200000 c 4500</leader><controlfield tag="001">BV048292377</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20240801</controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">220621s2022 |||| o||u| ||||||eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781316584880</subfield><subfield code="c">Online</subfield><subfield code="9">978-1-316-58488-0</subfield></datafield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1017/9781316584880</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ZDB-20-CBO)CR9781316584880</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1334029014</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV048292377</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-12</subfield><subfield code="a">DE-473</subfield><subfield code="a">DE-11</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">418.02</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ES 900</subfield><subfield code="0">(DE-625)27926:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ES 965</subfield><subfield code="0">(DE-625)27939:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Egbert, Jesse</subfield><subfield code="d">1985-</subfield><subfield code="0">(DE-588)1107801001</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Designing and evaluating language corpora</subfield><subfield code="b">a practical framework for corpus representativeness</subfield><subfield code="c">Jesse Egbert, Douglas Biber, Bethany Gray</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Cambridge</subfield><subfield code="b">Cambridge University Press</subfield><subfield code="c">2022</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (xiii, 284 Seiten)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' - highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Corpora (Linguistics) / Design</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Korpus</subfield><subfield code="g">Linguistik</subfield><subfield code="0">(DE-588)4165338-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Korpus</subfield><subfield code="g">Linguistik</subfield><subfield code="0">(DE-588)4165338-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Biber, Douglas</subfield><subfield code="d">1952-</subfield><subfield code="0">(DE-588)137511272</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Gray, Bethany</subfield><subfield code="d">ca. 20./21. Jh.</subfield><subfield code="0">(DE-588)1083812181</subfield><subfield code="4">aut</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe, Hardcover</subfield><subfield code="z">978-1-107-15138-3</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe, Paperback</subfield><subfield code="z">978-1-316-60588-2</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://doi.org/10.1017/9781316584880</subfield><subfield code="x">Verlag</subfield><subfield code="z">URL des Erstveröffentlichers</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-20-CBO</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-033672336</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1017/9781316584880</subfield><subfield code="l">DE-12</subfield><subfield code="p">ZDB-20-CBO</subfield><subfield code="q">BSB_PDA_CBO</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1017/9781316584880</subfield><subfield code="l">DE-473</subfield><subfield code="p">ZDB-20-CBO</subfield><subfield code="q">UBG_PDA_CBO_Kauf24</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield></record></collection> |
id | DE-604.BV048292377 |
illustrated | Not Illustrated |
index_date | 2024-07-03T20:04:04Z |
indexdate | 2024-08-02T00:20:36Z |
institution | BVB |
isbn | 9781316584880 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-033672336 |
oclc_num | 1334029014 |
open_access_boolean | |
owner | DE-12 DE-473 DE-BY-UBG DE-11 |
owner_facet | DE-12 DE-473 DE-BY-UBG DE-11 |
physical | 1 Online-Ressource (xiii, 284 Seiten) |
psigel | ZDB-20-CBO ZDB-20-CBO BSB_PDA_CBO ZDB-20-CBO UBG_PDA_CBO_Kauf24 |
publishDate | 2022 |
publishDateSearch | 2022 |
publishDateSort | 2022 |
publisher | Cambridge University Press |
record_format | marc |
spelling | Egbert, Jesse 1985- (DE-588)1107801001 aut Designing and evaluating language corpora a practical framework for corpus representativeness Jesse Egbert, Douglas Biber, Bethany Gray Cambridge Cambridge University Press 2022 1 Online-Ressource (xiii, 284 Seiten) txt rdacontent c rdamedia cr rdacarrier Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' - highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation Corpora (Linguistics) / Design Korpus Linguistik (DE-588)4165338-5 gnd rswk-swf Korpus Linguistik (DE-588)4165338-5 s DE-604 Biber, Douglas 1952- (DE-588)137511272 aut Gray, Bethany ca. 20./21. Jh. (DE-588)1083812181 aut Erscheint auch als Druck-Ausgabe, Hardcover 978-1-107-15138-3 Erscheint auch als Druck-Ausgabe, Paperback 978-1-316-60588-2 https://doi.org/10.1017/9781316584880 Verlag URL des Erstveröffentlichers Volltext |
spellingShingle | Egbert, Jesse 1985- Biber, Douglas 1952- Gray, Bethany ca. 20./21. Jh Designing and evaluating language corpora a practical framework for corpus representativeness Corpora (Linguistics) / Design Korpus Linguistik (DE-588)4165338-5 gnd |
subject_GND | (DE-588)4165338-5 |
title | Designing and evaluating language corpora a practical framework for corpus representativeness |
title_auth | Designing and evaluating language corpora a practical framework for corpus representativeness |
title_exact_search | Designing and evaluating language corpora a practical framework for corpus representativeness |
title_exact_search_txtP | Designing and evaluating language corpora a practical framework for corpus representativeness |
title_full | Designing and evaluating language corpora a practical framework for corpus representativeness Jesse Egbert, Douglas Biber, Bethany Gray |
title_fullStr | Designing and evaluating language corpora a practical framework for corpus representativeness Jesse Egbert, Douglas Biber, Bethany Gray |
title_full_unstemmed | Designing and evaluating language corpora a practical framework for corpus representativeness Jesse Egbert, Douglas Biber, Bethany Gray |
title_short | Designing and evaluating language corpora |
title_sort | designing and evaluating language corpora a practical framework for corpus representativeness |
title_sub | a practical framework for corpus representativeness |
topic | Corpora (Linguistics) / Design Korpus Linguistik (DE-588)4165338-5 gnd |
topic_facet | Corpora (Linguistics) / Design Korpus Linguistik |
url | https://doi.org/10.1017/9781316584880 |
work_keys_str_mv | AT egbertjesse designingandevaluatinglanguagecorporaapracticalframeworkforcorpusrepresentativeness AT biberdouglas designingandevaluatinglanguagecorporaapracticalframeworkforcorpusrepresentativeness AT graybethany designingandevaluatinglanguagecorporaapracticalframeworkforcorpusrepresentativeness |