Text analytics for corpus linguistics and digital humanities: simple R scripts and tools
"Do you want to gain a deeper understanding of how big tech analyzes and exploits our text data, or investigate how political parties differ by analyzing textual styles in documents? This book explores how to apply state-of-the-art text analytics methods to detect and visualize phenomena in tex...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
London ; New York
Bloomsbury Academic
2024
|
Schriftenreihe: | Language, data science and digital humanities
Bloomsbury collections |
Schlagworte: | |
Online-Zugang: | DE-12 DE-188 DE-19 Volltext |
Zusammenfassung: | "Do you want to gain a deeper understanding of how big tech analyzes and exploits our text data, or investigate how political parties differ by analyzing textual styles in documents? This book explores how to apply state-of-the-art text analytics methods to detect and visualize phenomena in text data. Solidly based on methods from corpus linguistics, natural language processing, text analytics and digital humanities, this book shows readers how to conduct experiments with their own corpora and research questions, underpin their theories, quantify the differences and pinpoint characteristics. Case studies and experiments are detailed in every chapter using real-world and open access corpora from politics, World English, history, and literature. The results are interpreted and put into perspective, pitfalls are pointed out, and necessary pre-processing steps are demonstrated. This book also demonstrates how to use the programming language R, as well as simple alternatives and additions to R, to conduct experiments and employ visualisations by example, with extensible R-code, recipes, links to corpora, and a wide range of methods. The methods introduced can be used across texts of all disciplines, from history or literature to party manifestos and patient reports." |
Beschreibung: | 1 Online-Ressource (236 Seiten) Illustrationen |
ISBN: | 9781350370852 9781350370838 9781350370845 |
DOI: | 10.5040/9781350370852 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV049743429 | ||
003 | DE-604 | ||
005 | 20241204 | ||
007 | cr|uuu---uuuuu | ||
008 | 240616s2024 xx a||| o|||| 00||| eng d | ||
020 | |a 9781350370852 |c online |9 978-1-350-37085-2 | ||
020 | |a 9781350370838 |c epdf |9 978-1-350-37083-8 | ||
020 | |a 9781350370845 |c epub |9 978-1-350-37084-5 | ||
024 | 7 | |a 10.5040/9781350370852 |2 doi | |
035 | |a (OCoLC)1443581295 | ||
035 | |a (DE-599)BVBBV049743429 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-12 |a DE-188 |a DE-19 | ||
084 | |a HF 450 |0 (DE-625)48914: |2 rvk | ||
100 | 1 | |a Schneider, Gerold |e Verfasser |0 (DE-588)140606904 |4 aut | |
245 | 1 | 0 | |a Text analytics for corpus linguistics and digital humanities |b simple R scripts and tools |c Gerold Schneider |
264 | 1 | |a London ; New York |b Bloomsbury Academic |c 2024 | |
300 | |a 1 Online-Ressource (236 Seiten) |b Illustrationen | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
490 | 0 | |a Language, data science and digital humanities | |
490 | 0 | |a Bloomsbury collections | |
520 | 3 | |a "Do you want to gain a deeper understanding of how big tech analyzes and exploits our text data, or investigate how political parties differ by analyzing textual styles in documents? This book explores how to apply state-of-the-art text analytics methods to detect and visualize phenomena in text data. Solidly based on methods from corpus linguistics, natural language processing, text analytics and digital humanities, this book shows readers how to conduct experiments with their own corpora and research questions, underpin their theories, quantify the differences and pinpoint characteristics. Case studies and experiments are detailed in every chapter using real-world and open access corpora from politics, World English, history, and literature. The results are interpreted and put into perspective, pitfalls are pointed out, and necessary pre-processing steps are demonstrated. This book also demonstrates how to use the programming language R, as well as simple alternatives and additions to R, to conduct experiments and employ visualisations by example, with extensible R-code, recipes, links to corpora, and a wide range of methods. The methods introduced can be used across texts of all disciplines, from history or literature to party manifestos and patient reports." | |
650 | 0 | 7 | |a Digital Humanities |0 (DE-588)1038714850 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Data Mining |0 (DE-588)4428654-5 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Textanalyse |0 (DE-588)4194196-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Korpus |g Linguistik |0 (DE-588)4165338-5 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Computerlinguistik |0 (DE-588)4035843-4 |2 gnd |9 rswk-swf |
653 | 0 | |a Text data mining | |
653 | 0 | |a R (Computer program language) | |
653 | 0 | |a Corpora (Linguistics) / Data processing | |
653 | 0 | |a Digital humanities / Research / Methodology | |
689 | 0 | 0 | |a Digital Humanities |0 (DE-588)1038714850 |D s |
689 | 0 | 1 | |a Computerlinguistik |0 (DE-588)4035843-4 |D s |
689 | 0 | 2 | |a Data Mining |0 (DE-588)4428654-5 |D s |
689 | 0 | 3 | |a Korpus |g Linguistik |0 (DE-588)4165338-5 |D s |
689 | 0 | 4 | |a Textanalyse |0 (DE-588)4194196-2 |D s |
689 | 0 | |5 DE-604 | |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe |z 978-1-350-37082-1 |
856 | 4 | 0 | |u https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections |x Verlag |z URL des Erstveröffentlichers |3 Volltext |
912 | |a ZDB-162-LIN | ||
912 | |a ZDB-162-BCC | ||
940 | 1 | |q ZDB-162-LIN24 | |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-035085313 | |
966 | e | |u https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections |l DE-12 |p ZDB-162-LIN |q ZDB-162-LIN24 |x Verlag |3 Volltext | |
966 | e | |u https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections |l DE-188 |p ZDB-162-BCC |x Verlag |3 Volltext | |
966 | e | |u https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections |l DE-19 |p ZDB-162-BCC |q UBM_Einzelkauf_2024 |x Verlag |3 Volltext |
Datensatz im Suchindex
_version_ | 1817529946791215104 |
---|---|
adam_text | |
any_adam_object | |
author | Schneider, Gerold |
author_GND | (DE-588)140606904 |
author_facet | Schneider, Gerold |
author_role | aut |
author_sort | Schneider, Gerold |
author_variant | g s gs |
building | Verbundindex |
bvnumber | BV049743429 |
classification_rvk | HF 450 |
collection | ZDB-162-LIN ZDB-162-BCC |
ctrlnum | (OCoLC)1443581295 (DE-599)BVBBV049743429 |
discipline | Anglistik / Amerikanistik |
doi_str_mv | 10.5040/9781350370852 |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 c 4500</leader><controlfield tag="001">BV049743429</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20241204</controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">240616s2024 xx a||| o|||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781350370852</subfield><subfield code="c">online</subfield><subfield code="9">978-1-350-37085-2</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781350370838</subfield><subfield code="c">epdf</subfield><subfield code="9">978-1-350-37083-8</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781350370845</subfield><subfield code="c">epub</subfield><subfield code="9">978-1-350-37084-5</subfield></datafield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.5040/9781350370852</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1443581295</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV049743429</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-12</subfield><subfield code="a">DE-188</subfield><subfield code="a">DE-19</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">HF 450</subfield><subfield code="0">(DE-625)48914:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Schneider, Gerold</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)140606904</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Text analytics for corpus linguistics and digital humanities</subfield><subfield code="b">simple R scripts and tools</subfield><subfield code="c">Gerold Schneider</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">London ; New York</subfield><subfield code="b">Bloomsbury Academic</subfield><subfield code="c">2024</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (236 Seiten)</subfield><subfield code="b">Illustrationen</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Language, data science and digital humanities</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Bloomsbury collections</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">"Do you want to gain a deeper understanding of how big tech analyzes and exploits our text data, or investigate how political parties differ by analyzing textual styles in documents? This book explores how to apply state-of-the-art text analytics methods to detect and visualize phenomena in text data. Solidly based on methods from corpus linguistics, natural language processing, text analytics and digital humanities, this book shows readers how to conduct experiments with their own corpora and research questions, underpin their theories, quantify the differences and pinpoint characteristics. Case studies and experiments are detailed in every chapter using real-world and open access corpora from politics, World English, history, and literature. The results are interpreted and put into perspective, pitfalls are pointed out, and necessary pre-processing steps are demonstrated. This book also demonstrates how to use the programming language R, as well as simple alternatives and additions to R, to conduct experiments and employ visualisations by example, with extensible R-code, recipes, links to corpora, and a wide range of methods. The methods introduced can be used across texts of all disciplines, from history or literature to party manifestos and patient reports."</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Digital Humanities</subfield><subfield code="0">(DE-588)1038714850</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Textanalyse</subfield><subfield code="0">(DE-588)4194196-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Korpus</subfield><subfield code="g">Linguistik</subfield><subfield code="0">(DE-588)4165338-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Computerlinguistik</subfield><subfield code="0">(DE-588)4035843-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="653" ind1=" " ind2="0"><subfield code="a">Text data mining</subfield></datafield><datafield tag="653" ind1=" " ind2="0"><subfield code="a">R (Computer program language)</subfield></datafield><datafield tag="653" ind1=" " ind2="0"><subfield code="a">Corpora (Linguistics) / Data processing</subfield></datafield><datafield tag="653" ind1=" " ind2="0"><subfield code="a">Digital humanities / Research / Methodology</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Digital Humanities</subfield><subfield code="0">(DE-588)1038714850</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Computerlinguistik</subfield><subfield code="0">(DE-588)4035843-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Data Mining</subfield><subfield code="0">(DE-588)4428654-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="3"><subfield code="a">Korpus</subfield><subfield code="g">Linguistik</subfield><subfield code="0">(DE-588)4165338-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="4"><subfield code="a">Textanalyse</subfield><subfield code="0">(DE-588)4194196-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">978-1-350-37082-1</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections</subfield><subfield code="x">Verlag</subfield><subfield code="z">URL des Erstveröffentlichers</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-162-LIN</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-162-BCC</subfield></datafield><datafield tag="940" ind1="1" ind2=" "><subfield code="q">ZDB-162-LIN24</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-035085313</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections</subfield><subfield code="l">DE-12</subfield><subfield code="p">ZDB-162-LIN</subfield><subfield code="q">ZDB-162-LIN24</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections</subfield><subfield code="l">DE-188</subfield><subfield code="p">ZDB-162-BCC</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections</subfield><subfield code="l">DE-19</subfield><subfield code="p">ZDB-162-BCC</subfield><subfield code="q">UBM_Einzelkauf_2024</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield></record></collection> |
id | DE-604.BV049743429 |
illustrated | Illustrated |
indexdate | 2024-12-04T17:00:29Z |
institution | BVB |
isbn | 9781350370852 9781350370838 9781350370845 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-035085313 |
oclc_num | 1443581295 |
open_access_boolean | |
owner | DE-12 DE-188 DE-19 DE-BY-UBM |
owner_facet | DE-12 DE-188 DE-19 DE-BY-UBM |
physical | 1 Online-Ressource (236 Seiten) Illustrationen |
psigel | ZDB-162-LIN ZDB-162-BCC ZDB-162-LIN24 ZDB-162-LIN ZDB-162-LIN24 ZDB-162-BCC UBM_Einzelkauf_2024 |
publishDate | 2024 |
publishDateSearch | 2024 |
publishDateSort | 2024 |
publisher | Bloomsbury Academic |
record_format | marc |
series2 | Language, data science and digital humanities Bloomsbury collections |
spelling | Schneider, Gerold Verfasser (DE-588)140606904 aut Text analytics for corpus linguistics and digital humanities simple R scripts and tools Gerold Schneider London ; New York Bloomsbury Academic 2024 1 Online-Ressource (236 Seiten) Illustrationen txt rdacontent c rdamedia cr rdacarrier Language, data science and digital humanities Bloomsbury collections "Do you want to gain a deeper understanding of how big tech analyzes and exploits our text data, or investigate how political parties differ by analyzing textual styles in documents? This book explores how to apply state-of-the-art text analytics methods to detect and visualize phenomena in text data. Solidly based on methods from corpus linguistics, natural language processing, text analytics and digital humanities, this book shows readers how to conduct experiments with their own corpora and research questions, underpin their theories, quantify the differences and pinpoint characteristics. Case studies and experiments are detailed in every chapter using real-world and open access corpora from politics, World English, history, and literature. The results are interpreted and put into perspective, pitfalls are pointed out, and necessary pre-processing steps are demonstrated. This book also demonstrates how to use the programming language R, as well as simple alternatives and additions to R, to conduct experiments and employ visualisations by example, with extensible R-code, recipes, links to corpora, and a wide range of methods. The methods introduced can be used across texts of all disciplines, from history or literature to party manifestos and patient reports." Digital Humanities (DE-588)1038714850 gnd rswk-swf Data Mining (DE-588)4428654-5 gnd rswk-swf Textanalyse (DE-588)4194196-2 gnd rswk-swf Korpus Linguistik (DE-588)4165338-5 gnd rswk-swf Computerlinguistik (DE-588)4035843-4 gnd rswk-swf Text data mining R (Computer program language) Corpora (Linguistics) / Data processing Digital humanities / Research / Methodology Digital Humanities (DE-588)1038714850 s Computerlinguistik (DE-588)4035843-4 s Data Mining (DE-588)4428654-5 s Korpus Linguistik (DE-588)4165338-5 s Textanalyse (DE-588)4194196-2 s DE-604 Erscheint auch als Druck-Ausgabe 978-1-350-37082-1 https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections Verlag URL des Erstveröffentlichers Volltext |
spellingShingle | Schneider, Gerold Text analytics for corpus linguistics and digital humanities simple R scripts and tools Digital Humanities (DE-588)1038714850 gnd Data Mining (DE-588)4428654-5 gnd Textanalyse (DE-588)4194196-2 gnd Korpus Linguistik (DE-588)4165338-5 gnd Computerlinguistik (DE-588)4035843-4 gnd |
subject_GND | (DE-588)1038714850 (DE-588)4428654-5 (DE-588)4194196-2 (DE-588)4165338-5 (DE-588)4035843-4 |
title | Text analytics for corpus linguistics and digital humanities simple R scripts and tools |
title_auth | Text analytics for corpus linguistics and digital humanities simple R scripts and tools |
title_exact_search | Text analytics for corpus linguistics and digital humanities simple R scripts and tools |
title_full | Text analytics for corpus linguistics and digital humanities simple R scripts and tools Gerold Schneider |
title_fullStr | Text analytics for corpus linguistics and digital humanities simple R scripts and tools Gerold Schneider |
title_full_unstemmed | Text analytics for corpus linguistics and digital humanities simple R scripts and tools Gerold Schneider |
title_short | Text analytics for corpus linguistics and digital humanities |
title_sort | text analytics for corpus linguistics and digital humanities simple r scripts and tools |
title_sub | simple R scripts and tools |
topic | Digital Humanities (DE-588)1038714850 gnd Data Mining (DE-588)4428654-5 gnd Textanalyse (DE-588)4194196-2 gnd Korpus Linguistik (DE-588)4165338-5 gnd Computerlinguistik (DE-588)4035843-4 gnd |
topic_facet | Digital Humanities Data Mining Textanalyse Korpus Linguistik Computerlinguistik |
url | https://doi.org/10.5040/9781350370852?locatt=label:secondary_bloomsburyCollections |
work_keys_str_mv | AT schneidergerold textanalyticsforcorpuslinguisticsanddigitalhumanitiessimplerscriptsandtools |