Natural language processing and computational linguistics: 1 Speech, morphology and syntax
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
London
ISTE
2016
|
Schriftenreihe: | Cognitive science series
Cognitive science series |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis Klappentext |
Beschreibung: | xiv, 277 Seiten Illustrationen |
ISBN: | 9781848218482 |
Internformat
MARC
LEADER | 00000nam a2200000 cc4500 | ||
---|---|---|---|
001 | BV043997661 | ||
003 | DE-604 | ||
005 | 20170313 | ||
007 | t | ||
008 | 170113s2016 a||| |||| 00||| eng d | ||
020 | |a 9781848218482 |9 978-1-84821-848-2 | ||
035 | |a (OCoLC)976413226 | ||
035 | |a (DE-599)BVBBV043997661 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-355 | ||
100 | 1 | |a Kurdi, Mohamed Zakaria |d 1973- |e Verfasser |0 (DE-588)1118418484 |4 aut | |
245 | 1 | 0 | |a Natural language processing and computational linguistics |n 1 |p Speech, morphology and syntax |c Mohamed Zakaria Kurdi |
264 | 1 | |a London |b ISTE |c 2016 | |
300 | |a xiv, 277 Seiten |b Illustrationen | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Cognitive science series | |
490 | 0 | |a Cognitive science series | |
650 | 0 | 7 | |a Computerlinguistik |0 (DE-588)4035843-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Natürliche Sprache |0 (DE-588)4041354-8 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Sprachverarbeitung |0 (DE-588)4116579-2 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Natürliche Sprache |0 (DE-588)4041354-8 |D s |
689 | 0 | 1 | |a Sprachverarbeitung |0 (DE-588)4116579-2 |D s |
689 | 0 | 2 | |a Computerlinguistik |0 (DE-588)4035843-4 |D s |
689 | 0 | |5 DE-604 | |
773 | 0 | 8 | |w (DE-604)BV043997653 |g 1 |
856 | 4 | 2 | |m Digitalisierung UB Regensburg - ADAM Catalogue Enrichment |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029405692&sequence=000003&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
856 | 4 | 2 | |m Digitalisierung UB Regensburg - ADAM Catalogue Enrichment |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029405692&sequence=000004&line_number=0002&func_code=DB_RECORDS&service_type=MEDIA |3 Klappentext |
999 | |a oai:aleph.bib-bvb.de:BVB01-029405692 |
Datensatz im Suchindex
_version_ | 1804176976577036288 |
---|---|
adam_text | Contents
Introduction................................................................... ix
Chapter 1. Linguistic Resources for NLP................................... 1
1.1. The concept of a corpus.......................................... 1
1.2. Corpus taxonomy....................................................... 4
1.2.1. Written versus spoken........................................... 4
1.2.2. The historical point of view............................ 5
1.2.3. The language of corpora........................................... 5
1.2.4. Thematic representativity......................................... 7
1.2.5. Age range of speakers ....................................... 8
1.3. Who collects and distributes corpora?................................. 8
1.3.1. The Gutenberg project............................................. 9
1.3.2. The linguistic data consortium.................................... 9
1.3.3. European language resource agency................................. 9
1.3.4. Open language archives community................................. 10
1.3.5. Miscellaneous.................................................... 10
1.4. The lifecycle of a corpus............................................ 10
1.4.1. Needs analysis................................................... 12
1.4.2. Design of scenarios to collect data
for the corpus........................................................... 12
1.4.3. Collection of the corpus ........................................ 12
1.4.4. Transcription.................................................... 16
1.4.5. Corpus annotation................................................ 18
1.4.6. Corpus documentation............................................. 22
1.4.7. Statistical analysis of data..................................... 22
1.4.8. The use of corpora in NLP........................................ 23
vi Natural Language Processing and Computational Linguistics 1
1.5. Examples of existing corpora................................... 23
1.5.1. American National Corpus................................... 23
1.5.2. Oxford English Corpus.......................................... 23
1.5.3. The Grenoble Tourism Office Corpus......................... 24
Chapter 2. The Sphere of Speech......................................... 25
2.1. Linguistic studies of speech................................... 25
2.1.1. Phonetics..................................................... 25
2.1.2. Phonology................................................... 46
2.2. Speech processing. ................................................ 61
2.2.1. Automatic speech recognition.................................. 62
2.2.2. Speech synthesis.............................................. 80
Chapter 3, Morphology Sphere............................................... 89
3.1. Elements of morphology............................................ 89
3.1.1. Morphological typology........................................ 90
3.1.2. Morphology of English......................................... 91
3.1.3. Parts of speech............................................... 95
3.1.4. Terms, collocations and colligations.......................... 99
3.2. Automatic morphological analysis................................. 100
3.2.1. Stemming..................................................... 101
3.2.2. Regular expressions for morphological analysis............. 104
3.2.3. Informal introduction to finite-state machines............. 108
3.2.4. Two-level morphology and FST............................... 112
3.2.5. Part-of-speech tagging....................................... 117
Chapter 4. Syntax Sphere.................................................. 127
4.1. Basic syntactic concepts......................................... 127
4.1.1. Delimitation of the field of syntax........................ 127
4.1.2. The concept of grammaticality................................ 128
4.1.3. Syntactic constituents....................................... 129
4.1.4. Syntactic typology of topology
and agreement....................................................... 139
4.1.5. Syntactic ambiguity.......................................... 140
4.1.6. Syntactic specificities of
spontaneous oral language......................................... 141
4.2. Elements of formal syntax........................................ 145
4.2.1. Syntax trees and rewrite rules............................... 145
4.2.2. Languages and formal grammars................................ 152
4.2.3. Hierarchy of languages
(Chomsky-Schutzenberger).......................................... 154
Contents vii
4.2.4. Feature structures and unification............................. 162
4.2.5. Definite clause grammar........................................ 169
4.3. Syntactic formalisms............................................... 171
4.3.1. X-bar.......................................................... 171
4.3.2. Head-driven phrase structure grammar........................... 178
4.3.3. Lexicalized tree-adjoining grammar............................. 193
4.4. Automatic parsing................................................. 201
4.4.1. Finite-state automata.......................................... 202
4.4.2. Recursive transition networks.................................. 203
4.4.3. Top-down approach.............................................. 207
4.4.4. Bottom-up approach........................................... 212
4.4.5. Mixed approach: left-comer..................................... 215
4.4.6. Tabular parsing (chart)........................................ 221
4.4.7. Probabilistic parsing.......................................... 225
4.4.8. Neural networks................................................ 233
4.4.9. parsing algorithms for
unification-based grammars............................................ 237
4.4.10. Robust parsing approaches..................................... 238
4.4.11. Generation algorithms......................................... 242
Bibliography............................................................... 245
Index...................................................................... 275
Natural language processing (NLP) is a scientific discipline which
is found at the interface of computer science, artificial intelligence
and cognitive psychology.
Providing an overview of international work in this
interdisciplinary field, this book gives the reader a panoramic
view of both early and current research in NLP. Carefully chosen
multilingual examples present the state of the art of a mature field
which is in a constant state of evolution.
In four chapters, this book presents the fundamental concepts of
phonetics and phonology and the two most important
applications in the field of speech processing: recognition and
synthesis. Also presented are the fundamental concepts of corpus
linguistics and the basic concepts of morphology and its NLP
applications such as stemming and part of speech tagging. The
fundamental notions and the most important syntactic theories
are presented, as well as the different approaches to syntactic
parsing with reference to cognitive models, algorithms and
computer applications.
Mohamed Zakaria Kurdi is Assistant Professor at the CS
Department of Lynchburg College in Virginia, USA. His research
interests include natural language processing, robust parsing,
text mining and intelligent computer-assisted language learning.
|
any_adam_object | 1 |
author | Kurdi, Mohamed Zakaria 1973- |
author_GND | (DE-588)1118418484 |
author_facet | Kurdi, Mohamed Zakaria 1973- |
author_role | aut |
author_sort | Kurdi, Mohamed Zakaria 1973- |
author_variant | m z k mz mzk |
building | Verbundindex |
bvnumber | BV043997661 |
ctrlnum | (OCoLC)976413226 (DE-599)BVBBV043997661 |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01887nam a2200397 cc4500</leader><controlfield tag="001">BV043997661</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20170313 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">170113s2016 a||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781848218482</subfield><subfield code="9">978-1-84821-848-2</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)976413226</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV043997661</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-355</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Kurdi, Mohamed Zakaria</subfield><subfield code="d">1973-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1118418484</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Natural language processing and computational linguistics</subfield><subfield code="n">1</subfield><subfield code="p">Speech, morphology and syntax</subfield><subfield code="c">Mohamed Zakaria Kurdi</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">London</subfield><subfield code="b">ISTE</subfield><subfield code="c">2016</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xiv, 277 Seiten</subfield><subfield code="b">Illustrationen</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Cognitive science series</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Cognitive science series</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Computerlinguistik</subfield><subfield code="0">(DE-588)4035843-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Natürliche Sprache</subfield><subfield code="0">(DE-588)4041354-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Sprachverarbeitung</subfield><subfield code="0">(DE-588)4116579-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Natürliche Sprache</subfield><subfield code="0">(DE-588)4041354-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Sprachverarbeitung</subfield><subfield code="0">(DE-588)4116579-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Computerlinguistik</subfield><subfield code="0">(DE-588)4035843-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="w">(DE-604)BV043997653</subfield><subfield code="g">1</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Regensburg - ADAM Catalogue Enrichment</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029405692&sequence=000003&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Regensburg - ADAM Catalogue Enrichment</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029405692&sequence=000004&line_number=0002&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Klappentext</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-029405692</subfield></datafield></record></collection> |
id | DE-604.BV043997661 |
illustrated | Illustrated |
indexdate | 2024-07-10T07:40:44Z |
institution | BVB |
isbn | 9781848218482 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-029405692 |
oclc_num | 976413226 |
open_access_boolean | |
owner | DE-355 DE-BY-UBR |
owner_facet | DE-355 DE-BY-UBR |
physical | xiv, 277 Seiten Illustrationen |
publishDate | 2016 |
publishDateSearch | 2016 |
publishDateSort | 2016 |
publisher | ISTE |
record_format | marc |
series2 | Cognitive science series |
spelling | Kurdi, Mohamed Zakaria 1973- Verfasser (DE-588)1118418484 aut Natural language processing and computational linguistics 1 Speech, morphology and syntax Mohamed Zakaria Kurdi London ISTE 2016 xiv, 277 Seiten Illustrationen txt rdacontent n rdamedia nc rdacarrier Cognitive science series Computerlinguistik (DE-588)4035843-4 gnd rswk-swf Natürliche Sprache (DE-588)4041354-8 gnd rswk-swf Sprachverarbeitung (DE-588)4116579-2 gnd rswk-swf Natürliche Sprache (DE-588)4041354-8 s Sprachverarbeitung (DE-588)4116579-2 s Computerlinguistik (DE-588)4035843-4 s DE-604 (DE-604)BV043997653 1 Digitalisierung UB Regensburg - ADAM Catalogue Enrichment application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029405692&sequence=000003&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis Digitalisierung UB Regensburg - ADAM Catalogue Enrichment application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029405692&sequence=000004&line_number=0002&func_code=DB_RECORDS&service_type=MEDIA Klappentext |
spellingShingle | Kurdi, Mohamed Zakaria 1973- Natural language processing and computational linguistics Computerlinguistik (DE-588)4035843-4 gnd Natürliche Sprache (DE-588)4041354-8 gnd Sprachverarbeitung (DE-588)4116579-2 gnd |
subject_GND | (DE-588)4035843-4 (DE-588)4041354-8 (DE-588)4116579-2 |
title | Natural language processing and computational linguistics |
title_auth | Natural language processing and computational linguistics |
title_exact_search | Natural language processing and computational linguistics |
title_full | Natural language processing and computational linguistics 1 Speech, morphology and syntax Mohamed Zakaria Kurdi |
title_fullStr | Natural language processing and computational linguistics 1 Speech, morphology and syntax Mohamed Zakaria Kurdi |
title_full_unstemmed | Natural language processing and computational linguistics 1 Speech, morphology and syntax Mohamed Zakaria Kurdi |
title_short | Natural language processing and computational linguistics |
title_sort | natural language processing and computational linguistics speech morphology and syntax |
topic | Computerlinguistik (DE-588)4035843-4 gnd Natürliche Sprache (DE-588)4041354-8 gnd Sprachverarbeitung (DE-588)4116579-2 gnd |
topic_facet | Computerlinguistik Natürliche Sprache Sprachverarbeitung |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029405692&sequence=000003&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=029405692&sequence=000004&line_number=0002&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV043997653 |
work_keys_str_mv | AT kurdimohamedzakaria naturallanguageprocessingandcomputationallinguistics1 |