Memory-based parsing:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Amsterdam [u.a.]
Benjamins
2004
|
Schriftenreihe: | Natural language processing
7 |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Literaturverz. S. [268] - 283 |
Beschreibung: | VIII, 294 S. graph. Darst. |
ISBN: | 9027249911 1588115909 |
Internformat
MARC
LEADER | 00000nam a2200000zcb4500 | ||
---|---|---|---|
001 | BV019608897 | ||
003 | DE-604 | ||
005 | 20060426 | ||
007 | t | ||
008 | 041125s2004 ne d||| |||| 00||| eng d | ||
010 | |a 2004052954 | ||
020 | |a 9027249911 |9 90-272-4991-1 | ||
020 | |a 1588115909 |9 1-58811-590-9 | ||
035 | |a (OCoLC)56420387 | ||
035 | |a (DE-599)BVBBV019608897 | ||
040 | |a DE-604 |b ger |e aacr | ||
041 | 0 | |a eng | |
044 | |a ne |c NL | ||
049 | |a DE-19 |a DE-473 |a DE-703 |a DE-29 | ||
050 | 0 | |a P98.5.P38 | |
082 | 0 | |a 410/.285 |2 22 | |
084 | |a ES 940 |0 (DE-625)27934: |2 rvk | ||
084 | |a ST 306 |0 (DE-625)143654: |2 rvk | ||
100 | 1 | |a Kübler, Sandra |e Verfasser |4 aut | |
245 | 1 | 0 | |a Memory-based parsing |c Sandra Kübler |
246 | 1 | 3 | |a Memory based parsing |
264 | 1 | |a Amsterdam [u.a.] |b Benjamins |c 2004 | |
300 | |a VIII, 294 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Natural language processing |v 7 | |
500 | |a Literaturverz. S. [268] - 283 | ||
650 | 4 | |a Analyse automatique (Linguistique) | |
650 | 4 | |a Linguistique informatique | |
650 | 7 | |a Lingüística computacional (gramática) |2 larpcal | |
650 | 7 | |a Parsing |2 gtt | |
650 | 4 | |a Computational linguistics | |
650 | 4 | |a Parsing (Computer grammar) | |
650 | 0 | 7 | |a Maschinelles Lernen |0 (DE-588)4193754-5 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Parser |0 (DE-588)4125056-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Natürliche Sprache |0 (DE-588)4041354-8 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Syntaktische Analyse |0 (DE-588)4058778-2 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Natürliche Sprache |0 (DE-588)4041354-8 |D s |
689 | 0 | 1 | |a Syntaktische Analyse |0 (DE-588)4058778-2 |D s |
689 | 0 | 2 | |a Maschinelles Lernen |0 (DE-588)4193754-5 |D s |
689 | 0 | |5 DE-604 | |
689 | 1 | 0 | |a Parser |0 (DE-588)4125056-4 |D s |
689 | 1 | 1 | |a Syntaktische Analyse |0 (DE-588)4058778-2 |D s |
689 | 1 | |5 DE-604 | |
830 | 0 | |a Natural language processing |v 7 |w (DE-604)BV013516598 |9 7 | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=012938652&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-012938652 |
Datensatz im Suchindex
_version_ | 1804132967560249344 |
---|---|
adam_text | CONTENTS
1 Introduction 1
1.1 Parsing 1
1.2 Machine Learning 4
1.3 Outline 7
2 Memory Based Learning 9
2.1 The Memory Based Learning Approach 9
2.1.1 Overlap Metric 13
2.1.2 Euclidean Distance 13
2.2 Extensions to the Basic Model 14
2.2.1 Reducing Retrieval Costs 14
2.2.2 Storage Reduction 16
2.2.3 Noise Tolerance 18
2.2.4 Nearest Hyperrectangle Approaches 19
2.3 Feature Weighting Metrics 21
2.3.1 Empirical Evaluation With Artificial Domains ... 22
2.3.2 Feature Weighting Dimensions 24
2.3.3 Feature Weighting as Search 30
2.4 Summary 33
3 Memory Based Approaches to Parsing 34
3.1 Noun Phrase Chunking 35
3.1.1 The approach of Veenstra (1998) 35
3.1.2 The approach of Tjong Kim Sang Veenstra (1999) 36
3.2 Shallow Parsing and Grammatical Relation Assignment . . 37
3.2.1 Memory Based Sequence Learning 37
3.2.2 Extended Memory Based Sequence Learning .... 40
3.2.3 The Approach of Buchholz (1998) 41
3.2.4 The Approach of Daelemans, Buchholz k Veenstra
(1999) 12
3.2.5 Memory Based Shallow Parsing Based on Words . . 15
3.2.6 The Approach of Buchholz (2002) 49
3.3 Memory Based Full Parsing 52
3.4 Summary 56
4 Data Oriented Parsing 57
4.1 A DOP Model for Phrase Structure Representations: D0P1 59
4.2 Parsing DOP1 62
4.2.1 Parsing 63
4.2.2 Monte Carlo Disambiguation 64
4.2.3 Empirical Evaluation of DOP1 on ATIS 67
4.2.4 Evaluating DOP1 on the Penn Treebank 70
4.3 Non Probabilistic DOP 72
4.4 The Treatment of Unknown Words: DOP3 75
4.5 Reducing DOP to a PCFG 77
4.5.1 The PCFG Representation of Subtrees 78
4.5.2 Maximum Constituent Parsing 80
4.6 Memory Based DOP 83
4.7 Summary 87
5 TUSBL: A Memory Based Parser 88
5.1 Parsing, Training Corpora, And Related Issues 90
5.1.1 The TiiBa D/S Treebank 91
5.1.2 A Short Comparison of TiiBa D/S and Penn Tree
bank 99
5.1.3 Going Beyond Pure Tree Structures 108
5.2 Parsing as Classification 118
5.3 Memory Based Learning as Part of a Hybrid Parsing Archi¬
tecture 132
5.3.1 The CASS Chunk Parser 135
5.3.2 CASS Output Structures 140
5.4 An Overview of the Architecture of the Memory Based Parser 155
5.5 The Learning Task 156
5.5.1 The Different Types of Information 161
5.5.2 Searching the Instance Base 169
5.6 A Sample Parse 180
5.7 Feature Weighting with a Flexible Number of Features . . 185
5.7.1 Weights for Omission in the Input Sentence .... 188
5.7.2 Weights for Omission in the Instance Base 193
5.8 The Backing Off Approach 199
5.9 Summary 206
6 Empirical Evaluation 209
6.1 Standard Evaluation Metrics 210
6.2 TiiSBL s Test Settings 212
6.3 TiiSBL s Time and Memory Requirements 214
6.4 Constituency Based Evaluation of TiiSBL 214
6.4.1 Evaluating TuSBL 214
6.4.2 Comparison to Other Parsers Evaluated on TiiBa
D/S 217
6.4.3 The Evaluation of the Word Based Search Module . 219
6.4.4 Evaluating the Word Based Search Including Skip¬
ping 220
6.4.5 Evaluating the Backing Off Module 221
6.4.6 Leave One Out Evaluation 222
6.5 Error Analysis 222
6.5.1 Unattached Constituents 223
6.5.2 Errors as Consequences of Unattached Constituents 224
6.5.3 Incorrect Grammatical Functions 226
6.5.4 Superfluous Nodes 229
6.5.5 Unrecoverable POS Tagging Errors 229
6.6 A Proposal for Dependency Based Evaluation 231
6.6.1 Deficiencies of Constituency Based Precision and Re¬
call 234
6.6.2 Converting TiiBa D/S into Dependencies 236
6.6.3 Dependency Based Parser Evaluation 240
6.7 Dependency Based Evaluation of TuSBL 242
6.7.1 Evaluating TuSBL 242
6.7.2 A Comparison of Constituency Based and Depend¬
ency Based Evaluation 243
6.8 Summary 247
7 A Comparison of Memory Based Approaches to TuSBL 251
7.1 Comparing TuSBL and DOP 252
7.2 Comparing TuSBL and MBSP 253
7.3 Comparing TiiSBL and OCTOPUS 255
7.4 Summary 259
8 Conclusion and Future Directions 260
A The Stuttgart Tubingen Tagset 263
B The TiiBa D/S Inventory of Syntactic Categories and
Grammatical Functions 266
References 268
Index of Subjects and Terms 284
|
any_adam_object | 1 |
author | Kübler, Sandra |
author_facet | Kübler, Sandra |
author_role | aut |
author_sort | Kübler, Sandra |
author_variant | s k sk |
building | Verbundindex |
bvnumber | BV019608897 |
callnumber-first | P - Language and Literature |
callnumber-label | P98 |
callnumber-raw | P98.5.P38 |
callnumber-search | P98.5.P38 |
callnumber-sort | P 298.5 P38 |
callnumber-subject | P - Philology and Linguistics |
classification_rvk | ES 940 ST 306 |
ctrlnum | (OCoLC)56420387 (DE-599)BVBBV019608897 |
dewey-full | 410/.285 |
dewey-hundreds | 400 - Language |
dewey-ones | 410 - Linguistics |
dewey-raw | 410/.285 |
dewey-search | 410/.285 |
dewey-sort | 3410 3285 |
dewey-tens | 410 - Linguistics |
discipline | Sprachwissenschaft Informatik Literaturwissenschaft |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02264nam a2200601zcb4500</leader><controlfield tag="001">BV019608897</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20060426 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">041125s2004 ne d||| |||| 00||| eng d</controlfield><datafield tag="010" ind1=" " ind2=" "><subfield code="a">2004052954</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9027249911</subfield><subfield code="9">90-272-4991-1</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1588115909</subfield><subfield code="9">1-58811-590-9</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)56420387</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV019608897</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">ne</subfield><subfield code="c">NL</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-19</subfield><subfield code="a">DE-473</subfield><subfield code="a">DE-703</subfield><subfield code="a">DE-29</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">P98.5.P38</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">410/.285</subfield><subfield code="2">22</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ES 940</subfield><subfield code="0">(DE-625)27934:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 306</subfield><subfield code="0">(DE-625)143654:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Kübler, Sandra</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Memory-based parsing</subfield><subfield code="c">Sandra Kübler</subfield></datafield><datafield tag="246" ind1="1" ind2="3"><subfield code="a">Memory based parsing</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Amsterdam [u.a.]</subfield><subfield code="b">Benjamins</subfield><subfield code="c">2004</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">VIII, 294 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Natural language processing</subfield><subfield code="v">7</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Literaturverz. S. [268] - 283</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Analyse automatique (Linguistique)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Linguistique informatique</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Lingüística computacional (gramática)</subfield><subfield code="2">larpcal</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Parsing</subfield><subfield code="2">gtt</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Computational linguistics</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Parsing (Computer grammar)</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Maschinelles Lernen</subfield><subfield code="0">(DE-588)4193754-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Parser</subfield><subfield code="0">(DE-588)4125056-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Natürliche Sprache</subfield><subfield code="0">(DE-588)4041354-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Syntaktische Analyse</subfield><subfield code="0">(DE-588)4058778-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Natürliche Sprache</subfield><subfield code="0">(DE-588)4041354-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Syntaktische Analyse</subfield><subfield code="0">(DE-588)4058778-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Maschinelles Lernen</subfield><subfield code="0">(DE-588)4193754-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Parser</subfield><subfield code="0">(DE-588)4125056-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="1"><subfield code="a">Syntaktische Analyse</subfield><subfield code="0">(DE-588)4058778-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Natural language processing</subfield><subfield code="v">7</subfield><subfield code="w">(DE-604)BV013516598</subfield><subfield code="9">7</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=012938652&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-012938652</subfield></datafield></record></collection> |
id | DE-604.BV019608897 |
illustrated | Illustrated |
indexdate | 2024-07-09T20:01:14Z |
institution | BVB |
isbn | 9027249911 1588115909 |
language | English |
lccn | 2004052954 |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-012938652 |
oclc_num | 56420387 |
open_access_boolean | |
owner | DE-19 DE-BY-UBM DE-473 DE-BY-UBG DE-703 DE-29 |
owner_facet | DE-19 DE-BY-UBM DE-473 DE-BY-UBG DE-703 DE-29 |
physical | VIII, 294 S. graph. Darst. |
publishDate | 2004 |
publishDateSearch | 2004 |
publishDateSort | 2004 |
publisher | Benjamins |
record_format | marc |
series | Natural language processing |
series2 | Natural language processing |
spelling | Kübler, Sandra Verfasser aut Memory-based parsing Sandra Kübler Memory based parsing Amsterdam [u.a.] Benjamins 2004 VIII, 294 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Natural language processing 7 Literaturverz. S. [268] - 283 Analyse automatique (Linguistique) Linguistique informatique Lingüística computacional (gramática) larpcal Parsing gtt Computational linguistics Parsing (Computer grammar) Maschinelles Lernen (DE-588)4193754-5 gnd rswk-swf Parser (DE-588)4125056-4 gnd rswk-swf Natürliche Sprache (DE-588)4041354-8 gnd rswk-swf Syntaktische Analyse (DE-588)4058778-2 gnd rswk-swf Natürliche Sprache (DE-588)4041354-8 s Syntaktische Analyse (DE-588)4058778-2 s Maschinelles Lernen (DE-588)4193754-5 s DE-604 Parser (DE-588)4125056-4 s Natural language processing 7 (DE-604)BV013516598 7 HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=012938652&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Kübler, Sandra Memory-based parsing Natural language processing Analyse automatique (Linguistique) Linguistique informatique Lingüística computacional (gramática) larpcal Parsing gtt Computational linguistics Parsing (Computer grammar) Maschinelles Lernen (DE-588)4193754-5 gnd Parser (DE-588)4125056-4 gnd Natürliche Sprache (DE-588)4041354-8 gnd Syntaktische Analyse (DE-588)4058778-2 gnd |
subject_GND | (DE-588)4193754-5 (DE-588)4125056-4 (DE-588)4041354-8 (DE-588)4058778-2 |
title | Memory-based parsing |
title_alt | Memory based parsing |
title_auth | Memory-based parsing |
title_exact_search | Memory-based parsing |
title_full | Memory-based parsing Sandra Kübler |
title_fullStr | Memory-based parsing Sandra Kübler |
title_full_unstemmed | Memory-based parsing Sandra Kübler |
title_short | Memory-based parsing |
title_sort | memory based parsing |
topic | Analyse automatique (Linguistique) Linguistique informatique Lingüística computacional (gramática) larpcal Parsing gtt Computational linguistics Parsing (Computer grammar) Maschinelles Lernen (DE-588)4193754-5 gnd Parser (DE-588)4125056-4 gnd Natürliche Sprache (DE-588)4041354-8 gnd Syntaktische Analyse (DE-588)4058778-2 gnd |
topic_facet | Analyse automatique (Linguistique) Linguistique informatique Lingüística computacional (gramática) Parsing Computational linguistics Parsing (Computer grammar) Maschinelles Lernen Parser Natürliche Sprache Syntaktische Analyse |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=012938652&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV013516598 |
work_keys_str_mv | AT kublersandra memorybasedparsing |