Improving the feasibility of precision-oriented HPSG parsing:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Abschlussarbeit Buch |
Sprache: | English |
Veröffentlicht: |
Saarbrücken
German Research Center for Artifical Intelligence
2011
Saarbrücken Saarland Univ., Department of Computational Linguistics and Phonetics |
Schriftenreihe: | Saarbrücken dissertations in computational linguistics and language technology
35 |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | X, 169 S. graph. Darst. |
ISBN: | 9783933218346 |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV039780838 | ||
003 | DE-604 | ||
005 | 00000000000000.0 | ||
007 | t | ||
008 | 111230s2011 d||| m||| 00||| eng d | ||
020 | |a 9783933218346 |9 978-3-933218-34-6 | ||
035 | |a (OCoLC)767954743 | ||
035 | |a (DE-599)BSZ35363929X | ||
040 | |a DE-604 |b ger | ||
041 | 0 | |a eng | |
049 | |a DE-83 | ||
100 | 1 | |a Cramer, Bart |d 1982- |e Verfasser |0 (DE-588)1017381178 |4 aut | |
245 | 1 | 0 | |a Improving the feasibility of precision-oriented HPSG parsing |c Bart Cramer |
264 | 1 | |a Saarbrücken |b German Research Center for Artifical Intelligence |c 2011 | |
264 | 1 | |a Saarbrücken |b Saarland Univ., Department of Computational Linguistics and Phonetics | |
300 | |a X, 169 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Saarbrücken dissertations in computational linguistics and language technology |v 35 | |
502 | |a Zugl.: Saarbrücken, Univ., Diss., 2011 | ||
650 | 0 | 7 | |a Deutsch |0 (DE-588)4113292-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Syntaktische Analyse |0 (DE-588)4058778-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Head-driven phrase structure grammar |0 (DE-588)4299529-2 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4113937-9 |a Hochschulschrift |2 gnd-content | |
689 | 0 | 0 | |a Head-driven phrase structure grammar |0 (DE-588)4299529-2 |D s |
689 | 0 | 1 | |a Deutsch |0 (DE-588)4113292-0 |D s |
689 | 0 | 2 | |a Syntaktische Analyse |0 (DE-588)4058778-2 |D s |
689 | 0 | |5 DE-604 | |
830 | 0 | |a Saarbrücken dissertations in computational linguistics and language technology |v 35 |w (DE-604)BV013075694 |9 35 | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024641673&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-024641673 |
Datensatz im Suchindex
_version_ | 1804148701020553216 |
---|---|
adam_text | IMAGE 1
CONTENTS
CONTENTS
1 INTRODUCTION 1
1 .1 THESIS OUTLINE 2
2 BACKGROUND 5
2.1 PARSING AND GRAMMARS 5
2.1.1 DEEPER ANALYSIS OF LANGUAGE 7
2.1.2 HEAD-DRIVEN PHRASE STRUCTURE GRAMMAR 8
2.1.3 A BASIC PARSING ALGORITHM 13
2.1.4 A TAXONOMY OF PARSING RESEARCH . . 14
2.1.5 EVALUATION METRICS 17
2.2 HAND-CRAFTED DEEP GRAMMARS 18
2.2.1 THE ENGLISH RESOURCE GRAMMAR 19
2.2.2 THE PARGRAM PARSER FOR ENGLISH 21
2.2.3 THE RASP PARSER 22
2.2.4 THE ALPINO PARSER 22
2.2.5 OTHER HAND-WRITTEN GRAMMARS 23
2.3 TREEBANKS 24
2.3.1 STRATEGIES FOR EFFICIENT ANNOTATION 25
2.4 USING TREEBANKS TO CREATE GRAMMARS 27
2.4.1 DEEP GRAMMAR EXTRACTION FOR ENGLISH 28
2.4.2 DEEP GRAMMAR EXTRACTION FOR GERMAN 32
2.4.3 EVALUATION ISSUES 33
2.5 THE INTERPLAY BETWEEN GRAMMAR AND PARSER 34
2.5.1 ROBUSTNESS METHODS 34
2.5.2 SEARCH SPACE RESTRICTION 37
2.6 ANATOMY OF A DELPH-IN PARSER 39
2.6.1 THE GRAMMAR 39
2.6.2 THE PET PARSER 41
2.7 MOTIVATION 45
3 CORE GRAMMAR CONSTRUCTION 49
3.1 THE GERMAN LANGUAGE 49
3.2 HPSG ANALYSES OF GERMAN 54
3.2.1 THE HEAD-CLUSTER SCHEMA AND ARGUMENT ATTRACTION . 54 3.2.2 A
FRONTING ANALYSIS 55
3.2.3 COMPLEMENT EXTRAPOSITION 57
3.2.4 ADJUNCT EXTRAPOSITION 58
3.3 IMPLEMENTING A CORE GRAMMAR FOR GERMAN 60
BIBLIOGRAFISCHE INFORMATIONEN HTTP://D-NB.INFO/1018530517
DIGITALISIERT DURCH
IMAGE 2
VIII CONTENTS
3.3.1 BASIC BUILDING BLOCKS 60
3.3.2 LEXICAL TYPES 63
3.3.3 THE CORE LEXICON 64
3.3.4 SEMANTICS VS SYNTACTIC DEPENDENCIES 65
3.3.5 MORPHOLOGY 67
3.3.6 HPSG SCHEMATA AND TOPOLOGICAL FIELDS 67
3.3.7 COORDINATIONS 72
3.4 SUMMARY 73
4 CREATION OF A DEEP LEXICON 75
4.1 INTRODUCTION 75
4.2 THE TIGER TREEBANK 78
4.2.1 PREPROCESSING THE TREEBANK 79
4.3 ACQUISITION OF THE LEXICON 80
4.3.1 SYNTACTIC PROPERTIES 80
4.3.2 MORPHOLOGY 82
4.4 THE RESULTING LEXICON 83
4.5 SUMMARY 89
5 LEVERAGE OF THE GOLD STANDARD 90
5.1 COMPARING PARSING OUTPUT WITH THE GOLD STANDARD 90 5.1.1 EXTRACTING
THE DEPENDENCIES FROM THE TREEBANK 90 5.1.2 ROLE IDENTIFICATION IN THE
PREDICATE 93
5.2 UNIT TESTING 94
5.3 AUTOMATIC CREATION OF A DYNAMIC TREEBANK 96
5.3.1 METHODOLOGY 96
5.3.2 RESULTS 98
5.4 PARSING UNSEEN TEXT 101
5.4.1 OPTIMISING THE DISAMBIGUATION MODEL 102
5.4.2 EVALUATION ON THE TEST SET 104
5.5 SUMMARY 107
INTERLUDE 109
6 AGENDA-BASED TASK PRUNING 118
6.1 INTRODUCTION 118
6.2 TASK-BASED SEARCH SPACE RESTRICTION 121
6.2.1 PRIORITISING PARSER TASKS 121
6.2.2 TASK PRUNING STRATEGIES 124
6.3 EXPERIMENTS 126
6.3.1 FIND THE SAME SOLUTION FASTER 126
IMAGE 3
CONTENTS IX
6.3.2 SCOPE OF PRUNING AND COUNTING STRATEGIES 127
6.3.3 ADJUSTING GLOBAL PRIORITIES FOR SPAN LENGTH 128 6.3.4 CONDITIONING
ON TREE LEAVES 133
6.4 EVALUATION 136
6.4.1 WHAT IS PRUNED? 136
6.5 DIRECTIONS FOR FUTURE RESEARCH 137
6.6 SUMMARY 138
7 IMPROVING PARSER ROBUSTNESS 140
7.1 FRAGMENT PARSING 140
7.2 ROBUSTNESS RULES 142
7.2.1 MOTIVATION 143
7.2.2 RESTRICTING AND DISPREFERRING ROBUSTNESS RULES 144 7.2.3 DEFINING
ROBUSTNESS RULES 145
7.2.4 EXPERIMENTS 146
7.2.5 WHAT THE MODEL PREDICTS 150
7.3 SUMMARY 151
7.3.1 FUTURE WORK 151
8 CONCLUSION 153
8.1 DIRECTIONS FOR FUTURE WORK 155
A TEST SUITE 156
|
any_adam_object | 1 |
author | Cramer, Bart 1982- |
author_GND | (DE-588)1017381178 |
author_facet | Cramer, Bart 1982- |
author_role | aut |
author_sort | Cramer, Bart 1982- |
author_variant | b c bc |
building | Verbundindex |
bvnumber | BV039780838 |
ctrlnum | (OCoLC)767954743 (DE-599)BSZ35363929X |
format | Thesis Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01897nam a2200409 cb4500</leader><controlfield tag="001">BV039780838</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">111230s2011 d||| m||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783933218346</subfield><subfield code="9">978-3-933218-34-6</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)767954743</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BSZ35363929X</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-83</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Cramer, Bart</subfield><subfield code="d">1982-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1017381178</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Improving the feasibility of precision-oriented HPSG parsing</subfield><subfield code="c">Bart Cramer</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Saarbrücken</subfield><subfield code="b">German Research Center for Artifical Intelligence</subfield><subfield code="c">2011</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Saarbrücken</subfield><subfield code="b">Saarland Univ., Department of Computational Linguistics and Phonetics</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">X, 169 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Saarbrücken dissertations in computational linguistics and language technology</subfield><subfield code="v">35</subfield></datafield><datafield tag="502" ind1=" " ind2=" "><subfield code="a">Zugl.: Saarbrücken, Univ., Diss., 2011</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Deutsch</subfield><subfield code="0">(DE-588)4113292-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Syntaktische Analyse</subfield><subfield code="0">(DE-588)4058778-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Head-driven phrase structure grammar</subfield><subfield code="0">(DE-588)4299529-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Head-driven phrase structure grammar</subfield><subfield code="0">(DE-588)4299529-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Deutsch</subfield><subfield code="0">(DE-588)4113292-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Syntaktische Analyse</subfield><subfield code="0">(DE-588)4058778-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Saarbrücken dissertations in computational linguistics and language technology</subfield><subfield code="v">35</subfield><subfield code="w">(DE-604)BV013075694</subfield><subfield code="9">35</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024641673&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-024641673</subfield></datafield></record></collection> |
genre | (DE-588)4113937-9 Hochschulschrift gnd-content |
genre_facet | Hochschulschrift |
id | DE-604.BV039780838 |
illustrated | Illustrated |
indexdate | 2024-07-10T00:11:19Z |
institution | BVB |
isbn | 9783933218346 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-024641673 |
oclc_num | 767954743 |
open_access_boolean | |
owner | DE-83 |
owner_facet | DE-83 |
physical | X, 169 S. graph. Darst. |
publishDate | 2011 |
publishDateSearch | 2011 |
publishDateSort | 2011 |
publisher | German Research Center for Artifical Intelligence Saarland Univ., Department of Computational Linguistics and Phonetics |
record_format | marc |
series | Saarbrücken dissertations in computational linguistics and language technology |
series2 | Saarbrücken dissertations in computational linguistics and language technology |
spelling | Cramer, Bart 1982- Verfasser (DE-588)1017381178 aut Improving the feasibility of precision-oriented HPSG parsing Bart Cramer Saarbrücken German Research Center for Artifical Intelligence 2011 Saarbrücken Saarland Univ., Department of Computational Linguistics and Phonetics X, 169 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Saarbrücken dissertations in computational linguistics and language technology 35 Zugl.: Saarbrücken, Univ., Diss., 2011 Deutsch (DE-588)4113292-0 gnd rswk-swf Syntaktische Analyse (DE-588)4058778-2 gnd rswk-swf Head-driven phrase structure grammar (DE-588)4299529-2 gnd rswk-swf (DE-588)4113937-9 Hochschulschrift gnd-content Head-driven phrase structure grammar (DE-588)4299529-2 s Deutsch (DE-588)4113292-0 s Syntaktische Analyse (DE-588)4058778-2 s DE-604 Saarbrücken dissertations in computational linguistics and language technology 35 (DE-604)BV013075694 35 DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024641673&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Cramer, Bart 1982- Improving the feasibility of precision-oriented HPSG parsing Saarbrücken dissertations in computational linguistics and language technology Deutsch (DE-588)4113292-0 gnd Syntaktische Analyse (DE-588)4058778-2 gnd Head-driven phrase structure grammar (DE-588)4299529-2 gnd |
subject_GND | (DE-588)4113292-0 (DE-588)4058778-2 (DE-588)4299529-2 (DE-588)4113937-9 |
title | Improving the feasibility of precision-oriented HPSG parsing |
title_auth | Improving the feasibility of precision-oriented HPSG parsing |
title_exact_search | Improving the feasibility of precision-oriented HPSG parsing |
title_full | Improving the feasibility of precision-oriented HPSG parsing Bart Cramer |
title_fullStr | Improving the feasibility of precision-oriented HPSG parsing Bart Cramer |
title_full_unstemmed | Improving the feasibility of precision-oriented HPSG parsing Bart Cramer |
title_short | Improving the feasibility of precision-oriented HPSG parsing |
title_sort | improving the feasibility of precision oriented hpsg parsing |
topic | Deutsch (DE-588)4113292-0 gnd Syntaktische Analyse (DE-588)4058778-2 gnd Head-driven phrase structure grammar (DE-588)4299529-2 gnd |
topic_facet | Deutsch Syntaktische Analyse Head-driven phrase structure grammar Hochschulschrift |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024641673&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV013075694 |
work_keys_str_mv | AT cramerbart improvingthefeasibilityofprecisionorientedhpsgparsing |