A wordnet from the ground up:
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Wrocław
Oficyna Wydawnicza Politechniki Wrocławskiej
2009
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | 220 S. graph. Darst. |
ISBN: | 9788374934763 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV036111640 | ||
003 | DE-604 | ||
005 | 20100430 | ||
007 | t | ||
008 | 100408s2009 d||| |||| 00||| eng d | ||
020 | |a 9788374934763 |9 978-83-7493-476-3 | ||
035 | |a (OCoLC)634369870 | ||
035 | |a (DE-599)BVBBV036111640 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-355 | ||
084 | |a KN 2650 |0 (DE-625)79661: |2 rvk | ||
100 | 1 | |a Piasecki, Maciej |e Verfasser |4 aut | |
245 | 1 | 0 | |a A wordnet from the ground up |c Maciej Piasecki ; Stanisław Szpakowicz ; Bartosz Broda |
264 | 1 | |a Wrocław |b Oficyna Wydawnicza Politechniki Wrocławskiej |c 2009 | |
300 | |a 220 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
650 | 0 | 7 | |a Lexikologie |0 (DE-588)4114409-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Computerlinguistik |0 (DE-588)4035843-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Polnisch |0 (DE-588)4120314-8 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Semantik |0 (DE-588)4054490-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Wortschatz |0 (DE-588)4126555-5 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Datenbank |0 (DE-588)4011119-2 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Polnisch |0 (DE-588)4120314-8 |D s |
689 | 0 | 1 | |a Wortschatz |0 (DE-588)4126555-5 |D s |
689 | 0 | 2 | |a Datenbank |0 (DE-588)4011119-2 |D s |
689 | 0 | |5 DE-604 | |
689 | 1 | 0 | |a Polnisch |0 (DE-588)4120314-8 |D s |
689 | 1 | 1 | |a Computerlinguistik |0 (DE-588)4035843-4 |D s |
689 | 1 | 2 | |a Semantik |0 (DE-588)4054490-4 |D s |
689 | 1 | 3 | |a Lexikologie |0 (DE-588)4114409-0 |D s |
689 | 1 | |5 DE-604 | |
700 | 1 | |a Szpakowicz, Stanislaw |e Verfasser |4 aut | |
700 | 1 | |a Broda, Bernd |e Verfasser |4 aut | |
856 | 4 | 2 | |m Digitalisierung UB Regensburg |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=019001809&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-019001809 |
Datensatz im Suchindex
_version_ | 1804141213388898304 |
---|---|
adam_text | Contents
Motivation, Goals,
Early Decisions
7
1.1. Motivation................................. 7
1.1.1.
What is a wordnet?
........................ 7
1.1.2.
Princeton WordNet
........................ 7
1.1.3.
The importance of wordnets for language processing
..... 11
1.1.4.
Wordnets out there
........................ 13
1.2.
The Goals of the plWordNet Project
................... 15
1.3.
Early Decisions
.............................. 18
1.3.1.
Models for wordnet development
................ 18
1.3.2.
Why we chose the merge approach
............... 19
Building a Wordnet Core
23
2.1.
The
Synset
................................ 23
2.2.
The Lexico-semantic Relations
...................... 26
2.2.1.
Antonymy
and conversion
.................... 27
2.2.2.
Hyponymy/hypernymy and troponymy
............. 28
2.2.3.
Meronymy/holonymy
....................... 31
2.2.4.
Relatedness, pertainymy and Polish derivation
......... 32
2.2.5.
Fuzzynymy
............................ 34
2.3.
Difficult Cases
.............................. 35
2.4.
The First
7000
Lexical Units
....................... 36
2.5.
The Final State of plWordNet Core
................... 44
Discovering Semantic Relatedness
47
3.1.
Expectations
................................ 47
3.2.
Basic Division: Patterns versus Statistical Mass
............ 48
3.3.
Evaluation
................................. 50
3.3.1.
Wordnet-based synonymy test for Polish
............ 53
3.3.2.
Enhanced WBST
......................... 57
3.4.
Measures of Semantic Relatedness
.................... 61
3.4.1.
The distributional hypothesis and its consequences
....... 61
3.4.2.
Context
and its description
.................... 62
3.4.3.
Preprocessing based on morphosyntactic constraints
...... 65
3.4.4.
Transformation based on rank weighting
............ 72
3.4.5.
Benefits for wordnet construction
................ 77
3.5.
Sense Discovery by Clustering
...................... 87
3.5.1.
Document clustering in sense discovery
............. 88
3.5.2.
Benefits of document clusters for constructing a wordnet
... 91
3.5.3.
Clustering by Committee as an example of word sense discovery
91
3.5.4.
Benefits of discovered senses for constructing a wordnet
... 95
4
Extracting Instances of Semantic Relations
101
4.1.
Lexico-Morphosyntactic Patterns
..................... 102
4.2.
Benefits of Handwritten Patterns for Wordnet Expansion
........ 105
4.3.
Generic Patterns Verified Statistically
.................. 108
4.4.
Benefits of Extracted Patterns for Wordnet Expansion
......... 119
4.5.
Hybrid Combinations: Patterns, Distributional Semantics and Classifiers
130
4.5.1.
Classifiers for lexical-semantic relations
............ 131
4.5.2.
Benefits of classifier-based filtering for wordnet expansion
. . 137
4.5.3.
Multicriteria voting in wordnet expansion
............ 143
4.5.4.
Benefits of weaving the expanded structure
........... 154
5
Polish WordNet Today and Tomorrow
165
5.1.
Weaving the Full-fledged Structure
................... 165
5.2.
plWordNet at Three
............................ 170
5.3.
Lessons Learned
............................. 176
5.4.
What Next?
................................ 181
A Tests for Lexico-semantic Relations
185
Bibliography
191
|
any_adam_object | 1 |
author | Piasecki, Maciej Szpakowicz, Stanislaw Broda, Bernd |
author_facet | Piasecki, Maciej Szpakowicz, Stanislaw Broda, Bernd |
author_role | aut aut aut |
author_sort | Piasecki, Maciej |
author_variant | m p mp s s ss b b bb |
building | Verbundindex |
bvnumber | BV036111640 |
classification_rvk | KN 2650 |
ctrlnum | (OCoLC)634369870 (DE-599)BVBBV036111640 |
discipline | Slavistik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01940nam a2200481 c 4500</leader><controlfield tag="001">BV036111640</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20100430 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">100408s2009 d||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9788374934763</subfield><subfield code="9">978-83-7493-476-3</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)634369870</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV036111640</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-355</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">KN 2650</subfield><subfield code="0">(DE-625)79661:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Piasecki, Maciej</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">A wordnet from the ground up</subfield><subfield code="c">Maciej Piasecki ; Stanisław Szpakowicz ; Bartosz Broda</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Wrocław</subfield><subfield code="b">Oficyna Wydawnicza Politechniki Wrocławskiej</subfield><subfield code="c">2009</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">220 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Lexikologie</subfield><subfield code="0">(DE-588)4114409-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Computerlinguistik</subfield><subfield code="0">(DE-588)4035843-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Polnisch</subfield><subfield code="0">(DE-588)4120314-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Semantik</subfield><subfield code="0">(DE-588)4054490-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Wortschatz</subfield><subfield code="0">(DE-588)4126555-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenbank</subfield><subfield code="0">(DE-588)4011119-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Polnisch</subfield><subfield code="0">(DE-588)4120314-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Wortschatz</subfield><subfield code="0">(DE-588)4126555-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Datenbank</subfield><subfield code="0">(DE-588)4011119-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Polnisch</subfield><subfield code="0">(DE-588)4120314-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="1"><subfield code="a">Computerlinguistik</subfield><subfield code="0">(DE-588)4035843-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="2"><subfield code="a">Semantik</subfield><subfield code="0">(DE-588)4054490-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="3"><subfield code="a">Lexikologie</subfield><subfield code="0">(DE-588)4114409-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Szpakowicz, Stanislaw</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Broda, Bernd</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Regensburg</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=019001809&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-019001809</subfield></datafield></record></collection> |
id | DE-604.BV036111640 |
illustrated | Illustrated |
indexdate | 2024-07-09T22:12:18Z |
institution | BVB |
isbn | 9788374934763 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-019001809 |
oclc_num | 634369870 |
open_access_boolean | |
owner | DE-355 DE-BY-UBR |
owner_facet | DE-355 DE-BY-UBR |
physical | 220 S. graph. Darst. |
publishDate | 2009 |
publishDateSearch | 2009 |
publishDateSort | 2009 |
publisher | Oficyna Wydawnicza Politechniki Wrocławskiej |
record_format | marc |
spelling | Piasecki, Maciej Verfasser aut A wordnet from the ground up Maciej Piasecki ; Stanisław Szpakowicz ; Bartosz Broda Wrocław Oficyna Wydawnicza Politechniki Wrocławskiej 2009 220 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Lexikologie (DE-588)4114409-0 gnd rswk-swf Computerlinguistik (DE-588)4035843-4 gnd rswk-swf Polnisch (DE-588)4120314-8 gnd rswk-swf Semantik (DE-588)4054490-4 gnd rswk-swf Wortschatz (DE-588)4126555-5 gnd rswk-swf Datenbank (DE-588)4011119-2 gnd rswk-swf Polnisch (DE-588)4120314-8 s Wortschatz (DE-588)4126555-5 s Datenbank (DE-588)4011119-2 s DE-604 Computerlinguistik (DE-588)4035843-4 s Semantik (DE-588)4054490-4 s Lexikologie (DE-588)4114409-0 s Szpakowicz, Stanislaw Verfasser aut Broda, Bernd Verfasser aut Digitalisierung UB Regensburg application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=019001809&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Piasecki, Maciej Szpakowicz, Stanislaw Broda, Bernd A wordnet from the ground up Lexikologie (DE-588)4114409-0 gnd Computerlinguistik (DE-588)4035843-4 gnd Polnisch (DE-588)4120314-8 gnd Semantik (DE-588)4054490-4 gnd Wortschatz (DE-588)4126555-5 gnd Datenbank (DE-588)4011119-2 gnd |
subject_GND | (DE-588)4114409-0 (DE-588)4035843-4 (DE-588)4120314-8 (DE-588)4054490-4 (DE-588)4126555-5 (DE-588)4011119-2 |
title | A wordnet from the ground up |
title_auth | A wordnet from the ground up |
title_exact_search | A wordnet from the ground up |
title_full | A wordnet from the ground up Maciej Piasecki ; Stanisław Szpakowicz ; Bartosz Broda |
title_fullStr | A wordnet from the ground up Maciej Piasecki ; Stanisław Szpakowicz ; Bartosz Broda |
title_full_unstemmed | A wordnet from the ground up Maciej Piasecki ; Stanisław Szpakowicz ; Bartosz Broda |
title_short | A wordnet from the ground up |
title_sort | a wordnet from the ground up |
topic | Lexikologie (DE-588)4114409-0 gnd Computerlinguistik (DE-588)4035843-4 gnd Polnisch (DE-588)4120314-8 gnd Semantik (DE-588)4054490-4 gnd Wortschatz (DE-588)4126555-5 gnd Datenbank (DE-588)4011119-2 gnd |
topic_facet | Lexikologie Computerlinguistik Polnisch Semantik Wortschatz Datenbank |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=019001809&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT piaseckimaciej awordnetfromthegroundup AT szpakowiczstanislaw awordnetfromthegroundup AT brodabernd awordnetfromthegroundup |