Text retrieval and filtering: analytic models of performance
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Boston [u.a.]
Kluwer
1998
|
Schriftenreihe: | The Kluwer international series on information retrieval
3 |
Schlagworte: | |
Online-Zugang: | Table of contents Inhaltsverzeichnis |
Beschreibung: | X, 242 S. graph. Darst. |
ISBN: | 0792381777 |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV012151256 | ||
003 | DE-604 | ||
005 | 20150211 | ||
007 | t | ||
008 | 980914s1998 xxud||| |||| 00||| eng d | ||
020 | |a 0792381777 |9 0-7923-8177-7 | ||
035 | |a (OCoLC)38989903 | ||
035 | |a (DE-599)BVBBV012151256 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
044 | |a xxu |c XD-US | ||
049 | |a DE-739 |a DE-384 |a DE-29 |a DE-521 |a DE-525 | ||
050 | 0 | |a QA76.9.T48L67 1998 | |
082 | 0 | |a 005 21 | |
082 | 0 | |a 005 |2 21 | |
084 | |a AN 95000 |0 (DE-625)6793: |2 rvk | ||
084 | |a ST 306 |0 (DE-625)143654: |2 rvk | ||
100 | 1 | |a Losee, Robert M. |e Verfasser |4 aut | |
245 | 1 | 0 | |a Text retrieval and filtering |b analytic models of performance |c by Robert M. Losee |
264 | 1 | |a Boston [u.a.] |b Kluwer |c 1998 | |
300 | |a X, 242 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a The Kluwer international series on information retrieval |v 3 | |
650 | 7 | |a Information storage and retrieval |2 gtt | |
650 | 4 | |a Traitement de texte | |
650 | 4 | |a Text processing (Computer science) | |
650 | 4 | |a Natural language processing (Computer science) | |
650 | 0 | 7 | |a Freitextsuche |0 (DE-588)4242783-6 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Freitextsuche |0 (DE-588)4242783-6 |D s |
689 | 0 | |5 DE-604 | |
830 | 0 | |a The Kluwer international series on information retrieval |v 3 |w (DE-604)BV011555884 |9 3 | |
856 | 4 | |u http://lcweb.loc.gov/catdir/toc/98-23431.html |3 Table of contents | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=008230264&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-008230264 |
Datensatz im Suchindex
_version_ | 1804126760645689344 |
---|---|
adam_text | Contents
Preface ix
Acknowledgments xiii
1. INTRODUCTION 1
1.1 Text Filtering and Retrieval 1
1.2 Systems and Experiments 3
1.3 Analytic Methods 12
1.4 Questions Needing Answers 16
1.5 The Structure of this Book 17
2. QUANTITATIVE REASONING 19
2.1 Introduction 19
2.2 Distributions of Random Variables 23
2.3 Inference and Conditional Probabilities 28
2.4 Estimating Parameters 32
2.5 Bayesian Methods 34
2.6 Other Quantitative Reasoning Systems 37
2.7 Summary 41
3. SIMILARITY AND RETRIEVAL DECISIONS 43
3.1 Characteristics of Similarity and Distance Measures 45
3.2 Distance 48
3.3 Similarity Measures for Nominal Data 50
3.4 Similarity Excluding Joint Absences 54
3.5 Boolean Retrieval 57
3.6 Angular Distance and Vector Retrieval 60
3.7 Probabilistic Retrieval 62
3.8 The Bayesian Learning Model 68
3.9 Related Weights 71
3.10 Upper Bounds 72
3.11 Browsing 73
v
vi TEXT RETRIEVAL AND FILTERING
3.12 Summary 75
4. MEASURING PERFORMANCE 77
4.1 Retrospective Performance Measures 79
4.2 Single Number Measures 85
4.3 Average Search Length (ASL) 89
4.4 Summary 92
5. THE QUALITY OF A RANKING METHOD 93
5.1 Introduction 93
5.2 Degree of Optimality for Specific Models 96
5.3 Relevance Feedback 101
5.4 Comparing Ranking under Different Levels of Knowledge 103
5.5 A General Model of Ranking Performance 105
5.6 Existence and Construction of Ranking Procedures 107
5.7 Summary 109
6. PERFORMANCE WITH ONE TERM 111
6.1 Introduction 111
6.2 Computing the ASL 114
6.3 A General Theory of Performance 117
6.4 Continuous Feature Distributions 119
6.5 Discrete Term Frequencies 120
6.6 A, ASL, and Traditional Performance Measures 121
6.7 The Effect of Parameter Values 125
6.8 Summary 127
I
7. MULTIVARIATE PROBABILITIES 129 j
7.1 Multivariate Binary Distributions 130
7.2 A Matrix Model of Multivariate Binary Data 131
7.3 A Binary Multivariate Expansion 135
7.4 Tree Based Dependence 136 !
7.5 Bayesian Networks 138
7.6 Mutual Information 139
7.7 Logistic Models 140
7.8 Multivariate Normal Distribution 141
7.9 Term Sequences and Non Stationary Processes 144
7.10 Empirical methods 146
7.11 Reducing Dimensionality 146
7.12 Discussion and Summary 149 j
8. PERFORMANCE WITH MULTIPLE TERMS 151
8.1 Introduction 151
8.2 ASL 153
8.3 Differing A values 155 j
V
s!
t:
Contents vii
8.4 Computing Q 156
8.5 Understanding Ranking with Term Dependencies 159
8.6 A Case Study 162
8.7 Performance Assuming Binary Independence 164
8.8 Different Dependence Assumptions 165
8.9 Query Length and Expansion 166
8.10 Browsing 169
8.11 Summary 169
9. LOGICS AND RULES 171
9.1 Introduction 171
9.2 Conditional Statements 174
9.3 Modal Logic 176
9.4 Temporal Logics 181
9.5 Logic and Probability 183
9.6 Filtering 187
9.7 Logic and Quantities 190
9.8 Relations and Fact Retrieval 191
9.9 Three Valued Logics 193
9.10 Non monotonic Reasoning 198
9.11 Summary 201
10. LINGUISTIC KNOWLEDGE 203
10.1 Introduction 203
10.2 Tagging and Suffix Stripping with One Term 205
10.3 Multiple Tags per Term Type 213
10.4 Controlled Vocabularies 214
10.5 Multiple Terms and Grammatical Structures 217
10.6 The Structure of Statements 219
10.7 Performance with Multiple Syntactic Tags: A Case Study 222
10.8 Evaluating Grammar Quality with Retrieval Performance 223
10.9 Discussion 226
Bibliography 227
Index 239
r
|
any_adam_object | 1 |
author | Losee, Robert M. |
author_facet | Losee, Robert M. |
author_role | aut |
author_sort | Losee, Robert M. |
author_variant | r m l rm rml |
building | Verbundindex |
bvnumber | BV012151256 |
callnumber-first | Q - Science |
callnumber-label | QA76 |
callnumber-raw | QA76.9.T48L67 1998 |
callnumber-search | QA76.9.T48L67 1998 |
callnumber-sort | QA 276.9 T48 L67 41998 |
callnumber-subject | QA - Mathematics |
classification_rvk | AN 95000 ST 306 |
ctrlnum | (OCoLC)38989903 (DE-599)BVBBV012151256 |
dewey-full | 00521 005 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 005 - Computer programming, programs, data, security |
dewey-raw | 005 21 005 |
dewey-search | 005 21 005 |
dewey-sort | 15 221 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Allgemeines Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01816nam a2200457 cb4500</leader><controlfield tag="001">BV012151256</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20150211 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">980914s1998 xxud||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0792381777</subfield><subfield code="9">0-7923-8177-7</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)38989903</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV012151256</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">xxu</subfield><subfield code="c">XD-US</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-739</subfield><subfield code="a">DE-384</subfield><subfield code="a">DE-29</subfield><subfield code="a">DE-521</subfield><subfield code="a">DE-525</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">QA76.9.T48L67 1998</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">005 21</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">005</subfield><subfield code="2">21</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">AN 95000</subfield><subfield code="0">(DE-625)6793:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 306</subfield><subfield code="0">(DE-625)143654:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Losee, Robert M.</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Text retrieval and filtering</subfield><subfield code="b">analytic models of performance</subfield><subfield code="c">by Robert M. Losee</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Boston [u.a.]</subfield><subfield code="b">Kluwer</subfield><subfield code="c">1998</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">X, 242 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">The Kluwer international series on information retrieval</subfield><subfield code="v">3</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Information storage and retrieval</subfield><subfield code="2">gtt</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Traitement de texte</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Text processing (Computer science)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Natural language processing (Computer science)</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Freitextsuche</subfield><subfield code="0">(DE-588)4242783-6</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Freitextsuche</subfield><subfield code="0">(DE-588)4242783-6</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">The Kluwer international series on information retrieval</subfield><subfield code="v">3</subfield><subfield code="w">(DE-604)BV011555884</subfield><subfield code="9">3</subfield></datafield><datafield tag="856" ind1="4" ind2=" "><subfield code="u">http://lcweb.loc.gov/catdir/toc/98-23431.html</subfield><subfield code="3">Table of contents</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=008230264&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-008230264</subfield></datafield></record></collection> |
id | DE-604.BV012151256 |
illustrated | Illustrated |
indexdate | 2024-07-09T18:22:35Z |
institution | BVB |
isbn | 0792381777 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-008230264 |
oclc_num | 38989903 |
open_access_boolean | |
owner | DE-739 DE-384 DE-29 DE-521 DE-525 |
owner_facet | DE-739 DE-384 DE-29 DE-521 DE-525 |
physical | X, 242 S. graph. Darst. |
publishDate | 1998 |
publishDateSearch | 1998 |
publishDateSort | 1998 |
publisher | Kluwer |
record_format | marc |
series | The Kluwer international series on information retrieval |
series2 | The Kluwer international series on information retrieval |
spelling | Losee, Robert M. Verfasser aut Text retrieval and filtering analytic models of performance by Robert M. Losee Boston [u.a.] Kluwer 1998 X, 242 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier The Kluwer international series on information retrieval 3 Information storage and retrieval gtt Traitement de texte Text processing (Computer science) Natural language processing (Computer science) Freitextsuche (DE-588)4242783-6 gnd rswk-swf Freitextsuche (DE-588)4242783-6 s DE-604 The Kluwer international series on information retrieval 3 (DE-604)BV011555884 3 http://lcweb.loc.gov/catdir/toc/98-23431.html Table of contents HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=008230264&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Losee, Robert M. Text retrieval and filtering analytic models of performance The Kluwer international series on information retrieval Information storage and retrieval gtt Traitement de texte Text processing (Computer science) Natural language processing (Computer science) Freitextsuche (DE-588)4242783-6 gnd |
subject_GND | (DE-588)4242783-6 |
title | Text retrieval and filtering analytic models of performance |
title_auth | Text retrieval and filtering analytic models of performance |
title_exact_search | Text retrieval and filtering analytic models of performance |
title_full | Text retrieval and filtering analytic models of performance by Robert M. Losee |
title_fullStr | Text retrieval and filtering analytic models of performance by Robert M. Losee |
title_full_unstemmed | Text retrieval and filtering analytic models of performance by Robert M. Losee |
title_short | Text retrieval and filtering |
title_sort | text retrieval and filtering analytic models of performance |
title_sub | analytic models of performance |
topic | Information storage and retrieval gtt Traitement de texte Text processing (Computer science) Natural language processing (Computer science) Freitextsuche (DE-588)4242783-6 gnd |
topic_facet | Information storage and retrieval Traitement de texte Text processing (Computer science) Natural language processing (Computer science) Freitextsuche |
url | http://lcweb.loc.gov/catdir/toc/98-23431.html http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=008230264&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV011555884 |
work_keys_str_mv | AT loseerobertm textretrievalandfilteringanalyticmodelsofperformance |