Finding out about: a cognitive perspective on search engine technology and the WWW
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Cambridge [u.a.]
Cambridge Univ. Press
2008
|
Ausgabe: | 1. paperback ed. |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Originally published: 2000 Includes bibliographical references and index |
Beschreibung: | XXVII, 356 S. Ill. 24 cm |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV036500541 | ||
003 | DE-604 | ||
005 | 20120320 | ||
007 | t | ||
008 | 100614s2008 a||| |||| 00||| eng d | ||
015 | |a GBA870844 |2 dnb | ||
020 | |z 9780521734462 |9 978-0-521-73446-2 | ||
020 | |z 0521734460 |9 0-521-73446-0 | ||
035 | |a (OCoLC)705586829 | ||
035 | |a (DE-599)BVBBV036500541 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-12 |a DE-355 | ||
084 | |a ST 205 |0 (DE-625)143613: |2 rvk | ||
084 | |a 24,1 |2 ssgn | ||
100 | 1 | |a Belew, Richard K. |e Verfasser |0 (DE-588)141866314 |4 aut | |
245 | 1 | 0 | |a Finding out about |b a cognitive perspective on search engine technology and the WWW |c Richard K. Belew |
250 | |a 1. paperback ed. | ||
264 | 1 | |a Cambridge [u.a.] |b Cambridge Univ. Press |c 2008 | |
300 | |a XXVII, 356 S. |b Ill. |c 24 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
500 | |a Originally published: 2000 | ||
500 | |a Includes bibliographical references and index | ||
650 | 4 | |a Search engines / Programming | |
650 | 4 | |a Web search engines | |
650 | 4 | |a World Wide Web / Subject access | |
650 | 0 | 7 | |a World Wide Web |0 (DE-588)4363898-3 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Information Retrieval |0 (DE-588)4072803-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Suchmaschine |0 (DE-588)4423007-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Kognitive Orientierung |0 (DE-588)4134004-8 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a World Wide Web |0 (DE-588)4363898-3 |D s |
689 | 0 | 1 | |a Suchmaschine |0 (DE-588)4423007-2 |D s |
689 | 0 | 2 | |a Information Retrieval |0 (DE-588)4072803-1 |D s |
689 | 0 | |5 DE-604 | |
689 | 1 | 0 | |a Suchmaschine |0 (DE-588)4423007-2 |D s |
689 | 1 | 1 | |a Information Retrieval |0 (DE-588)4072803-1 |D s |
689 | 1 | 2 | |a Kognitive Orientierung |0 (DE-588)4134004-8 |D s |
689 | 1 | |8 1\p |5 DE-604 | |
856 | 4 | 2 | |m Digitalisierung BSBMuenchen |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020422970&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-020422970 | ||
883 | 1 | |8 1\p |a cgwrk |d 20201028 |q DE-101 |u https://d-nb.info/provenance/plan#cgwrk |
Datensatz im Suchindex
_version_ | 1804143064960204800 |
---|---|
adam_text | Contents
Figures
page
xi
Foreword by
C. J.
van
Rijsbergen
XV
Preface
xix
1
Overview
1
1.1
Finding Out About
-
A Cognitive Activity
1
1.1.1
Working within the
IR
Tradition
8
1.2
Keywords
10
1.2.1
Elements of the Query Language
10
1.2.2
Topical Scope
11
1.2.3
Document Descriptors
12
1.3
Query Syntax
13
1.3.1
Query Sessions
14
1.4
Documents
16
1.4.1
Structured Aspects of Documents
19
1.4.2
Corpora
20
1.4.3
Document Proxies
20
1.4.4
Genre
21
1.4.5
Beyond Text
22
1.5
Indexing
26
1.5.1
Automatically Selecting Keywords
27
1.5.2
Computer-Assisted Indexing
28
vi
CONTENTS
1.6
FOA
versus
Database
Retrieval
29
1.7
How Well Are We Doing?
34
1.8
Summary
36
2
Extracting Lexical Features
39
2.1
Building Useful Tools
39
2.2
Interdocument
Parsing
40
2.3
Intradocument Parsing
42
2.3.1
Stemming and Other Morphological Processing
44
2.3.2
Noise Words
47
2.3.3
Summary
48
2.4
Example Corpora
48
2.5
Implementation
50
2.5.1
Basic Algorithm
51
2.5.2
Fine Points
54
2.5.3
Software Libraries
58
3
Weighting and Matching against Indices
60
3.1
Microscopic Semantics and the Statistics of
Communication
60
3.2
Remember Zipf
62
3.2.1
Looking for Meaning in All the Wrong Places
(At the Character Level)
63
3.2.2
Zipf s Own Explanation
64
3.2.3
Benoit
Mandelbrot s Explanation
66
3.2.4
Herbert Simon s Explanation
67
3.2.5
More Recent Zipfian Sightings
68
3.2.6
Summary
71
3.3
A Statistical Basis for Keyword Meaning
71
3.3.1
Lexical Consequences, Internal/External
Perspectives
71
3.3.2
Word Occurrence as
a Poisson
Process
73
3.3.3
Resolving Power
76
3.3.4
Language Distribution
78
3.3.5
Weighting the Index Relation
81
3.3.6
Informative Signals versus Noise Words
83
3.3.7
Inverse Document Frequency
84
3.4
Vector Space
86
CONTENTS
vii
3.4.1 Keyword
Discrimination
88
3.4.2
Vector Length Normalization
89
3.4.3
Summary: SMART Weighting Specification
92
3.5
Matching Queries against Documents
93
3.5.1
Measures of Association
94
3.5.2
Cosine Similarity
95
3.6
Calculating TF-IDF Weighting
96
3.7
Computing Partial Match Scores
97
3.8
Summary
101
4
Assessing the Retrieval
105
4.1
Personal Assessment of Relevance
106
4.1.1
Cognitive Assumptions
106
4.2
Extending the Dialog with RelFbk
109
4.2.1
Using RelFbk for Query Refinement 111
4.2.2
Using RelFbk to Adapt Documents Indices
115
4.2.3
Summary
116
4.3
Aggregated Assessment: Search Engine Performance
116
4.3.1
Underlying Assumptions
116
4.3.2
Consensual Relevance
118
4.3.3
Traditional Evaluation Methodologies
119
4.3.4
Basic Measures
122
4.3.5
Ordering the Retr Set
124
4.3.6
Normalized Recall and Precision
128
4.3.7
Multiple Retrievals across Varying Queries
129
4.3.8
One-Parameter Criteria
132
4.3.9
Test Corpora
135
4.3.10
Other Measures
137
4.4
RAVE: A Relevance Assessment VEhicle
141
4.4.1
RAVeUnlon
141
4.4.2
RAVePlan
142
4.4.3
Interactive RAVE
143
4.4.4
RAVeCompile
145
4.5
Summary
146
5
Mathematical Foundations
149
5.1
Derivation ofZipf s Law for Random Texts
149
5.1.1
Discussion
152
viii CONTENTS
5.2
Dimensionality Reduction
153
5.2.1
A Simple Example 153
5.2.2
Formal Notions of Similarity
155
5.2.3
Singular Value Decomposition
156
5.2.4
How Many Dimensions
к
to Reduce to?
157
5.2.5
Other Uses of Vector Space
158
5.2.6
Computational Considerations
159
5.2.7
Latent Semantic Claims
159
5.3
Preference Relations 161
5.3.1
Multidimensional Scaling
161
5.3.2
Information in RelFbk I63
5.3.3
Connections between MDS and LSI
165
5.4
Clustering I65
5.4.1
The Cluster Hypothesis I65
5.4.2
Clustering Algorithms i66
5.5
Probabilistic Retrieval l67
5.5.1
Probability Ranking Principle
168
5.5.2
Bayesian Inversion
169
5.5.3
Odds Calculation l70
5.5.4
Binary Independence Model
170
5.5.5
Linear Discriminators
173
5.5.6
Cost Analysis I75
5.5.7
Bayesian Networks
175
6
Inference beyond the Index
182
6.1
Citation:
Interdocument
Links
185
6.1.1
Bibliometric Analysis of Science
187
6.1.2
Time Scale
190
6.1.3
Legal Citation
191
6.1.4
Citations and Arguments
193
6.1.5
Analyzing WWW Adjacency
195
6.2
Hypertext, Intradocument Links
199
6.2.1
Footnotes, Hyperfootnotes, and
cf.
200
6.2.2
Hierarchic Containment 2°1
6.2.3
Argument Relations
205
6.2.4
Intra-
versus
Interdocument
Relations
207
6.2.5
Beyond Unary About (k) Predicates
209
6.3
Keyword Structures
210
CONTENTS
ix
6.3.1
Automatic Thesaurus Construction
211
6.3.2
Corpus-Based Linguistics and WordNet
213
6.3.3
Taxonomies
217
6.4
Social Relations among Authors
220
6.4.1
AI
Genealogy
221
6.4.2
An Empirical Foundation for a Philosophy
of Science
223
6.5
Modes of Inference
224
6.5.1
Theorem-Proving Models for Relevance
224
6.5.2
Spreading Activation Search
225
6.5.3
Discovering Latent Knowledge within a Corpus
234
6.6
Deep Interfaces
238
6.6.1
Geographical Hitlists
238
6.7
FOA
(The Law)
242
6.8
FOA
(Evolution)
245
6.9
Text-Based Intelligence
247
7
Adaptive Information Retrieval
252
7.1
Background
252
7.1.1
Training against Manual Indices
254
7.1.2
Alternative Tasks for Learning
255
7.1.3
Sources of Feedback
256
7.2
Building Hypotheses about Documents
258
7.2.1
Feature Selection
259
7.2.2
Hypothesis Spaces
262
7.3
Learning Which Documents to Route
263
7.3.1
Widrow-Hoff
265
7.3.2
User Drift and Event Tracking
266
7.4
Classification
267
7.4.1
Modeling Documents
269
7.4.2
Training a Classifier
271
7.4.3
Priors
272
7.5
Other Approaches to Classification
273
7.5.1
Nearest-Neighbor Matching
273
7.5.2
Boolean Predicates
273
7.5.3
When Irrelevant Attributes Abound
274
7.5.4
Combining Classifiers
275
7.5.5
Hierarchic Classification
277
CONTENTS
7.6
Information-Seeking Agents
279
7.6.1
Exploiting Linkage for Context
279
7.6.2
The InfoSpiders Algorithm
281
7.6.3
Adapting to Spatial Context
284
7.7
Other Learning Applications and Issues
286
7.7.1
Adaptive Lenses
286
7.7.2
Adapting to Fluid Language Use
288
7.8
Symbolic and Subsymbolic Learning
288
8
Conclusions and Future Directions
292
8.1
Things that Are Changing
292
8.1.1
WWW Crawling
294
8.2
Things that Stay the Same
298
8.2.1
The
FOA
Language Game
299
8.2.2 Sperber
and Wilson s Relevance
304
8.2.3
Argument Structures
305
8.2.4
User as Portal
306
8.3
Who Needs to
FOA
307
8.3.1
Authors
308
8.3.2
Scientists
309
8.3.3
The Changing Economics of Publishing
311
8.3.4
Teachers and Students
313
8.4
Summary
316
(Active) Colophon
318
Bibliography
321
Index
347
|
any_adam_object | 1 |
author | Belew, Richard K. |
author_GND | (DE-588)141866314 |
author_facet | Belew, Richard K. |
author_role | aut |
author_sort | Belew, Richard K. |
author_variant | r k b rk rkb |
building | Verbundindex |
bvnumber | BV036500541 |
classification_rvk | ST 205 |
ctrlnum | (OCoLC)705586829 (DE-599)BVBBV036500541 |
discipline | Informatik |
edition | 1. paperback ed. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02217nam a2200541 c 4500</leader><controlfield tag="001">BV036500541</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20120320 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">100614s2008 a||| |||| 00||| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBA870844</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9780521734462</subfield><subfield code="9">978-0-521-73446-2</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">0521734460</subfield><subfield code="9">0-521-73446-0</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)705586829</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV036500541</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-12</subfield><subfield code="a">DE-355</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 205</subfield><subfield code="0">(DE-625)143613:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">24,1</subfield><subfield code="2">ssgn</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Belew, Richard K.</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)141866314</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Finding out about</subfield><subfield code="b">a cognitive perspective on search engine technology and the WWW</subfield><subfield code="c">Richard K. Belew</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1. paperback ed.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Cambridge [u.a.]</subfield><subfield code="b">Cambridge Univ. Press</subfield><subfield code="c">2008</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XXVII, 356 S.</subfield><subfield code="b">Ill.</subfield><subfield code="c">24 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Originally published: 2000</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references and index</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Search engines / Programming</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Web search engines</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">World Wide Web / Subject access</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">World Wide Web</subfield><subfield code="0">(DE-588)4363898-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Suchmaschine</subfield><subfield code="0">(DE-588)4423007-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Kognitive Orientierung</subfield><subfield code="0">(DE-588)4134004-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">World Wide Web</subfield><subfield code="0">(DE-588)4363898-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Suchmaschine</subfield><subfield code="0">(DE-588)4423007-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Suchmaschine</subfield><subfield code="0">(DE-588)4423007-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="1"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="2"><subfield code="a">Kognitive Orientierung</subfield><subfield code="0">(DE-588)4134004-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung BSBMuenchen</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020422970&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-020422970</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield></record></collection> |
id | DE-604.BV036500541 |
illustrated | Illustrated |
indexdate | 2024-07-09T22:41:44Z |
institution | BVB |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-020422970 |
oclc_num | 705586829 |
open_access_boolean | |
owner | DE-12 DE-355 DE-BY-UBR |
owner_facet | DE-12 DE-355 DE-BY-UBR |
physical | XXVII, 356 S. Ill. 24 cm |
publishDate | 2008 |
publishDateSearch | 2008 |
publishDateSort | 2008 |
publisher | Cambridge Univ. Press |
record_format | marc |
spelling | Belew, Richard K. Verfasser (DE-588)141866314 aut Finding out about a cognitive perspective on search engine technology and the WWW Richard K. Belew 1. paperback ed. Cambridge [u.a.] Cambridge Univ. Press 2008 XXVII, 356 S. Ill. 24 cm txt rdacontent n rdamedia nc rdacarrier Originally published: 2000 Includes bibliographical references and index Search engines / Programming Web search engines World Wide Web / Subject access World Wide Web (DE-588)4363898-3 gnd rswk-swf Information Retrieval (DE-588)4072803-1 gnd rswk-swf Suchmaschine (DE-588)4423007-2 gnd rswk-swf Kognitive Orientierung (DE-588)4134004-8 gnd rswk-swf World Wide Web (DE-588)4363898-3 s Suchmaschine (DE-588)4423007-2 s Information Retrieval (DE-588)4072803-1 s DE-604 Kognitive Orientierung (DE-588)4134004-8 s 1\p DE-604 Digitalisierung BSBMuenchen application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020422970&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis 1\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk |
spellingShingle | Belew, Richard K. Finding out about a cognitive perspective on search engine technology and the WWW Search engines / Programming Web search engines World Wide Web / Subject access World Wide Web (DE-588)4363898-3 gnd Information Retrieval (DE-588)4072803-1 gnd Suchmaschine (DE-588)4423007-2 gnd Kognitive Orientierung (DE-588)4134004-8 gnd |
subject_GND | (DE-588)4363898-3 (DE-588)4072803-1 (DE-588)4423007-2 (DE-588)4134004-8 |
title | Finding out about a cognitive perspective on search engine technology and the WWW |
title_auth | Finding out about a cognitive perspective on search engine technology and the WWW |
title_exact_search | Finding out about a cognitive perspective on search engine technology and the WWW |
title_full | Finding out about a cognitive perspective on search engine technology and the WWW Richard K. Belew |
title_fullStr | Finding out about a cognitive perspective on search engine technology and the WWW Richard K. Belew |
title_full_unstemmed | Finding out about a cognitive perspective on search engine technology and the WWW Richard K. Belew |
title_short | Finding out about |
title_sort | finding out about a cognitive perspective on search engine technology and the www |
title_sub | a cognitive perspective on search engine technology and the WWW |
topic | Search engines / Programming Web search engines World Wide Web / Subject access World Wide Web (DE-588)4363898-3 gnd Information Retrieval (DE-588)4072803-1 gnd Suchmaschine (DE-588)4423007-2 gnd Kognitive Orientierung (DE-588)4134004-8 gnd |
topic_facet | Search engines / Programming Web search engines World Wide Web / Subject access World Wide Web Information Retrieval Suchmaschine Kognitive Orientierung |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020422970&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT belewrichardk findingoutaboutacognitiveperspectiveonsearchenginetechnologyandthewww |