Information retrieval architecture and algorithms:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
New York [u.a.]
Springer
2011
|
Schriftenreihe: | Computer science
|
Schlagworte: | |
Online-Zugang: | Inhaltstext Inhaltsverzeichnis |
Beschreibung: | XII, 305 S. Ill., graph. Darst. |
ISBN: | 9781441977151 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV037253708 | ||
003 | DE-604 | ||
005 | 20120319 | ||
007 | t | ||
008 | 110228s2011 ad|| |||| 00||| eng d | ||
020 | |a 9781441977151 |c GB. : ca. EUR 71.64 (freier Pr.), ca. sfr 96.00 (freier Pr.) |9 978-1-4419-7715-1 | ||
024 | 3 | |a 9781441977151 | |
028 | 5 | 2 | |a 80022596 |
035 | |a (OCoLC)711835446 | ||
035 | |a (DE-599)HBZHT016580681 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
049 | |a DE-12 |a DE-355 | ||
084 | |a ST 270 |0 (DE-625)143638: |2 rvk | ||
084 | |a 24,1 |2 ssgn | ||
100 | 1 | |a Kowalski, Gerald |d 1945- |e Verfasser |0 (DE-588)143277235 |4 aut | |
245 | 1 | 0 | |a Information retrieval architecture and algorithms |c Gerald Kowalski |
264 | 1 | |a New York [u.a.] |b Springer |c 2011 | |
300 | |a XII, 305 S. |b Ill., graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Computer science | |
650 | 0 | 7 | |a Suchverfahren |0 (DE-588)4132315-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Information Retrieval |0 (DE-588)4072803-1 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Informationsspeicherung |0 (DE-588)4332067-3 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Information Retrieval |0 (DE-588)4072803-1 |D s |
689 | 0 | 1 | |a Suchverfahren |0 (DE-588)4132315-4 |D s |
689 | 0 | |5 DE-604 | |
689 | 1 | 0 | |a Informationsspeicherung |0 (DE-588)4332067-3 |D s |
689 | 1 | 1 | |a Information Retrieval |0 (DE-588)4072803-1 |D s |
689 | 1 | |8 1\p |5 DE-604 | |
856 | 4 | 2 | |m X:MVB |q text/html |u http://deposit.dnb.de/cgi-bin/dokserv?id=3520086&prov=M&dok_var=1&dok_ext=htm |3 Inhaltstext |
856 | 4 | 2 | |m Digitalisierung BSB Muenchen |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=021166969&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
883 | 1 | |8 1\p |a cgwrk |d 20201028 |q DE-101 |u https://d-nb.info/provenance/plan#cgwrk | |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-021166969 |
Datensatz im Suchindex
_version_ | 1805095500047712256 |
---|---|
adam_text |
Contents
1 Information
Retrieval
System
Functions
. 1
1.1
Introduction
. 1
1.1.1
Primary
Information
Retrieval
Problems.
З
1.1.2
Objectives of
Information
Retrieval
System. 6
1.2
Functional Overview of Information Retrieval Systems
. 10
1.2.1
Selective Dissemination of Information
. 11
1.2.2
Alerts
. 12
1.2.3
Items and Item Index
. 13
1.2.4
Indexing and Mapping to a Taxonomy
. 13
1.3
Understanding Search Functions
. 14
1.3.1
Boolean Logic
. 15
1.3.2
Proximity
. 16
1.3.3
Contiguous Word Phrases
. 17
1.3.4
Fuzzy Searches
. 18
1.3.5
Term Masking
. 18
1.3.6
Numeric and Date Ranges
. 19
1.3.7
Vocabulary Browse
. 20
1.3.8
Multimedia Search
. 20
1.4
Relationship to Database Management Systems
. 20
1.5
Digital Libraries and Data Warehouses
. 22
1.6
Processing Subsystem Overview
. 24
1.7
Summary
. 25
1.8
Exercises
. 26
2
Data Structures and Mathematical Algorithms
. 27
2.1
Data Structures
. 27
2.1.1
Introduction to Data Structures
. 27
2.1.2
Inverted File Structure
. 29
2.1.3
N-Gram Data Structures
. 31
2.1.4
PAT Data Structure
. 34
2.1.5
Signature File Structure
. 38
x
Contents
2.1.6 Hypertext and XML Data
Structures .
40
2.1.7 XML. 43
2.2
Mathematical Algorithms.
44
2.2.1
Introduction
. 44
2.2.2
Bayesian Mathematics
. 45
2.2.3
Shannon's Theory of Information
. 47
2.2.4
Latent Semantic Indexing
. 48
2.2.5
Hidden Markov Models
. 53
2.2.6
Neural Networks
. 56
2.2.7
Support Vector Machines
. 58
2.3
Summary
. 59
2.4
Exercises
. 60
3
Ingest
. 63
3.1
Introduction to Ingest
. 63
3.2
Item Receipt
. 64
3.3
Duplicate Detection
. 67
3.4
Item Normalization
. 71
3.5
Zoning and Creation of Processing Tokens
. 72
3.6
Stemming
. 76
3.6.1
Introduction to the Stemming Process
. 77
3.6.2
Porter Stemming Algorithm
. 79
3.6.3
Dictionary Look-Up
Stemmers . 80
3.6.4
Successor
Stemmers. 81
3.6.5
Conclusions on Stemming
. 83
3.7
Entity Processing
. 84
3.7.1
Entity Identification
. 85
3.7.2
Entity Normalization
. 86
3.7.3
Entity Resolution
. 86
3.7.4
Information Extraction
. 87
3.8
Categorization
. 88
3.9
Citational Metadata
. 92
3.10
Summary
. 92
3.11
Exercises
. 93
4
Indexing
. 95
4.1
What is Indexing
. 95
4.1.1
History
. 96
4.1.2
Objectives
. 97
4.2
Manual Indexing Process
. 99
4.2.1
Scope of Indexing
. 99
4.2.2
Precoordination and Linkages
. 100
4.3
Automatic Indexing of Text
. 102
4.3.1
Statistical Indexing
. 105
4.3.2
Natural Language
. 120
4.3.3
Concept Indexing
. 125
Contents xj
4.4 Automatic
Indexing
of
Multimedia. 129
4.4.1
Introduction to
Mutlimedia
Indexing.
130
4.4.2 Audio
Indexing
. 131
4.4.3 Image
Indexing
. 134
4.4.4 Video
Indexing
. 136
4.5
Summary
. 137
4.6
Exercises
. 139
5
Search
. 141
5.1
Introduction
. 141
5.2
Similarity Measures and Ranking
. 142
5.2.1
Similarity Measures
. 144
5.3
Hidden Markov Models Techniques
. 152
5.4
Ranking Algorithms
. 153
5.5
Relevance Feedback
. 154
5.6
Selective Dissemination
ofinformation
Search
. 157
5.7
Weighted Searches of Boolean Systems
. 163
5.8
Multimedia Searching
. 167
5.9
Summary
. 168
5.10
Exercises
. 170
6
Document and Term Clustering
. 171
6.1
Introduction to Clustering
. 171
6.2
Thesaurus Generation
. 174
6.2.1
Manual Clustering
. 175
6.2.2
Automatic Term Clustering
. 176
6.3
Item Clustering
. 184
6.4
Hierarchy of Clusters
. 186
6.4.1
Automatic Hierarchical Cluster Algorithms
. 189
6.5
Measure of Tightness for Cluster
. 193
6.6
Issues with Use of Hierarchical Clusters for Search
. 194
6.7
Summary
. 195
6.8
Exercises
. 197
7
Information Presentation
. 199
7.1
Information Presentation Introduction
. 199
7.2
Presentation of the Hits
. 199
7.2.1
Sequential Listing of Hits
. 200
7.2.2
Cluster View
. 201
7.2.3
Network View
. 205
7.2.4
Timeline Presentation
. 208
7.3
Display of the Item
. 210
7.3.1
Indicating Search Terms in Display
. 210
7.3.2
Text Summarization
. 211
7.4
Collaborative Filtering
. 213
7.4.1
Page Ranking as Collaborative Filtering
. 215
xii Contents
7.5 Multimedia
Presentation .
216
7.5.1 Audio
Presentation
. 216
7.5.2 Image
Item Presentation
. 219
7.5.3 Video
Presentation
. 223
7.6 Human
Perception and Presentation
. 225
7.6.1
Introduction to
Information
Visualization.
226
7.6.2
Cognition and Perception
. 229
7.7
Summary
. 233
7.8
Exercises
. 234
8
Search Architecture
. 235
8.1
Index Search Optimization
. 235
8.1.1
Pruning the Index
. 236
8.1.2
Champion Lists
. 236
8.2
Text Search Optimization
. 237
8.2.1
Software Text Search Algorithms
. 239
8.2.2
Hardware Text Search Systems
. 244
8.3
GOOGLE Scalable Multiprocessor Architecture
. 249
8.4
Summary
. 251
8.5
Exercises
. 252
9
Information System Evaluation
. 253
9.1
Introduction to Information System Evaluation
. 253
9.2
Measures Used in System Evaluations
. 259
9.3
Multimedia Information Retrieval Evaluation
. 269
9.4
Measurement Example:
TREC
Evolution
. 271
9.5
Summary
. 279
9.6
Exercises
. 280
Bibliography
. 283
Index
. 301 |
any_adam_object | 1 |
author | Kowalski, Gerald 1945- |
author_GND | (DE-588)143277235 |
author_facet | Kowalski, Gerald 1945- |
author_role | aut |
author_sort | Kowalski, Gerald 1945- |
author_variant | g k gk |
building | Verbundindex |
bvnumber | BV037253708 |
classification_rvk | ST 270 |
ctrlnum | (OCoLC)711835446 (DE-599)HBZHT016580681 |
discipline | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 c 4500</leader><controlfield tag="001">BV037253708</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20120319</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">110228s2011 ad|| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781441977151</subfield><subfield code="c">GB. : ca. EUR 71.64 (freier Pr.), ca. sfr 96.00 (freier Pr.)</subfield><subfield code="9">978-1-4419-7715-1</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9781441977151</subfield></datafield><datafield tag="028" ind1="5" ind2="2"><subfield code="a">80022596</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)711835446</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)HBZHT016580681</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-12</subfield><subfield code="a">DE-355</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 270</subfield><subfield code="0">(DE-625)143638:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">24,1</subfield><subfield code="2">ssgn</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Kowalski, Gerald</subfield><subfield code="d">1945-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)143277235</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Information retrieval architecture and algorithms</subfield><subfield code="c">Gerald Kowalski</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">New York [u.a.]</subfield><subfield code="b">Springer</subfield><subfield code="c">2011</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XII, 305 S.</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Computer science</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Suchverfahren</subfield><subfield code="0">(DE-588)4132315-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Informationsspeicherung</subfield><subfield code="0">(DE-588)4332067-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Suchverfahren</subfield><subfield code="0">(DE-588)4132315-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Informationsspeicherung</subfield><subfield code="0">(DE-588)4332067-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="1"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">X:MVB</subfield><subfield code="q">text/html</subfield><subfield code="u">http://deposit.dnb.de/cgi-bin/dokserv?id=3520086&prov=M&dok_var=1&dok_ext=htm</subfield><subfield code="3">Inhaltstext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung BSB Muenchen</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=021166969&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-021166969</subfield></datafield></record></collection> |
id | DE-604.BV037253708 |
illustrated | Illustrated |
indexdate | 2024-07-20T11:00:16Z |
institution | BVB |
isbn | 9781441977151 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-021166969 |
oclc_num | 711835446 |
open_access_boolean | |
owner | DE-12 DE-355 DE-BY-UBR |
owner_facet | DE-12 DE-355 DE-BY-UBR |
physical | XII, 305 S. Ill., graph. Darst. |
publishDate | 2011 |
publishDateSearch | 2011 |
publishDateSort | 2011 |
publisher | Springer |
record_format | marc |
series2 | Computer science |
spelling | Kowalski, Gerald 1945- Verfasser (DE-588)143277235 aut Information retrieval architecture and algorithms Gerald Kowalski New York [u.a.] Springer 2011 XII, 305 S. Ill., graph. Darst. txt rdacontent n rdamedia nc rdacarrier Computer science Suchverfahren (DE-588)4132315-4 gnd rswk-swf Information Retrieval (DE-588)4072803-1 gnd rswk-swf Informationsspeicherung (DE-588)4332067-3 gnd rswk-swf Information Retrieval (DE-588)4072803-1 s Suchverfahren (DE-588)4132315-4 s DE-604 Informationsspeicherung (DE-588)4332067-3 s 1\p DE-604 X:MVB text/html http://deposit.dnb.de/cgi-bin/dokserv?id=3520086&prov=M&dok_var=1&dok_ext=htm Inhaltstext Digitalisierung BSB Muenchen application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=021166969&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis 1\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk |
spellingShingle | Kowalski, Gerald 1945- Information retrieval architecture and algorithms Suchverfahren (DE-588)4132315-4 gnd Information Retrieval (DE-588)4072803-1 gnd Informationsspeicherung (DE-588)4332067-3 gnd |
subject_GND | (DE-588)4132315-4 (DE-588)4072803-1 (DE-588)4332067-3 |
title | Information retrieval architecture and algorithms |
title_auth | Information retrieval architecture and algorithms |
title_exact_search | Information retrieval architecture and algorithms |
title_full | Information retrieval architecture and algorithms Gerald Kowalski |
title_fullStr | Information retrieval architecture and algorithms Gerald Kowalski |
title_full_unstemmed | Information retrieval architecture and algorithms Gerald Kowalski |
title_short | Information retrieval architecture and algorithms |
title_sort | information retrieval architecture and algorithms |
topic | Suchverfahren (DE-588)4132315-4 gnd Information Retrieval (DE-588)4072803-1 gnd Informationsspeicherung (DE-588)4332067-3 gnd |
topic_facet | Suchverfahren Information Retrieval Informationsspeicherung |
url | http://deposit.dnb.de/cgi-bin/dokserv?id=3520086&prov=M&dok_var=1&dok_ext=htm http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=021166969&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT kowalskigerald informationretrievalarchitectureandalgorithms |