Link analysis: an information science approach
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Amsterdam [u.a.]
Elsevier
2004
|
Schriftenreihe: | Library and information science
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | XII, 269 S. graph. Darst. |
ISBN: | 0120885530 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV019739004 | ||
003 | DE-604 | ||
005 | 20120730 | ||
007 | t | ||
008 | 050316s2004 d||| |||| 00||| eng d | ||
020 | |a 0120885530 |9 0-12-088553-0 | ||
035 | |a (OCoLC)255628749 | ||
035 | |a (DE-599)BVBBV019739004 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-91G |a DE-M382 |a DE-19 |a DE-355 | ||
084 | |a ST 270 |0 (DE-625)143638: |2 rvk | ||
084 | |a DAT 825f |2 stub | ||
084 | |a 24,1 |2 ssgn | ||
084 | |a DAT 616f |2 stub | ||
084 | |a INF 191f |2 stub | ||
100 | 1 | |a Thelwall, Mike |e Verfasser |4 aut | |
245 | 1 | 0 | |a Link analysis |b an information science approach |c Mike Thelwall |
264 | 1 | |a Amsterdam [u.a.] |b Elsevier |c 2004 | |
300 | |a XII, 269 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Library and information science | |
650 | 0 | 7 | |a World Wide Web |0 (DE-588)4363898-3 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Hyperlink |0 (DE-588)4617682-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Suchmaschine |0 (DE-588)4423007-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Information Retrieval |0 (DE-588)4072803-1 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a World Wide Web |0 (DE-588)4363898-3 |D s |
689 | 0 | 1 | |a Suchmaschine |0 (DE-588)4423007-2 |D s |
689 | 0 | |5 DE-604 | |
689 | 1 | 0 | |a World Wide Web |0 (DE-588)4363898-3 |D s |
689 | 1 | 1 | |a Hyperlink |0 (DE-588)4617682-2 |D s |
689 | 1 | 2 | |a Information Retrieval |0 (DE-588)4072803-1 |D s |
689 | 1 | |5 DE-604 | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=013065742&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-013065742 |
Datensatz im Suchindex
_version_ | 1804133204368556032 |
---|---|
adam_text | Introduction v
Link Analysis: An Information Science Approach
PartI: Theory 1
1 1
Introduction 1
Objectives 1
Link analysis 1
Historical overview 2
What is the information science approach to link analysis? 3
Contents andstructure 4
Key terminology 5
Sumtnary 6
Further reading 6
References 7
2 9
Web crawlers and search engines 9
Objectives 9
Introduction 9
Web crawlers 9
Findingpages 11
Content crawling vs. URL crawling 11
Content crawling 14
Obscured links 14
Depth and other arbitrary limitations 15
Automatically generated pages 15
Ethical issues and robots.txt 17
The webpage 17
Web crawling summary 18
Search engines 18
Knownbiases 19
Search engine ranking 20
The Internet Archive 20
Summary 20
Further reading 21
References 21
3 23
The theoretical perspective for link counting 23
Objectives 23
Introduction 23
The theoretical perspective for link counting 23
Anomalies 24
Manual filtering and banned lists 26
Alternative Document Models 27
Web sites and web documents 27
ADMs and Standard ADM counting 29
ADM ränge counting modeis 30
Choosing link counting strategies 31
vi Link Analysis: An Information Science Approach
Summary 32
Further reading 32
References 33
4 35
Interpreting linkcounts: Random samples and correlations 35
Objectives 35
Introduction 35
Interpreting link counts 35
The pilot feasibility and validity study 37
Full scale random sampling 38
Confidence limits for categories 40
Correlation testing 41
Literature review 43
Summary 43
Further reading 43
References 44
Part II: web structure 47
5 47
Link structures in the web graph 47
Objectives 47
Introduction 47
Power laws in the web 48
Models of web growth 50
Link topologies 52
Power laws and link topologies in academic webs 54
Summary 55
Further reading 56
References 56
6 59
The content structure of the web 59
Objectives 59
Introduction 59
The topic structure of the web 60
A link content web growth model 61
Linktext 62
The subject structure of academic webs 62
Colinks 66
Summary 66
Further reading 67
References 67
III Academic links 69
7 69
Universities: Link types 69
Objectives 69
Introduction 69
Citation analysis 69
The role of a university web site 70
Introduction vii
National Systems of university web sites 71
Page types 72
Link types 75
Summary 77
Further reading 78
References 78
8 81
Universities: Link modeis 81
Objectives 81
Introduction 81
The relationship between inlinks andresearch 81
Academic linking: Quality vs. quantity 84
Alternative logical linking modeis 86
Mathematical modeis 87
The influence of geography 88
Regional groupings 89
Summary 91
References 91
9 93
Universities: International links 93
Objectives 93
Introduction 93
National vs. international links 94
International linking comparisons 95
Linguistic influences 96
Summary 98
Further reading 99
References 99
10 101
Departments and subjects 101
Objectives 101
Introduction 101
Departmental web sites 102
Disciplinary differences in link types 103
issues of scale and correlation tests 104
Country 105
Subject 105
Outcome 105
Geographie and international factors 106
Summary 106
Further reading 107
References 107
11 109
Journals andarticles 109
Objectives 109
Introduction 109
Journal Impact Factors 109
Journal web sites 110
viii Link Analysis: An Information Science Approach
Journal web site inlinks: Issues 111
Journal web site inlinks: Casestudy 112
Types of links in Journal articles 113
Digital library links 114
Combined link and log file analysis 114
Related research topics 115
Summary 116
Further reading 116
References 116
IV Applications 119
12 119
Search engines and web design 119
Objectives 119
Introduction 119
Link structures and crawler coverage 119
Text in web sites and the Vector Space Model 120
The PageRank algorithm 121
Case study: PageRank calculations for a gateway site 124
HITS 127
HITS worked example 128
Summary: Web site design for PageRank and HITS 131
Further reading 132
Appendix: the Vector Space Model 133
References 134
13 137
A health check for Spanish universities 137
objective 137
Introduction 137
Research questions 137
Methods 138
Results and discussion 138
Conclusion 144
References 144
14 145
Personal web pages linking to universities 145
Objectives 145
Introduction 145
Web publishing and personal home pages 146
Research questions 147
Methods 148
Data collection 148
Data analysis 149
Results 151
ISPbiastest 151
ADMfming 152
Correlations between links and research ratings 153
A comparison of university and home page link sources 154
Introduction ix
Individual page categorizations 155
Conclusion 158
Meta conclusions 159
Acknowedgement 159
References 160
15 163
Academic networks 163
Objectives 163
Introduction 163
Methods 163
University Sitemaps 164
National academic web maps 168
Subject maps 170
Summary 171
Further reading 171
References 172
16 173
Business web sites 173
Objectives 173
Introduction 173
Site coverage checks 173
Site indexing and ranking checks 174
Competitive intelligence 174
Casestudy 175
Center Parcs 176
Hoseasons 176
Butlins 177
Pontins 178
Haven Holidays 178
General queries 179
Summary 179
Further reading 180
References 180
V Tools and techniques 181
17 181
Using commercial search engines and the Internet Archive 181
Objectives 181
Introduction 181
Checking results 182
Dealing with variations in results 183
Using multiple search engines 184
Using the Internet Archive 184
Summary 185
Online resources 185
Further reading 186
References 186
18 189
Personal crawlers 189
x Link Analysis: An Information Science Approach
Objectives 189
Introduction 189
Types of personal crawler 189
SocSciBot 190
Web page retrieved 190
Web page qualification 191
Web link extraction 192
URLs from HTTP 192
Obscured or unspecified URLs 193
Server generated pages 193
Dealing with errors 194
Human Intervention during crawls 195
SocSciBot tools 195
Summary 196
Online resources 196
Further reading 196
References 197
19 199
Data cleansing 199
Objectives 199
Introduction 199
Overview of data cleansing techniques 199
Anomaly identification 200
TLD Spectral Analysis 201
Summary 201
Online resources 202
References 202
20 203
Online university link databases 203
Objective 203
Introduction 203
Overview of the link databases 203
Link structure files 204
The banned lists 205
Analyzing the data 206
Other link structure databases 206
Summary 206
Online resources 206
Further reading 206
Reference 208
21 209
Embedded link analysis methodologies 209
Objectives 209
Introduction 209
Web Sphere Analysis 210
Virtual ethnography 210
Summary 211
Introduction xi
Further reading 212
References 212
22 213
Social Network Analysis 213
Objectives 213
Introduction 213
Some SNA metrics 214
Software 215
Summary 216
Further reading 216
References 216
23 219
Network visualizations 219
Objectives 219
Introduction 219
Network diagrams 219
Large network diagrams 221
MultiDimensional Scaling 221
Self Organizing Maps 222
Knowledge Domain Visualisation 223
Summary 223
Online resources 223
References 223
24 227
Academic link indicators 227
Objective 227
Introduction 227
Web indicators as process indicators 228
Issues of size and reliability 228
Benchmarking indicators 230
Link metrics 230
Relational indicators 232
Other metrics 232
Summary 233
Further reading 233
References 234
VI Summary 237
25 237
Summary 237
Objectives 237
Introduction 237
information science contributions to link analysis 238
Other link analysis approaches 239
Future directions 240
26 241
Glossary 241
References 243
Appendix 245
xii Link Analysis: An Information Science Approach
A SocSciBot tutorial 245
Tutorial 245
Step 1: Installing SocSciBot, SocSciBot Tools and Cyclist 245
Step 2: Installing Pajek 247
Step 3: Crawling a first site with SocSciBot 247
Step 4: Crawling two more sites with SocSciBot 252
Step 5: Viewing basic reports about the small test project with SocSciBot Tools 253
Step 6: Viewing a network diagram with Pajek 257
Step 7: Viewing site diagrams with Pajek 261
Step 8: Using Cyclist 263
Summary 264
Index 265
|
any_adam_object | 1 |
author | Thelwall, Mike |
author_facet | Thelwall, Mike |
author_role | aut |
author_sort | Thelwall, Mike |
author_variant | m t mt |
building | Verbundindex |
bvnumber | BV019739004 |
classification_rvk | ST 270 |
classification_tum | DAT 825f DAT 616f INF 191f |
ctrlnum | (OCoLC)255628749 (DE-599)BVBBV019739004 |
discipline | Informatik Bibliotheks- u. Informationswesen |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01771nam a2200469 c 4500</leader><controlfield tag="001">BV019739004</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20120730 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">050316s2004 d||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0120885530</subfield><subfield code="9">0-12-088553-0</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)255628749</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV019739004</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91G</subfield><subfield code="a">DE-M382</subfield><subfield code="a">DE-19</subfield><subfield code="a">DE-355</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 270</subfield><subfield code="0">(DE-625)143638:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">DAT 825f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">24,1</subfield><subfield code="2">ssgn</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">DAT 616f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">INF 191f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Thelwall, Mike</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Link analysis</subfield><subfield code="b">an information science approach</subfield><subfield code="c">Mike Thelwall</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Amsterdam [u.a.]</subfield><subfield code="b">Elsevier</subfield><subfield code="c">2004</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XII, 269 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Library and information science</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">World Wide Web</subfield><subfield code="0">(DE-588)4363898-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Hyperlink</subfield><subfield code="0">(DE-588)4617682-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Suchmaschine</subfield><subfield code="0">(DE-588)4423007-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">World Wide Web</subfield><subfield code="0">(DE-588)4363898-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Suchmaschine</subfield><subfield code="0">(DE-588)4423007-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">World Wide Web</subfield><subfield code="0">(DE-588)4363898-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="1"><subfield code="a">Hyperlink</subfield><subfield code="0">(DE-588)4617682-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="2"><subfield code="a">Information Retrieval</subfield><subfield code="0">(DE-588)4072803-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=013065742&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-013065742</subfield></datafield></record></collection> |
id | DE-604.BV019739004 |
illustrated | Illustrated |
indexdate | 2024-07-09T20:05:00Z |
institution | BVB |
isbn | 0120885530 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-013065742 |
oclc_num | 255628749 |
open_access_boolean | |
owner | DE-91G DE-BY-TUM DE-M382 DE-19 DE-BY-UBM DE-355 DE-BY-UBR |
owner_facet | DE-91G DE-BY-TUM DE-M382 DE-19 DE-BY-UBM DE-355 DE-BY-UBR |
physical | XII, 269 S. graph. Darst. |
publishDate | 2004 |
publishDateSearch | 2004 |
publishDateSort | 2004 |
publisher | Elsevier |
record_format | marc |
series2 | Library and information science |
spelling | Thelwall, Mike Verfasser aut Link analysis an information science approach Mike Thelwall Amsterdam [u.a.] Elsevier 2004 XII, 269 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Library and information science World Wide Web (DE-588)4363898-3 gnd rswk-swf Hyperlink (DE-588)4617682-2 gnd rswk-swf Suchmaschine (DE-588)4423007-2 gnd rswk-swf Information Retrieval (DE-588)4072803-1 gnd rswk-swf World Wide Web (DE-588)4363898-3 s Suchmaschine (DE-588)4423007-2 s DE-604 Hyperlink (DE-588)4617682-2 s Information Retrieval (DE-588)4072803-1 s HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=013065742&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Thelwall, Mike Link analysis an information science approach World Wide Web (DE-588)4363898-3 gnd Hyperlink (DE-588)4617682-2 gnd Suchmaschine (DE-588)4423007-2 gnd Information Retrieval (DE-588)4072803-1 gnd |
subject_GND | (DE-588)4363898-3 (DE-588)4617682-2 (DE-588)4423007-2 (DE-588)4072803-1 |
title | Link analysis an information science approach |
title_auth | Link analysis an information science approach |
title_exact_search | Link analysis an information science approach |
title_full | Link analysis an information science approach Mike Thelwall |
title_fullStr | Link analysis an information science approach Mike Thelwall |
title_full_unstemmed | Link analysis an information science approach Mike Thelwall |
title_short | Link analysis |
title_sort | link analysis an information science approach |
title_sub | an information science approach |
topic | World Wide Web (DE-588)4363898-3 gnd Hyperlink (DE-588)4617682-2 gnd Suchmaschine (DE-588)4423007-2 gnd Information Retrieval (DE-588)4072803-1 gnd |
topic_facet | World Wide Web Hyperlink Suchmaschine Information Retrieval |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=013065742&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT thelwallmike linkanalysisaninformationscienceapproach |