Data matching: concepts and techniques for rcord linkage, entity resolution, and duplicate detection
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Berlin [u.a.]
Springer
2012
|
Schriftenreihe: | Data-centric systems and applications
|
Schlagworte: | |
Online-Zugang: | Inhaltstext Inhaltsverzeichnis |
Beschreibung: | XIX, 270 S. graph. Darst. 24 cm |
ISBN: | 3642311636 9783642311635 |
Internformat
MARC
LEADER | 00000nam a22000002c 4500 | ||
---|---|---|---|
001 | BV040343924 | ||
003 | DE-604 | ||
005 | 20180301 | ||
007 | t | ||
008 | 120801s2012 gw d||| |||| 00||| eng d | ||
015 | |a 12,N22 |2 dnb | ||
016 | 7 | |a 1022534254 |2 DE-101 | |
020 | |a 3642311636 |9 3-642-31163-6 | ||
020 | |a 9783642311635 |c Gb. : ca. EUR 58.80 (DE) (freier Pr.), ca. EUR 60.50 (AT) (freier Pr.), ca. sfr 73.50 (freier Pr.) |9 978-3-642-31163-5 | ||
024 | 3 | |a 9783642311635 | |
028 | 5 | 2 | |a Best.-Nr.: 80052897 |
035 | |a (OCoLC)812205052 | ||
035 | |a (DE-599)DNB1022534254 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
044 | |a gw |c XA-DE-BE | ||
049 | |a DE-29T |a DE-739 |a DE-19 |a DE-91G | ||
082 | 0 | |a 005.741 |2 22/ger | |
084 | |a ST 274 |0 (DE-625)143641: |2 rvk | ||
084 | |a 004 |2 sdnb | ||
084 | |a DAT 655f |2 stub | ||
100 | 1 | |a Christen, Peter |d 1968- |e Verfasser |0 (DE-588)121161234 |4 aut | |
245 | 1 | 0 | |a Data matching |b concepts and techniques for rcord linkage, entity resolution, and duplicate detection |c Peter Christen |
264 | 1 | |a Berlin [u.a.] |b Springer |c 2012 | |
300 | |a XIX, 270 S. |b graph. Darst. |c 24 cm | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Data-centric systems and applications | |
650 | 0 | 7 | |a Informationsqualität |0 (DE-588)4793947-3 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Datensatz |0 (DE-588)4011133-7 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Datenverwaltung |0 (DE-588)4011168-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Datenbanksystem |0 (DE-588)4113276-2 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Matching |0 (DE-588)4212483-9 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Datenbanksystem |0 (DE-588)4113276-2 |D s |
689 | 0 | 1 | |a Datenverwaltung |0 (DE-588)4011168-4 |D s |
689 | 0 | 2 | |a Datensatz |0 (DE-588)4011133-7 |D s |
689 | 0 | 3 | |a Matching |0 (DE-588)4212483-9 |D s |
689 | 0 | 4 | |a Informationsqualität |0 (DE-588)4793947-3 |D s |
689 | 0 | |5 DE-604 | |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-3-642-31164-2 |
856 | 4 | 2 | |m X:MVB |q text/html |u http://deposit.dnb.de/cgi-bin/dokserv?id=4043998&prov=M&dok_var=1&dok_ext=htm |3 Inhaltstext |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=025198126&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-025198126 |
Datensatz im Suchindex
_version_ | 1807953283395878912 |
---|---|
adam_text |
IMAGE 1
CONTENTS
PART I OVERVIEW 1 INTRODUCTION 3
1.1 AIMS AND CHALLENGES O F DATA MATCHING 3
1.1.1 LACK O F UNIQUE ENTITY IDENTIFIERS AND DATA QUALITY 5
1.1.2 COMPUTATION COMPLEXITY 5
1.1.3 LACK OF TRAINING DATA CONTAINING THE TRUE MATCH STATUS 6
1.1.4 PRIVACY AND CONFIDENTIALITY 6
1.2 DATA INTEGRATION AND LINK ANALYSIS 6
1.3 A SHORT HISTORY O F DATA MATCHING 9
1.4 EXAMPLE APPLICATION AREAS 11
1.4.1 NATIONAL CENSUS 11
1.4.2 THE HEALTH SECTOR 12
1.4.3 NATIONAL SECURITY 13
1.4.4 CRIME AND FRAUD DETECTION AND PREVENTION 14
1.4.5 BUSINESS MAILING LISTS 15
1.4.6 BIBLIOGRAPHIC DATABASES 17
1.4.7 ONLINE SHOPPING 18
1.4.8 SOCIAL SCIENCES AND GENEALOGY 19
1.5 FURTHER READING 20
2 THE DATA MATCHING PROCESS 23
2.1 OVERVIEW 23
2.1.1 A SMALL DATA MATCHING EXAMPLE 23
2.2 DATA PRE-PROCESSING 24
2.3 INDEXING 27
2.4 RECORD PAIR COMPARISON 29
X V
HTTP://D-NB.INFO/1022534254
IMAGE 2
X V I C O N T E N T S
2.5 RECORD PAIR CLASSIFICATION 32
2.6 EVALUATION O F MATCHING QUALITY AND COMPLEXITY 34
2.7 FURTHER READING 35
PART II STEPS O F THE DATA MATCHING PROCESS
3 DATA PRE-PROCESSING 39
3.1 DATA QUALITY ISSUES RELEVANT TO DATA MATCHING 39
3.2 ISSUES WITH NAMES AND OTHER PERSONAL INFORMATION 4 2
3.3 TYPES AND SOURCES O F VARIATIONS AND ERRORS IN NAMES 45
3.4 GENERAL DATA CLEANING TASKS 48
3.5 DATA PRE-PROCESSING FOR DATA MATCHING 51
3.5.1 REMOVING UNWANTED CHARACTERS AND TOKENS 51
3.5.2 STANDARDISATION AND TOKENISATION 53
3.5.3 SEGMENTATION INTO OUTPUT FIELDS 55
3.5.4 VERIFICATION 56
3.6 RULE-BASED SEGMENTATION APPROACHES 58
3.7 STATISTICAL SEGMENTATION APPROACHES 60
3.7.1 HIDDEN MARKOV MODEL BASED SEGMENTATION 62
3.8 PRACTICAL CONSIDERATIONS AND RESEARCH ISSUES 65
3.9 FURTHER READING 66
4 INDEXING 69
4.1 WHY INDEXING? 69
4.2 DEFINING BLOCKING KEYS 7 0
4.3 (PHONETIC) ENCODING FUNCTIONS 74
4.3.1 SOUNDEX 74
4.3.2 PHONEX 75
4.3.3 PHONIX 76
4.3.4 NYSIIS 76
4.3.5 OXFORD NAME COMPRESSION ALGORITHM 77
4.3.6 DOUBLE-METAPHONE 78
4.3.7 FUZZY SOUNDEX 78
4.3.8 OTHER ENCODING FUNCTIONS 79
4.4 STANDARD BLOCKING' 80
4.5 SORTED NEIGHBOURHOOD APPROACH 81
4.6 Q-GRAM BASED INDEXING 84
4.7 SUFFIX-ARRAY BASED INDEXING 86
4.8 CANOPY CLUSTERING 89
4.9 MAPPING BASED INDEXING 9 2
4.10 A COMPARISON O F INDEXING TECHNIQUES 93
4.11 OTHER INDEXING TECHNIQUES 9 4
IMAGE 3
C O N T E N T S X V I I
4.12 LEARNING OPTIMAL BLOCKING KEYS 97
4.13 PRACTICAL CONSIDERATIONS AND RESEARCH ISSUES 98
4.14 FURTHER READING 100
5 FIELD AND RECORD COMPARISON 101
5.1 OVERVIEW AND MOTIVATION 101
5.2 EXACT, TRUNCATE AND ENCODING COMPARISON 102
5.3 EDIT DISTANCE STRING COMPARISON 103
5.3.1 SMITH-WATERMAN EDIT DISTANCE STRING COMPARISON 105
5.4 Q- GRAM BASED STRING COMPARISON 106
5.5 JARO AND WINKLER STRING COMPARISON 109
5.6 MONGE-ELKAN STRING COMPARISON I L L
5.7 EXTENDED JACCARD COMPARISON 112
5.8 SOFTTFIDF STRING COMPARISON 113
5.9 LONGEST COMMON SUBSTRING COMPARISON 114
5.10 OTHER APPROXIMATE STRING COMPARISON TECHNIQUES 116
5.10.1 BAG DISTANCE 116
5.10.2 COMPRESSION DISTANCE 116
5.10.3 EDITEX 117
5.10.4 SYLLABLE ALIGNMENT DISTANCE 118
5.11 STRING COMPARISON EXAMPLES 118
5.12 NUMERICAL COMPARISON 121
5.13 DATE, AGE AND TIME COMPARISON 122
5.14 GEOGRAPHICAL DISTANCE COMPARISON 124
5.15 COMPARING COMPLEX DATA 124
5.16 RECORD COMPARISON 125
5.17 PRACTICAL CONSIDERATIONS AND RESEARCH ISSUES 126
5.18 FURTHER READING 127
6 CLASSIFICATION 129
6.1 OVERVIEW 129
6.2 THRESHOLD-BASED CLASSIFICATION 131
6.3 PROBABILISTIC CLASSIFICATION 133
6.4 COST-BASED CLASSIFICATION 137
6.5 RULE-BASED CLASSIFICATION 139
6.6 SUPERVISED CLASSIFICATION METHODS 142
6.7 ACTIVE LEARNING APPROACHES 147
6.8 MANAGING TRANSITIVE CLOSURE 149
6.9 CLUSTERING-BASED APPROACHES 150
6.10 COLLECTIVE CLASSIFICATION 154
6.11 MATCHING RESTRICTIONS AND GROUP LINKING 157
6.12 MERGING MATCHES 160
IMAGE 4
X V I I I
C O N T E N T S
6.13 PRACTICAL CONSIDERATIONS AND RESEARCH ISSUES 161
6.14 FURTHER READING 162
7 EVALUATION O F MATCHING QUALITY AND COMPLEXITY 1 63
7.1 OVERVIEW 163
7.2 MEASURING MATCHING QUALITY 165
7.3 MEASURING MATCHING COMPLEXITY 172
7.4 CLERICAL REVIEW 174
7.5 PUBLIC TEST DATA 176
7.6 SYNTHETIC TEST DATA 178
7.7 PRACTICAL CONSIDERATIONS AND RESEARCH ISSUES 183
7.8 FURTHER READING 184
PART III FURTHER TOPICS
8 PRIVACY ASPECTS O F DATA MATCHING 187
8.1 PRIVACY AND CONFIDENTIALITY CHALLENGES FOR DATA MATCHING . . . 187
8.1.1 REQUIRING ACCESS TO IDENTIFYING INFORMATION 188
8.1.2 SENSITIVE AND CONFIDENTIAL OUTCOMES FROM MATCHED DATA 189
8.2 DATA MATCHING SCENARIOS 190
8.3 PRIVACY-PRESERVING DATA MATCHING TECHNIQUES 193
8.3.1 EXACT PRIVACY-PRESERVING MATCHING TECHNIQUES . . . . 196
8.3.2 APPROXIMATE PRIVACY-PRESERVING MATCHING TECHNIQUES 199
8.3.3 SCALABLE PRIVACY-PRESERVING MATCHING TECHNIQUES 203
8.4 PRACTICAL CONSIDERATIONS AND RESEARCH ISSUES 205
8.5 FURTHER READING 207
9 FURTHER TOPICS AND RESEARCH DIRECTIONS 209
9.1 GEOCODE MATCHING 209
9.2 MATCHING UNSTRUCTURED AND COMPLEX DATA 211
9.3 REAL-TIME DATA MATCHING 213
9.4 MATCHING DYNAMIC DATABASES 215
9.5 PARALLEL AND DISTRIBUTED DATA MATCHING 217
9.6 RESEARCH CHALLENGES AND DIRECTIONS 222
10 DATA MATCHING SYSTEMS 229
10.1 COMMERCIAL SYSTEMS AND CHECKLIST 229
10.2 RESEARCH AND OPEN SOURCE SYSTEMS 231
10.2.1 BIGMATCH 231
10.2.2 D-DUPE 232
IMAGE 5
C O N T E N T S X I X
10.2.3 DUDE 232
10.2.4 FEBRL 234
10.2.5 FRIL 236
10.2.6 MERGE TOOLBOX 238
10.2.7 OYSTER 239
10.2.8 R RECORDLINKAGE 240
10.2.9 SECONDSTRING 240
10.2.10 SILK 240
10.2.11 SIMMETRICS 241
10.2.12 TAILOR 241
10.2.13 WHIRL 241
GLOSSARY 243
REFERENCES 251
INDEX 265 |
any_adam_object | 1 |
author | Christen, Peter 1968- |
author_GND | (DE-588)121161234 |
author_facet | Christen, Peter 1968- |
author_role | aut |
author_sort | Christen, Peter 1968- |
author_variant | p c pc |
building | Verbundindex |
bvnumber | BV040343924 |
classification_rvk | ST 274 |
classification_tum | DAT 655f |
ctrlnum | (OCoLC)812205052 (DE-599)DNB1022534254 |
dewey-full | 005.741 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 005 - Computer programming, programs, data, security |
dewey-raw | 005.741 |
dewey-search | 005.741 |
dewey-sort | 15.741 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a22000002c 4500</leader><controlfield tag="001">BV040343924</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20180301</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">120801s2012 gw d||| |||| 00||| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">12,N22</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">1022534254</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">3642311636</subfield><subfield code="9">3-642-31163-6</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642311635</subfield><subfield code="c">Gb. : ca. EUR 58.80 (DE) (freier Pr.), ca. EUR 60.50 (AT) (freier Pr.), ca. sfr 73.50 (freier Pr.)</subfield><subfield code="9">978-3-642-31163-5</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9783642311635</subfield></datafield><datafield tag="028" ind1="5" ind2="2"><subfield code="a">Best.-Nr.: 80052897</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)812205052</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB1022534254</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">XA-DE-BE</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield><subfield code="a">DE-739</subfield><subfield code="a">DE-19</subfield><subfield code="a">DE-91G</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">005.741</subfield><subfield code="2">22/ger</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 274</subfield><subfield code="0">(DE-625)143641:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">004</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">DAT 655f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Christen, Peter</subfield><subfield code="d">1968-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)121161234</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Data matching</subfield><subfield code="b">concepts and techniques for rcord linkage, entity resolution, and duplicate detection</subfield><subfield code="c">Peter Christen</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin [u.a.]</subfield><subfield code="b">Springer</subfield><subfield code="c">2012</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XIX, 270 S.</subfield><subfield code="b">graph. Darst.</subfield><subfield code="c">24 cm</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Data-centric systems and applications</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Informationsqualität</subfield><subfield code="0">(DE-588)4793947-3</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datensatz</subfield><subfield code="0">(DE-588)4011133-7</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenverwaltung</subfield><subfield code="0">(DE-588)4011168-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Datenbanksystem</subfield><subfield code="0">(DE-588)4113276-2</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Matching</subfield><subfield code="0">(DE-588)4212483-9</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Datenbanksystem</subfield><subfield code="0">(DE-588)4113276-2</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Datenverwaltung</subfield><subfield code="0">(DE-588)4011168-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Datensatz</subfield><subfield code="0">(DE-588)4011133-7</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="3"><subfield code="a">Matching</subfield><subfield code="0">(DE-588)4212483-9</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="4"><subfield code="a">Informationsqualität</subfield><subfield code="0">(DE-588)4793947-3</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-3-642-31164-2</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">X:MVB</subfield><subfield code="q">text/html</subfield><subfield code="u">http://deposit.dnb.de/cgi-bin/dokserv?id=4043998&prov=M&dok_var=1&dok_ext=htm</subfield><subfield code="3">Inhaltstext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=025198126&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-025198126</subfield></datafield></record></collection> |
id | DE-604.BV040343924 |
illustrated | Illustrated |
indexdate | 2024-08-21T00:03:31Z |
institution | BVB |
isbn | 3642311636 9783642311635 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-025198126 |
oclc_num | 812205052 |
open_access_boolean | |
owner | DE-29T DE-739 DE-19 DE-BY-UBM DE-91G DE-BY-TUM |
owner_facet | DE-29T DE-739 DE-19 DE-BY-UBM DE-91G DE-BY-TUM |
physical | XIX, 270 S. graph. Darst. 24 cm |
publishDate | 2012 |
publishDateSearch | 2012 |
publishDateSort | 2012 |
publisher | Springer |
record_format | marc |
series2 | Data-centric systems and applications |
spelling | Christen, Peter 1968- Verfasser (DE-588)121161234 aut Data matching concepts and techniques for rcord linkage, entity resolution, and duplicate detection Peter Christen Berlin [u.a.] Springer 2012 XIX, 270 S. graph. Darst. 24 cm txt rdacontent n rdamedia nc rdacarrier Data-centric systems and applications Informationsqualität (DE-588)4793947-3 gnd rswk-swf Datensatz (DE-588)4011133-7 gnd rswk-swf Datenverwaltung (DE-588)4011168-4 gnd rswk-swf Datenbanksystem (DE-588)4113276-2 gnd rswk-swf Matching (DE-588)4212483-9 gnd rswk-swf Datenbanksystem (DE-588)4113276-2 s Datenverwaltung (DE-588)4011168-4 s Datensatz (DE-588)4011133-7 s Matching (DE-588)4212483-9 s Informationsqualität (DE-588)4793947-3 s DE-604 Erscheint auch als Online-Ausgabe 978-3-642-31164-2 X:MVB text/html http://deposit.dnb.de/cgi-bin/dokserv?id=4043998&prov=M&dok_var=1&dok_ext=htm Inhaltstext DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=025198126&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Christen, Peter 1968- Data matching concepts and techniques for rcord linkage, entity resolution, and duplicate detection Informationsqualität (DE-588)4793947-3 gnd Datensatz (DE-588)4011133-7 gnd Datenverwaltung (DE-588)4011168-4 gnd Datenbanksystem (DE-588)4113276-2 gnd Matching (DE-588)4212483-9 gnd |
subject_GND | (DE-588)4793947-3 (DE-588)4011133-7 (DE-588)4011168-4 (DE-588)4113276-2 (DE-588)4212483-9 |
title | Data matching concepts and techniques for rcord linkage, entity resolution, and duplicate detection |
title_auth | Data matching concepts and techniques for rcord linkage, entity resolution, and duplicate detection |
title_exact_search | Data matching concepts and techniques for rcord linkage, entity resolution, and duplicate detection |
title_full | Data matching concepts and techniques for rcord linkage, entity resolution, and duplicate detection Peter Christen |
title_fullStr | Data matching concepts and techniques for rcord linkage, entity resolution, and duplicate detection Peter Christen |
title_full_unstemmed | Data matching concepts and techniques for rcord linkage, entity resolution, and duplicate detection Peter Christen |
title_short | Data matching |
title_sort | data matching concepts and techniques for rcord linkage entity resolution and duplicate detection |
title_sub | concepts and techniques for rcord linkage, entity resolution, and duplicate detection |
topic | Informationsqualität (DE-588)4793947-3 gnd Datensatz (DE-588)4011133-7 gnd Datenverwaltung (DE-588)4011168-4 gnd Datenbanksystem (DE-588)4113276-2 gnd Matching (DE-588)4212483-9 gnd |
topic_facet | Informationsqualität Datensatz Datenverwaltung Datenbanksystem Matching |
url | http://deposit.dnb.de/cgi-bin/dokserv?id=4043998&prov=M&dok_var=1&dok_ext=htm http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=025198126&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT christenpeter datamatchingconceptsandtechniquesforrcordlinkageentityresolutionandduplicatedetection |