Practical text mining with Perl:
"This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet. It covers mining ideas from several perspectives - statistics, data mining, linguistics, and information retrieval - and provides readers with the mean...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Hoboken, NJ
Wiley
2008
|
Schriftenreihe: | Wiley series on methods and applications in data mining
|
Schlagworte: | |
Online-Zugang: | Publisher description Table of contents only Inhaltsverzeichnis |
Zusammenfassung: | "This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet. It covers mining ideas from several perspectives - statistics, data mining, linguistics, and information retrieval - and provides readers with the means to successfully complete text mining tasks on their own."--BOOK JACKET. |
Beschreibung: | XXIV, 295 S. graph. Darst. |
ISBN: | 9780470176436 |
Internformat
MARC
LEADER | 00000nam a2200000zc 4500 | ||
---|---|---|---|
001 | BV035073882 | ||
003 | DE-604 | ||
005 | 20081210 | ||
007 | t | ||
008 | 080929s2008 xxud||| |||| 00||| eng d | ||
010 | |a 2008008144 | ||
020 | |a 9780470176436 |c cloth |9 978-0-470-17643-6 | ||
035 | |a (OCoLC)212020725 | ||
035 | |a (DE-599)BVBBV035073882 | ||
040 | |a DE-604 |b ger |e aacr | ||
041 | 0 | |a eng | |
044 | |a xxu |c US | ||
049 | |a DE-703 | ||
050 | 0 | |a QA76.9.D343 | |
082 | 0 | |a 005.74 | |
084 | |a ST 250 |0 (DE-625)143626: |2 rvk | ||
100 | 1 | |a Bilisoly, Roger |e Verfasser |4 aut | |
245 | 1 | 0 | |a Practical text mining with Perl |c Roger Bilisoly |
264 | 1 | |a Hoboken, NJ |b Wiley |c 2008 | |
300 | |a XXIV, 295 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Wiley series on methods and applications in data mining | |
520 | 1 | |a "This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet. It covers mining ideas from several perspectives - statistics, data mining, linguistics, and information retrieval - and provides readers with the means to successfully complete text mining tasks on their own."--BOOK JACKET. | |
650 | 7 | |a Mineração de dados |2 larpcal | |
650 | 7 | |a Perl |2 larpcal | |
650 | 7 | |a Processamento de texto |2 larpcal | |
650 | 4 | |a Data mining | |
650 | 4 | |a Text processing (Computer science) | |
650 | 4 | |a Perl (Computer program language) | |
650 | 0 | 7 | |a Perl |g Programmiersprache |0 (DE-588)4307836-9 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Text Mining |0 (DE-588)4728093-1 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Text Mining |0 (DE-588)4728093-1 |D s |
689 | 0 | 1 | |a Perl |g Programmiersprache |0 (DE-588)4307836-9 |D s |
689 | 0 | |5 DE-604 | |
856 | 4 | |u http://www.loc.gov/catdir/enhancements/fy0814/2008008144-d.html |3 Publisher description | |
856 | 4 | |u http://www.loc.gov/catdir/enhancements/fy0814/2008008144-t.html |3 Table of contents only | |
856 | 4 | 2 | |m Digitalisierung UB Bayreuth |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=016742236&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-016742236 |
Datensatz im Suchindex
_version_ | 1804138023342833664 |
---|---|
adam_text | Contents
List of Figures
xiii
List of Tables
xv
Preface
xvii
Acknowledgments
xxiii
1
Introduction
1
1.1
Overview of this Book
1
1.2
Text Mining and Related Fields
2
1.2.1
Chapter
2:
Pattern Matching
2
1.2.2
Chapter
3:
Data Structures
3
1.2.3
Chapter
4:
Probability
3
1.2.4
Chapter
5:
Information Retrieval
3
1.2.5
Chapter
6:
Corpus Linguistics
4
1.2.6
Chapter
7:
Multivariate Statistics
4
1.2.7
Chapter
8:
Clustering
5
1.2.8
Chapter
9:
Three Additional Topics
5
1.3
Advice for Reading this Book
5
vii
VIU
CONTENTS
2
Text Patterns
7
2.1
Introduction
7
2.2
Regular Expressions
8
2.2.1
First Regex: Finding the Word Cat
8
2.2.2
Character Ranges and Finding Telephone Numbers
10
2.2.3
Testing Regexes with Perl
12
2.3
Finding Words in a Text
15
2.3.1
Regex Summary
15
2.3.2
Nineteenth-Century Literature
17
2.3.3
Perl Variables and the Function split
17
2.3.4
Match Variables
20
2.4
Decomposing Poe s The Tell-Tale Heart into Words
21
2.4.1
Dashes and String Substitutions
23
2.4.2
Hyphens
24
2.4.3
Apostrophes
27
2.5
A Simple Concordance
28
2.5.1
Command Line Arguments
33
2.5.2
Writing to Files
33
2.6
First Attempt at Extracting Sentences
34
2.6.1
Sentence Segmentation Preliminaries
35
2.6.2
Sentence Segmentation for A Christmas Carol
37
2.6.3
Leftmost Greediness and Sentence Segmentation
41
2.7
Regex Odds and Ends
46
2.7.1
Match Variables and Backreferences
47
2.7.2
Regular Expression Operators and Their Output
48
2.7.3
Lookaround
50
2.8
References
52
Problems
52
3
Quantitative Text Summaries
59
3.1
Introduction
59
3.2
Scalars, Interpolation, and Context in Perl
59
3.3
Arrays and Context in Perl
60
3.4
Word Lengths in Poe s The Tell-Tale Heart
64
3.5
Arrays and Functions
66
3.5.1
Adding and Removing Entries from Arrays
66
3.5.2
Selecting Subsets of an Array
69
3.5.3
Sorting an Array
69
3.6
Hashes
73
3.6.1
Using a Hash
74
3.7
Two Text Applications
77
CONTENTS
ІХ
3.7.1 Zipf s
Law for A Christmas Carol
77
3.7.2 Perl
for Word
Games 83
3.7.2.1 An
Aid to Crossword
Puzzles 83
3.7.2.2 Word
Anagrams
84
3.7.2.3
Finding Words in a Set of Letters
85
3.8
Complex Data Structures
86
3.8.1
References and Pointers
87
3.8.2
Arrays of Arrays and Beyond
90
3.8.3
Application: Comparing the Words in Two
Poe
Stories
92
3.9
References
96
3.10
First Transition
97
Problems
97
Probability and Text Sampling
105
4.1
Introduction
105
4.2
Probability
105
4.2.1
Probability and Coin Hipping
106
4.2.2
Probabilities and Texts
108
4.2.2.1
Estimating Letter Probabilities for
Poe
and Dickens
109
4.2.2.2
Estimating Letter
Bigram
Probabilities
112
4.3
Conditional Probability
115
4.3.1
Independence
117
4.4
Mean and Variance of Random Variables
118
4.4.1
Sampling and Error Estimates
120
4.5
The Bag-of-Words Model for Poe s The Black Cat
123
4.6
The Effect of Sample Size
124
4.6.1
Tokens vs. Types in Poe s Hans Pfaall
124
4.7
References
128
Problems
129
Applying Information Retrieval to Text Mining
133
5.1
Introduction
133
5.2
Counting Letters and Words
134
5.2.1
Counting Letters in
Poe
with Perl
134
5.2.2
Counting Pronouns Occurring in
Poe 136
5.3
Text Counts and Vectors
138
5.3.1
Vectors and Angles for Two
Poe
Stories
139
5.3.2
Computing Angles between Vectors
140
5.3.2.1
Subroutines in Perl
140
5.3.2.2
Computing the Angle between Vectors
143
5.4
The Term-Document Matrix Applied to
Poe 143
CONTENTS
5.5 Matrix
Multiplication
147
5.5.1 Matrix
Multiplication
Applied
to
Poe 148
5.6
Functions of Counts
150
5.7
Document Similarity
152
5.7.1
Inverse Document Frequency
153
5.7.2 Poe
Story Angles Revisited
154
5.8
References
157
Problems
157
Concordance Lines and Corpus Linguistics
161
6.1
Introduction
161
6.2
Sampling
162
6.2.1
Statistical Survey Sampling
162
6.2.2
Text Sampling
163
6.3
Corpus as Baseline
164
6.3.1
Function vs. Content Words in Dickens, London, and Shelley
168
6.4
Concordancing
169
6.4.1
Sorting Concordance Lines
170
6.4.1.1
Code for Sorting Concordance Lines
171
6.4.2
Application: Word Usage Differences between London and
Shelley
172
6.4.3
Application: Word Morphology of Adverbs
176
6.5
Collocations and Concordance Lines
179
6.5.1
More Ways to Sort Concordance Lines
179
6.5.2
Application: Phrasal Verbs in The Call of the Wild
181
6.5.3
Grouping Words: Colors in The Call of the Wild
184
6.6
Applications with References
185
6.7
Second Transition
187
Problems
188
Multivariate Techniques with Text
191
7.1
Introduction
191
7.2
Basic Statistics
192
7.2.1
г-Ѕсогеѕ
Applied to
Poe 193
7.2.2
Word Correlations among Poe s Short Stories
195
7.2.3
Correlations and Cosines
199
7.2.4
Correlations and Covariances
201
7.3
Basic linear algebra
202
7.3.1 2
by
2
Correlation Matrices
202
7.4
Principal Components Analysis
205
7.4.1
Finding the Principal Components
206
CONTENTS
ХІ
7.4.2 PCA Applied
to the
68 Poe Short
Stories
206
7.4.3
Another PCA Example with
Poe s Short
Stories
209
7.4.4
Rotations
209
7.5 Text Applications 211
7.5.1
A Word on Factor Analysis
211
7.6
Applications and References
211
Problems
212
8
Text Clustering
219
8.1
Introduction
219
8.2
Clustering
220
8.2.1
Two-Variable Example of it-Means
220
8.2.2
/t-Means with
R
223
8.2.3
He versus She in Poe s Short Stories
224
8.2.4 Poe
Clusters Using Eight Pronouns
229
8.2.5
Clustering
Poe
Using Principal Components
230
8.2.6
Hierarchical Clustering of Poe s Short Stories
234
8.3
A Note on Classification
235
8.3.1
Decision Trees and Overfitting
235
8.4
References
236
8.5
Last Transition
236
Problems
236
9
A Sample of Additional Topics
243
9.1
Introduction
243
9.2
Perl Modules
243
9.2.1
Modules for Number Words
244
9.2.2
The StopWords Module
245
9.2.3
The Sentence Segmentation Module
245
9.2.4
An Object-Oriented Module for Tagging
247
9.2.5
Miscellaneous Modules
248
9.3
Other Languages: Analyzing Goethe in German
248
9.4
Permutation Tests
251
9.4.1
Runs and Hypothesis Testing
252
9.4.2
Distribution of Character Names in Dickens and London
254
9.5
References
258
Appendix A: Overview of Perl for Text Mining
259
A.I Basic Data Structures
259
A.
1.1
Special Variables and Arrays
262
A.2 Operators
263
XII
CONTENTS
А.З
Branching and Looping
266
A.4 A Few Perl Functions
270
A.5 Introduction to Regular Expressions
271
Appendix B: Summary of
R
used in this Book
275
B.I Basics of
R
275
В.
1.1
Data Entry
276
B.I.
2
Basic Operators
277
B.1.3 Matrix Manipulation
278
B.2 This Book s
R
Code
279
References
283
Index
291
|
adam_txt |
Contents
List of Figures
xiii
List of Tables
xv
Preface
xvii
Acknowledgments
xxiii
1
Introduction
1
1.1
Overview of this Book
1
1.2
Text Mining and Related Fields
2
1.2.1
Chapter
2:
Pattern Matching
2
1.2.2
Chapter
3:
Data Structures
3
1.2.3
Chapter
4:
Probability
3
1.2.4
Chapter
5:
Information Retrieval
3
1.2.5
Chapter
6:
Corpus Linguistics
4
1.2.6
Chapter
7:
Multivariate Statistics
4
1.2.7
Chapter
8:
Clustering
5
1.2.8
Chapter
9:
Three Additional Topics
5
1.3
Advice for Reading this Book
5
vii
VIU
CONTENTS
2
Text Patterns
7
2.1
Introduction
7
2.2
Regular Expressions
8
2.2.1
First Regex: Finding the Word Cat
8
2.2.2
Character Ranges and Finding Telephone Numbers
10
2.2.3
Testing Regexes with Perl
12
2.3
Finding Words in a Text
15
2.3.1
Regex Summary
15
2.3.2
Nineteenth-Century Literature
17
2.3.3
Perl Variables and the Function split
17
2.3.4
Match Variables
20
2.4
Decomposing Poe's "The Tell-Tale Heart" into Words
21
2.4.1
Dashes and String Substitutions
23
2.4.2
Hyphens
24
2.4.3
Apostrophes
27
2.5
A Simple Concordance
28
2.5.1
Command Line Arguments
33
2.5.2
Writing to Files
33
2.6
First Attempt at Extracting Sentences
34
2.6.1
Sentence Segmentation Preliminaries
35
2.6.2
Sentence Segmentation for A Christmas Carol
37
2.6.3
Leftmost Greediness and Sentence Segmentation
41
2.7
Regex Odds and Ends
46
2.7.1
Match Variables and Backreferences
47
2.7.2
Regular Expression Operators and Their Output
48
2.7.3
Lookaround
50
2.8
References
52
Problems
52
3
Quantitative Text Summaries
59
3.1
Introduction
59
3.2
Scalars, Interpolation, and Context in Perl
59
3.3
Arrays and Context in Perl
60
3.4
Word Lengths in Poe's "The Tell-Tale Heart"
64
3.5
Arrays and Functions
66
3.5.1
Adding and Removing Entries from Arrays
66
3.5.2
Selecting Subsets of an Array
69
3.5.3
Sorting an Array
69
3.6
Hashes
73
3.6.1
Using a Hash
74
3.7
Two Text Applications
77
CONTENTS
ІХ
3.7.1 Zipf 's
Law for A Christmas Carol
77
3.7.2 Perl
for Word
Games 83
3.7.2.1 An
Aid to Crossword
Puzzles 83
3.7.2.2 Word
Anagrams
84
3.7.2.3
Finding Words in a Set of Letters
85
3.8
Complex Data Structures
86
3.8.1
References and Pointers
87
3.8.2
Arrays of Arrays and Beyond
90
3.8.3
Application: Comparing the Words in Two
Poe
Stories
92
3.9
References
96
3.10
First Transition
97
Problems
97
Probability and Text Sampling
105
4.1
Introduction
105
4.2
Probability
105
4.2.1
Probability and Coin Hipping
106
4.2.2
Probabilities and Texts
108
4.2.2.1
Estimating Letter Probabilities for
Poe
and Dickens
109
4.2.2.2
Estimating Letter
Bigram
Probabilities
112
4.3
Conditional Probability
115
4.3.1
Independence
117
4.4
Mean and Variance of Random Variables
118
4.4.1
Sampling and Error Estimates
120
4.5
The Bag-of-Words Model for Poe's "The Black Cat"
123
4.6
The Effect of Sample Size
124
4.6.1
Tokens vs. Types in Poe's "Hans Pfaall"
124
4.7
References
128
Problems
129
Applying Information Retrieval to Text Mining
133
5.1
Introduction
133
5.2
Counting Letters and Words
134
5.2.1
Counting Letters in
Poe
with Perl
134
5.2.2
Counting Pronouns Occurring in
Poe 136
5.3
Text Counts and Vectors
138
5.3.1
Vectors and Angles for Two
Poe
Stories
139
5.3.2
Computing Angles between Vectors
140
5.3.2.1
Subroutines in Perl
140
5.3.2.2
Computing the Angle between Vectors
143
5.4
The Term-Document Matrix Applied to
Poe 143
CONTENTS
5.5 Matrix
Multiplication
147
5.5.1 Matrix
Multiplication
Applied
to
Poe 148
5.6
Functions of Counts
150
5.7
Document Similarity
152
5.7.1
Inverse Document Frequency
153
5.7.2 Poe
Story Angles Revisited
154
5.8
References
157
Problems
157
Concordance Lines and Corpus Linguistics
161
6.1
Introduction
161
6.2
Sampling
162
6.2.1
Statistical Survey Sampling
162
6.2.2
Text Sampling
163
6.3
Corpus as Baseline
164
6.3.1
Function vs. Content Words in Dickens, London, and Shelley
168
6.4
Concordancing
169
6.4.1
Sorting Concordance Lines
170
6.4.1.1
Code for Sorting Concordance Lines
171
6.4.2
Application: Word Usage Differences between London and
Shelley
172
6.4.3
Application: Word Morphology of Adverbs
176
6.5
Collocations and Concordance Lines
179
6.5.1
More Ways to Sort Concordance Lines
179
6.5.2
Application: Phrasal Verbs in The Call of the Wild
181
6.5.3
Grouping Words: Colors in The Call of the Wild
184
6.6
Applications with References
185
6.7
Second Transition
187
Problems
188
Multivariate Techniques with Text
191
7.1
Introduction
191
7.2
Basic Statistics
192
7.2.1
г-Ѕсогеѕ
Applied to
Poe 193
7.2.2
Word Correlations among Poe's Short Stories
195
7.2.3
Correlations and Cosines
199
7.2.4
Correlations and Covariances
201
7.3
Basic linear algebra
202
7.3.1 2
by
2
Correlation Matrices
202
7.4
Principal Components Analysis
205
7.4.1
Finding the Principal Components
206
CONTENTS
ХІ
7.4.2 PCA Applied
to the
68 Poe Short
Stories
206
7.4.3
Another PCA Example with
Poe's Short
Stories
209
7.4.4
Rotations
209
7.5 Text Applications 211
7.5.1
A Word on Factor Analysis
211
7.6
Applications and References
211
Problems
212
8
Text Clustering
219
8.1
Introduction
219
8.2
Clustering
220
8.2.1
Two-Variable Example of it-Means
220
8.2.2
/t-Means with
R
223
8.2.3
He versus She in Poe's Short Stories
224
8.2.4 Poe
Clusters Using Eight Pronouns
229
8.2.5
Clustering
Poe
Using Principal Components
230
8.2.6
Hierarchical Clustering of Poe's Short Stories
234
8.3
A Note on Classification
235
8.3.1
Decision Trees and Overfitting
235
8.4
References
236
8.5
Last Transition
236
Problems
236
9
A Sample of Additional Topics
243
9.1
Introduction
243
9.2
Perl Modules
243
9.2.1
Modules for Number Words
244
9.2.2
The StopWords Module
245
9.2.3
The Sentence Segmentation Module
245
9.2.4
An Object-Oriented Module for Tagging
247
9.2.5
Miscellaneous Modules
248
9.3
Other Languages: Analyzing Goethe in German
248
9.4
Permutation Tests
251
9.4.1
Runs and Hypothesis Testing
252
9.4.2
Distribution of Character Names in Dickens and London
254
9.5
References
258
Appendix A: Overview of Perl for Text Mining
259
A.I Basic Data Structures
259
A.
1.1
Special Variables and Arrays
262
A.2 Operators
263
XII
CONTENTS
А.З
Branching and Looping
266
A.4 A Few Perl Functions
270
A.5 Introduction to Regular Expressions
271
Appendix B: Summary of
R
used in this Book
275
B.I Basics of
R
275
В.
1.1
Data Entry
276
B.I.
2
Basic Operators
277
B.1.3 Matrix Manipulation
278
B.2 This Book's
R
Code
279
References
283
Index
291 |
any_adam_object | 1 |
any_adam_object_boolean | 1 |
author | Bilisoly, Roger |
author_facet | Bilisoly, Roger |
author_role | aut |
author_sort | Bilisoly, Roger |
author_variant | r b rb |
building | Verbundindex |
bvnumber | BV035073882 |
callnumber-first | Q - Science |
callnumber-label | QA76 |
callnumber-raw | QA76.9.D343 |
callnumber-search | QA76.9.D343 |
callnumber-sort | QA 276.9 D343 |
callnumber-subject | QA - Mathematics |
classification_rvk | ST 250 |
ctrlnum | (OCoLC)212020725 (DE-599)BVBBV035073882 |
dewey-full | 005.74 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 005 - Computer programming, programs, data, security |
dewey-raw | 005.74 |
dewey-search | 005.74 |
dewey-sort | 15.74 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
discipline_str_mv | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02301nam a2200505zc 4500</leader><controlfield tag="001">BV035073882</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20081210 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">080929s2008 xxud||| |||| 00||| eng d</controlfield><datafield tag="010" ind1=" " ind2=" "><subfield code="a">2008008144</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780470176436</subfield><subfield code="c">cloth</subfield><subfield code="9">978-0-470-17643-6</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)212020725</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV035073882</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">xxu</subfield><subfield code="c">US</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-703</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">QA76.9.D343</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">005.74</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 250</subfield><subfield code="0">(DE-625)143626:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Bilisoly, Roger</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Practical text mining with Perl</subfield><subfield code="c">Roger Bilisoly</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Hoboken, NJ</subfield><subfield code="b">Wiley</subfield><subfield code="c">2008</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XXIV, 295 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Wiley series on methods and applications in data mining</subfield></datafield><datafield tag="520" ind1="1" ind2=" "><subfield code="a">"This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet. It covers mining ideas from several perspectives - statistics, data mining, linguistics, and information retrieval - and provides readers with the means to successfully complete text mining tasks on their own."--BOOK JACKET.</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Mineração de dados</subfield><subfield code="2">larpcal</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Perl</subfield><subfield code="2">larpcal</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Processamento de texto</subfield><subfield code="2">larpcal</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Text processing (Computer science)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Perl (Computer program language)</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Perl</subfield><subfield code="g">Programmiersprache</subfield><subfield code="0">(DE-588)4307836-9</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Text Mining</subfield><subfield code="0">(DE-588)4728093-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Text Mining</subfield><subfield code="0">(DE-588)4728093-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Perl</subfield><subfield code="g">Programmiersprache</subfield><subfield code="0">(DE-588)4307836-9</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2=" "><subfield code="u">http://www.loc.gov/catdir/enhancements/fy0814/2008008144-d.html</subfield><subfield code="3">Publisher description</subfield></datafield><datafield tag="856" ind1="4" ind2=" "><subfield code="u">http://www.loc.gov/catdir/enhancements/fy0814/2008008144-t.html</subfield><subfield code="3">Table of contents only</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Bayreuth</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=016742236&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-016742236</subfield></datafield></record></collection> |
id | DE-604.BV035073882 |
illustrated | Illustrated |
index_date | 2024-07-02T22:04:55Z |
indexdate | 2024-07-09T21:21:36Z |
institution | BVB |
isbn | 9780470176436 |
language | English |
lccn | 2008008144 |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-016742236 |
oclc_num | 212020725 |
open_access_boolean | |
owner | DE-703 |
owner_facet | DE-703 |
physical | XXIV, 295 S. graph. Darst. |
publishDate | 2008 |
publishDateSearch | 2008 |
publishDateSort | 2008 |
publisher | Wiley |
record_format | marc |
series2 | Wiley series on methods and applications in data mining |
spelling | Bilisoly, Roger Verfasser aut Practical text mining with Perl Roger Bilisoly Hoboken, NJ Wiley 2008 XXIV, 295 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Wiley series on methods and applications in data mining "This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet. It covers mining ideas from several perspectives - statistics, data mining, linguistics, and information retrieval - and provides readers with the means to successfully complete text mining tasks on their own."--BOOK JACKET. Mineração de dados larpcal Perl larpcal Processamento de texto larpcal Data mining Text processing (Computer science) Perl (Computer program language) Perl Programmiersprache (DE-588)4307836-9 gnd rswk-swf Text Mining (DE-588)4728093-1 gnd rswk-swf Text Mining (DE-588)4728093-1 s Perl Programmiersprache (DE-588)4307836-9 s DE-604 http://www.loc.gov/catdir/enhancements/fy0814/2008008144-d.html Publisher description http://www.loc.gov/catdir/enhancements/fy0814/2008008144-t.html Table of contents only Digitalisierung UB Bayreuth application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=016742236&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Bilisoly, Roger Practical text mining with Perl Mineração de dados larpcal Perl larpcal Processamento de texto larpcal Data mining Text processing (Computer science) Perl (Computer program language) Perl Programmiersprache (DE-588)4307836-9 gnd Text Mining (DE-588)4728093-1 gnd |
subject_GND | (DE-588)4307836-9 (DE-588)4728093-1 |
title | Practical text mining with Perl |
title_auth | Practical text mining with Perl |
title_exact_search | Practical text mining with Perl |
title_exact_search_txtP | Practical text mining with Perl |
title_full | Practical text mining with Perl Roger Bilisoly |
title_fullStr | Practical text mining with Perl Roger Bilisoly |
title_full_unstemmed | Practical text mining with Perl Roger Bilisoly |
title_short | Practical text mining with Perl |
title_sort | practical text mining with perl |
topic | Mineração de dados larpcal Perl larpcal Processamento de texto larpcal Data mining Text processing (Computer science) Perl (Computer program language) Perl Programmiersprache (DE-588)4307836-9 gnd Text Mining (DE-588)4728093-1 gnd |
topic_facet | Mineração de dados Perl Processamento de texto Data mining Text processing (Computer science) Perl (Computer program language) Perl Programmiersprache Text Mining |
url | http://www.loc.gov/catdir/enhancements/fy0814/2008008144-d.html http://www.loc.gov/catdir/enhancements/fy0814/2008008144-t.html http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=016742236&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT bilisolyroger practicaltextminingwithperl |