Python 3 text processing with NLTK 3 cookbook :: over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 /
This book is intended for Python programmers interested in learning how to do natural language processing. Maybe you've learned the limits of regular expressions the hard way, or you've realized that human language cannot be deterministically parsed like a computer language. Perhaps you ha...
Gespeichert in:
1. Verfasser: | |
---|---|
Weitere Verfasser: | |
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Birmingham, England :
Packt Publishing Ltd,
2014.
|
Ausgabe: | Second edition. |
Schlagworte: | |
Online-Zugang: | Volltext |
Zusammenfassung: | This book is intended for Python programmers interested in learning how to do natural language processing. Maybe you've learned the limits of regular expressions the hard way, or you've realized that human language cannot be deterministically parsed like a computer language. Perhaps you have more text than you know what to do with, and need automated ways to analyze and structure that text. This Cookbook will show you how to train and use statistical language models to process text in ways that are practically impossible with standard programming tools. A basic knowledge of Python and the basi. |
Beschreibung: | "Quick answers to common problems"--Cover Includes index. |
Beschreibung: | 1 online resource (304 pages) : illustrations |
ISBN: | 9781782167860 1782167862 1782167854 9781782167853 |
Internformat
MARC
LEADER | 00000cam a2200000 i 4500 | ||
---|---|---|---|
001 | ZDB-4-EBA-ocn891381366 | ||
003 | OCoLC | ||
005 | 20241004212047.0 | ||
006 | m o d | ||
007 | cr cn||||||||| | ||
008 | 140902t20142014enka o 001 0 eng d | ||
040 | |a E7B |b eng |e rda |e pn |c E7B |d OCLCO |d COO |d EBLCP |d HEBIS |d IDEBK |d DEBSZ |d YDXCP |d CHVBK |d N$T |d OCLCQ |d AZK |d AGLDB |d OCLCQ |d ICA |d MERUC |d OCLCQ |d REB |d D6H |d OCLCF |d VTS |d OCLCQ |d WYU |d ZCU |d M8D |d VT2 |d OCLCQ |d K6U |d OCLCQ |d OCLCO |d QGK |d OCLCQ |d OCLCO |d OCLCL | ||
019 | |a 889674234 |a 907279783 |a 961486506 |a 1259247125 | ||
020 | |a 9781782167860 |q (e-book) | ||
020 | |a 1782167862 |q (e-book) | ||
020 | |a 1782167854 | ||
020 | |a 9781782167853 | ||
020 | |z 9781782167853 | ||
035 | |a (OCoLC)891381366 |z (OCoLC)889674234 |z (OCoLC)907279783 |z (OCoLC)961486506 |z (OCoLC)1259247125 | ||
050 | 4 | |a QA76.73.P98 |b .P47 2014eb | |
072 | 7 | |a COM |x 051360 |2 bisacsh | |
082 | 7 | |a 005.133 |2 23 | |
049 | |a MAIN | ||
100 | 1 | |a Perkins, Jacob, |e author. | |
245 | 1 | 0 | |a Python 3 text processing with NLTK 3 cookbook : |b over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / |c Jacob Perkins ; cover image by Faiz Fattohi. |
250 | |a Second edition. | ||
264 | 1 | |a Birmingham, England : |b Packt Publishing Ltd, |c 2014. | |
264 | 4 | |c ©2014 | |
300 | |a 1 online resource (304 pages) : |b illustrations | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
347 | |a text file | ||
500 | |a "Quick answers to common problems"--Cover | ||
500 | |a Includes index. | ||
588 | 0 | |a Online resource; title from PDF title page (ebrary, viewed September 2, 2014). | |
505 | 0 | |a Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Tokenizing Text and WordNet Basics; Introduction; Tokenizing text into sentences; Tokenizing sentences into words; Tokenizing sentences using regular expressions; Training a sentence tokenizer; Filtering stopwords in a tokenized sentence; Looking up Synsets for a word in WordNet; Looking up lemmas and synonyms in WordNet; Calculating WordNet Synset similarity; Discovering word collocations; Chapter 2: Replacing and Correcting Words; Introduction; Stemming words. | |
505 | 8 | |a Lemmatizing words with WordNetReplacing words matching regular expressions; Removing repeating characters; Spelling correction with Enchant; Replacing synonyms; Replacing negations with antonyms; Chapter 3: Creating Custom Corpora; Introduction; Setting up a custom corpus; Creating a wordlist corpus; Creating a part-of-speech tagged word corpus; Creating a chunked phrase corpus; Creating a categorized text corpus; Creating a categorized chunk corpus reader; Lazy corpus loading; Creating a custom corpus view; Creating a MongoDB-backed corpus reader; Corpus editing with file locking. | |
505 | 8 | |a Chapter 4: Part-of-speech TaggingIntroduction; Default tagging; Training a unigram part-of-speech tagger; Combining taggers with backoff tagging; Training and combining ngram taggers; Creating a model of likely word tags; Tagging with regular expressions; Affix tagging; Training a Brill tagger; Training the TnT tagger; Using WordNet for tagging; Tagging proper names; Classifier-based tagging; Training a tagger with NLTK-Trainer; Chapter 5: Extracting Chunks; Introduction; Chunking and chinking with regular expressions; Merging and splitting chunks with regular expressions. | |
505 | 8 | |a Expanding and removing chunks with regular expressionsPartial parsing with regular expressions; Training a tagger-based chunker; Classification-based chunking; Extracting named entities; Extracting proper noun chunks; Extracting location chunks; Training a named entity chunker; Training a chunker with NLTK-Trainer; Chapter 6: Transforming Chunks and Trees; Introduction; Filtering insignificant words from a sentence; Correcting verb forms; Swapping verb phrases; Swapping noun cardinals; Swapping infinitive phrases; Singularizing plural nouns; Chaining chunk transformations. | |
505 | 8 | |a Converting a chunk tree to textFlattening a deep tree; Creating a shallow tree; Converting tree labels; Chapter 7: Text Classification; Introduction; Bag of words feature extraction; Training a Naive Bayes classifier; Training a decision tree classifier; Training a maximum entropy classifier; Training scikit-learn classifiers; Measuring precision and recall of a classifier; Calculating high information words; Combining classifiers with voting; Classifying with multiple binary classifiers; Training a classifier with NLTK-Trainer; Chapter 8: Distributed Processing and Handling Large Datasets. | |
520 | |a This book is intended for Python programmers interested in learning how to do natural language processing. Maybe you've learned the limits of regular expressions the hard way, or you've realized that human language cannot be deterministically parsed like a computer language. Perhaps you have more text than you know what to do with, and need automated ways to analyze and structure that text. This Cookbook will show you how to train and use statistical language models to process text in ways that are practically impossible with standard programming tools. A basic knowledge of Python and the basi. | ||
546 | |a English. | ||
650 | 0 | |a Python (Computer program language) |0 http://id.loc.gov/authorities/subjects/sh96008834 | |
650 | 0 | |a Natural language processing (Computer science) |x Research. | |
650 | 6 | |a Python (Langage de programmation) | |
650 | 6 | |a Traitement automatique des langues naturelles |x Recherche. | |
650 | 7 | |a COMPUTERS |x Programming Languages |x Python. |2 bisacsh | |
650 | 7 | |a Python (Computer program language) |2 fast | |
700 | 1 | |a Fattohi, Faiz, |e cover designer. | |
758 | |i has work: |a Python 3 text processing with NLTK 3 cookbook (Text) |1 https://id.oclc.org/worldcat/entity/E39PCGpQm36dPrKgJmyDm44xMq |4 https://id.oclc.org/worldcat/ontology/hasWork | ||
776 | 0 | 8 | |i Print version: |a Perkins, Jacob. |t Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0. |b Second edition. |d Birmingham, England : Packt Publishing Ltd, ©2014 |h iii, 288 pages |z 9781782167853 |
856 | 4 | 0 | |l FWS01 |p ZDB-4-EBA |q FWS_PDA_EBA |u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=836632 |3 Volltext |
936 | |a BATCHLOAD | ||
938 | |a ProQuest Ebook Central |b EBLB |n EBL1706433 | ||
938 | |a ebrary |b EBRY |n ebr10921234 | ||
938 | |a EBSCOhost |b EBSC |n 836632 | ||
938 | |a ProQuest MyiLibrary Digital eBook Collection |b IDEB |n cis29662597 | ||
938 | |a YBP Library Services |b YANK |n 12055178 | ||
994 | |a 92 |b GEBAY | ||
912 | |a ZDB-4-EBA | ||
049 | |a DE-863 |
Datensatz im Suchindex
DE-BY-FWS_katkey | ZDB-4-EBA-ocn891381366 |
---|---|
_version_ | 1816882287632973824 |
adam_text | |
any_adam_object | |
author | Perkins, Jacob |
author2 | Fattohi, Faiz |
author2_role | cov |
author2_variant | f f ff |
author_facet | Perkins, Jacob Fattohi, Faiz |
author_role | aut |
author_sort | Perkins, Jacob |
author_variant | j p jp |
building | Verbundindex |
bvnumber | localFWS |
callnumber-first | Q - Science |
callnumber-label | QA76 |
callnumber-raw | QA76.73.P98 .P47 2014eb |
callnumber-search | QA76.73.P98 .P47 2014eb |
callnumber-sort | QA 276.73 P98 P47 42014EB |
callnumber-subject | QA - Mathematics |
collection | ZDB-4-EBA |
contents | Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Tokenizing Text and WordNet Basics; Introduction; Tokenizing text into sentences; Tokenizing sentences into words; Tokenizing sentences using regular expressions; Training a sentence tokenizer; Filtering stopwords in a tokenized sentence; Looking up Synsets for a word in WordNet; Looking up lemmas and synonyms in WordNet; Calculating WordNet Synset similarity; Discovering word collocations; Chapter 2: Replacing and Correcting Words; Introduction; Stemming words. Lemmatizing words with WordNetReplacing words matching regular expressions; Removing repeating characters; Spelling correction with Enchant; Replacing synonyms; Replacing negations with antonyms; Chapter 3: Creating Custom Corpora; Introduction; Setting up a custom corpus; Creating a wordlist corpus; Creating a part-of-speech tagged word corpus; Creating a chunked phrase corpus; Creating a categorized text corpus; Creating a categorized chunk corpus reader; Lazy corpus loading; Creating a custom corpus view; Creating a MongoDB-backed corpus reader; Corpus editing with file locking. Chapter 4: Part-of-speech TaggingIntroduction; Default tagging; Training a unigram part-of-speech tagger; Combining taggers with backoff tagging; Training and combining ngram taggers; Creating a model of likely word tags; Tagging with regular expressions; Affix tagging; Training a Brill tagger; Training the TnT tagger; Using WordNet for tagging; Tagging proper names; Classifier-based tagging; Training a tagger with NLTK-Trainer; Chapter 5: Extracting Chunks; Introduction; Chunking and chinking with regular expressions; Merging and splitting chunks with regular expressions. Expanding and removing chunks with regular expressionsPartial parsing with regular expressions; Training a tagger-based chunker; Classification-based chunking; Extracting named entities; Extracting proper noun chunks; Extracting location chunks; Training a named entity chunker; Training a chunker with NLTK-Trainer; Chapter 6: Transforming Chunks and Trees; Introduction; Filtering insignificant words from a sentence; Correcting verb forms; Swapping verb phrases; Swapping noun cardinals; Swapping infinitive phrases; Singularizing plural nouns; Chaining chunk transformations. Converting a chunk tree to textFlattening a deep tree; Creating a shallow tree; Converting tree labels; Chapter 7: Text Classification; Introduction; Bag of words feature extraction; Training a Naive Bayes classifier; Training a decision tree classifier; Training a maximum entropy classifier; Training scikit-learn classifiers; Measuring precision and recall of a classifier; Calculating high information words; Combining classifiers with voting; Classifying with multiple binary classifiers; Training a classifier with NLTK-Trainer; Chapter 8: Distributed Processing and Handling Large Datasets. |
ctrlnum | (OCoLC)891381366 |
dewey-full | 005.133 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 005 - Computer programming, programs, data, security |
dewey-raw | 005.133 |
dewey-search | 005.133 |
dewey-sort | 15.133 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
edition | Second edition. |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>06712cam a2200685 i 4500</leader><controlfield tag="001">ZDB-4-EBA-ocn891381366</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr cn|||||||||</controlfield><controlfield tag="008">140902t20142014enka o 001 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">E7B</subfield><subfield code="b">eng</subfield><subfield code="e">rda</subfield><subfield code="e">pn</subfield><subfield code="c">E7B</subfield><subfield code="d">OCLCO</subfield><subfield code="d">COO</subfield><subfield code="d">EBLCP</subfield><subfield code="d">HEBIS</subfield><subfield code="d">IDEBK</subfield><subfield code="d">DEBSZ</subfield><subfield code="d">YDXCP</subfield><subfield code="d">CHVBK</subfield><subfield code="d">N$T</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">AZK</subfield><subfield code="d">AGLDB</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">ICA</subfield><subfield code="d">MERUC</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">REB</subfield><subfield code="d">D6H</subfield><subfield code="d">OCLCF</subfield><subfield code="d">VTS</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">WYU</subfield><subfield code="d">ZCU</subfield><subfield code="d">M8D</subfield><subfield code="d">VT2</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">K6U</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">QGK</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield></datafield><datafield tag="019" ind1=" " ind2=" "><subfield code="a">889674234</subfield><subfield code="a">907279783</subfield><subfield code="a">961486506</subfield><subfield code="a">1259247125</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781782167860</subfield><subfield code="q">(e-book)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1782167862</subfield><subfield code="q">(e-book)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1782167854</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781782167853</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781782167853</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)891381366</subfield><subfield code="z">(OCoLC)889674234</subfield><subfield code="z">(OCoLC)907279783</subfield><subfield code="z">(OCoLC)961486506</subfield><subfield code="z">(OCoLC)1259247125</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">QA76.73.P98</subfield><subfield code="b">.P47 2014eb</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">COM</subfield><subfield code="x">051360</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">005.133</subfield><subfield code="2">23</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Perkins, Jacob,</subfield><subfield code="e">author.</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Python 3 text processing with NLTK 3 cookbook :</subfield><subfield code="b">over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 /</subfield><subfield code="c">Jacob Perkins ; cover image by Faiz Fattohi.</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">Second edition.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Birmingham, England :</subfield><subfield code="b">Packt Publishing Ltd,</subfield><subfield code="c">2014.</subfield></datafield><datafield tag="264" ind1=" " ind2="4"><subfield code="c">©2014</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (304 pages) :</subfield><subfield code="b">illustrations</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="347" ind1=" " ind2=" "><subfield code="a">text file</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">"Quick answers to common problems"--Cover</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes index.</subfield></datafield><datafield tag="588" ind1="0" ind2=" "><subfield code="a">Online resource; title from PDF title page (ebrary, viewed September 2, 2014).</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Tokenizing Text and WordNet Basics; Introduction; Tokenizing text into sentences; Tokenizing sentences into words; Tokenizing sentences using regular expressions; Training a sentence tokenizer; Filtering stopwords in a tokenized sentence; Looking up Synsets for a word in WordNet; Looking up lemmas and synonyms in WordNet; Calculating WordNet Synset similarity; Discovering word collocations; Chapter 2: Replacing and Correcting Words; Introduction; Stemming words.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Lemmatizing words with WordNetReplacing words matching regular expressions; Removing repeating characters; Spelling correction with Enchant; Replacing synonyms; Replacing negations with antonyms; Chapter 3: Creating Custom Corpora; Introduction; Setting up a custom corpus; Creating a wordlist corpus; Creating a part-of-speech tagged word corpus; Creating a chunked phrase corpus; Creating a categorized text corpus; Creating a categorized chunk corpus reader; Lazy corpus loading; Creating a custom corpus view; Creating a MongoDB-backed corpus reader; Corpus editing with file locking.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Chapter 4: Part-of-speech TaggingIntroduction; Default tagging; Training a unigram part-of-speech tagger; Combining taggers with backoff tagging; Training and combining ngram taggers; Creating a model of likely word tags; Tagging with regular expressions; Affix tagging; Training a Brill tagger; Training the TnT tagger; Using WordNet for tagging; Tagging proper names; Classifier-based tagging; Training a tagger with NLTK-Trainer; Chapter 5: Extracting Chunks; Introduction; Chunking and chinking with regular expressions; Merging and splitting chunks with regular expressions.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Expanding and removing chunks with regular expressionsPartial parsing with regular expressions; Training a tagger-based chunker; Classification-based chunking; Extracting named entities; Extracting proper noun chunks; Extracting location chunks; Training a named entity chunker; Training a chunker with NLTK-Trainer; Chapter 6: Transforming Chunks and Trees; Introduction; Filtering insignificant words from a sentence; Correcting verb forms; Swapping verb phrases; Swapping noun cardinals; Swapping infinitive phrases; Singularizing plural nouns; Chaining chunk transformations.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Converting a chunk tree to textFlattening a deep tree; Creating a shallow tree; Converting tree labels; Chapter 7: Text Classification; Introduction; Bag of words feature extraction; Training a Naive Bayes classifier; Training a decision tree classifier; Training a maximum entropy classifier; Training scikit-learn classifiers; Measuring precision and recall of a classifier; Calculating high information words; Combining classifiers with voting; Classifying with multiple binary classifiers; Training a classifier with NLTK-Trainer; Chapter 8: Distributed Processing and Handling Large Datasets.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">This book is intended for Python programmers interested in learning how to do natural language processing. Maybe you've learned the limits of regular expressions the hard way, or you've realized that human language cannot be deterministically parsed like a computer language. Perhaps you have more text than you know what to do with, and need automated ways to analyze and structure that text. This Cookbook will show you how to train and use statistical language models to process text in ways that are practically impossible with standard programming tools. A basic knowledge of Python and the basi.</subfield></datafield><datafield tag="546" ind1=" " ind2=" "><subfield code="a">English.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Python (Computer program language)</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh96008834</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Natural language processing (Computer science)</subfield><subfield code="x">Research.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Python (Langage de programmation)</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Traitement automatique des langues naturelles</subfield><subfield code="x">Recherche.</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS</subfield><subfield code="x">Programming Languages</subfield><subfield code="x">Python.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Python (Computer program language)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Fattohi, Faiz,</subfield><subfield code="e">cover designer.</subfield></datafield><datafield tag="758" ind1=" " ind2=" "><subfield code="i">has work:</subfield><subfield code="a">Python 3 text processing with NLTK 3 cookbook (Text)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCGpQm36dPrKgJmyDm44xMq</subfield><subfield code="4">https://id.oclc.org/worldcat/ontology/hasWork</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Print version:</subfield><subfield code="a">Perkins, Jacob.</subfield><subfield code="t">Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0.</subfield><subfield code="b">Second edition.</subfield><subfield code="d">Birmingham, England : Packt Publishing Ltd, ©2014</subfield><subfield code="h">iii, 288 pages</subfield><subfield code="z">9781782167853</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=836632</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="936" ind1=" " ind2=" "><subfield code="a">BATCHLOAD</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ProQuest Ebook Central</subfield><subfield code="b">EBLB</subfield><subfield code="n">EBL1706433</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ebrary</subfield><subfield code="b">EBRY</subfield><subfield code="n">ebr10921234</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">836632</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ProQuest MyiLibrary Digital eBook Collection</subfield><subfield code="b">IDEB</subfield><subfield code="n">cis29662597</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">YBP Library Services</subfield><subfield code="b">YANK</subfield><subfield code="n">12055178</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection> |
id | ZDB-4-EBA-ocn891381366 |
illustrated | Illustrated |
indexdate | 2024-11-27T13:26:13Z |
institution | BVB |
isbn | 9781782167860 1782167862 1782167854 9781782167853 |
language | English |
oclc_num | 891381366 |
open_access_boolean | |
owner | MAIN DE-863 DE-BY-FWS |
owner_facet | MAIN DE-863 DE-BY-FWS |
physical | 1 online resource (304 pages) : illustrations |
psigel | ZDB-4-EBA |
publishDate | 2014 |
publishDateSearch | 2014 |
publishDateSort | 2014 |
publisher | Packt Publishing Ltd, |
record_format | marc |
spelling | Perkins, Jacob, author. Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / Jacob Perkins ; cover image by Faiz Fattohi. Second edition. Birmingham, England : Packt Publishing Ltd, 2014. ©2014 1 online resource (304 pages) : illustrations text txt rdacontent computer c rdamedia online resource cr rdacarrier text file "Quick answers to common problems"--Cover Includes index. Online resource; title from PDF title page (ebrary, viewed September 2, 2014). Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Tokenizing Text and WordNet Basics; Introduction; Tokenizing text into sentences; Tokenizing sentences into words; Tokenizing sentences using regular expressions; Training a sentence tokenizer; Filtering stopwords in a tokenized sentence; Looking up Synsets for a word in WordNet; Looking up lemmas and synonyms in WordNet; Calculating WordNet Synset similarity; Discovering word collocations; Chapter 2: Replacing and Correcting Words; Introduction; Stemming words. Lemmatizing words with WordNetReplacing words matching regular expressions; Removing repeating characters; Spelling correction with Enchant; Replacing synonyms; Replacing negations with antonyms; Chapter 3: Creating Custom Corpora; Introduction; Setting up a custom corpus; Creating a wordlist corpus; Creating a part-of-speech tagged word corpus; Creating a chunked phrase corpus; Creating a categorized text corpus; Creating a categorized chunk corpus reader; Lazy corpus loading; Creating a custom corpus view; Creating a MongoDB-backed corpus reader; Corpus editing with file locking. Chapter 4: Part-of-speech TaggingIntroduction; Default tagging; Training a unigram part-of-speech tagger; Combining taggers with backoff tagging; Training and combining ngram taggers; Creating a model of likely word tags; Tagging with regular expressions; Affix tagging; Training a Brill tagger; Training the TnT tagger; Using WordNet for tagging; Tagging proper names; Classifier-based tagging; Training a tagger with NLTK-Trainer; Chapter 5: Extracting Chunks; Introduction; Chunking and chinking with regular expressions; Merging and splitting chunks with regular expressions. Expanding and removing chunks with regular expressionsPartial parsing with regular expressions; Training a tagger-based chunker; Classification-based chunking; Extracting named entities; Extracting proper noun chunks; Extracting location chunks; Training a named entity chunker; Training a chunker with NLTK-Trainer; Chapter 6: Transforming Chunks and Trees; Introduction; Filtering insignificant words from a sentence; Correcting verb forms; Swapping verb phrases; Swapping noun cardinals; Swapping infinitive phrases; Singularizing plural nouns; Chaining chunk transformations. Converting a chunk tree to textFlattening a deep tree; Creating a shallow tree; Converting tree labels; Chapter 7: Text Classification; Introduction; Bag of words feature extraction; Training a Naive Bayes classifier; Training a decision tree classifier; Training a maximum entropy classifier; Training scikit-learn classifiers; Measuring precision and recall of a classifier; Calculating high information words; Combining classifiers with voting; Classifying with multiple binary classifiers; Training a classifier with NLTK-Trainer; Chapter 8: Distributed Processing and Handling Large Datasets. This book is intended for Python programmers interested in learning how to do natural language processing. Maybe you've learned the limits of regular expressions the hard way, or you've realized that human language cannot be deterministically parsed like a computer language. Perhaps you have more text than you know what to do with, and need automated ways to analyze and structure that text. This Cookbook will show you how to train and use statistical language models to process text in ways that are practically impossible with standard programming tools. A basic knowledge of Python and the basi. English. Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Natural language processing (Computer science) Research. Python (Langage de programmation) Traitement automatique des langues naturelles Recherche. COMPUTERS Programming Languages Python. bisacsh Python (Computer program language) fast Fattohi, Faiz, cover designer. has work: Python 3 text processing with NLTK 3 cookbook (Text) https://id.oclc.org/worldcat/entity/E39PCGpQm36dPrKgJmyDm44xMq https://id.oclc.org/worldcat/ontology/hasWork Print version: Perkins, Jacob. Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0. Second edition. Birmingham, England : Packt Publishing Ltd, ©2014 iii, 288 pages 9781782167853 FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=836632 Volltext |
spellingShingle | Perkins, Jacob Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Tokenizing Text and WordNet Basics; Introduction; Tokenizing text into sentences; Tokenizing sentences into words; Tokenizing sentences using regular expressions; Training a sentence tokenizer; Filtering stopwords in a tokenized sentence; Looking up Synsets for a word in WordNet; Looking up lemmas and synonyms in WordNet; Calculating WordNet Synset similarity; Discovering word collocations; Chapter 2: Replacing and Correcting Words; Introduction; Stemming words. Lemmatizing words with WordNetReplacing words matching regular expressions; Removing repeating characters; Spelling correction with Enchant; Replacing synonyms; Replacing negations with antonyms; Chapter 3: Creating Custom Corpora; Introduction; Setting up a custom corpus; Creating a wordlist corpus; Creating a part-of-speech tagged word corpus; Creating a chunked phrase corpus; Creating a categorized text corpus; Creating a categorized chunk corpus reader; Lazy corpus loading; Creating a custom corpus view; Creating a MongoDB-backed corpus reader; Corpus editing with file locking. Chapter 4: Part-of-speech TaggingIntroduction; Default tagging; Training a unigram part-of-speech tagger; Combining taggers with backoff tagging; Training and combining ngram taggers; Creating a model of likely word tags; Tagging with regular expressions; Affix tagging; Training a Brill tagger; Training the TnT tagger; Using WordNet for tagging; Tagging proper names; Classifier-based tagging; Training a tagger with NLTK-Trainer; Chapter 5: Extracting Chunks; Introduction; Chunking and chinking with regular expressions; Merging and splitting chunks with regular expressions. Expanding and removing chunks with regular expressionsPartial parsing with regular expressions; Training a tagger-based chunker; Classification-based chunking; Extracting named entities; Extracting proper noun chunks; Extracting location chunks; Training a named entity chunker; Training a chunker with NLTK-Trainer; Chapter 6: Transforming Chunks and Trees; Introduction; Filtering insignificant words from a sentence; Correcting verb forms; Swapping verb phrases; Swapping noun cardinals; Swapping infinitive phrases; Singularizing plural nouns; Chaining chunk transformations. Converting a chunk tree to textFlattening a deep tree; Creating a shallow tree; Converting tree labels; Chapter 7: Text Classification; Introduction; Bag of words feature extraction; Training a Naive Bayes classifier; Training a decision tree classifier; Training a maximum entropy classifier; Training scikit-learn classifiers; Measuring precision and recall of a classifier; Calculating high information words; Combining classifiers with voting; Classifying with multiple binary classifiers; Training a classifier with NLTK-Trainer; Chapter 8: Distributed Processing and Handling Large Datasets. Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Natural language processing (Computer science) Research. Python (Langage de programmation) Traitement automatique des langues naturelles Recherche. COMPUTERS Programming Languages Python. bisacsh Python (Computer program language) fast |
subject_GND | http://id.loc.gov/authorities/subjects/sh96008834 |
title | Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / |
title_auth | Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / |
title_exact_search | Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / |
title_full | Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / Jacob Perkins ; cover image by Faiz Fattohi. |
title_fullStr | Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / Jacob Perkins ; cover image by Faiz Fattohi. |
title_full_unstemmed | Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / Jacob Perkins ; cover image by Faiz Fattohi. |
title_short | Python 3 text processing with NLTK 3 cookbook : |
title_sort | python 3 text processing with nltk 3 cookbook over 80 practical recipes on natural language processing techniques using python s nltk 3 0 |
title_sub | over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / |
topic | Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Natural language processing (Computer science) Research. Python (Langage de programmation) Traitement automatique des langues naturelles Recherche. COMPUTERS Programming Languages Python. bisacsh Python (Computer program language) fast |
topic_facet | Python (Computer program language) Natural language processing (Computer science) Research. Python (Langage de programmation) Traitement automatique des langues naturelles Recherche. COMPUTERS Programming Languages Python. |
url | https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=836632 |
work_keys_str_mv | AT perkinsjacob python3textprocessingwithnltk3cookbookover80practicalrecipesonnaturallanguageprocessingtechniquesusingpythonsnltk30 AT fattohifaiz python3textprocessingwithnltk3cookbookover80practicalrecipesonnaturallanguageprocessingtechniquesusingpythonsnltk30 |