Verfügbarkeit: Python natural language processing :

Python natural language processing :: explore NLP with machine learning and deep learning techniques /

Chapter 6: Advanced Feature Engineering and NLP Algorithms -- Recall word embedding -- Understanding the basics of word2vec -- Distributional semantics -- Defining word2vec -- Necessity of unsupervised distribution semantic model - word2vec -- Challenges -- Converting the word2vec model from black b...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Thanaki, Jalaj (VerfasserIn)
Format:	Elektronisch E-Book
Sprache:	English
Veröffentlicht:	Birmingham, UK : Packt Publishing, 2017.
Schlagworte:	Python (Computer program language) Natural language processing (Computer science) Machine learning. Natural Language Processing Machine Learning Python (Langage de programmation) Traitement automatique des langues naturelles. Apprentissage automatique. COMPUTERS > Natural Language Processing. COMPUTERS > Data Processing. COMPUTERS > Programming Languages > General. Machine learning
Online-Zugang:	Volltext
Zusammenfassung:	Chapter 6: Advanced Feature Engineering and NLP Algorithms -- Recall word embedding -- Understanding the basics of word2vec -- Distributional semantics -- Defining word2vec -- Necessity of unsupervised distribution semantic model - word2vec -- Challenges -- Converting the word2vec model from black box to white box -- Distributional similarity based representation -- Understanding the components of the word2vec model -- Input of the word2vec -- Output of word2vec -- Construction components of the word2vec model -- Architectural component -- Understanding the logic of the word2vec model -- Vocabulary builder -- Context builder -- Neural network with two layers -- Structural details of a word2vec neural network -- Word2vec neural network layer's details -- Softmax function -- Main processing algorithms -- Continuous bag of words -- Skip-gram -- Understanding algorithmic techniques and the mathematics behind the word2vec model -- Understanding the basic mathematics for the word2vec algorithm -- Techniques used at the vocabulary building stage -- Lossy counting -- Using it at the stage of vocabulary building -- Applications -- Techniques used at the context building stage -- Dynamic window scaling -- Understanding dynamic context window techniques -- Subsampling -- Pruning -- Algorithms used by neural networks -- Structure of the neurons -- Basic neuron structure -- Training a simple neuron -- Define error function -- Understanding gradient descent in word2vec -- Single neuron application -- Multi-layer neural networks -- Backpropagation -- Mathematics behind the word2vec model -- Techniques used to generate final vectors and probability prediction stage -- Hierarchical softmax -- Negative sampling -- Some of the facts related to word2vec -- Applications of word2vec -- Implementation of simple examples -- Famous example (king - man + woman).
Beschreibung:	1 online resource (1 volume) : illustrations
ISBN:	9781787285521 1787285529 9781523112173 1523112174

Internformat

MARC


LEADER	00000cam a2200000 i 4500
001	ZDB-4-EBA-on1001347182
003	OCoLC
005	20241004212047.0
006	m o d
007	cr unu\|\|\|\|\|\|\|\|
008	170818s2017 enka o 000 0 eng d
040			\|a UMI \|b eng \|e rda \|e pn \|c UMI \|d TEFOD \|d IDEBK \|d OCLCF \|d TOH \|d STF \|d COO \|d KNOVL \|d UOK \|d CEF \|d KSU \|d NLE \|d UKMGB \|d N$T \|d ZCU \|d UAB \|d UKAHL \|d K6U \|d OCLCQ \|d UPM \|d OCLCO \|d OCLCQ \|d OCLCO \|d OCLCL \|d DXU \|d OCLCQ
015			\|a GBB7H2177 \|2 bnb
016	7		\|a 018470857 \|2 Uk
020			\|a 9781787285521 \|q (electronic bk.)
020			\|a 1787285529 \|q (electronic bk.)
020			\|a 9781523112173 \|q (electronic bk.)
020			\|a 1523112174 \|q (electronic bk.)
020			\|z 9781787121423
035			\|a (OCoLC)1001347182
037			\|a CL0500000885 \|b Safari Books Online
037			\|a 95F30604-2FBC-4BEC-A90C-AAEE9AA77F57 \|b OverDrive, Inc. \|n http://www.overdrive.com
050		4	\|a QA76.73.P98
072		7	\|a COM \|x 042000 \|2 bisacsh
072		7	\|a COM \|x 018000 \|2 bisacsh
072		7	\|a COM \|x 051010 \|2 bisacsh
082	7		\|a 005.133 \|2 23
049			\|a MAIN
100	1		\|a Thanaki, Jalaj, \|e author.
245	1	0	\|a Python natural language processing : \|b explore NLP with machine learning and deep learning techniques / \|c Jalaj Thanaki.
264		1	\|a Birmingham, UK : \|b Packt Publishing, \|c 2017.
300			\|a 1 online resource (1 volume) : \|b illustrations
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
588	0		\|a Online resource; title from title page (Safari, viewed August 18, 2017).
505	0		\|a Cover -- Copyright -- Credits -- Foreword -- About the Author -- Acknowledgement -- About the Reviewers -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Introduction -- Understanding natural language processing -- Understanding basic applications -- Understanding advanced applications -- Advantages of togetherness -- NLP and Python -- Environment setup for NLTK -- Tips for readers -- Summary -- Chapter 2: Practical Understanding of a Corpus and Dataset -- What is a corpus? -- Why do we need a corpus? -- Understanding corpus analysis -- Exercise -- Understanding types of data attributes -- Categorical or qualitative data attributes -- Numeric or quantitative data attributes -- Exploring different file formats for corpora -- Resources for accessing free corpora -- Preparing a dataset for NLP applications -- Selecting data -- Preprocessing the dataset -- Formatting -- Cleaning -- Sampling -- Transforming data -- Web scraping -- Summary -- Chapter 3: Understanding the Structure of a Sentences -- Understanding components of NLP -- Natural language understanding -- Natural language generation -- Differences between NLU and NLG -- Branches of NLP -- Defining context-free grammar -- Exercise -- Morphological analysis -- What is morphology? -- What are morphemes? -- What is a stem? -- What is morphological analysis? -- What is a word? -- Classification of morphemes -- Free morphemes -- Bound morphemes -- Derivational morphemes -- Inflectional morphemes -- What is the difference between a stem and a root? -- Exercise -- Lexical analysis -- What is a token? -- What are part of speech tags? -- Process of deriving tokens -- Difference between stemming and lemmatization -- Applications -- Syntactic analysis -- What is syntactic analysis? -- Semantic analysis -- What is semantic analysis? -- Lexical semantics.
505	8		\|a Hyponymy and hyponyms -- Homonymy -- Polysemy -- What is the difference between polysemy and homonymy? -- Application of semantic analysis -- Handling ambiguity -- Lexical ambiguity -- Syntactic ambiguity -- Approach to handle syntactic ambiguity -- Semantic ambiguity -- Pragmatic ambiguity -- Discourse integration -- Applications -- Pragmatic analysis -- Summary -- Chapter 4: Preprocessing -- Handling corpus-raw text -- Getting raw text -- Lowercase conversion -- Sentence tokenization -- Challenges of sentence tokenization -- Stemming for raw text -- Challenges of stemming for raw text -- Lemmatization of raw text -- Challenges of lemmatization of raw text -- Stop word removal -- Exercise -- Handling corpus-raw sentences -- Word tokenization -- Challenges for word tokenization -- Word lemmatization -- Challenges for word lemmatization -- Basic preprocessing -- Regular expressions -- Basic level regular expression -- Basic flags -- Advanced level regular expression -- Positive lookahead -- Positive lookbehind -- Negative lookahead -- Negative lookbehind -- Practical and customized preprocessing -- Decide by yourself -- Is preprocessing required? -- What kind of preprocessing is required? -- Understanding case studies of preprocessing -- Grammar correction system -- Sentiment analysis -- Machine translation -- Spelling correction -- Approach -- Summary -- Chapter 5: Feature Engineering and NLP Algorithms -- Understanding feature engineering -- What is feature engineering? -- What is the purpose of feature engineering? -- Challenges -- Basic feature of NLP -- Parsers and parsing -- Understanding the basics of parsers -- Understanding the concept of parsing -- Developing a parser from scratch -- Types of grammar -- Context-free grammar -- Probabilistic context-free grammar -- Calculating the probability of a tree.
505	8		\|a Calculating the probability of a string -- Grammar transformation -- Developing a parser with the Cocke-Kasami-Younger Algorithm -- Developing parsers step-by-step -- Existing parser tools -- The Stanford parser -- The spaCy parser -- Extracting and understanding the features -- Customizing parser tools -- Challenges -- POS tagging and POS taggers -- Understanding the concept of POS tagging and POS taggers -- Developing POS taggers step-by-step -- Plug and play with existing POS taggers -- A Stanford POS tagger example -- Using polyglot to generate POS tagging -- Exercise -- Using POS tags as features -- Challenges -- Name entity recognition -- Classes of NER -- Plug and play with existing NER tools -- A Stanford NER example -- A Spacy NER example -- Extracting and understanding the features -- Challenges -- n-grams -- Understanding n-gram using a practice example -- Application -- Bag of words -- Understanding BOW -- Understanding BOW using a practical example -- Comparing n-grams and BOW -- Applications -- Semantic tools and resources -- Basic statistical features for NLP -- Basic mathematics -- Basic concepts of linear algebra for NLP -- Basic concepts of the probabilistic theory for NLP -- Probability -- Independent event and dependent event -- Conditional probability -- TF-IDF -- Understanding TF-IDF -- Understanding TF-IDF with a practical example -- Using textblob -- Using scikit-learn -- Application -- Vectorization -- Encoders and decoders -- One-hot encoding -- Understanding a practical example for one-hot encoding -- Application -- Normalization -- The linguistics aspect of normalization -- The statistical aspect of normalization -- Probabilistic models -- Understanding probabilistic language modeling -- Application of LM -- Indexing -- Application -- Ranking -- Advantages of features engineering -- Challenges of features engineering.
505	8		\|a Advantages of word2vec -- Challenges of word2vec -- How is word2vec used in real-life applications? -- When should you use word2vec? -- Developing something interesting -- Exercise -- Extension of the word2vec concept -- Para2Vec -- Doc2Vec -- Applications of Doc2vec -- GloVe -- Exercise -- Importance of vectorization in deep learning -- Summary -- Chapter 7: Rule-Based System for NLP -- Understanding of the rule-based system -- What does the RB system mean? -- Purpose of having the rule-based system -- Why do we need the rule-based system? -- Which kind of applications can use the RB approach over the other approaches? -- Exercise -- What kind of resources do you need if you want to develop a rule-based system? -- Architecture of the RB system -- General architecture of the rule-based system as an expert system -- Practical architecture of the rule-based system for NLP applications -- Custom architecture -- the RB system for NLP applications -- Exercise -- Apache UIMA -- the RB system for NLP applications -- Understanding the RB system development life cycle -- Applications -- NLP applications using the rule-based system -- Generalized AI applications using the rule-based system -- Developing NLP applications using the RB system -- Thinking process for making rules -- Start with simple rules -- Scraping the text data -- Defining the rule for our goal -- Coding our rule and generating a prototype and result -- Exercise -- Python for pattern-matching rules for a proofreading application -- Exercise -- Grammar correction -- Template-based chatbot application -- Flow of code -- Advantages of template-based chatbot -- Disadvantages of template-based chatbot -- Exercise -- Comparing the rule-based approach with other approaches -- Advantages of the rule-based system -- Disadvantages of the rule-based system -- Challenges for the rule-based system.
520			\|a Chapter 6: Advanced Feature Engineering and NLP Algorithms -- Recall word embedding -- Understanding the basics of word2vec -- Distributional semantics -- Defining word2vec -- Necessity of unsupervised distribution semantic model - word2vec -- Challenges -- Converting the word2vec model from black box to white box -- Distributional similarity based representation -- Understanding the components of the word2vec model -- Input of the word2vec -- Output of word2vec -- Construction components of the word2vec model -- Architectural component -- Understanding the logic of the word2vec model -- Vocabulary builder -- Context builder -- Neural network with two layers -- Structural details of a word2vec neural network -- Word2vec neural network layer's details -- Softmax function -- Main processing algorithms -- Continuous bag of words -- Skip-gram -- Understanding algorithmic techniques and the mathematics behind the word2vec model -- Understanding the basic mathematics for the word2vec algorithm -- Techniques used at the vocabulary building stage -- Lossy counting -- Using it at the stage of vocabulary building -- Applications -- Techniques used at the context building stage -- Dynamic window scaling -- Understanding dynamic context window techniques -- Subsampling -- Pruning -- Algorithms used by neural networks -- Structure of the neurons -- Basic neuron structure -- Training a simple neuron -- Define error function -- Understanding gradient descent in word2vec -- Single neuron application -- Multi-layer neural networks -- Backpropagation -- Mathematics behind the word2vec model -- Techniques used to generate final vectors and probability prediction stage -- Hierarchical softmax -- Negative sampling -- Some of the facts related to word2vec -- Applications of word2vec -- Implementation of simple examples -- Famous example (king - man + woman).
650		0	\|a Python (Computer program language) \|0 http://id.loc.gov/authorities/subjects/sh96008834
650		0	\|a Natural language processing (Computer science) \|0 http://id.loc.gov/authorities/subjects/sh88002425
650		0	\|a Machine learning. \|0 http://id.loc.gov/authorities/subjects/sh85079324
650		2	\|a Natural Language Processing \|0 https://id.nlm.nih.gov/mesh/D009323
650		2	\|a Machine Learning \|0 https://id.nlm.nih.gov/mesh/D000069550
650		6	\|a Python (Langage de programmation)
650		6	\|a Traitement automatique des langues naturelles.
650		6	\|a Apprentissage automatique.
650		7	\|a COMPUTERS \|x Natural Language Processing. \|2 bisacsh
650		7	\|a COMPUTERS \|x Data Processing. \|2 bisacsh
650		7	\|a COMPUTERS \|x Programming Languages \|x General. \|2 bisacsh
650		7	\|a Machine learning \|2 fast
650		7	\|a Natural language processing (Computer science) \|2 fast
650		7	\|a Python (Computer program language) \|2 fast
856	4	0	\|l FWS01 \|p ZDB-4-EBA \|q FWS_PDA_EBA \|u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1566414 \|3 Volltext
938			\|a Askews and Holts Library Services \|b ASKH \|n AH33041183
938			\|a EBSCOhost \|b EBSC \|n 1566414
938			\|a ProQuest MyiLibrary Digital eBook Collection \|b IDEB \|n cis38148030
994			\|a 92 \|b GEBAY
912			\|a ZDB-4-EBA
049			\|a DE-863

Datensatz im Suchindex

DE-BY-FWS_katkey	ZDB-4-EBA-on1001347182
_version_	1816882398017617920
adam_text
any_adam_object
author	Thanaki, Jalaj
author_facet	Thanaki, Jalaj
author_role	aut
author_sort	Thanaki, Jalaj
author_variant	j t jt
building	Verbundindex
bvnumber	localFWS
callnumber-first	Q - Science
callnumber-label	QA76
callnumber-raw	QA76.73.P98
callnumber-search	QA76.73.P98
callnumber-sort	QA 276.73 P98
callnumber-subject	QA - Mathematics
collection	ZDB-4-EBA
contents	Cover -- Copyright -- Credits -- Foreword -- About the Author -- Acknowledgement -- About the Reviewers -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Introduction -- Understanding natural language processing -- Understanding basic applications -- Understanding advanced applications -- Advantages of togetherness -- NLP and Python -- Environment setup for NLTK -- Tips for readers -- Summary -- Chapter 2: Practical Understanding of a Corpus and Dataset -- What is a corpus? -- Why do we need a corpus? -- Understanding corpus analysis -- Exercise -- Understanding types of data attributes -- Categorical or qualitative data attributes -- Numeric or quantitative data attributes -- Exploring different file formats for corpora -- Resources for accessing free corpora -- Preparing a dataset for NLP applications -- Selecting data -- Preprocessing the dataset -- Formatting -- Cleaning -- Sampling -- Transforming data -- Web scraping -- Summary -- Chapter 3: Understanding the Structure of a Sentences -- Understanding components of NLP -- Natural language understanding -- Natural language generation -- Differences between NLU and NLG -- Branches of NLP -- Defining context-free grammar -- Exercise -- Morphological analysis -- What is morphology? -- What are morphemes? -- What is a stem? -- What is morphological analysis? -- What is a word? -- Classification of morphemes -- Free morphemes -- Bound morphemes -- Derivational morphemes -- Inflectional morphemes -- What is the difference between a stem and a root? -- Exercise -- Lexical analysis -- What is a token? -- What are part of speech tags? -- Process of deriving tokens -- Difference between stemming and lemmatization -- Applications -- Syntactic analysis -- What is syntactic analysis? -- Semantic analysis -- What is semantic analysis? -- Lexical semantics. Hyponymy and hyponyms -- Homonymy -- Polysemy -- What is the difference between polysemy and homonymy? -- Application of semantic analysis -- Handling ambiguity -- Lexical ambiguity -- Syntactic ambiguity -- Approach to handle syntactic ambiguity -- Semantic ambiguity -- Pragmatic ambiguity -- Discourse integration -- Applications -- Pragmatic analysis -- Summary -- Chapter 4: Preprocessing -- Handling corpus-raw text -- Getting raw text -- Lowercase conversion -- Sentence tokenization -- Challenges of sentence tokenization -- Stemming for raw text -- Challenges of stemming for raw text -- Lemmatization of raw text -- Challenges of lemmatization of raw text -- Stop word removal -- Exercise -- Handling corpus-raw sentences -- Word tokenization -- Challenges for word tokenization -- Word lemmatization -- Challenges for word lemmatization -- Basic preprocessing -- Regular expressions -- Basic level regular expression -- Basic flags -- Advanced level regular expression -- Positive lookahead -- Positive lookbehind -- Negative lookahead -- Negative lookbehind -- Practical and customized preprocessing -- Decide by yourself -- Is preprocessing required? -- What kind of preprocessing is required? -- Understanding case studies of preprocessing -- Grammar correction system -- Sentiment analysis -- Machine translation -- Spelling correction -- Approach -- Summary -- Chapter 5: Feature Engineering and NLP Algorithms -- Understanding feature engineering -- What is feature engineering? -- What is the purpose of feature engineering? -- Challenges -- Basic feature of NLP -- Parsers and parsing -- Understanding the basics of parsers -- Understanding the concept of parsing -- Developing a parser from scratch -- Types of grammar -- Context-free grammar -- Probabilistic context-free grammar -- Calculating the probability of a tree. Calculating the probability of a string -- Grammar transformation -- Developing a parser with the Cocke-Kasami-Younger Algorithm -- Developing parsers step-by-step -- Existing parser tools -- The Stanford parser -- The spaCy parser -- Extracting and understanding the features -- Customizing parser tools -- Challenges -- POS tagging and POS taggers -- Understanding the concept of POS tagging and POS taggers -- Developing POS taggers step-by-step -- Plug and play with existing POS taggers -- A Stanford POS tagger example -- Using polyglot to generate POS tagging -- Exercise -- Using POS tags as features -- Challenges -- Name entity recognition -- Classes of NER -- Plug and play with existing NER tools -- A Stanford NER example -- A Spacy NER example -- Extracting and understanding the features -- Challenges -- n-grams -- Understanding n-gram using a practice example -- Application -- Bag of words -- Understanding BOW -- Understanding BOW using a practical example -- Comparing n-grams and BOW -- Applications -- Semantic tools and resources -- Basic statistical features for NLP -- Basic mathematics -- Basic concepts of linear algebra for NLP -- Basic concepts of the probabilistic theory for NLP -- Probability -- Independent event and dependent event -- Conditional probability -- TF-IDF -- Understanding TF-IDF -- Understanding TF-IDF with a practical example -- Using textblob -- Using scikit-learn -- Application -- Vectorization -- Encoders and decoders -- One-hot encoding -- Understanding a practical example for one-hot encoding -- Application -- Normalization -- The linguistics aspect of normalization -- The statistical aspect of normalization -- Probabilistic models -- Understanding probabilistic language modeling -- Application of LM -- Indexing -- Application -- Ranking -- Advantages of features engineering -- Challenges of features engineering. Advantages of word2vec -- Challenges of word2vec -- How is word2vec used in real-life applications? -- When should you use word2vec? -- Developing something interesting -- Exercise -- Extension of the word2vec concept -- Para2Vec -- Doc2Vec -- Applications of Doc2vec -- GloVe -- Exercise -- Importance of vectorization in deep learning -- Summary -- Chapter 7: Rule-Based System for NLP -- Understanding of the rule-based system -- What does the RB system mean? -- Purpose of having the rule-based system -- Why do we need the rule-based system? -- Which kind of applications can use the RB approach over the other approaches? -- Exercise -- What kind of resources do you need if you want to develop a rule-based system? -- Architecture of the RB system -- General architecture of the rule-based system as an expert system -- Practical architecture of the rule-based system for NLP applications -- Custom architecture -- the RB system for NLP applications -- Exercise -- Apache UIMA -- the RB system for NLP applications -- Understanding the RB system development life cycle -- Applications -- NLP applications using the rule-based system -- Generalized AI applications using the rule-based system -- Developing NLP applications using the RB system -- Thinking process for making rules -- Start with simple rules -- Scraping the text data -- Defining the rule for our goal -- Coding our rule and generating a prototype and result -- Exercise -- Python for pattern-matching rules for a proofreading application -- Exercise -- Grammar correction -- Template-based chatbot application -- Flow of code -- Advantages of template-based chatbot -- Disadvantages of template-based chatbot -- Exercise -- Comparing the rule-based approach with other approaches -- Advantages of the rule-based system -- Disadvantages of the rule-based system -- Challenges for the rule-based system.
ctrlnum	(OCoLC)1001347182
dewey-full	005.133
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	005 - Computer programming, programs, data, security
dewey-raw	005.133
dewey-search	005.133
dewey-sort	15.133
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
format	Electronic eBook
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>12238cam a2200685 i 4500</leader><controlfield tag="001">ZDB-4-EBA-on1001347182</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr unu\|\|\|\|\|\|\|\|</controlfield><controlfield tag="008">170818s2017 enka o 000 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">UMI</subfield><subfield code="b">eng</subfield><subfield code="e">rda</subfield><subfield code="e">pn</subfield><subfield code="c">UMI</subfield><subfield code="d">TEFOD</subfield><subfield code="d">IDEBK</subfield><subfield code="d">OCLCF</subfield><subfield code="d">TOH</subfield><subfield code="d">STF</subfield><subfield code="d">COO</subfield><subfield code="d">KNOVL</subfield><subfield code="d">UOK</subfield><subfield code="d">CEF</subfield><subfield code="d">KSU</subfield><subfield code="d">NLE</subfield><subfield code="d">UKMGB</subfield><subfield code="d">N$T</subfield><subfield code="d">ZCU</subfield><subfield code="d">UAB</subfield><subfield code="d">UKAHL</subfield><subfield code="d">K6U</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">UPM</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield><subfield code="d">DXU</subfield><subfield code="d">OCLCQ</subfield></datafield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBB7H2177</subfield><subfield code="2">bnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">018470857</subfield><subfield code="2">Uk</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781787285521</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1787285529</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781523112173</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1523112174</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781787121423</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1001347182</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">CL0500000885</subfield><subfield code="b">Safari Books Online</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">95F30604-2FBC-4BEC-A90C-AAEE9AA77F57</subfield><subfield code="b">OverDrive, Inc.</subfield><subfield code="n">http://www.overdrive.com</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">QA76.73.P98</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">COM</subfield><subfield code="x">042000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">COM</subfield><subfield code="x">018000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">COM</subfield><subfield code="x">051010</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">005.133</subfield><subfield code="2">23</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Thanaki, Jalaj,</subfield><subfield code="e">author.</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Python natural language processing :</subfield><subfield code="b">explore NLP with machine learning and deep learning techniques /</subfield><subfield code="c">Jalaj Thanaki.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Birmingham, UK :</subfield><subfield code="b">Packt Publishing,</subfield><subfield code="c">2017.</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (1 volume) :</subfield><subfield code="b">illustrations</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="588" ind1="0" ind2=" "><subfield code="a">Online resource; title from title page (Safari, viewed August 18, 2017).</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Cover -- Copyright -- Credits -- Foreword -- About the Author -- Acknowledgement -- About the Reviewers -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Introduction -- Understanding natural language processing -- Understanding basic applications -- Understanding advanced applications -- Advantages of togetherness -- NLP and Python -- Environment setup for NLTK -- Tips for readers -- Summary -- Chapter 2: Practical Understanding of a Corpus and Dataset -- What is a corpus? -- Why do we need a corpus? -- Understanding corpus analysis -- Exercise -- Understanding types of data attributes -- Categorical or qualitative data attributes -- Numeric or quantitative data attributes -- Exploring different file formats for corpora -- Resources for accessing free corpora -- Preparing a dataset for NLP applications -- Selecting data -- Preprocessing the dataset -- Formatting -- Cleaning -- Sampling -- Transforming data -- Web scraping -- Summary -- Chapter 3: Understanding the Structure of a Sentences -- Understanding components of NLP -- Natural language understanding -- Natural language generation -- Differences between NLU and NLG -- Branches of NLP -- Defining context-free grammar -- Exercise -- Morphological analysis -- What is morphology? -- What are morphemes? -- What is a stem? -- What is morphological analysis? -- What is a word? -- Classification of morphemes -- Free morphemes -- Bound morphemes -- Derivational morphemes -- Inflectional morphemes -- What is the difference between a stem and a root? -- Exercise -- Lexical analysis -- What is a token? -- What are part of speech tags? -- Process of deriving tokens -- Difference between stemming and lemmatization -- Applications -- Syntactic analysis -- What is syntactic analysis? -- Semantic analysis -- What is semantic analysis? -- Lexical semantics.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Hyponymy and hyponyms -- Homonymy -- Polysemy -- What is the difference between polysemy and homonymy? -- Application of semantic analysis -- Handling ambiguity -- Lexical ambiguity -- Syntactic ambiguity -- Approach to handle syntactic ambiguity -- Semantic ambiguity -- Pragmatic ambiguity -- Discourse integration -- Applications -- Pragmatic analysis -- Summary -- Chapter 4: Preprocessing -- Handling corpus-raw text -- Getting raw text -- Lowercase conversion -- Sentence tokenization -- Challenges of sentence tokenization -- Stemming for raw text -- Challenges of stemming for raw text -- Lemmatization of raw text -- Challenges of lemmatization of raw text -- Stop word removal -- Exercise -- Handling corpus-raw sentences -- Word tokenization -- Challenges for word tokenization -- Word lemmatization -- Challenges for word lemmatization -- Basic preprocessing -- Regular expressions -- Basic level regular expression -- Basic flags -- Advanced level regular expression -- Positive lookahead -- Positive lookbehind -- Negative lookahead -- Negative lookbehind -- Practical and customized preprocessing -- Decide by yourself -- Is preprocessing required? -- What kind of preprocessing is required? -- Understanding case studies of preprocessing -- Grammar correction system -- Sentiment analysis -- Machine translation -- Spelling correction -- Approach -- Summary -- Chapter 5: Feature Engineering and NLP Algorithms -- Understanding feature engineering -- What is feature engineering? -- What is the purpose of feature engineering? -- Challenges -- Basic feature of NLP -- Parsers and parsing -- Understanding the basics of parsers -- Understanding the concept of parsing -- Developing a parser from scratch -- Types of grammar -- Context-free grammar -- Probabilistic context-free grammar -- Calculating the probability of a tree.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Calculating the probability of a string -- Grammar transformation -- Developing a parser with the Cocke-Kasami-Younger Algorithm -- Developing parsers step-by-step -- Existing parser tools -- The Stanford parser -- The spaCy parser -- Extracting and understanding the features -- Customizing parser tools -- Challenges -- POS tagging and POS taggers -- Understanding the concept of POS tagging and POS taggers -- Developing POS taggers step-by-step -- Plug and play with existing POS taggers -- A Stanford POS tagger example -- Using polyglot to generate POS tagging -- Exercise -- Using POS tags as features -- Challenges -- Name entity recognition -- Classes of NER -- Plug and play with existing NER tools -- A Stanford NER example -- A Spacy NER example -- Extracting and understanding the features -- Challenges -- n-grams -- Understanding n-gram using a practice example -- Application -- Bag of words -- Understanding BOW -- Understanding BOW using a practical example -- Comparing n-grams and BOW -- Applications -- Semantic tools and resources -- Basic statistical features for NLP -- Basic mathematics -- Basic concepts of linear algebra for NLP -- Basic concepts of the probabilistic theory for NLP -- Probability -- Independent event and dependent event -- Conditional probability -- TF-IDF -- Understanding TF-IDF -- Understanding TF-IDF with a practical example -- Using textblob -- Using scikit-learn -- Application -- Vectorization -- Encoders and decoders -- One-hot encoding -- Understanding a practical example for one-hot encoding -- Application -- Normalization -- The linguistics aspect of normalization -- The statistical aspect of normalization -- Probabilistic models -- Understanding probabilistic language modeling -- Application of LM -- Indexing -- Application -- Ranking -- Advantages of features engineering -- Challenges of features engineering.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Advantages of word2vec -- Challenges of word2vec -- How is word2vec used in real-life applications? -- When should you use word2vec? -- Developing something interesting -- Exercise -- Extension of the word2vec concept -- Para2Vec -- Doc2Vec -- Applications of Doc2vec -- GloVe -- Exercise -- Importance of vectorization in deep learning -- Summary -- Chapter 7: Rule-Based System for NLP -- Understanding of the rule-based system -- What does the RB system mean? -- Purpose of having the rule-based system -- Why do we need the rule-based system? -- Which kind of applications can use the RB approach over the other approaches? -- Exercise -- What kind of resources do you need if you want to develop a rule-based system? -- Architecture of the RB system -- General architecture of the rule-based system as an expert system -- Practical architecture of the rule-based system for NLP applications -- Custom architecture -- the RB system for NLP applications -- Exercise -- Apache UIMA -- the RB system for NLP applications -- Understanding the RB system development life cycle -- Applications -- NLP applications using the rule-based system -- Generalized AI applications using the rule-based system -- Developing NLP applications using the RB system -- Thinking process for making rules -- Start with simple rules -- Scraping the text data -- Defining the rule for our goal -- Coding our rule and generating a prototype and result -- Exercise -- Python for pattern-matching rules for a proofreading application -- Exercise -- Grammar correction -- Template-based chatbot application -- Flow of code -- Advantages of template-based chatbot -- Disadvantages of template-based chatbot -- Exercise -- Comparing the rule-based approach with other approaches -- Advantages of the rule-based system -- Disadvantages of the rule-based system -- Challenges for the rule-based system.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Chapter 6: Advanced Feature Engineering and NLP Algorithms -- Recall word embedding -- Understanding the basics of word2vec -- Distributional semantics -- Defining word2vec -- Necessity of unsupervised distribution semantic model - word2vec -- Challenges -- Converting the word2vec model from black box to white box -- Distributional similarity based representation -- Understanding the components of the word2vec model -- Input of the word2vec -- Output of word2vec -- Construction components of the word2vec model -- Architectural component -- Understanding the logic of the word2vec model -- Vocabulary builder -- Context builder -- Neural network with two layers -- Structural details of a word2vec neural network -- Word2vec neural network layer's details -- Softmax function -- Main processing algorithms -- Continuous bag of words -- Skip-gram -- Understanding algorithmic techniques and the mathematics behind the word2vec model -- Understanding the basic mathematics for the word2vec algorithm -- Techniques used at the vocabulary building stage -- Lossy counting -- Using it at the stage of vocabulary building -- Applications -- Techniques used at the context building stage -- Dynamic window scaling -- Understanding dynamic context window techniques -- Subsampling -- Pruning -- Algorithms used by neural networks -- Structure of the neurons -- Basic neuron structure -- Training a simple neuron -- Define error function -- Understanding gradient descent in word2vec -- Single neuron application -- Multi-layer neural networks -- Backpropagation -- Mathematics behind the word2vec model -- Techniques used to generate final vectors and probability prediction stage -- Hierarchical softmax -- Negative sampling -- Some of the facts related to word2vec -- Applications of word2vec -- Implementation of simple examples -- Famous example (king - man + woman).</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Python (Computer program language)</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh96008834</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Natural language processing (Computer science)</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh88002425</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Machine learning.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh85079324</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Natural Language Processing</subfield><subfield code="0">https://id.nlm.nih.gov/mesh/D009323</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Machine Learning</subfield><subfield code="0">https://id.nlm.nih.gov/mesh/D000069550</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Python (Langage de programmation)</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Traitement automatique des langues naturelles.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Apprentissage automatique.</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS</subfield><subfield code="x">Natural Language Processing.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS</subfield><subfield code="x">Data Processing.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS</subfield><subfield code="x">Programming Languages</subfield><subfield code="x">General.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Machine learning</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Natural language processing (Computer science)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Python (Computer program language)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1566414</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">Askews and Holts Library Services</subfield><subfield code="b">ASKH</subfield><subfield code="n">AH33041183</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">1566414</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ProQuest MyiLibrary Digital eBook Collection</subfield><subfield code="b">IDEB</subfield><subfield code="n">cis38148030</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection>
id	ZDB-4-EBA-on1001347182
illustrated	Illustrated
indexdate	2024-11-27T13:27:58Z
institution	BVB
isbn	9781787285521 1787285529 9781523112173 1523112174
language	English
oclc_num	1001347182
open_access_boolean
owner	MAIN DE-863 DE-BY-FWS
owner_facet	MAIN DE-863 DE-BY-FWS
physical	1 online resource (1 volume) : illustrations
psigel	ZDB-4-EBA
publishDate	2017
publishDateSearch	2017
publishDateSort	2017
publisher	Packt Publishing,
record_format	marc
spelling	Thanaki, Jalaj, author. Python natural language processing : explore NLP with machine learning and deep learning techniques / Jalaj Thanaki. Birmingham, UK : Packt Publishing, 2017. 1 online resource (1 volume) : illustrations text txt rdacontent computer c rdamedia online resource cr rdacarrier Online resource; title from title page (Safari, viewed August 18, 2017). Cover -- Copyright -- Credits -- Foreword -- About the Author -- Acknowledgement -- About the Reviewers -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Introduction -- Understanding natural language processing -- Understanding basic applications -- Understanding advanced applications -- Advantages of togetherness -- NLP and Python -- Environment setup for NLTK -- Tips for readers -- Summary -- Chapter 2: Practical Understanding of a Corpus and Dataset -- What is a corpus? -- Why do we need a corpus? -- Understanding corpus analysis -- Exercise -- Understanding types of data attributes -- Categorical or qualitative data attributes -- Numeric or quantitative data attributes -- Exploring different file formats for corpora -- Resources for accessing free corpora -- Preparing a dataset for NLP applications -- Selecting data -- Preprocessing the dataset -- Formatting -- Cleaning -- Sampling -- Transforming data -- Web scraping -- Summary -- Chapter 3: Understanding the Structure of a Sentences -- Understanding components of NLP -- Natural language understanding -- Natural language generation -- Differences between NLU and NLG -- Branches of NLP -- Defining context-free grammar -- Exercise -- Morphological analysis -- What is morphology? -- What are morphemes? -- What is a stem? -- What is morphological analysis? -- What is a word? -- Classification of morphemes -- Free morphemes -- Bound morphemes -- Derivational morphemes -- Inflectional morphemes -- What is the difference between a stem and a root? -- Exercise -- Lexical analysis -- What is a token? -- What are part of speech tags? -- Process of deriving tokens -- Difference between stemming and lemmatization -- Applications -- Syntactic analysis -- What is syntactic analysis? -- Semantic analysis -- What is semantic analysis? -- Lexical semantics. Hyponymy and hyponyms -- Homonymy -- Polysemy -- What is the difference between polysemy and homonymy? -- Application of semantic analysis -- Handling ambiguity -- Lexical ambiguity -- Syntactic ambiguity -- Approach to handle syntactic ambiguity -- Semantic ambiguity -- Pragmatic ambiguity -- Discourse integration -- Applications -- Pragmatic analysis -- Summary -- Chapter 4: Preprocessing -- Handling corpus-raw text -- Getting raw text -- Lowercase conversion -- Sentence tokenization -- Challenges of sentence tokenization -- Stemming for raw text -- Challenges of stemming for raw text -- Lemmatization of raw text -- Challenges of lemmatization of raw text -- Stop word removal -- Exercise -- Handling corpus-raw sentences -- Word tokenization -- Challenges for word tokenization -- Word lemmatization -- Challenges for word lemmatization -- Basic preprocessing -- Regular expressions -- Basic level regular expression -- Basic flags -- Advanced level regular expression -- Positive lookahead -- Positive lookbehind -- Negative lookahead -- Negative lookbehind -- Practical and customized preprocessing -- Decide by yourself -- Is preprocessing required? -- What kind of preprocessing is required? -- Understanding case studies of preprocessing -- Grammar correction system -- Sentiment analysis -- Machine translation -- Spelling correction -- Approach -- Summary -- Chapter 5: Feature Engineering and NLP Algorithms -- Understanding feature engineering -- What is feature engineering? -- What is the purpose of feature engineering? -- Challenges -- Basic feature of NLP -- Parsers and parsing -- Understanding the basics of parsers -- Understanding the concept of parsing -- Developing a parser from scratch -- Types of grammar -- Context-free grammar -- Probabilistic context-free grammar -- Calculating the probability of a tree. Calculating the probability of a string -- Grammar transformation -- Developing a parser with the Cocke-Kasami-Younger Algorithm -- Developing parsers step-by-step -- Existing parser tools -- The Stanford parser -- The spaCy parser -- Extracting and understanding the features -- Customizing parser tools -- Challenges -- POS tagging and POS taggers -- Understanding the concept of POS tagging and POS taggers -- Developing POS taggers step-by-step -- Plug and play with existing POS taggers -- A Stanford POS tagger example -- Using polyglot to generate POS tagging -- Exercise -- Using POS tags as features -- Challenges -- Name entity recognition -- Classes of NER -- Plug and play with existing NER tools -- A Stanford NER example -- A Spacy NER example -- Extracting and understanding the features -- Challenges -- n-grams -- Understanding n-gram using a practice example -- Application -- Bag of words -- Understanding BOW -- Understanding BOW using a practical example -- Comparing n-grams and BOW -- Applications -- Semantic tools and resources -- Basic statistical features for NLP -- Basic mathematics -- Basic concepts of linear algebra for NLP -- Basic concepts of the probabilistic theory for NLP -- Probability -- Independent event and dependent event -- Conditional probability -- TF-IDF -- Understanding TF-IDF -- Understanding TF-IDF with a practical example -- Using textblob -- Using scikit-learn -- Application -- Vectorization -- Encoders and decoders -- One-hot encoding -- Understanding a practical example for one-hot encoding -- Application -- Normalization -- The linguistics aspect of normalization -- The statistical aspect of normalization -- Probabilistic models -- Understanding probabilistic language modeling -- Application of LM -- Indexing -- Application -- Ranking -- Advantages of features engineering -- Challenges of features engineering. Advantages of word2vec -- Challenges of word2vec -- How is word2vec used in real-life applications? -- When should you use word2vec? -- Developing something interesting -- Exercise -- Extension of the word2vec concept -- Para2Vec -- Doc2Vec -- Applications of Doc2vec -- GloVe -- Exercise -- Importance of vectorization in deep learning -- Summary -- Chapter 7: Rule-Based System for NLP -- Understanding of the rule-based system -- What does the RB system mean? -- Purpose of having the rule-based system -- Why do we need the rule-based system? -- Which kind of applications can use the RB approach over the other approaches? -- Exercise -- What kind of resources do you need if you want to develop a rule-based system? -- Architecture of the RB system -- General architecture of the rule-based system as an expert system -- Practical architecture of the rule-based system for NLP applications -- Custom architecture -- the RB system for NLP applications -- Exercise -- Apache UIMA -- the RB system for NLP applications -- Understanding the RB system development life cycle -- Applications -- NLP applications using the rule-based system -- Generalized AI applications using the rule-based system -- Developing NLP applications using the RB system -- Thinking process for making rules -- Start with simple rules -- Scraping the text data -- Defining the rule for our goal -- Coding our rule and generating a prototype and result -- Exercise -- Python for pattern-matching rules for a proofreading application -- Exercise -- Grammar correction -- Template-based chatbot application -- Flow of code -- Advantages of template-based chatbot -- Disadvantages of template-based chatbot -- Exercise -- Comparing the rule-based approach with other approaches -- Advantages of the rule-based system -- Disadvantages of the rule-based system -- Challenges for the rule-based system. Chapter 6: Advanced Feature Engineering and NLP Algorithms -- Recall word embedding -- Understanding the basics of word2vec -- Distributional semantics -- Defining word2vec -- Necessity of unsupervised distribution semantic model - word2vec -- Challenges -- Converting the word2vec model from black box to white box -- Distributional similarity based representation -- Understanding the components of the word2vec model -- Input of the word2vec -- Output of word2vec -- Construction components of the word2vec model -- Architectural component -- Understanding the logic of the word2vec model -- Vocabulary builder -- Context builder -- Neural network with two layers -- Structural details of a word2vec neural network -- Word2vec neural network layer's details -- Softmax function -- Main processing algorithms -- Continuous bag of words -- Skip-gram -- Understanding algorithmic techniques and the mathematics behind the word2vec model -- Understanding the basic mathematics for the word2vec algorithm -- Techniques used at the vocabulary building stage -- Lossy counting -- Using it at the stage of vocabulary building -- Applications -- Techniques used at the context building stage -- Dynamic window scaling -- Understanding dynamic context window techniques -- Subsampling -- Pruning -- Algorithms used by neural networks -- Structure of the neurons -- Basic neuron structure -- Training a simple neuron -- Define error function -- Understanding gradient descent in word2vec -- Single neuron application -- Multi-layer neural networks -- Backpropagation -- Mathematics behind the word2vec model -- Techniques used to generate final vectors and probability prediction stage -- Hierarchical softmax -- Negative sampling -- Some of the facts related to word2vec -- Applications of word2vec -- Implementation of simple examples -- Famous example (king - man + woman). Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Natural language processing (Computer science) http://id.loc.gov/authorities/subjects/sh88002425 Machine learning. http://id.loc.gov/authorities/subjects/sh85079324 Natural Language Processing https://id.nlm.nih.gov/mesh/D009323 Machine Learning https://id.nlm.nih.gov/mesh/D000069550 Python (Langage de programmation) Traitement automatique des langues naturelles. Apprentissage automatique. COMPUTERS Natural Language Processing. bisacsh COMPUTERS Data Processing. bisacsh COMPUTERS Programming Languages General. bisacsh Machine learning fast Natural language processing (Computer science) fast Python (Computer program language) fast FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1566414 Volltext
spellingShingle	Thanaki, Jalaj Python natural language processing : explore NLP with machine learning and deep learning techniques / Cover -- Copyright -- Credits -- Foreword -- About the Author -- Acknowledgement -- About the Reviewers -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Introduction -- Understanding natural language processing -- Understanding basic applications -- Understanding advanced applications -- Advantages of togetherness -- NLP and Python -- Environment setup for NLTK -- Tips for readers -- Summary -- Chapter 2: Practical Understanding of a Corpus and Dataset -- What is a corpus? -- Why do we need a corpus? -- Understanding corpus analysis -- Exercise -- Understanding types of data attributes -- Categorical or qualitative data attributes -- Numeric or quantitative data attributes -- Exploring different file formats for corpora -- Resources for accessing free corpora -- Preparing a dataset for NLP applications -- Selecting data -- Preprocessing the dataset -- Formatting -- Cleaning -- Sampling -- Transforming data -- Web scraping -- Summary -- Chapter 3: Understanding the Structure of a Sentences -- Understanding components of NLP -- Natural language understanding -- Natural language generation -- Differences between NLU and NLG -- Branches of NLP -- Defining context-free grammar -- Exercise -- Morphological analysis -- What is morphology? -- What are morphemes? -- What is a stem? -- What is morphological analysis? -- What is a word? -- Classification of morphemes -- Free morphemes -- Bound morphemes -- Derivational morphemes -- Inflectional morphemes -- What is the difference between a stem and a root? -- Exercise -- Lexical analysis -- What is a token? -- What are part of speech tags? -- Process of deriving tokens -- Difference between stemming and lemmatization -- Applications -- Syntactic analysis -- What is syntactic analysis? -- Semantic analysis -- What is semantic analysis? -- Lexical semantics. Hyponymy and hyponyms -- Homonymy -- Polysemy -- What is the difference between polysemy and homonymy? -- Application of semantic analysis -- Handling ambiguity -- Lexical ambiguity -- Syntactic ambiguity -- Approach to handle syntactic ambiguity -- Semantic ambiguity -- Pragmatic ambiguity -- Discourse integration -- Applications -- Pragmatic analysis -- Summary -- Chapter 4: Preprocessing -- Handling corpus-raw text -- Getting raw text -- Lowercase conversion -- Sentence tokenization -- Challenges of sentence tokenization -- Stemming for raw text -- Challenges of stemming for raw text -- Lemmatization of raw text -- Challenges of lemmatization of raw text -- Stop word removal -- Exercise -- Handling corpus-raw sentences -- Word tokenization -- Challenges for word tokenization -- Word lemmatization -- Challenges for word lemmatization -- Basic preprocessing -- Regular expressions -- Basic level regular expression -- Basic flags -- Advanced level regular expression -- Positive lookahead -- Positive lookbehind -- Negative lookahead -- Negative lookbehind -- Practical and customized preprocessing -- Decide by yourself -- Is preprocessing required? -- What kind of preprocessing is required? -- Understanding case studies of preprocessing -- Grammar correction system -- Sentiment analysis -- Machine translation -- Spelling correction -- Approach -- Summary -- Chapter 5: Feature Engineering and NLP Algorithms -- Understanding feature engineering -- What is feature engineering? -- What is the purpose of feature engineering? -- Challenges -- Basic feature of NLP -- Parsers and parsing -- Understanding the basics of parsers -- Understanding the concept of parsing -- Developing a parser from scratch -- Types of grammar -- Context-free grammar -- Probabilistic context-free grammar -- Calculating the probability of a tree. Calculating the probability of a string -- Grammar transformation -- Developing a parser with the Cocke-Kasami-Younger Algorithm -- Developing parsers step-by-step -- Existing parser tools -- The Stanford parser -- The spaCy parser -- Extracting and understanding the features -- Customizing parser tools -- Challenges -- POS tagging and POS taggers -- Understanding the concept of POS tagging and POS taggers -- Developing POS taggers step-by-step -- Plug and play with existing POS taggers -- A Stanford POS tagger example -- Using polyglot to generate POS tagging -- Exercise -- Using POS tags as features -- Challenges -- Name entity recognition -- Classes of NER -- Plug and play with existing NER tools -- A Stanford NER example -- A Spacy NER example -- Extracting and understanding the features -- Challenges -- n-grams -- Understanding n-gram using a practice example -- Application -- Bag of words -- Understanding BOW -- Understanding BOW using a practical example -- Comparing n-grams and BOW -- Applications -- Semantic tools and resources -- Basic statistical features for NLP -- Basic mathematics -- Basic concepts of linear algebra for NLP -- Basic concepts of the probabilistic theory for NLP -- Probability -- Independent event and dependent event -- Conditional probability -- TF-IDF -- Understanding TF-IDF -- Understanding TF-IDF with a practical example -- Using textblob -- Using scikit-learn -- Application -- Vectorization -- Encoders and decoders -- One-hot encoding -- Understanding a practical example for one-hot encoding -- Application -- Normalization -- The linguistics aspect of normalization -- The statistical aspect of normalization -- Probabilistic models -- Understanding probabilistic language modeling -- Application of LM -- Indexing -- Application -- Ranking -- Advantages of features engineering -- Challenges of features engineering. Advantages of word2vec -- Challenges of word2vec -- How is word2vec used in real-life applications? -- When should you use word2vec? -- Developing something interesting -- Exercise -- Extension of the word2vec concept -- Para2Vec -- Doc2Vec -- Applications of Doc2vec -- GloVe -- Exercise -- Importance of vectorization in deep learning -- Summary -- Chapter 7: Rule-Based System for NLP -- Understanding of the rule-based system -- What does the RB system mean? -- Purpose of having the rule-based system -- Why do we need the rule-based system? -- Which kind of applications can use the RB approach over the other approaches? -- Exercise -- What kind of resources do you need if you want to develop a rule-based system? -- Architecture of the RB system -- General architecture of the rule-based system as an expert system -- Practical architecture of the rule-based system for NLP applications -- Custom architecture -- the RB system for NLP applications -- Exercise -- Apache UIMA -- the RB system for NLP applications -- Understanding the RB system development life cycle -- Applications -- NLP applications using the rule-based system -- Generalized AI applications using the rule-based system -- Developing NLP applications using the RB system -- Thinking process for making rules -- Start with simple rules -- Scraping the text data -- Defining the rule for our goal -- Coding our rule and generating a prototype and result -- Exercise -- Python for pattern-matching rules for a proofreading application -- Exercise -- Grammar correction -- Template-based chatbot application -- Flow of code -- Advantages of template-based chatbot -- Disadvantages of template-based chatbot -- Exercise -- Comparing the rule-based approach with other approaches -- Advantages of the rule-based system -- Disadvantages of the rule-based system -- Challenges for the rule-based system. Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Natural language processing (Computer science) http://id.loc.gov/authorities/subjects/sh88002425 Machine learning. http://id.loc.gov/authorities/subjects/sh85079324 Natural Language Processing https://id.nlm.nih.gov/mesh/D009323 Machine Learning https://id.nlm.nih.gov/mesh/D000069550 Python (Langage de programmation) Traitement automatique des langues naturelles. Apprentissage automatique. COMPUTERS Natural Language Processing. bisacsh COMPUTERS Data Processing. bisacsh COMPUTERS Programming Languages General. bisacsh Machine learning fast Natural language processing (Computer science) fast Python (Computer program language) fast
subject_GND	http://id.loc.gov/authorities/subjects/sh96008834 http://id.loc.gov/authorities/subjects/sh88002425 http://id.loc.gov/authorities/subjects/sh85079324 https://id.nlm.nih.gov/mesh/D009323 https://id.nlm.nih.gov/mesh/D000069550
title	Python natural language processing : explore NLP with machine learning and deep learning techniques /
title_auth	Python natural language processing : explore NLP with machine learning and deep learning techniques /
title_exact_search	Python natural language processing : explore NLP with machine learning and deep learning techniques /
title_full	Python natural language processing : explore NLP with machine learning and deep learning techniques / Jalaj Thanaki.
title_fullStr	Python natural language processing : explore NLP with machine learning and deep learning techniques / Jalaj Thanaki.
title_full_unstemmed	Python natural language processing : explore NLP with machine learning and deep learning techniques / Jalaj Thanaki.
title_short	Python natural language processing :
title_sort	python natural language processing explore nlp with machine learning and deep learning techniques
title_sub	explore NLP with machine learning and deep learning techniques /
topic	Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Natural language processing (Computer science) http://id.loc.gov/authorities/subjects/sh88002425 Machine learning. http://id.loc.gov/authorities/subjects/sh85079324 Natural Language Processing https://id.nlm.nih.gov/mesh/D009323 Machine Learning https://id.nlm.nih.gov/mesh/D000069550 Python (Langage de programmation) Traitement automatique des langues naturelles. Apprentissage automatique. COMPUTERS Natural Language Processing. bisacsh COMPUTERS Data Processing. bisacsh COMPUTERS Programming Languages General. bisacsh Machine learning fast Natural language processing (Computer science) fast Python (Computer program language) fast
topic_facet	Python (Computer program language) Natural language processing (Computer science) Machine learning. Natural Language Processing Machine Learning Python (Langage de programmation) Traitement automatique des langues naturelles. Apprentissage automatique. COMPUTERS Natural Language Processing. COMPUTERS Data Processing. COMPUTERS Programming Languages General. Machine learning
url	https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1566414
work_keys_str_mv	AT thanakijalaj pythonnaturallanguageprocessingexplorenlpwithmachinelearninganddeeplearningtechniques

Verfügbarkeit

Es ist kein Print-Exemplar vorhanden.

Volltext öffnen

MARC

Datensatz im Suchindex

Es ist kein Print-Exemplar vorhanden.

Ähnliche Einträge