Regression analysis with Python :: learn the art of regression analysis with Python /
Learn the art of regression analysis with Python About This Book Become competent at implementing regression analysis in Python Solve some of the complex data science problems related to predicting outcomes Get to grips with various types of regression for effective data analysis Who This Book Is Fo...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Birmingham, UK :
Packt Publishing,
2016.
|
Schriftenreihe: | Community experience distilled.
|
Schlagworte: | |
Online-Zugang: | Volltext |
Zusammenfassung: | Learn the art of regression analysis with Python About This Book Become competent at implementing regression analysis in Python Solve some of the complex data science problems related to predicting outcomes Get to grips with various types of regression for effective data analysis Who This Book Is For The book targets Python developers, with a basic understanding of data science, statistics, and math, who want to learn how to do regression analysis on a dataset. It is beneficial if you have some knowledge of statistics and data science. What You Will Learn Format a dataset for regression and evaluate its performance Apply multiple linear regression to real-world problems Learn to classify training points Create an observation matrix, using different techniques of data analysis and cleaning Apply several techniques to decrease (and eventually fix) any overfitting problem Learn to scale linear models to a big dataset and deal with incremental data In Detail Regression is the process of learning relationships between inputs and continuous outputs from example data, which enables predictions for novel inputs. There are many kinds of regression algorithms, and the aim of this book is to explain which is the right one to use for each set of problems and how to prepare real-world data for it. With this book you will learn to define a simple regression problem and evaluate its performance. The book will help you understand how to properly parse a dataset, clean it, and create an output matrix optimally built for regression. You will begin with a simple regression algorithm to solve some data science problems and then progress to more complex algorithms. The book will enable you to use regression models to predict outcomes and take critical business decisions. Through the book, you will gain knowledge to use Python for building fast better linear models and to apply the results in Python or in any computer language you prefer. Style and approach This is a practical tutorial-based book. You will be given an example problem and then supplied with the relevant code and how to walk through it. The details are provided in a step by step manner, followed by a thorough explanation of the math underlying the solution. This approach will help you leverage your own data using the same techniques. |
Beschreibung: | Includes index. |
Beschreibung: | 1 online resource (1 volume) : illustrations |
ISBN: | 9781783980741 1783980745 |
Internformat
MARC
LEADER | 00000cam a2200000 i 4500 | ||
---|---|---|---|
001 | ZDB-4-EBA-ocn945554879 | ||
003 | OCoLC | ||
005 | 20241004212047.0 | ||
006 | m o d | ||
007 | cr unu|||||||| | ||
008 | 160324s2016 enka o 001 0 eng d | ||
040 | |a UMI |b eng |e rda |e pn |c UMI |d OCLCF |d KSU |d COO |d DEBBG |d DEBSZ |d YDX |d TEFOD |d REB |d UOK |d CEF |d NLE |d UKMGB |d WYU |d N$T |d AGLDB |d IGB |d UKAHL |d RDF |d QGK |d OCLCO |d OCLCQ |d OCLCO |d OCLCL |d OCLCQ | ||
015 | |a GBB742474 |2 bnb | ||
016 | 7 | |a 018006799 |2 Uk | |
019 | |a 1259219463 | ||
020 | |a 9781783980741 |q (electronic bk.) | ||
020 | |a 1783980745 |q (electronic bk.) | ||
020 | |z 1785286315 | ||
020 | |z 9781785286315 | ||
035 | |a (OCoLC)945554879 |z (OCoLC)1259219463 | ||
037 | |a CL0500000725 |b Safari Books Online | ||
037 | |a 63FFBA73-36AF-4027-A499-E4AFDEE6CB27 |b OverDrive, Inc. |n http://www.overdrive.com | ||
050 | 4 | |a QA278.2 | |
072 | 7 | |a MAT |x 003000 |2 bisacsh | |
072 | 7 | |a MAT |x 029000 |2 bisacsh | |
082 | 7 | |a 519.5/36 |2 23 | |
049 | |a MAIN | ||
100 | 1 | |a Massaron, Luca, |e author. | |
245 | 1 | 0 | |a Regression analysis with Python : |b learn the art of regression analysis with Python / |c Luca Massaron, Alberto Boschetti. |
264 | 1 | |a Birmingham, UK : |b Packt Publishing, |c 2016. | |
300 | |a 1 online resource (1 volume) : |b illustrations | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
347 | |a text file | ||
490 | 1 | |a Community experience distilled | |
588 | |a Description based on online resource; title from cover (viewed March 23, 2016). | ||
500 | |a Includes index. | ||
505 | 0 | |a Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Regression -- The Workhorse of Data Science; Regression analysis and data science; Exploring the promise of data science; The challenge; The linear models; What you are going to find in the book ; Python for data science; Installing Python; Choosing between Python 2 and Python 3; Step-by-step installation; Installing packages; Package upgrades; Scientific distributions; Introducing Jupyter or IPython; Python packages and functions for linear models ; NumPy; SciPy | |
505 | 8 | |a StatsmodelsScikit-learn; Summary; Chapter 2: Approaching Simple Linear Regression; Defining a regression problem; Linear models and supervised learning; Reflecting on predictive variables; Reflecting on response variables; The family of linear models; Preparing to discover simple linear regression; Starting from the basics; A measure of linear relationship; Extending to linear regression; Regressing with StatsModels; The coefficient of determination; Meaning and significance of coefficients; Evaluating the fitted values; Correlation is not causation; Predicting with a regression model | |
505 | 8 | |a Regressing with Scikit-learnMinimizing the cost function; Explaining the reason for using squared errors; Pseudoinverse and other optimization methods; Gradient Descent at work; Summary; Chapter 3: Multiple Regression in Action; Using multiple features; Model building with Statsmodels; Using formulas as an alternative; The correlation matrix; Revisiting gradient descent; Feature scaling; Unstandardizing coefficients; Estimating feature importance; Inspecting standardized coefficients; Comparing models by R-squared; Interaction models; Discovering interactions; Polynomial regression | |
505 | 8 | |a Testing linear versus cubic transformationGoing for higher-degree solutions; Introducing underfitting and overfitting; Summary; Chapter 4: Logistic Regression; Defining a classification problem; Formalization of the problem: binary classification; Assessing the classifier's performance; Defining a probability-based approach; More on the logistic and logit functions; Let's see some code; Pros and cons of logistic regression; Revisiting Gradient Descend; Multiclass Logistic Regression; An example; Summary; Chapter 5: Data Preparation; Numeric feature scaling; Mean centering; Standardization | |
505 | 8 | |a NormalizationThe logistic regression case; Qualitative feature encoding; Dummy coding with Pandas; DictVectorizer and one-hot encoding; Feature hasher; Numeric feature transformation; Observing residuals; Summarizations by binning; Missing data; Missing data imputation; Keeping track of missing values; Outliers; Outliers on the response; Outliers among the predictors; Removing or replacing outliers; Summary; Chapter 6: Achieving Generalization; Checking on out-of-sample data; Testing by sample split; Cross-validation; Bootstrapping; Greedy selection of features ; The Madelon dataset | |
520 | |a Learn the art of regression analysis with Python About This Book Become competent at implementing regression analysis in Python Solve some of the complex data science problems related to predicting outcomes Get to grips with various types of regression for effective data analysis Who This Book Is For The book targets Python developers, with a basic understanding of data science, statistics, and math, who want to learn how to do regression analysis on a dataset. It is beneficial if you have some knowledge of statistics and data science. What You Will Learn Format a dataset for regression and evaluate its performance Apply multiple linear regression to real-world problems Learn to classify training points Create an observation matrix, using different techniques of data analysis and cleaning Apply several techniques to decrease (and eventually fix) any overfitting problem Learn to scale linear models to a big dataset and deal with incremental data In Detail Regression is the process of learning relationships between inputs and continuous outputs from example data, which enables predictions for novel inputs. There are many kinds of regression algorithms, and the aim of this book is to explain which is the right one to use for each set of problems and how to prepare real-world data for it. With this book you will learn to define a simple regression problem and evaluate its performance. The book will help you understand how to properly parse a dataset, clean it, and create an output matrix optimally built for regression. You will begin with a simple regression algorithm to solve some data science problems and then progress to more complex algorithms. The book will enable you to use regression models to predict outcomes and take critical business decisions. Through the book, you will gain knowledge to use Python for building fast better linear models and to apply the results in Python or in any computer language you prefer. Style and approach This is a practical tutorial-based book. You will be given an example problem and then supplied with the relevant code and how to walk through it. The details are provided in a step by step manner, followed by a thorough explanation of the math underlying the solution. This approach will help you leverage your own data using the same techniques. | ||
650 | 0 | |a Regression analysis. |0 http://id.loc.gov/authorities/subjects/sh85112392 | |
650 | 0 | |a Python (Computer program language) |0 http://id.loc.gov/authorities/subjects/sh96008834 | |
650 | 2 | |a Regression Analysis |0 https://id.nlm.nih.gov/mesh/D012044 | |
650 | 6 | |a Analyse de régression. | |
650 | 6 | |a Python (Langage de programmation) | |
650 | 7 | |a COMPUTERS |x Data Visualization. |2 bisacsh | |
650 | 7 | |a COMPUTERS |x Programming |x Algorithms. |2 bisacsh | |
650 | 7 | |a MATHEMATICS / Applied |2 bisacsh | |
650 | 7 | |a MATHEMATICS / Probability & Statistics / General |2 bisacsh | |
650 | 7 | |a Python (Computer program language) |2 fast | |
650 | 7 | |a Regression analysis |2 fast | |
700 | 1 | |a Boschetti, Alberto, |e author. | |
758 | |i has work: |a Regression Analysis with Python (Text) |1 https://id.oclc.org/worldcat/entity/E39PCYKG6gqFYGd4DG6kC9V4hd |4 https://id.oclc.org/worldcat/ontology/hasWork | ||
776 | |z 1-78528-631-5 | ||
830 | 0 | |a Community experience distilled. |0 http://id.loc.gov/authorities/names/no2011030603 | |
856 | 4 | 0 | |l FWS01 |p ZDB-4-EBA |q FWS_PDA_EBA |u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1193721 |3 Volltext |
938 | |a Askews and Holts Library Services |b ASKH |n BDZ0027122886 | ||
938 | |a YBP Library Services |b YANK |n 12872677 | ||
938 | |a EBSCOhost |b EBSC |n 1193721 | ||
994 | |a 92 |b GEBAY | ||
912 | |a ZDB-4-EBA | ||
049 | |a DE-863 |
Datensatz im Suchindex
DE-BY-FWS_katkey | ZDB-4-EBA-ocn945554879 |
---|---|
_version_ | 1816882343476985856 |
adam_text | |
any_adam_object | |
author | Massaron, Luca Boschetti, Alberto |
author_facet | Massaron, Luca Boschetti, Alberto |
author_role | aut aut |
author_sort | Massaron, Luca |
author_variant | l m lm a b ab |
building | Verbundindex |
bvnumber | localFWS |
callnumber-first | Q - Science |
callnumber-label | QA278 |
callnumber-raw | QA278.2 |
callnumber-search | QA278.2 |
callnumber-sort | QA 3278.2 |
callnumber-subject | QA - Mathematics |
collection | ZDB-4-EBA |
contents | Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Regression -- The Workhorse of Data Science; Regression analysis and data science; Exploring the promise of data science; The challenge; The linear models; What you are going to find in the book ; Python for data science; Installing Python; Choosing between Python 2 and Python 3; Step-by-step installation; Installing packages; Package upgrades; Scientific distributions; Introducing Jupyter or IPython; Python packages and functions for linear models ; NumPy; SciPy StatsmodelsScikit-learn; Summary; Chapter 2: Approaching Simple Linear Regression; Defining a regression problem; Linear models and supervised learning; Reflecting on predictive variables; Reflecting on response variables; The family of linear models; Preparing to discover simple linear regression; Starting from the basics; A measure of linear relationship; Extending to linear regression; Regressing with StatsModels; The coefficient of determination; Meaning and significance of coefficients; Evaluating the fitted values; Correlation is not causation; Predicting with a regression model Regressing with Scikit-learnMinimizing the cost function; Explaining the reason for using squared errors; Pseudoinverse and other optimization methods; Gradient Descent at work; Summary; Chapter 3: Multiple Regression in Action; Using multiple features; Model building with Statsmodels; Using formulas as an alternative; The correlation matrix; Revisiting gradient descent; Feature scaling; Unstandardizing coefficients; Estimating feature importance; Inspecting standardized coefficients; Comparing models by R-squared; Interaction models; Discovering interactions; Polynomial regression Testing linear versus cubic transformationGoing for higher-degree solutions; Introducing underfitting and overfitting; Summary; Chapter 4: Logistic Regression; Defining a classification problem; Formalization of the problem: binary classification; Assessing the classifier's performance; Defining a probability-based approach; More on the logistic and logit functions; Let's see some code; Pros and cons of logistic regression; Revisiting Gradient Descend; Multiclass Logistic Regression; An example; Summary; Chapter 5: Data Preparation; Numeric feature scaling; Mean centering; Standardization NormalizationThe logistic regression case; Qualitative feature encoding; Dummy coding with Pandas; DictVectorizer and one-hot encoding; Feature hasher; Numeric feature transformation; Observing residuals; Summarizations by binning; Missing data; Missing data imputation; Keeping track of missing values; Outliers; Outliers on the response; Outliers among the predictors; Removing or replacing outliers; Summary; Chapter 6: Achieving Generalization; Checking on out-of-sample data; Testing by sample split; Cross-validation; Bootstrapping; Greedy selection of features ; The Madelon dataset |
ctrlnum | (OCoLC)945554879 |
dewey-full | 519.5/36 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 519 - Probabilities and applied mathematics |
dewey-raw | 519.5/36 |
dewey-search | 519.5/36 |
dewey-sort | 3519.5 236 |
dewey-tens | 510 - Mathematics |
discipline | Mathematik |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>08304cam a2200733 i 4500</leader><controlfield tag="001">ZDB-4-EBA-ocn945554879</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr unu||||||||</controlfield><controlfield tag="008">160324s2016 enka o 001 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">UMI</subfield><subfield code="b">eng</subfield><subfield code="e">rda</subfield><subfield code="e">pn</subfield><subfield code="c">UMI</subfield><subfield code="d">OCLCF</subfield><subfield code="d">KSU</subfield><subfield code="d">COO</subfield><subfield code="d">DEBBG</subfield><subfield code="d">DEBSZ</subfield><subfield code="d">YDX</subfield><subfield code="d">TEFOD</subfield><subfield code="d">REB</subfield><subfield code="d">UOK</subfield><subfield code="d">CEF</subfield><subfield code="d">NLE</subfield><subfield code="d">UKMGB</subfield><subfield code="d">WYU</subfield><subfield code="d">N$T</subfield><subfield code="d">AGLDB</subfield><subfield code="d">IGB</subfield><subfield code="d">UKAHL</subfield><subfield code="d">RDF</subfield><subfield code="d">QGK</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield><subfield code="d">OCLCQ</subfield></datafield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBB742474</subfield><subfield code="2">bnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">018006799</subfield><subfield code="2">Uk</subfield></datafield><datafield tag="019" ind1=" " ind2=" "><subfield code="a">1259219463</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781783980741</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1783980745</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">1785286315</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781785286315</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)945554879</subfield><subfield code="z">(OCoLC)1259219463</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">CL0500000725</subfield><subfield code="b">Safari Books Online</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">63FFBA73-36AF-4027-A499-E4AFDEE6CB27</subfield><subfield code="b">OverDrive, Inc.</subfield><subfield code="n">http://www.overdrive.com</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">QA278.2</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">MAT</subfield><subfield code="x">003000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">MAT</subfield><subfield code="x">029000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">519.5/36</subfield><subfield code="2">23</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Massaron, Luca,</subfield><subfield code="e">author.</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Regression analysis with Python :</subfield><subfield code="b">learn the art of regression analysis with Python /</subfield><subfield code="c">Luca Massaron, Alberto Boschetti.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Birmingham, UK :</subfield><subfield code="b">Packt Publishing,</subfield><subfield code="c">2016.</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (1 volume) :</subfield><subfield code="b">illustrations</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="347" ind1=" " ind2=" "><subfield code="a">text file</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Community experience distilled</subfield></datafield><datafield tag="588" ind1=" " ind2=" "><subfield code="a">Description based on online resource; title from cover (viewed March 23, 2016).</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Includes index.</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Regression -- The Workhorse of Data Science; Regression analysis and data science; Exploring the promise of data science; The challenge; The linear models; What you are going to find in the book ; Python for data science; Installing Python; Choosing between Python 2 and Python 3; Step-by-step installation; Installing packages; Package upgrades; Scientific distributions; Introducing Jupyter or IPython; Python packages and functions for linear models ; NumPy; SciPy</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">StatsmodelsScikit-learn; Summary; Chapter 2: Approaching Simple Linear Regression; Defining a regression problem; Linear models and supervised learning; Reflecting on predictive variables; Reflecting on response variables; The family of linear models; Preparing to discover simple linear regression; Starting from the basics; A measure of linear relationship; Extending to linear regression; Regressing with StatsModels; The coefficient of determination; Meaning and significance of coefficients; Evaluating the fitted values; Correlation is not causation; Predicting with a regression model</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Regressing with Scikit-learnMinimizing the cost function; Explaining the reason for using squared errors; Pseudoinverse and other optimization methods; Gradient Descent at work; Summary; Chapter 3: Multiple Regression in Action; Using multiple features; Model building with Statsmodels; Using formulas as an alternative; The correlation matrix; Revisiting gradient descent; Feature scaling; Unstandardizing coefficients; Estimating feature importance; Inspecting standardized coefficients; Comparing models by R-squared; Interaction models; Discovering interactions; Polynomial regression</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Testing linear versus cubic transformationGoing for higher-degree solutions; Introducing underfitting and overfitting; Summary; Chapter 4: Logistic Regression; Defining a classification problem; Formalization of the problem: binary classification; Assessing the classifier's performance; Defining a probability-based approach; More on the logistic and logit functions; Let's see some code; Pros and cons of logistic regression; Revisiting Gradient Descend; Multiclass Logistic Regression; An example; Summary; Chapter 5: Data Preparation; Numeric feature scaling; Mean centering; Standardization</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">NormalizationThe logistic regression case; Qualitative feature encoding; Dummy coding with Pandas; DictVectorizer and one-hot encoding; Feature hasher; Numeric feature transformation; Observing residuals; Summarizations by binning; Missing data; Missing data imputation; Keeping track of missing values; Outliers; Outliers on the response; Outliers among the predictors; Removing or replacing outliers; Summary; Chapter 6: Achieving Generalization; Checking on out-of-sample data; Testing by sample split; Cross-validation; Bootstrapping; Greedy selection of features ; The Madelon dataset</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Learn the art of regression analysis with Python About This Book Become competent at implementing regression analysis in Python Solve some of the complex data science problems related to predicting outcomes Get to grips with various types of regression for effective data analysis Who This Book Is For The book targets Python developers, with a basic understanding of data science, statistics, and math, who want to learn how to do regression analysis on a dataset. It is beneficial if you have some knowledge of statistics and data science. What You Will Learn Format a dataset for regression and evaluate its performance Apply multiple linear regression to real-world problems Learn to classify training points Create an observation matrix, using different techniques of data analysis and cleaning Apply several techniques to decrease (and eventually fix) any overfitting problem Learn to scale linear models to a big dataset and deal with incremental data In Detail Regression is the process of learning relationships between inputs and continuous outputs from example data, which enables predictions for novel inputs. There are many kinds of regression algorithms, and the aim of this book is to explain which is the right one to use for each set of problems and how to prepare real-world data for it. With this book you will learn to define a simple regression problem and evaluate its performance. The book will help you understand how to properly parse a dataset, clean it, and create an output matrix optimally built for regression. You will begin with a simple regression algorithm to solve some data science problems and then progress to more complex algorithms. The book will enable you to use regression models to predict outcomes and take critical business decisions. Through the book, you will gain knowledge to use Python for building fast better linear models and to apply the results in Python or in any computer language you prefer. Style and approach This is a practical tutorial-based book. You will be given an example problem and then supplied with the relevant code and how to walk through it. The details are provided in a step by step manner, followed by a thorough explanation of the math underlying the solution. This approach will help you leverage your own data using the same techniques.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Regression analysis.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh85112392</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Python (Computer program language)</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh96008834</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Regression Analysis</subfield><subfield code="0">https://id.nlm.nih.gov/mesh/D012044</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Analyse de régression.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Python (Langage de programmation)</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS</subfield><subfield code="x">Data Visualization.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS</subfield><subfield code="x">Programming</subfield><subfield code="x">Algorithms.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">MATHEMATICS / Applied</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">MATHEMATICS / Probability & Statistics / General</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Python (Computer program language)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Regression analysis</subfield><subfield code="2">fast</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Boschetti, Alberto,</subfield><subfield code="e">author.</subfield></datafield><datafield tag="758" ind1=" " ind2=" "><subfield code="i">has work:</subfield><subfield code="a">Regression Analysis with Python (Text)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCYKG6gqFYGd4DG6kC9V4hd</subfield><subfield code="4">https://id.oclc.org/worldcat/ontology/hasWork</subfield></datafield><datafield tag="776" ind1=" " ind2=" "><subfield code="z">1-78528-631-5</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Community experience distilled.</subfield><subfield code="0">http://id.loc.gov/authorities/names/no2011030603</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1193721</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">Askews and Holts Library Services</subfield><subfield code="b">ASKH</subfield><subfield code="n">BDZ0027122886</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">YBP Library Services</subfield><subfield code="b">YANK</subfield><subfield code="n">12872677</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">1193721</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection> |
id | ZDB-4-EBA-ocn945554879 |
illustrated | Illustrated |
indexdate | 2024-11-27T13:27:06Z |
institution | BVB |
isbn | 9781783980741 1783980745 |
language | English |
oclc_num | 945554879 |
open_access_boolean | |
owner | MAIN DE-863 DE-BY-FWS |
owner_facet | MAIN DE-863 DE-BY-FWS |
physical | 1 online resource (1 volume) : illustrations |
psigel | ZDB-4-EBA |
publishDate | 2016 |
publishDateSearch | 2016 |
publishDateSort | 2016 |
publisher | Packt Publishing, |
record_format | marc |
series | Community experience distilled. |
series2 | Community experience distilled |
spelling | Massaron, Luca, author. Regression analysis with Python : learn the art of regression analysis with Python / Luca Massaron, Alberto Boschetti. Birmingham, UK : Packt Publishing, 2016. 1 online resource (1 volume) : illustrations text txt rdacontent computer c rdamedia online resource cr rdacarrier text file Community experience distilled Description based on online resource; title from cover (viewed March 23, 2016). Includes index. Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Regression -- The Workhorse of Data Science; Regression analysis and data science; Exploring the promise of data science; The challenge; The linear models; What you are going to find in the book ; Python for data science; Installing Python; Choosing between Python 2 and Python 3; Step-by-step installation; Installing packages; Package upgrades; Scientific distributions; Introducing Jupyter or IPython; Python packages and functions for linear models ; NumPy; SciPy StatsmodelsScikit-learn; Summary; Chapter 2: Approaching Simple Linear Regression; Defining a regression problem; Linear models and supervised learning; Reflecting on predictive variables; Reflecting on response variables; The family of linear models; Preparing to discover simple linear regression; Starting from the basics; A measure of linear relationship; Extending to linear regression; Regressing with StatsModels; The coefficient of determination; Meaning and significance of coefficients; Evaluating the fitted values; Correlation is not causation; Predicting with a regression model Regressing with Scikit-learnMinimizing the cost function; Explaining the reason for using squared errors; Pseudoinverse and other optimization methods; Gradient Descent at work; Summary; Chapter 3: Multiple Regression in Action; Using multiple features; Model building with Statsmodels; Using formulas as an alternative; The correlation matrix; Revisiting gradient descent; Feature scaling; Unstandardizing coefficients; Estimating feature importance; Inspecting standardized coefficients; Comparing models by R-squared; Interaction models; Discovering interactions; Polynomial regression Testing linear versus cubic transformationGoing for higher-degree solutions; Introducing underfitting and overfitting; Summary; Chapter 4: Logistic Regression; Defining a classification problem; Formalization of the problem: binary classification; Assessing the classifier's performance; Defining a probability-based approach; More on the logistic and logit functions; Let's see some code; Pros and cons of logistic regression; Revisiting Gradient Descend; Multiclass Logistic Regression; An example; Summary; Chapter 5: Data Preparation; Numeric feature scaling; Mean centering; Standardization NormalizationThe logistic regression case; Qualitative feature encoding; Dummy coding with Pandas; DictVectorizer and one-hot encoding; Feature hasher; Numeric feature transformation; Observing residuals; Summarizations by binning; Missing data; Missing data imputation; Keeping track of missing values; Outliers; Outliers on the response; Outliers among the predictors; Removing or replacing outliers; Summary; Chapter 6: Achieving Generalization; Checking on out-of-sample data; Testing by sample split; Cross-validation; Bootstrapping; Greedy selection of features ; The Madelon dataset Learn the art of regression analysis with Python About This Book Become competent at implementing regression analysis in Python Solve some of the complex data science problems related to predicting outcomes Get to grips with various types of regression for effective data analysis Who This Book Is For The book targets Python developers, with a basic understanding of data science, statistics, and math, who want to learn how to do regression analysis on a dataset. It is beneficial if you have some knowledge of statistics and data science. What You Will Learn Format a dataset for regression and evaluate its performance Apply multiple linear regression to real-world problems Learn to classify training points Create an observation matrix, using different techniques of data analysis and cleaning Apply several techniques to decrease (and eventually fix) any overfitting problem Learn to scale linear models to a big dataset and deal with incremental data In Detail Regression is the process of learning relationships between inputs and continuous outputs from example data, which enables predictions for novel inputs. There are many kinds of regression algorithms, and the aim of this book is to explain which is the right one to use for each set of problems and how to prepare real-world data for it. With this book you will learn to define a simple regression problem and evaluate its performance. The book will help you understand how to properly parse a dataset, clean it, and create an output matrix optimally built for regression. You will begin with a simple regression algorithm to solve some data science problems and then progress to more complex algorithms. The book will enable you to use regression models to predict outcomes and take critical business decisions. Through the book, you will gain knowledge to use Python for building fast better linear models and to apply the results in Python or in any computer language you prefer. Style and approach This is a practical tutorial-based book. You will be given an example problem and then supplied with the relevant code and how to walk through it. The details are provided in a step by step manner, followed by a thorough explanation of the math underlying the solution. This approach will help you leverage your own data using the same techniques. Regression analysis. http://id.loc.gov/authorities/subjects/sh85112392 Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Regression Analysis https://id.nlm.nih.gov/mesh/D012044 Analyse de régression. Python (Langage de programmation) COMPUTERS Data Visualization. bisacsh COMPUTERS Programming Algorithms. bisacsh MATHEMATICS / Applied bisacsh MATHEMATICS / Probability & Statistics / General bisacsh Python (Computer program language) fast Regression analysis fast Boschetti, Alberto, author. has work: Regression Analysis with Python (Text) https://id.oclc.org/worldcat/entity/E39PCYKG6gqFYGd4DG6kC9V4hd https://id.oclc.org/worldcat/ontology/hasWork 1-78528-631-5 Community experience distilled. http://id.loc.gov/authorities/names/no2011030603 FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1193721 Volltext |
spellingShingle | Massaron, Luca Boschetti, Alberto Regression analysis with Python : learn the art of regression analysis with Python / Community experience distilled. Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Regression -- The Workhorse of Data Science; Regression analysis and data science; Exploring the promise of data science; The challenge; The linear models; What you are going to find in the book ; Python for data science; Installing Python; Choosing between Python 2 and Python 3; Step-by-step installation; Installing packages; Package upgrades; Scientific distributions; Introducing Jupyter or IPython; Python packages and functions for linear models ; NumPy; SciPy StatsmodelsScikit-learn; Summary; Chapter 2: Approaching Simple Linear Regression; Defining a regression problem; Linear models and supervised learning; Reflecting on predictive variables; Reflecting on response variables; The family of linear models; Preparing to discover simple linear regression; Starting from the basics; A measure of linear relationship; Extending to linear regression; Regressing with StatsModels; The coefficient of determination; Meaning and significance of coefficients; Evaluating the fitted values; Correlation is not causation; Predicting with a regression model Regressing with Scikit-learnMinimizing the cost function; Explaining the reason for using squared errors; Pseudoinverse and other optimization methods; Gradient Descent at work; Summary; Chapter 3: Multiple Regression in Action; Using multiple features; Model building with Statsmodels; Using formulas as an alternative; The correlation matrix; Revisiting gradient descent; Feature scaling; Unstandardizing coefficients; Estimating feature importance; Inspecting standardized coefficients; Comparing models by R-squared; Interaction models; Discovering interactions; Polynomial regression Testing linear versus cubic transformationGoing for higher-degree solutions; Introducing underfitting and overfitting; Summary; Chapter 4: Logistic Regression; Defining a classification problem; Formalization of the problem: binary classification; Assessing the classifier's performance; Defining a probability-based approach; More on the logistic and logit functions; Let's see some code; Pros and cons of logistic regression; Revisiting Gradient Descend; Multiclass Logistic Regression; An example; Summary; Chapter 5: Data Preparation; Numeric feature scaling; Mean centering; Standardization NormalizationThe logistic regression case; Qualitative feature encoding; Dummy coding with Pandas; DictVectorizer and one-hot encoding; Feature hasher; Numeric feature transformation; Observing residuals; Summarizations by binning; Missing data; Missing data imputation; Keeping track of missing values; Outliers; Outliers on the response; Outliers among the predictors; Removing or replacing outliers; Summary; Chapter 6: Achieving Generalization; Checking on out-of-sample data; Testing by sample split; Cross-validation; Bootstrapping; Greedy selection of features ; The Madelon dataset Regression analysis. http://id.loc.gov/authorities/subjects/sh85112392 Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Regression Analysis https://id.nlm.nih.gov/mesh/D012044 Analyse de régression. Python (Langage de programmation) COMPUTERS Data Visualization. bisacsh COMPUTERS Programming Algorithms. bisacsh MATHEMATICS / Applied bisacsh MATHEMATICS / Probability & Statistics / General bisacsh Python (Computer program language) fast Regression analysis fast |
subject_GND | http://id.loc.gov/authorities/subjects/sh85112392 http://id.loc.gov/authorities/subjects/sh96008834 https://id.nlm.nih.gov/mesh/D012044 |
title | Regression analysis with Python : learn the art of regression analysis with Python / |
title_auth | Regression analysis with Python : learn the art of regression analysis with Python / |
title_exact_search | Regression analysis with Python : learn the art of regression analysis with Python / |
title_full | Regression analysis with Python : learn the art of regression analysis with Python / Luca Massaron, Alberto Boschetti. |
title_fullStr | Regression analysis with Python : learn the art of regression analysis with Python / Luca Massaron, Alberto Boschetti. |
title_full_unstemmed | Regression analysis with Python : learn the art of regression analysis with Python / Luca Massaron, Alberto Boschetti. |
title_short | Regression analysis with Python : |
title_sort | regression analysis with python learn the art of regression analysis with python |
title_sub | learn the art of regression analysis with Python / |
topic | Regression analysis. http://id.loc.gov/authorities/subjects/sh85112392 Python (Computer program language) http://id.loc.gov/authorities/subjects/sh96008834 Regression Analysis https://id.nlm.nih.gov/mesh/D012044 Analyse de régression. Python (Langage de programmation) COMPUTERS Data Visualization. bisacsh COMPUTERS Programming Algorithms. bisacsh MATHEMATICS / Applied bisacsh MATHEMATICS / Probability & Statistics / General bisacsh Python (Computer program language) fast Regression analysis fast |
topic_facet | Regression analysis. Python (Computer program language) Regression Analysis Analyse de régression. Python (Langage de programmation) COMPUTERS Data Visualization. COMPUTERS Programming Algorithms. MATHEMATICS / Applied MATHEMATICS / Probability & Statistics / General Regression analysis |
url | https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1193721 |
work_keys_str_mv | AT massaronluca regressionanalysiswithpythonlearntheartofregressionanalysiswithpython AT boschettialberto regressionanalysiswithpythonlearntheartofregressionanalysiswithpython |