R data analysis projects :: build end to end analytics systems to get deeper insights from your data /
Get valuable insights from your data by building data analysis systems from scratch with R. About This Book A handy guide to take your understanding of data analysis with R to the next level Real-world projects that focus on problems in finance, network analysis, social media, and more From data man...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Birmingham, UK :
Packt Publishing,
2017.
|
Schlagworte: | |
Online-Zugang: | Volltext |
Zusammenfassung: | Get valuable insights from your data by building data analysis systems from scratch with R. About This Book A handy guide to take your understanding of data analysis with R to the next level Real-world projects that focus on problems in finance, network analysis, social media, and more From data manipulation to analysis to visualization in R, this book will teach you everything you need to know about building end-to-end data analysis pipelines using R Who This Book Is For If you are looking for a book that takes you all the way through the practical application of advanced and effective analytics methodologies in R, then this is the book for you. A fundamental understanding of R and the basic concepts of data analysis is all you need to get started with this book. What You Will Learn Build end-to-end predictive analytics systems in R Build an experimental design to gather your own data and conduct analysis Build a recommender system from scratch using different approaches Use and leverage RShiny to build reactive programming applications Build systems for varied domains including market research, network analysis, social media analysis, and more Explore various R Packages such as RShiny, ggplot, recommenderlab, dplyr, and find out how to use them effectively Communicate modeling results using Shiny Dashboards Perform multi-variate time-series analysis prediction, supplemented with sensitivity analysis and risk modeling In Detail R offers a large variety of packages and libraries for fast and accurate data analysis and visualization. As a result, it's one of the most popularly used languages by data scientists and analysts, or anyone who wants to perform data analysis. This book will demonstrate how you can put to use your existing knowledge of data analysis in R to build highly efficient, end-to-end data analysis pipelines without any hassle. You'll start by building a content-based recommendation system, followed by building a project on sentiment analysis with tweets. You'll implement time-series modeling for anomaly detection, and understand cluster analysis of streaming data. You'll work through projects on performing efficient market data research, building recommendation systems, and analyzing networks accurately, all provided with easy to follow codes. With the help of these real-world projects, you'll get a better understanding of the challenges faced when building data analysis pipelines, and see how you can overcome them without comp ... |
Beschreibung: | 1 online resource (1 volume) : illustrations |
ISBN: | 9781788620574 1788620577 1788621875 9781788621878 |
Internformat
MARC
LEADER | 00000cam a2200000 i 4500 | ||
---|---|---|---|
001 | ZDB-4-EBA-on1017754231 | ||
003 | OCoLC | ||
005 | 20241004212047.0 | ||
006 | m o d | ||
007 | cr unu|||||||| | ||
008 | 180104s2017 enka o 000 0 eng d | ||
040 | |a UMI |b eng |e rda |e pn |c UMI |d IDEBK |d TOH |d STF |d OCLCF |d CEF |d KSU |d N$T |d DEBBG |d TEFOD |d G3B |d S9I |d UAB |d QGK |d OCLCQ |d OCLCO |d OCL |d NZAUC |d OCLCQ |d OCLCO |d OCLCL |d DXU | ||
020 | |a 9781788620574 |q (electronic bk.) | ||
020 | |a 1788620577 |q (electronic bk.) | ||
020 | |a 1788621875 | ||
020 | |a 9781788621878 | ||
020 | |z 9781788621878 | ||
035 | |a (OCoLC)1017754231 | ||
037 | |a CL0500000922 |b Safari Books Online | ||
037 | |a F0F13939-1C2A-4149-9BEC-142B4AE332AC |b OverDrive, Inc. |n http://www.overdrive.com | ||
050 | 4 | |a QA276.45.R3 | |
072 | 7 | |a MAT |x 003000 |2 bisacsh | |
072 | 7 | |a MAT |x 029000 |2 bisacsh | |
082 | 7 | |a 519.502855133 |2 23 | |
049 | |a MAIN | ||
100 | 1 | |a Subramanian, Gopi, |e author. |0 http://id.loc.gov/authorities/names/no2016051875 | |
245 | 1 | 0 | |a R data analysis projects : |b build end to end analytics systems to get deeper insights from your data / |c Gopi Subramanian. |
246 | 3 | 0 | |a Build end to end analytics systems to get deeper insights from your data |
264 | 1 | |a Birmingham, UK : |b Packt Publishing, |c 2017. | |
300 | |a 1 online resource (1 volume) : |b illustrations | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
347 | |a data file | ||
588 | 0 | |a Online resource; title from title page (viewed January 3, 2018). | |
520 | |a Get valuable insights from your data by building data analysis systems from scratch with R. About This Book A handy guide to take your understanding of data analysis with R to the next level Real-world projects that focus on problems in finance, network analysis, social media, and more From data manipulation to analysis to visualization in R, this book will teach you everything you need to know about building end-to-end data analysis pipelines using R Who This Book Is For If you are looking for a book that takes you all the way through the practical application of advanced and effective analytics methodologies in R, then this is the book for you. A fundamental understanding of R and the basic concepts of data analysis is all you need to get started with this book. What You Will Learn Build end-to-end predictive analytics systems in R Build an experimental design to gather your own data and conduct analysis Build a recommender system from scratch using different approaches Use and leverage RShiny to build reactive programming applications Build systems for varied domains including market research, network analysis, social media analysis, and more Explore various R Packages such as RShiny, ggplot, recommenderlab, dplyr, and find out how to use them effectively Communicate modeling results using Shiny Dashboards Perform multi-variate time-series analysis prediction, supplemented with sensitivity analysis and risk modeling In Detail R offers a large variety of packages and libraries for fast and accurate data analysis and visualization. As a result, it's one of the most popularly used languages by data scientists and analysts, or anyone who wants to perform data analysis. This book will demonstrate how you can put to use your existing knowledge of data analysis in R to build highly efficient, end-to-end data analysis pipelines without any hassle. You'll start by building a content-based recommendation system, followed by building a project on sentiment analysis with tweets. You'll implement time-series modeling for anomaly detection, and understand cluster analysis of streaming data. You'll work through projects on performing efficient market data research, building recommendation systems, and analyzing networks accurately, all provided with easy to follow codes. With the help of these real-world projects, you'll get a better understanding of the challenges faced when building data analysis pipelines, and see how you can overcome them without comp ... | ||
505 | 0 | |a Cover -- Title Page -- Copyright -- Credits -- About the Author -- About the Reviewer -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Association Rule Mining -- Understanding the recommender systems -- Transactions -- Weighted transactions -- Our web application -- Retailer use case and data -- Association rule mining -- Support and confidence thresholds -- The cross-selling campaign -- Leverage -- Conviction -- Weighted association rule mining -- Hyperlink-induced topic search (HITS) -- Negative association rules -- Rules visualization -- Wrapping up -- Summary -- Chapter 2: Fuzzy Logic Induced Content-Based Recommendation -- Introducing content-based recommendation -- News aggregator use case and data -- Designing the content-based recommendation engine -- Building a similarity index -- Bag-of-words -- Term frequency -- Document frequency -- Inverse document frequency (IDF) -- TFIDF -- Why cosine similarity? -- Searching -- Polarity scores -- Jaccard's distance -- Jaccards distance/index -- Ranking search results -- Fuzzy logic -- Fuzzification -- Defining the rules -- Evaluating the rules -- Defuzzification -- Complete R Code -- Summary -- Chapter 3: Collaborative Filtering -- Collaborative filtering -- Memory-based approach -- Model-based approach -- Latent factor approach -- Recommenderlab package -- Popular approach -- Use case and data -- Designing and implementing collaborative filtering -- Ratings matrix -- Normalization -- Train test split -- Train model -- User-based models -- Item-based models -- Factor-based models -- Complete R Code -- Summary -- Chapter 4: Taming Time Series Data Using Deep Neural Networks -- Time series data -- Non-seasonal time series -- Seasonal time series -- Time series as a regression problem -- Deep neural networks -- Forward cycle -- Backward cycle. | |
505 | 8 | |a Introduction to the MXNet R package -- Symbolic programming in MXNet -- Softmax activation -- Use case and data -- Deep networks for time series prediction -- Training test split -- Complete R code -- Summary -- Chapter 5: Twitter Text Sentiment Classification Using Kernel Density Estimates -- Kernel density estimation -- Twitter text -- Sentiment classification -- Dictionary methods -- Machine learning methods -- Our approach -- Dictionary based scoring -- Text pre-processing -- Term-frequeny inverse document frequency (TFIDF) -- Delta TFIDF -- Building a sentiment classifier -- Assembling an RShiny application -- Complete R code -- Summary -- Chapter 6: Record Linkage -- Stochastic and Machine Learning Approaches -- Introducing our use case -- Demonstrating the use of RecordLinkage package -- Feature generation -- String features -- Phonetic features -- Stochastic record linkage -- Expectation maximization method -- Weights-based method -- Machine learning-based record linkage -- Unsupervised learning -- Supervised learning -- Building an RShiny application -- Complete R code -- Feature generation -- Expectation maximization method -- Weights-based method -- Machine learning method -- RShiny application -- Summary -- Chapter 7: Streaming Data Clustering Analysis in R -- Streaming data and its challenges -- Bounded problems -- Drift -- Single pass -- Real time -- Introducing stream clustering -- Macro-cluster -- Introducing the stream package -- Data stream data -- DSD as a static simulator -- DSD as a simulator with drift -- DSD connecting to memory, file, or database -- Inflight operation -- Can we connect this DSD to an actual data stream? -- Data stream task -- Use case and data -- Speed layer -- Batch layer -- Reservoir sampling -- Complete R code -- Summary -- Chapter 8: Analyze and Understand Networks Using R -- Graphs in R. | |
505 | 8 | |a Degree of a vertex -- Strength of a vertex -- Adjacency Matrix -- More networks in R -- Centrality of a vertex -- Farness and Closeness of a node -- Finding the shortest path between nodes -- Random walk on a graph -- Use case and data -- Data preparation -- Product network analysis -- Building a RShiny application -- The complete R script -- Summary -- Index. | |
650 | 0 | |a R (Computer program language) |0 http://id.loc.gov/authorities/subjects/sh2002004407 | |
650 | 0 | |a Domain-specific programming languages. |0 http://id.loc.gov/authorities/subjects/sh2007005223 | |
650 | 0 | |a Data mining |x Mathematics. | |
650 | 6 | |a R (Langage de programmation) | |
650 | 6 | |a Langages orientés problème (Informatique) | |
650 | 6 | |a Exploration de données (Informatique) |x Mathématiques. | |
650 | 7 | |a MATHEMATICS |x Applied. |2 bisacsh | |
650 | 7 | |a MATHEMATICS |x Probability & Statistics |x General. |2 bisacsh | |
650 | 7 | |a Data mining |x Mathematics |2 fast | |
650 | 7 | |a Domain-specific programming languages |2 fast | |
650 | 7 | |a R (Computer program language) |2 fast | |
758 | |i has work: |a R data analysis projects (Text) |1 https://id.oclc.org/worldcat/entity/E39PCFPG6rCFfJqTpVQwxqbYbm |4 https://id.oclc.org/worldcat/ontology/hasWork | ||
856 | 4 | 0 | |l FWS01 |p ZDB-4-EBA |q FWS_PDA_EBA |u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1636279 |3 Volltext |
938 | |a EBSCOhost |b EBSC |n 1636279 | ||
938 | |a ProQuest MyiLibrary Digital eBook Collection |b IDEB |n cis39387075 | ||
994 | |a 92 |b GEBAY | ||
912 | |a ZDB-4-EBA | ||
049 | |a DE-863 |
Datensatz im Suchindex
DE-BY-FWS_katkey | ZDB-4-EBA-on1017754231 |
---|---|
_version_ | 1816882409612771328 |
adam_text | |
any_adam_object | |
author | Subramanian, Gopi |
author_GND | http://id.loc.gov/authorities/names/no2016051875 |
author_facet | Subramanian, Gopi |
author_role | aut |
author_sort | Subramanian, Gopi |
author_variant | g s gs |
building | Verbundindex |
bvnumber | localFWS |
callnumber-first | Q - Science |
callnumber-label | QA276 |
callnumber-raw | QA276.45.R3 |
callnumber-search | QA276.45.R3 |
callnumber-sort | QA 3276.45 R3 |
callnumber-subject | QA - Mathematics |
collection | ZDB-4-EBA |
contents | Cover -- Title Page -- Copyright -- Credits -- About the Author -- About the Reviewer -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Association Rule Mining -- Understanding the recommender systems -- Transactions -- Weighted transactions -- Our web application -- Retailer use case and data -- Association rule mining -- Support and confidence thresholds -- The cross-selling campaign -- Leverage -- Conviction -- Weighted association rule mining -- Hyperlink-induced topic search (HITS) -- Negative association rules -- Rules visualization -- Wrapping up -- Summary -- Chapter 2: Fuzzy Logic Induced Content-Based Recommendation -- Introducing content-based recommendation -- News aggregator use case and data -- Designing the content-based recommendation engine -- Building a similarity index -- Bag-of-words -- Term frequency -- Document frequency -- Inverse document frequency (IDF) -- TFIDF -- Why cosine similarity? -- Searching -- Polarity scores -- Jaccard's distance -- Jaccards distance/index -- Ranking search results -- Fuzzy logic -- Fuzzification -- Defining the rules -- Evaluating the rules -- Defuzzification -- Complete R Code -- Summary -- Chapter 3: Collaborative Filtering -- Collaborative filtering -- Memory-based approach -- Model-based approach -- Latent factor approach -- Recommenderlab package -- Popular approach -- Use case and data -- Designing and implementing collaborative filtering -- Ratings matrix -- Normalization -- Train test split -- Train model -- User-based models -- Item-based models -- Factor-based models -- Complete R Code -- Summary -- Chapter 4: Taming Time Series Data Using Deep Neural Networks -- Time series data -- Non-seasonal time series -- Seasonal time series -- Time series as a regression problem -- Deep neural networks -- Forward cycle -- Backward cycle. Introduction to the MXNet R package -- Symbolic programming in MXNet -- Softmax activation -- Use case and data -- Deep networks for time series prediction -- Training test split -- Complete R code -- Summary -- Chapter 5: Twitter Text Sentiment Classification Using Kernel Density Estimates -- Kernel density estimation -- Twitter text -- Sentiment classification -- Dictionary methods -- Machine learning methods -- Our approach -- Dictionary based scoring -- Text pre-processing -- Term-frequeny inverse document frequency (TFIDF) -- Delta TFIDF -- Building a sentiment classifier -- Assembling an RShiny application -- Complete R code -- Summary -- Chapter 6: Record Linkage -- Stochastic and Machine Learning Approaches -- Introducing our use case -- Demonstrating the use of RecordLinkage package -- Feature generation -- String features -- Phonetic features -- Stochastic record linkage -- Expectation maximization method -- Weights-based method -- Machine learning-based record linkage -- Unsupervised learning -- Supervised learning -- Building an RShiny application -- Complete R code -- Feature generation -- Expectation maximization method -- Weights-based method -- Machine learning method -- RShiny application -- Summary -- Chapter 7: Streaming Data Clustering Analysis in R -- Streaming data and its challenges -- Bounded problems -- Drift -- Single pass -- Real time -- Introducing stream clustering -- Macro-cluster -- Introducing the stream package -- Data stream data -- DSD as a static simulator -- DSD as a simulator with drift -- DSD connecting to memory, file, or database -- Inflight operation -- Can we connect this DSD to an actual data stream? -- Data stream task -- Use case and data -- Speed layer -- Batch layer -- Reservoir sampling -- Complete R code -- Summary -- Chapter 8: Analyze and Understand Networks Using R -- Graphs in R. Degree of a vertex -- Strength of a vertex -- Adjacency Matrix -- More networks in R -- Centrality of a vertex -- Farness and Closeness of a node -- Finding the shortest path between nodes -- Random walk on a graph -- Use case and data -- Data preparation -- Product network analysis -- Building a RShiny application -- The complete R script -- Summary -- Index. |
ctrlnum | (OCoLC)1017754231 |
dewey-full | 519.502855133 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 519 - Probabilities and applied mathematics |
dewey-raw | 519.502855133 |
dewey-search | 519.502855133 |
dewey-sort | 3519.502855133 |
dewey-tens | 510 - Mathematics |
discipline | Mathematik |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>09331cam a2200625 i 4500</leader><controlfield tag="001">ZDB-4-EBA-on1017754231</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr unu||||||||</controlfield><controlfield tag="008">180104s2017 enka o 000 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">UMI</subfield><subfield code="b">eng</subfield><subfield code="e">rda</subfield><subfield code="e">pn</subfield><subfield code="c">UMI</subfield><subfield code="d">IDEBK</subfield><subfield code="d">TOH</subfield><subfield code="d">STF</subfield><subfield code="d">OCLCF</subfield><subfield code="d">CEF</subfield><subfield code="d">KSU</subfield><subfield code="d">N$T</subfield><subfield code="d">DEBBG</subfield><subfield code="d">TEFOD</subfield><subfield code="d">G3B</subfield><subfield code="d">S9I</subfield><subfield code="d">UAB</subfield><subfield code="d">QGK</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCL</subfield><subfield code="d">NZAUC</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield><subfield code="d">DXU</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781788620574</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1788620577</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1788621875</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781788621878</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781788621878</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1017754231</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">CL0500000922</subfield><subfield code="b">Safari Books Online</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">F0F13939-1C2A-4149-9BEC-142B4AE332AC</subfield><subfield code="b">OverDrive, Inc.</subfield><subfield code="n">http://www.overdrive.com</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">QA276.45.R3</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">MAT</subfield><subfield code="x">003000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">MAT</subfield><subfield code="x">029000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">519.502855133</subfield><subfield code="2">23</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Subramanian, Gopi,</subfield><subfield code="e">author.</subfield><subfield code="0">http://id.loc.gov/authorities/names/no2016051875</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">R data analysis projects :</subfield><subfield code="b">build end to end analytics systems to get deeper insights from your data /</subfield><subfield code="c">Gopi Subramanian.</subfield></datafield><datafield tag="246" ind1="3" ind2="0"><subfield code="a">Build end to end analytics systems to get deeper insights from your data</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Birmingham, UK :</subfield><subfield code="b">Packt Publishing,</subfield><subfield code="c">2017.</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (1 volume) :</subfield><subfield code="b">illustrations</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="347" ind1=" " ind2=" "><subfield code="a">data file</subfield></datafield><datafield tag="588" ind1="0" ind2=" "><subfield code="a">Online resource; title from title page (viewed January 3, 2018).</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Get valuable insights from your data by building data analysis systems from scratch with R. About This Book A handy guide to take your understanding of data analysis with R to the next level Real-world projects that focus on problems in finance, network analysis, social media, and more From data manipulation to analysis to visualization in R, this book will teach you everything you need to know about building end-to-end data analysis pipelines using R Who This Book Is For If you are looking for a book that takes you all the way through the practical application of advanced and effective analytics methodologies in R, then this is the book for you. A fundamental understanding of R and the basic concepts of data analysis is all you need to get started with this book. What You Will Learn Build end-to-end predictive analytics systems in R Build an experimental design to gather your own data and conduct analysis Build a recommender system from scratch using different approaches Use and leverage RShiny to build reactive programming applications Build systems for varied domains including market research, network analysis, social media analysis, and more Explore various R Packages such as RShiny, ggplot, recommenderlab, dplyr, and find out how to use them effectively Communicate modeling results using Shiny Dashboards Perform multi-variate time-series analysis prediction, supplemented with sensitivity analysis and risk modeling In Detail R offers a large variety of packages and libraries for fast and accurate data analysis and visualization. As a result, it's one of the most popularly used languages by data scientists and analysts, or anyone who wants to perform data analysis. This book will demonstrate how you can put to use your existing knowledge of data analysis in R to build highly efficient, end-to-end data analysis pipelines without any hassle. You'll start by building a content-based recommendation system, followed by building a project on sentiment analysis with tweets. You'll implement time-series modeling for anomaly detection, and understand cluster analysis of streaming data. You'll work through projects on performing efficient market data research, building recommendation systems, and analyzing networks accurately, all provided with easy to follow codes. With the help of these real-world projects, you'll get a better understanding of the challenges faced when building data analysis pipelines, and see how you can overcome them without comp ...</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Cover -- Title Page -- Copyright -- Credits -- About the Author -- About the Reviewer -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Association Rule Mining -- Understanding the recommender systems -- Transactions -- Weighted transactions -- Our web application -- Retailer use case and data -- Association rule mining -- Support and confidence thresholds -- The cross-selling campaign -- Leverage -- Conviction -- Weighted association rule mining -- Hyperlink-induced topic search (HITS) -- Negative association rules -- Rules visualization -- Wrapping up -- Summary -- Chapter 2: Fuzzy Logic Induced Content-Based Recommendation -- Introducing content-based recommendation -- News aggregator use case and data -- Designing the content-based recommendation engine -- Building a similarity index -- Bag-of-words -- Term frequency -- Document frequency -- Inverse document frequency (IDF) -- TFIDF -- Why cosine similarity? -- Searching -- Polarity scores -- Jaccard's distance -- Jaccards distance/index -- Ranking search results -- Fuzzy logic -- Fuzzification -- Defining the rules -- Evaluating the rules -- Defuzzification -- Complete R Code -- Summary -- Chapter 3: Collaborative Filtering -- Collaborative filtering -- Memory-based approach -- Model-based approach -- Latent factor approach -- Recommenderlab package -- Popular approach -- Use case and data -- Designing and implementing collaborative filtering -- Ratings matrix -- Normalization -- Train test split -- Train model -- User-based models -- Item-based models -- Factor-based models -- Complete R Code -- Summary -- Chapter 4: Taming Time Series Data Using Deep Neural Networks -- Time series data -- Non-seasonal time series -- Seasonal time series -- Time series as a regression problem -- Deep neural networks -- Forward cycle -- Backward cycle.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Introduction to the MXNet R package -- Symbolic programming in MXNet -- Softmax activation -- Use case and data -- Deep networks for time series prediction -- Training test split -- Complete R code -- Summary -- Chapter 5: Twitter Text Sentiment Classification Using Kernel Density Estimates -- Kernel density estimation -- Twitter text -- Sentiment classification -- Dictionary methods -- Machine learning methods -- Our approach -- Dictionary based scoring -- Text pre-processing -- Term-frequeny inverse document frequency (TFIDF) -- Delta TFIDF -- Building a sentiment classifier -- Assembling an RShiny application -- Complete R code -- Summary -- Chapter 6: Record Linkage -- Stochastic and Machine Learning Approaches -- Introducing our use case -- Demonstrating the use of RecordLinkage package -- Feature generation -- String features -- Phonetic features -- Stochastic record linkage -- Expectation maximization method -- Weights-based method -- Machine learning-based record linkage -- Unsupervised learning -- Supervised learning -- Building an RShiny application -- Complete R code -- Feature generation -- Expectation maximization method -- Weights-based method -- Machine learning method -- RShiny application -- Summary -- Chapter 7: Streaming Data Clustering Analysis in R -- Streaming data and its challenges -- Bounded problems -- Drift -- Single pass -- Real time -- Introducing stream clustering -- Macro-cluster -- Introducing the stream package -- Data stream data -- DSD as a static simulator -- DSD as a simulator with drift -- DSD connecting to memory, file, or database -- Inflight operation -- Can we connect this DSD to an actual data stream? -- Data stream task -- Use case and data -- Speed layer -- Batch layer -- Reservoir sampling -- Complete R code -- Summary -- Chapter 8: Analyze and Understand Networks Using R -- Graphs in R.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Degree of a vertex -- Strength of a vertex -- Adjacency Matrix -- More networks in R -- Centrality of a vertex -- Farness and Closeness of a node -- Finding the shortest path between nodes -- Random walk on a graph -- Use case and data -- Data preparation -- Product network analysis -- Building a RShiny application -- The complete R script -- Summary -- Index.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">R (Computer program language)</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh2002004407</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Domain-specific programming languages.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh2007005223</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Data mining</subfield><subfield code="x">Mathematics.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">R (Langage de programmation)</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Langages orientés problème (Informatique)</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Exploration de données (Informatique)</subfield><subfield code="x">Mathématiques.</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">MATHEMATICS</subfield><subfield code="x">Applied.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">MATHEMATICS</subfield><subfield code="x">Probability & Statistics</subfield><subfield code="x">General.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Data mining</subfield><subfield code="x">Mathematics</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Domain-specific programming languages</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">R (Computer program language)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="758" ind1=" " ind2=" "><subfield code="i">has work:</subfield><subfield code="a">R data analysis projects (Text)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCFPG6rCFfJqTpVQwxqbYbm</subfield><subfield code="4">https://id.oclc.org/worldcat/ontology/hasWork</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1636279</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">1636279</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ProQuest MyiLibrary Digital eBook Collection</subfield><subfield code="b">IDEB</subfield><subfield code="n">cis39387075</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection> |
id | ZDB-4-EBA-on1017754231 |
illustrated | Illustrated |
indexdate | 2024-11-27T13:28:09Z |
institution | BVB |
isbn | 9781788620574 1788620577 1788621875 9781788621878 |
language | English |
oclc_num | 1017754231 |
open_access_boolean | |
owner | MAIN DE-863 DE-BY-FWS |
owner_facet | MAIN DE-863 DE-BY-FWS |
physical | 1 online resource (1 volume) : illustrations |
psigel | ZDB-4-EBA |
publishDate | 2017 |
publishDateSearch | 2017 |
publishDateSort | 2017 |
publisher | Packt Publishing, |
record_format | marc |
spelling | Subramanian, Gopi, author. http://id.loc.gov/authorities/names/no2016051875 R data analysis projects : build end to end analytics systems to get deeper insights from your data / Gopi Subramanian. Build end to end analytics systems to get deeper insights from your data Birmingham, UK : Packt Publishing, 2017. 1 online resource (1 volume) : illustrations text txt rdacontent computer c rdamedia online resource cr rdacarrier data file Online resource; title from title page (viewed January 3, 2018). Get valuable insights from your data by building data analysis systems from scratch with R. About This Book A handy guide to take your understanding of data analysis with R to the next level Real-world projects that focus on problems in finance, network analysis, social media, and more From data manipulation to analysis to visualization in R, this book will teach you everything you need to know about building end-to-end data analysis pipelines using R Who This Book Is For If you are looking for a book that takes you all the way through the practical application of advanced and effective analytics methodologies in R, then this is the book for you. A fundamental understanding of R and the basic concepts of data analysis is all you need to get started with this book. What You Will Learn Build end-to-end predictive analytics systems in R Build an experimental design to gather your own data and conduct analysis Build a recommender system from scratch using different approaches Use and leverage RShiny to build reactive programming applications Build systems for varied domains including market research, network analysis, social media analysis, and more Explore various R Packages such as RShiny, ggplot, recommenderlab, dplyr, and find out how to use them effectively Communicate modeling results using Shiny Dashboards Perform multi-variate time-series analysis prediction, supplemented with sensitivity analysis and risk modeling In Detail R offers a large variety of packages and libraries for fast and accurate data analysis and visualization. As a result, it's one of the most popularly used languages by data scientists and analysts, or anyone who wants to perform data analysis. This book will demonstrate how you can put to use your existing knowledge of data analysis in R to build highly efficient, end-to-end data analysis pipelines without any hassle. You'll start by building a content-based recommendation system, followed by building a project on sentiment analysis with tweets. You'll implement time-series modeling for anomaly detection, and understand cluster analysis of streaming data. You'll work through projects on performing efficient market data research, building recommendation systems, and analyzing networks accurately, all provided with easy to follow codes. With the help of these real-world projects, you'll get a better understanding of the challenges faced when building data analysis pipelines, and see how you can overcome them without comp ... Cover -- Title Page -- Copyright -- Credits -- About the Author -- About the Reviewer -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Association Rule Mining -- Understanding the recommender systems -- Transactions -- Weighted transactions -- Our web application -- Retailer use case and data -- Association rule mining -- Support and confidence thresholds -- The cross-selling campaign -- Leverage -- Conviction -- Weighted association rule mining -- Hyperlink-induced topic search (HITS) -- Negative association rules -- Rules visualization -- Wrapping up -- Summary -- Chapter 2: Fuzzy Logic Induced Content-Based Recommendation -- Introducing content-based recommendation -- News aggregator use case and data -- Designing the content-based recommendation engine -- Building a similarity index -- Bag-of-words -- Term frequency -- Document frequency -- Inverse document frequency (IDF) -- TFIDF -- Why cosine similarity? -- Searching -- Polarity scores -- Jaccard's distance -- Jaccards distance/index -- Ranking search results -- Fuzzy logic -- Fuzzification -- Defining the rules -- Evaluating the rules -- Defuzzification -- Complete R Code -- Summary -- Chapter 3: Collaborative Filtering -- Collaborative filtering -- Memory-based approach -- Model-based approach -- Latent factor approach -- Recommenderlab package -- Popular approach -- Use case and data -- Designing and implementing collaborative filtering -- Ratings matrix -- Normalization -- Train test split -- Train model -- User-based models -- Item-based models -- Factor-based models -- Complete R Code -- Summary -- Chapter 4: Taming Time Series Data Using Deep Neural Networks -- Time series data -- Non-seasonal time series -- Seasonal time series -- Time series as a regression problem -- Deep neural networks -- Forward cycle -- Backward cycle. Introduction to the MXNet R package -- Symbolic programming in MXNet -- Softmax activation -- Use case and data -- Deep networks for time series prediction -- Training test split -- Complete R code -- Summary -- Chapter 5: Twitter Text Sentiment Classification Using Kernel Density Estimates -- Kernel density estimation -- Twitter text -- Sentiment classification -- Dictionary methods -- Machine learning methods -- Our approach -- Dictionary based scoring -- Text pre-processing -- Term-frequeny inverse document frequency (TFIDF) -- Delta TFIDF -- Building a sentiment classifier -- Assembling an RShiny application -- Complete R code -- Summary -- Chapter 6: Record Linkage -- Stochastic and Machine Learning Approaches -- Introducing our use case -- Demonstrating the use of RecordLinkage package -- Feature generation -- String features -- Phonetic features -- Stochastic record linkage -- Expectation maximization method -- Weights-based method -- Machine learning-based record linkage -- Unsupervised learning -- Supervised learning -- Building an RShiny application -- Complete R code -- Feature generation -- Expectation maximization method -- Weights-based method -- Machine learning method -- RShiny application -- Summary -- Chapter 7: Streaming Data Clustering Analysis in R -- Streaming data and its challenges -- Bounded problems -- Drift -- Single pass -- Real time -- Introducing stream clustering -- Macro-cluster -- Introducing the stream package -- Data stream data -- DSD as a static simulator -- DSD as a simulator with drift -- DSD connecting to memory, file, or database -- Inflight operation -- Can we connect this DSD to an actual data stream? -- Data stream task -- Use case and data -- Speed layer -- Batch layer -- Reservoir sampling -- Complete R code -- Summary -- Chapter 8: Analyze and Understand Networks Using R -- Graphs in R. Degree of a vertex -- Strength of a vertex -- Adjacency Matrix -- More networks in R -- Centrality of a vertex -- Farness and Closeness of a node -- Finding the shortest path between nodes -- Random walk on a graph -- Use case and data -- Data preparation -- Product network analysis -- Building a RShiny application -- The complete R script -- Summary -- Index. R (Computer program language) http://id.loc.gov/authorities/subjects/sh2002004407 Domain-specific programming languages. http://id.loc.gov/authorities/subjects/sh2007005223 Data mining Mathematics. R (Langage de programmation) Langages orientés problème (Informatique) Exploration de données (Informatique) Mathématiques. MATHEMATICS Applied. bisacsh MATHEMATICS Probability & Statistics General. bisacsh Data mining Mathematics fast Domain-specific programming languages fast R (Computer program language) fast has work: R data analysis projects (Text) https://id.oclc.org/worldcat/entity/E39PCFPG6rCFfJqTpVQwxqbYbm https://id.oclc.org/worldcat/ontology/hasWork FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1636279 Volltext |
spellingShingle | Subramanian, Gopi R data analysis projects : build end to end analytics systems to get deeper insights from your data / Cover -- Title Page -- Copyright -- Credits -- About the Author -- About the Reviewer -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Association Rule Mining -- Understanding the recommender systems -- Transactions -- Weighted transactions -- Our web application -- Retailer use case and data -- Association rule mining -- Support and confidence thresholds -- The cross-selling campaign -- Leverage -- Conviction -- Weighted association rule mining -- Hyperlink-induced topic search (HITS) -- Negative association rules -- Rules visualization -- Wrapping up -- Summary -- Chapter 2: Fuzzy Logic Induced Content-Based Recommendation -- Introducing content-based recommendation -- News aggregator use case and data -- Designing the content-based recommendation engine -- Building a similarity index -- Bag-of-words -- Term frequency -- Document frequency -- Inverse document frequency (IDF) -- TFIDF -- Why cosine similarity? -- Searching -- Polarity scores -- Jaccard's distance -- Jaccards distance/index -- Ranking search results -- Fuzzy logic -- Fuzzification -- Defining the rules -- Evaluating the rules -- Defuzzification -- Complete R Code -- Summary -- Chapter 3: Collaborative Filtering -- Collaborative filtering -- Memory-based approach -- Model-based approach -- Latent factor approach -- Recommenderlab package -- Popular approach -- Use case and data -- Designing and implementing collaborative filtering -- Ratings matrix -- Normalization -- Train test split -- Train model -- User-based models -- Item-based models -- Factor-based models -- Complete R Code -- Summary -- Chapter 4: Taming Time Series Data Using Deep Neural Networks -- Time series data -- Non-seasonal time series -- Seasonal time series -- Time series as a regression problem -- Deep neural networks -- Forward cycle -- Backward cycle. Introduction to the MXNet R package -- Symbolic programming in MXNet -- Softmax activation -- Use case and data -- Deep networks for time series prediction -- Training test split -- Complete R code -- Summary -- Chapter 5: Twitter Text Sentiment Classification Using Kernel Density Estimates -- Kernel density estimation -- Twitter text -- Sentiment classification -- Dictionary methods -- Machine learning methods -- Our approach -- Dictionary based scoring -- Text pre-processing -- Term-frequeny inverse document frequency (TFIDF) -- Delta TFIDF -- Building a sentiment classifier -- Assembling an RShiny application -- Complete R code -- Summary -- Chapter 6: Record Linkage -- Stochastic and Machine Learning Approaches -- Introducing our use case -- Demonstrating the use of RecordLinkage package -- Feature generation -- String features -- Phonetic features -- Stochastic record linkage -- Expectation maximization method -- Weights-based method -- Machine learning-based record linkage -- Unsupervised learning -- Supervised learning -- Building an RShiny application -- Complete R code -- Feature generation -- Expectation maximization method -- Weights-based method -- Machine learning method -- RShiny application -- Summary -- Chapter 7: Streaming Data Clustering Analysis in R -- Streaming data and its challenges -- Bounded problems -- Drift -- Single pass -- Real time -- Introducing stream clustering -- Macro-cluster -- Introducing the stream package -- Data stream data -- DSD as a static simulator -- DSD as a simulator with drift -- DSD connecting to memory, file, or database -- Inflight operation -- Can we connect this DSD to an actual data stream? -- Data stream task -- Use case and data -- Speed layer -- Batch layer -- Reservoir sampling -- Complete R code -- Summary -- Chapter 8: Analyze and Understand Networks Using R -- Graphs in R. Degree of a vertex -- Strength of a vertex -- Adjacency Matrix -- More networks in R -- Centrality of a vertex -- Farness and Closeness of a node -- Finding the shortest path between nodes -- Random walk on a graph -- Use case and data -- Data preparation -- Product network analysis -- Building a RShiny application -- The complete R script -- Summary -- Index. R (Computer program language) http://id.loc.gov/authorities/subjects/sh2002004407 Domain-specific programming languages. http://id.loc.gov/authorities/subjects/sh2007005223 Data mining Mathematics. R (Langage de programmation) Langages orientés problème (Informatique) Exploration de données (Informatique) Mathématiques. MATHEMATICS Applied. bisacsh MATHEMATICS Probability & Statistics General. bisacsh Data mining Mathematics fast Domain-specific programming languages fast R (Computer program language) fast |
subject_GND | http://id.loc.gov/authorities/subjects/sh2002004407 http://id.loc.gov/authorities/subjects/sh2007005223 |
title | R data analysis projects : build end to end analytics systems to get deeper insights from your data / |
title_alt | Build end to end analytics systems to get deeper insights from your data |
title_auth | R data analysis projects : build end to end analytics systems to get deeper insights from your data / |
title_exact_search | R data analysis projects : build end to end analytics systems to get deeper insights from your data / |
title_full | R data analysis projects : build end to end analytics systems to get deeper insights from your data / Gopi Subramanian. |
title_fullStr | R data analysis projects : build end to end analytics systems to get deeper insights from your data / Gopi Subramanian. |
title_full_unstemmed | R data analysis projects : build end to end analytics systems to get deeper insights from your data / Gopi Subramanian. |
title_short | R data analysis projects : |
title_sort | r data analysis projects build end to end analytics systems to get deeper insights from your data |
title_sub | build end to end analytics systems to get deeper insights from your data / |
topic | R (Computer program language) http://id.loc.gov/authorities/subjects/sh2002004407 Domain-specific programming languages. http://id.loc.gov/authorities/subjects/sh2007005223 Data mining Mathematics. R (Langage de programmation) Langages orientés problème (Informatique) Exploration de données (Informatique) Mathématiques. MATHEMATICS Applied. bisacsh MATHEMATICS Probability & Statistics General. bisacsh Data mining Mathematics fast Domain-specific programming languages fast R (Computer program language) fast |
topic_facet | R (Computer program language) Domain-specific programming languages. Data mining Mathematics. R (Langage de programmation) Langages orientés problème (Informatique) Exploration de données (Informatique) Mathématiques. MATHEMATICS Applied. MATHEMATICS Probability & Statistics General. Data mining Mathematics Domain-specific programming languages |
url | https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1636279 |
work_keys_str_mv | AT subramaniangopi rdataanalysisprojectsbuildendtoendanalyticssystemstogetdeeperinsightsfromyourdata AT subramaniangopi buildendtoendanalyticssystemstogetdeeperinsightsfromyourdata |