Verfügbarkeit: The Reinforcement Learning Workshop

The Reinforcement Learning Workshop: Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.

With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intellige...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Palmas, Alessandro
Weitere Verfasser:	Ghelfi, Emanuele, Petre, Alexandra Galina, Kulkarni, Mayur, N.S., Anand, Nguyen, Quan, Sen, Aritra, So, Anthony (Data scientist), Basak, Saikat
Format:	Elektronisch E-Book
Sprache:	English
Veröffentlicht:	Birmingham : Packt Publishing, Limited, 2020.
Schlagworte:	Reinforcement learning. Algorithms. Algorithms Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. Programming & scripting languages: general. Artificial intelligence. Neural networks & fuzzy systems. Computers > Intelligence (AI) & Semantics. Computers > Neural Networks. Computers > Programming Languages > Python. Reinforcement learning
Online-Zugang:	Volltext
Zusammenfassung:	With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intelligent applications with ease.
Beschreibung:	Description based upon print version of record. Batch Normalization.
Beschreibung:	1 online resource (821 p.)
ISBN:	9781800209961 1800209967

Internformat

MARC


LEADER	00000cam a2200000Mu 4500
001	ZDB-4-EBA-on1223099995
003	OCoLC
005	20241004212047.0
006	m o d
007	cr \|\|\|\|\|\|\|\|\|\|\|
008	201121s2020 xx o \|\|\| 0 eng d
040			\|a EBLCP \|b eng \|c EBLCP \|d YDX \|d NLW \|d TXI \|d UKAHL \|d UKMGB \|d OCLCF \|d N$T \|d YDXIT \|d OCLCO \|d OCLCQ \|d OCLCO \|d OCLCL
015			\|a GBC094821 \|2 bnb
015			\|a GBC101726 \|2 bnb
016	7		\|a 019859985 \|2 Uk
016	7		\|a 020052275 \|2 Uk
019			\|a 1191244885 \|a 1193132561 \|a 1196193586
020			\|a 9781800209961
020			\|a 1800209967
020			\|z 1800200455
020			\|z 9781800200456
035			\|a (OCoLC)1223099995 \|z (OCoLC)1191244885 \|z (OCoLC)1193132561 \|z (OCoLC)1196193586
037			\|a 9781800209961 \|b Packt Publishing Pvt. Ltd
050		4	\|a Q325.6 \|b .P35 2020
082	7		\|a 006.31 \|2 23
049			\|a MAIN
100	1		\|a Palmas, Alessandro.
245	1	4	\|a The Reinforcement Learning Workshop \|h [electronic resource] : \|b Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
260			\|a Birmingham : \|b Packt Publishing, Limited, \|c 2020.
300			\|a 1 online resource (821 p.)
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
500			\|a Description based upon print version of record.
505	0		\|a Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment
505	8		\|a Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole
505	8		\|a Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes
505	8		\|a The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld
505	8		\|a Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation
500			\|a Batch Normalization.
520			\|a With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intelligent applications with ease.
650		0	\|a Reinforcement learning. \|0 http://id.loc.gov/authorities/subjects/sh92000704
650		0	\|a Algorithms. \|0 http://id.loc.gov/authorities/subjects/sh85003487
650		2	\|a Algorithms \|0 https://id.nlm.nih.gov/mesh/D000465
650		6	\|a Apprentissage par renforcement (Intelligence artificielle)
650		6	\|a Algorithmes.
650		7	\|a algorithms. \|2 aat
650		7	\|a Programming & scripting languages: general. \|2 bicssc
650		7	\|a Artificial intelligence. \|2 bicssc
650		7	\|a Neural networks & fuzzy systems. \|2 bicssc
650		7	\|a Computers \|x Intelligence (AI) & Semantics. \|2 bisacsh
650		7	\|a Computers \|x Neural Networks. \|2 bisacsh
650		7	\|a Computers \|x Programming Languages \|x Python. \|2 bisacsh
650		7	\|a Algorithms \|2 fast
650		7	\|a Reinforcement learning \|2 fast
700	1		\|a Ghelfi, Emanuele.
700	1		\|a Petre, Alexandra Galina.
700	1		\|a Kulkarni, Mayur.
700	1		\|a N.S., Anand.
700	1		\|a Nguyen, Quan.
700	1		\|a Sen, Aritra.
700	1		\|a So, Anthony \|c (Data scientist) \|1 https://id.oclc.org/worldcat/entity/E39PCjGVCDWxcCx8xrFc47wmr3 \|0 http://id.loc.gov/authorities/names/no2021117553
700	1		\|a Basak, Saikat.
758			\|i has work: \|a The Reinforcement Learning Workshop (Text) \|1 https://id.oclc.org/worldcat/entity/E39PCG9J4QJbxqBWRt8TQKQ8YP \|4 https://id.oclc.org/worldcat/ontology/hasWork
776	0	8	\|i Print version: \|a Palmas, Alessandro \|t The Reinforcement Learning Workshop : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems \|d Birmingham : Packt Publishing, Limited,c2020 \|z 9781800200456
856	4	0	\|l FWS01 \|p ZDB-4-EBA \|q FWS_PDA_EBA \|u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2575333 \|3 Volltext
938			\|a ProQuest Ebook Central \|b EBLB \|n EBL6318341
938			\|a YBP Library Services \|b YANK \|n 301466899
938			\|a Askews and Holts Library Services \|b ASKH \|n AH37507271
938			\|a EBSCOhost \|b EBSC \|n 2575333
994			\|a 92 \|b GEBAY
912			\|a ZDB-4-EBA
049			\|a DE-863

Datensatz im Suchindex

DE-BY-FWS_katkey	ZDB-4-EBA-on1223099995
_version_	1816882533549211648
adam_text
any_adam_object
author	Palmas, Alessandro
author2	Ghelfi, Emanuele Petre, Alexandra Galina Kulkarni, Mayur N.S., Anand Nguyen, Quan Sen, Aritra So, Anthony (Data scientist) Basak, Saikat
author2_role
author2_variant	e g eg a g p ag agp m k mk a n an q n qn a s as a s as s b sb
author_GND	http://id.loc.gov/authorities/names/no2021117553
author_facet	Palmas, Alessandro Ghelfi, Emanuele Petre, Alexandra Galina Kulkarni, Mayur N.S., Anand Nguyen, Quan Sen, Aritra So, Anthony (Data scientist) Basak, Saikat
author_role
author_sort	Palmas, Alessandro
author_variant	a p ap
building	Verbundindex
bvnumber	localFWS
callnumber-first	Q - Science
callnumber-label	Q325
callnumber-raw	Q325.6 .P35 2020
callnumber-search	Q325.6 .P35 2020
callnumber-sort	Q 3325.6 P35 42020
callnumber-subject	Q - General Science
collection	ZDB-4-EBA
contents	Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation
ctrlnum	(OCoLC)1223099995
dewey-full	006.31
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	006 - Special computer methods
dewey-raw	006.31
dewey-search	006.31
dewey-sort	16.31
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
format	Electronic eBook
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>06741cam a2200817Mu 4500</leader><controlfield tag="001">ZDB-4-EBA-on1223099995</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr \|\|\|\|\|\|\|\|\|\|\|</controlfield><controlfield tag="008">201121s2020 xx o \|\|\| 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">EBLCP</subfield><subfield code="b">eng</subfield><subfield code="c">EBLCP</subfield><subfield code="d">YDX</subfield><subfield code="d">NLW</subfield><subfield code="d">TXI</subfield><subfield code="d">UKAHL</subfield><subfield code="d">UKMGB</subfield><subfield code="d">OCLCF</subfield><subfield code="d">N$T</subfield><subfield code="d">YDXIT</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield></datafield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBC094821</subfield><subfield code="2">bnb</subfield></datafield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBC101726</subfield><subfield code="2">bnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">019859985</subfield><subfield code="2">Uk</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">020052275</subfield><subfield code="2">Uk</subfield></datafield><datafield tag="019" ind1=" " ind2=" "><subfield code="a">1191244885</subfield><subfield code="a">1193132561</subfield><subfield code="a">1196193586</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781800209961</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1800209967</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">1800200455</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781800200456</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1223099995</subfield><subfield code="z">(OCoLC)1191244885</subfield><subfield code="z">(OCoLC)1193132561</subfield><subfield code="z">(OCoLC)1196193586</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">9781800209961</subfield><subfield code="b">Packt Publishing Pvt. Ltd</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">Q325.6</subfield><subfield code="b">.P35 2020</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">006.31</subfield><subfield code="2">23</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Palmas, Alessandro.</subfield></datafield><datafield tag="245" ind1="1" ind2="4"><subfield code="a">The Reinforcement Learning Workshop</subfield><subfield code="h">[electronic resource] :</subfield><subfield code="b">Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.</subfield></datafield><datafield tag="260" ind1=" " ind2=" "><subfield code="a">Birmingham :</subfield><subfield code="b">Packt Publishing, Limited,</subfield><subfield code="c">2020.</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (821 p.)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Description based upon print version of record.</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Batch Normalization.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intelligent applications with ease.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Reinforcement learning.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh92000704</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Algorithms.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh85003487</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Algorithms</subfield><subfield code="0">https://id.nlm.nih.gov/mesh/D000465</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Apprentissage par renforcement (Intelligence artificielle)</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Algorithmes.</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">algorithms.</subfield><subfield code="2">aat</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Programming & scripting languages: general.</subfield><subfield code="2">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Artificial intelligence.</subfield><subfield code="2">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Neural networks & fuzzy systems.</subfield><subfield code="2">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Computers</subfield><subfield code="x">Intelligence (AI) & Semantics.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Computers</subfield><subfield code="x">Neural Networks.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Computers</subfield><subfield code="x">Programming Languages</subfield><subfield code="x">Python.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Algorithms</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Reinforcement learning</subfield><subfield code="2">fast</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Ghelfi, Emanuele.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Petre, Alexandra Galina.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kulkarni, Mayur.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">N.S., Anand.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Nguyen, Quan.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sen, Aritra.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">So, Anthony</subfield><subfield code="c">(Data scientist)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCjGVCDWxcCx8xrFc47wmr3</subfield><subfield code="0">http://id.loc.gov/authorities/names/no2021117553</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Basak, Saikat.</subfield></datafield><datafield tag="758" ind1=" " ind2=" "><subfield code="i">has work:</subfield><subfield code="a">The Reinforcement Learning Workshop (Text)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCG9J4QJbxqBWRt8TQKQ8YP</subfield><subfield code="4">https://id.oclc.org/worldcat/ontology/hasWork</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Print version:</subfield><subfield code="a">Palmas, Alessandro</subfield><subfield code="t">The Reinforcement Learning Workshop : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems</subfield><subfield code="d">Birmingham : Packt Publishing, Limited,c2020</subfield><subfield code="z">9781800200456</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2575333</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ProQuest Ebook Central</subfield><subfield code="b">EBLB</subfield><subfield code="n">EBL6318341</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">YBP Library Services</subfield><subfield code="b">YANK</subfield><subfield code="n">301466899</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">Askews and Holts Library Services</subfield><subfield code="b">ASKH</subfield><subfield code="n">AH37507271</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">2575333</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection>
id	ZDB-4-EBA-on1223099995
illustrated	Not Illustrated
indexdate	2024-11-27T13:30:08Z
institution	BVB
isbn	9781800209961 1800209967
language	English
oclc_num	1223099995
open_access_boolean
owner	MAIN DE-863 DE-BY-FWS
owner_facet	MAIN DE-863 DE-BY-FWS
physical	1 online resource (821 p.)
psigel	ZDB-4-EBA
publishDate	2020
publishDateSearch	2020
publishDateSort	2020
publisher	Packt Publishing, Limited,
record_format	marc
spelling	Palmas, Alessandro. The Reinforcement Learning Workshop [electronic resource] : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. Birmingham : Packt Publishing, Limited, 2020. 1 online resource (821 p.) text txt rdacontent computer c rdamedia online resource cr rdacarrier Description based upon print version of record. Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation Batch Normalization. With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intelligent applications with ease. Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Algorithms. http://id.loc.gov/authorities/subjects/sh85003487 Algorithms https://id.nlm.nih.gov/mesh/D000465 Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. aat Programming & scripting languages: general. bicssc Artificial intelligence. bicssc Neural networks & fuzzy systems. bicssc Computers Intelligence (AI) & Semantics. bisacsh Computers Neural Networks. bisacsh Computers Programming Languages Python. bisacsh Algorithms fast Reinforcement learning fast Ghelfi, Emanuele. Petre, Alexandra Galina. Kulkarni, Mayur. N.S., Anand. Nguyen, Quan. Sen, Aritra. So, Anthony (Data scientist) https://id.oclc.org/worldcat/entity/E39PCjGVCDWxcCx8xrFc47wmr3 http://id.loc.gov/authorities/names/no2021117553 Basak, Saikat. has work: The Reinforcement Learning Workshop (Text) https://id.oclc.org/worldcat/entity/E39PCG9J4QJbxqBWRt8TQKQ8YP https://id.oclc.org/worldcat/ontology/hasWork Print version: Palmas, Alessandro The Reinforcement Learning Workshop : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems Birmingham : Packt Publishing, Limited,c2020 9781800200456 FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2575333 Volltext
spellingShingle	Palmas, Alessandro The Reinforcement Learning Workshop Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Algorithms. http://id.loc.gov/authorities/subjects/sh85003487 Algorithms https://id.nlm.nih.gov/mesh/D000465 Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. aat Programming & scripting languages: general. bicssc Artificial intelligence. bicssc Neural networks & fuzzy systems. bicssc Computers Intelligence (AI) & Semantics. bisacsh Computers Neural Networks. bisacsh Computers Programming Languages Python. bisacsh Algorithms fast Reinforcement learning fast
subject_GND	http://id.loc.gov/authorities/subjects/sh92000704 http://id.loc.gov/authorities/subjects/sh85003487 https://id.nlm.nih.gov/mesh/D000465
title	The Reinforcement Learning Workshop Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
title_auth	The Reinforcement Learning Workshop Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
title_exact_search	The Reinforcement Learning Workshop Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
title_full	The Reinforcement Learning Workshop [electronic resource] : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
title_fullStr	The Reinforcement Learning Workshop [electronic resource] : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
title_full_unstemmed	The Reinforcement Learning Workshop [electronic resource] : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
title_short	The Reinforcement Learning Workshop
title_sort	reinforcement learning workshop learn how to apply cutting edge reinforcement learning algorithms to a wide range of control problems
title_sub	Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
topic	Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Algorithms. http://id.loc.gov/authorities/subjects/sh85003487 Algorithms https://id.nlm.nih.gov/mesh/D000465 Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. aat Programming & scripting languages: general. bicssc Artificial intelligence. bicssc Neural networks & fuzzy systems. bicssc Computers Intelligence (AI) & Semantics. bisacsh Computers Neural Networks. bisacsh Computers Programming Languages Python. bisacsh Algorithms fast Reinforcement learning fast
topic_facet	Reinforcement learning. Algorithms. Algorithms Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. Programming & scripting languages: general. Artificial intelligence. Neural networks & fuzzy systems. Computers Intelligence (AI) & Semantics. Computers Neural Networks. Computers Programming Languages Python. Reinforcement learning
url	https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2575333
work_keys_str_mv	AT palmasalessandro thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT ghelfiemanuele thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT petrealexandragalina thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT kulkarnimayur thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT nsanand thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT nguyenquan thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT senaritra thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT soanthony thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT basaksaikat thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT palmasalessandro reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT ghelfiemanuele reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT petrealexandragalina reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT kulkarnimayur reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT nsanand reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT nguyenquan reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT senaritra reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT soanthony reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT basaksaikat reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems

Verfügbarkeit

Es ist kein Print-Exemplar vorhanden.

Volltext öffnen

MARC

Datensatz im Suchindex

Es ist kein Print-Exemplar vorhanden.

Ähnliche Einträge