The Reinforcement Learning Workshop: Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.
With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intellige...
Gespeichert in:
1. Verfasser: | |
---|---|
Weitere Verfasser: | , , , , , , , |
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
Birmingham :
Packt Publishing, Limited,
2020.
|
Schlagworte: | |
Online-Zugang: | Volltext |
Zusammenfassung: | With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intelligent applications with ease. |
Beschreibung: | Description based upon print version of record. Batch Normalization. |
Beschreibung: | 1 online resource (821 p.) |
ISBN: | 9781800209961 1800209967 |
Internformat
MARC
LEADER | 00000cam a2200000Mu 4500 | ||
---|---|---|---|
001 | ZDB-4-EBA-on1223099995 | ||
003 | OCoLC | ||
005 | 20241004212047.0 | ||
006 | m o d | ||
007 | cr ||||||||||| | ||
008 | 201121s2020 xx o ||| 0 eng d | ||
040 | |a EBLCP |b eng |c EBLCP |d YDX |d NLW |d TXI |d UKAHL |d UKMGB |d OCLCF |d N$T |d YDXIT |d OCLCO |d OCLCQ |d OCLCO |d OCLCL | ||
015 | |a GBC094821 |2 bnb | ||
015 | |a GBC101726 |2 bnb | ||
016 | 7 | |a 019859985 |2 Uk | |
016 | 7 | |a 020052275 |2 Uk | |
019 | |a 1191244885 |a 1193132561 |a 1196193586 | ||
020 | |a 9781800209961 | ||
020 | |a 1800209967 | ||
020 | |z 1800200455 | ||
020 | |z 9781800200456 | ||
035 | |a (OCoLC)1223099995 |z (OCoLC)1191244885 |z (OCoLC)1193132561 |z (OCoLC)1196193586 | ||
037 | |a 9781800209961 |b Packt Publishing Pvt. Ltd | ||
050 | 4 | |a Q325.6 |b .P35 2020 | |
082 | 7 | |a 006.31 |2 23 | |
049 | |a MAIN | ||
100 | 1 | |a Palmas, Alessandro. | |
245 | 1 | 4 | |a The Reinforcement Learning Workshop |h [electronic resource] : |b Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. |
260 | |a Birmingham : |b Packt Publishing, Limited, |c 2020. | ||
300 | |a 1 online resource (821 p.) | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
500 | |a Description based upon print version of record. | ||
505 | 0 | |a Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment | |
505 | 8 | |a Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole | |
505 | 8 | |a Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes | |
505 | 8 | |a The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld | |
505 | 8 | |a Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation | |
500 | |a Batch Normalization. | ||
520 | |a With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intelligent applications with ease. | ||
650 | 0 | |a Reinforcement learning. |0 http://id.loc.gov/authorities/subjects/sh92000704 | |
650 | 0 | |a Algorithms. |0 http://id.loc.gov/authorities/subjects/sh85003487 | |
650 | 2 | |a Algorithms |0 https://id.nlm.nih.gov/mesh/D000465 | |
650 | 6 | |a Apprentissage par renforcement (Intelligence artificielle) | |
650 | 6 | |a Algorithmes. | |
650 | 7 | |a algorithms. |2 aat | |
650 | 7 | |a Programming & scripting languages: general. |2 bicssc | |
650 | 7 | |a Artificial intelligence. |2 bicssc | |
650 | 7 | |a Neural networks & fuzzy systems. |2 bicssc | |
650 | 7 | |a Computers |x Intelligence (AI) & Semantics. |2 bisacsh | |
650 | 7 | |a Computers |x Neural Networks. |2 bisacsh | |
650 | 7 | |a Computers |x Programming Languages |x Python. |2 bisacsh | |
650 | 7 | |a Algorithms |2 fast | |
650 | 7 | |a Reinforcement learning |2 fast | |
700 | 1 | |a Ghelfi, Emanuele. | |
700 | 1 | |a Petre, Alexandra Galina. | |
700 | 1 | |a Kulkarni, Mayur. | |
700 | 1 | |a N.S., Anand. | |
700 | 1 | |a Nguyen, Quan. | |
700 | 1 | |a Sen, Aritra. | |
700 | 1 | |a So, Anthony |c (Data scientist) |1 https://id.oclc.org/worldcat/entity/E39PCjGVCDWxcCx8xrFc47wmr3 |0 http://id.loc.gov/authorities/names/no2021117553 | |
700 | 1 | |a Basak, Saikat. | |
758 | |i has work: |a The Reinforcement Learning Workshop (Text) |1 https://id.oclc.org/worldcat/entity/E39PCG9J4QJbxqBWRt8TQKQ8YP |4 https://id.oclc.org/worldcat/ontology/hasWork | ||
776 | 0 | 8 | |i Print version: |a Palmas, Alessandro |t The Reinforcement Learning Workshop : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems |d Birmingham : Packt Publishing, Limited,c2020 |z 9781800200456 |
856 | 4 | 0 | |l FWS01 |p ZDB-4-EBA |q FWS_PDA_EBA |u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2575333 |3 Volltext |
938 | |a ProQuest Ebook Central |b EBLB |n EBL6318341 | ||
938 | |a YBP Library Services |b YANK |n 301466899 | ||
938 | |a Askews and Holts Library Services |b ASKH |n AH37507271 | ||
938 | |a EBSCOhost |b EBSC |n 2575333 | ||
994 | |a 92 |b GEBAY | ||
912 | |a ZDB-4-EBA | ||
049 | |a DE-863 |
Datensatz im Suchindex
DE-BY-FWS_katkey | ZDB-4-EBA-on1223099995 |
---|---|
_version_ | 1816882533549211648 |
adam_text | |
any_adam_object | |
author | Palmas, Alessandro |
author2 | Ghelfi, Emanuele Petre, Alexandra Galina Kulkarni, Mayur N.S., Anand Nguyen, Quan Sen, Aritra So, Anthony (Data scientist) Basak, Saikat |
author2_role | |
author2_variant | e g eg a g p ag agp m k mk a n an q n qn a s as a s as s b sb |
author_GND | http://id.loc.gov/authorities/names/no2021117553 |
author_facet | Palmas, Alessandro Ghelfi, Emanuele Petre, Alexandra Galina Kulkarni, Mayur N.S., Anand Nguyen, Quan Sen, Aritra So, Anthony (Data scientist) Basak, Saikat |
author_role | |
author_sort | Palmas, Alessandro |
author_variant | a p ap |
building | Verbundindex |
bvnumber | localFWS |
callnumber-first | Q - Science |
callnumber-label | Q325 |
callnumber-raw | Q325.6 .P35 2020 |
callnumber-search | Q325.6 .P35 2020 |
callnumber-sort | Q 3325.6 P35 42020 |
callnumber-subject | Q - General Science |
collection | ZDB-4-EBA |
contents | Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation |
ctrlnum | (OCoLC)1223099995 |
dewey-full | 006.31 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.31 |
dewey-search | 006.31 |
dewey-sort | 16.31 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>06741cam a2200817Mu 4500</leader><controlfield tag="001">ZDB-4-EBA-on1223099995</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr |||||||||||</controlfield><controlfield tag="008">201121s2020 xx o ||| 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">EBLCP</subfield><subfield code="b">eng</subfield><subfield code="c">EBLCP</subfield><subfield code="d">YDX</subfield><subfield code="d">NLW</subfield><subfield code="d">TXI</subfield><subfield code="d">UKAHL</subfield><subfield code="d">UKMGB</subfield><subfield code="d">OCLCF</subfield><subfield code="d">N$T</subfield><subfield code="d">YDXIT</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield></datafield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBC094821</subfield><subfield code="2">bnb</subfield></datafield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">GBC101726</subfield><subfield code="2">bnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">019859985</subfield><subfield code="2">Uk</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">020052275</subfield><subfield code="2">Uk</subfield></datafield><datafield tag="019" ind1=" " ind2=" "><subfield code="a">1191244885</subfield><subfield code="a">1193132561</subfield><subfield code="a">1196193586</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781800209961</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1800209967</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">1800200455</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781800200456</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1223099995</subfield><subfield code="z">(OCoLC)1191244885</subfield><subfield code="z">(OCoLC)1193132561</subfield><subfield code="z">(OCoLC)1196193586</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">9781800209961</subfield><subfield code="b">Packt Publishing Pvt. Ltd</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">Q325.6</subfield><subfield code="b">.P35 2020</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">006.31</subfield><subfield code="2">23</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Palmas, Alessandro.</subfield></datafield><datafield tag="245" ind1="1" ind2="4"><subfield code="a">The Reinforcement Learning Workshop</subfield><subfield code="h">[electronic resource] :</subfield><subfield code="b">Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems.</subfield></datafield><datafield tag="260" ind1=" " ind2=" "><subfield code="a">Birmingham :</subfield><subfield code="b">Packt Publishing, Limited,</subfield><subfield code="c">2020.</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (821 p.)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Description based upon print version of record.</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Batch Normalization.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intelligent applications with ease.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Reinforcement learning.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh92000704</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Algorithms.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh85003487</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Algorithms</subfield><subfield code="0">https://id.nlm.nih.gov/mesh/D000465</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Apprentissage par renforcement (Intelligence artificielle)</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Algorithmes.</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">algorithms.</subfield><subfield code="2">aat</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Programming & scripting languages: general.</subfield><subfield code="2">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Artificial intelligence.</subfield><subfield code="2">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Neural networks & fuzzy systems.</subfield><subfield code="2">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Computers</subfield><subfield code="x">Intelligence (AI) & Semantics.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Computers</subfield><subfield code="x">Neural Networks.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Computers</subfield><subfield code="x">Programming Languages</subfield><subfield code="x">Python.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Algorithms</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Reinforcement learning</subfield><subfield code="2">fast</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Ghelfi, Emanuele.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Petre, Alexandra Galina.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kulkarni, Mayur.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">N.S., Anand.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Nguyen, Quan.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sen, Aritra.</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">So, Anthony</subfield><subfield code="c">(Data scientist)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCjGVCDWxcCx8xrFc47wmr3</subfield><subfield code="0">http://id.loc.gov/authorities/names/no2021117553</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Basak, Saikat.</subfield></datafield><datafield tag="758" ind1=" " ind2=" "><subfield code="i">has work:</subfield><subfield code="a">The Reinforcement Learning Workshop (Text)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCG9J4QJbxqBWRt8TQKQ8YP</subfield><subfield code="4">https://id.oclc.org/worldcat/ontology/hasWork</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Print version:</subfield><subfield code="a">Palmas, Alessandro</subfield><subfield code="t">The Reinforcement Learning Workshop : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems</subfield><subfield code="d">Birmingham : Packt Publishing, Limited,c2020</subfield><subfield code="z">9781800200456</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2575333</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">ProQuest Ebook Central</subfield><subfield code="b">EBLB</subfield><subfield code="n">EBL6318341</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">YBP Library Services</subfield><subfield code="b">YANK</subfield><subfield code="n">301466899</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">Askews and Holts Library Services</subfield><subfield code="b">ASKH</subfield><subfield code="n">AH37507271</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">2575333</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection> |
id | ZDB-4-EBA-on1223099995 |
illustrated | Not Illustrated |
indexdate | 2024-11-27T13:30:08Z |
institution | BVB |
isbn | 9781800209961 1800209967 |
language | English |
oclc_num | 1223099995 |
open_access_boolean | |
owner | MAIN DE-863 DE-BY-FWS |
owner_facet | MAIN DE-863 DE-BY-FWS |
physical | 1 online resource (821 p.) |
psigel | ZDB-4-EBA |
publishDate | 2020 |
publishDateSearch | 2020 |
publishDateSort | 2020 |
publisher | Packt Publishing, Limited, |
record_format | marc |
spelling | Palmas, Alessandro. The Reinforcement Learning Workshop [electronic resource] : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. Birmingham : Packt Publishing, Limited, 2020. 1 online resource (821 p.) text txt rdacontent computer c rdamedia online resource cr rdacarrier Description based upon print version of record. Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation Batch Normalization. With the help of practical examples and engaging activities, The Reinforcement Learning Workshop takes you through reinforcement learning's core techniques and frameworks. Following a hands-on approach, it allows you to learn reinforcement learning at your own pace to develop your own intelligent applications with ease. Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Algorithms. http://id.loc.gov/authorities/subjects/sh85003487 Algorithms https://id.nlm.nih.gov/mesh/D000465 Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. aat Programming & scripting languages: general. bicssc Artificial intelligence. bicssc Neural networks & fuzzy systems. bicssc Computers Intelligence (AI) & Semantics. bisacsh Computers Neural Networks. bisacsh Computers Programming Languages Python. bisacsh Algorithms fast Reinforcement learning fast Ghelfi, Emanuele. Petre, Alexandra Galina. Kulkarni, Mayur. N.S., Anand. Nguyen, Quan. Sen, Aritra. So, Anthony (Data scientist) https://id.oclc.org/worldcat/entity/E39PCjGVCDWxcCx8xrFc47wmr3 http://id.loc.gov/authorities/names/no2021117553 Basak, Saikat. has work: The Reinforcement Learning Workshop (Text) https://id.oclc.org/worldcat/entity/E39PCG9J4QJbxqBWRt8TQKQ8YP https://id.oclc.org/worldcat/ontology/hasWork Print version: Palmas, Alessandro The Reinforcement Learning Workshop : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems Birmingham : Packt Publishing, Limited,c2020 9781800200456 FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2575333 Volltext |
spellingShingle | Palmas, Alessandro The Reinforcement Learning Workshop Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. Cover -- FM -- Copyright -- Table of Contents -- Preface -- Chapter 1: Introduction to Reinforcement Learning -- Introduction -- Learning Paradigms -- Introduction to Learning Paradigms -- Supervised versus Unsupervised versus RL -- Classifying Common Problems into Learning Scenarios -- Predicting Whether an Image Contains a Dog or a Cat -- Detecting and Classifying All Dogs and Cats in an Image -- Playing Chess -- Fundamentals of Reinforcement Learning -- Elements of RL -- Agent -- Actions -- Environment -- Policy -- An Example of an Autonomous Driving Environment Exercise 1.01: Implementing a Toy Environment Using Python -- The Agent-Environment Interface -- What's the Agent? What's in the Environment? -- Environment Types -- Finite versus Continuous -- Deterministic versus Stochastic -- Fully Observable versus Partially Observable -- POMDP versus MDP -- Single Agents versus Multiple Agents -- An Action and Its Types -- Policy -- Stochastic Policies -- Policy Parameterizations -- Exercise 1.02: Implementing a Linear Policy -- Goals and Rewards -- Why Discount? -- Reinforcement Learning Frameworks -- OpenAI Gym -- Getting Started with Gym -- CartPole Gym Spaces -- Exercise 1.03: Creating a Space for Image Observations -- Rendering an Environment -- Rendering CartPole -- A Reinforcement Learning Loop with Gym -- Exercise 1.04: Implementing the Reinforcement Learning Loop with Gym -- Activity 1.01: Measuring the Performance of a Random Agent -- OpenAI Baselines -- Getting Started with Baselines -- DQN on CartPole -- Applications of Reinforcement Learning -- Games -- Go -- Dota 2 -- StarCraft -- Robot Control -- Autonomous Driving -- Summary -- Chapter 2: Markov Decision Processes and Bellman Equations -- Introduction -- Markov Processes The Markov Property -- Markov Chains -- Markov Reward Processes -- Value Functions and Bellman Equations for MRPs -- Solving Linear Systems of an Equation Using SciPy -- Exercise 2.01: Finding the Value Function in an MRP -- Markov Decision Processes -- The State-Value Function and the Action-Value Function -- Bellman Optimality Equation -- Solving the Bellman Optimality Equation -- Solving MDPs -- Algorithm Categorization -- Value-Based Algorithms -- Policy Search Algorithms -- Linear Programming -- Exercise 2.02: Determining the Best Policy for an MDP Using Linear Programming -- Gridworld Activity 2.01: Solving Gridworld -- Summary -- Chapter 3: Deep Learning in Practice with TensorFlow 2 -- Introduction -- An Introduction to TensorFlow and Keras -- TensorFlow -- Keras -- Exercise 3.01: Building a Sequential Model with the Keras High-Level API -- How to Implement a Neural Network Using TensorFlow -- Model Creation -- Model Training -- Loss Function Definition -- Optimizer Choice -- Learning Rate Scheduling -- Feature Normalization -- Model Validation -- Performance Metrics -- Model Improvement -- Overfitting -- Regularization -- Early Stopping -- Dropout -- Data Augmentation Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Algorithms. http://id.loc.gov/authorities/subjects/sh85003487 Algorithms https://id.nlm.nih.gov/mesh/D000465 Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. aat Programming & scripting languages: general. bicssc Artificial intelligence. bicssc Neural networks & fuzzy systems. bicssc Computers Intelligence (AI) & Semantics. bisacsh Computers Neural Networks. bisacsh Computers Programming Languages Python. bisacsh Algorithms fast Reinforcement learning fast |
subject_GND | http://id.loc.gov/authorities/subjects/sh92000704 http://id.loc.gov/authorities/subjects/sh85003487 https://id.nlm.nih.gov/mesh/D000465 |
title | The Reinforcement Learning Workshop Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. |
title_auth | The Reinforcement Learning Workshop Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. |
title_exact_search | The Reinforcement Learning Workshop Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. |
title_full | The Reinforcement Learning Workshop [electronic resource] : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. |
title_fullStr | The Reinforcement Learning Workshop [electronic resource] : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. |
title_full_unstemmed | The Reinforcement Learning Workshop [electronic resource] : Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. |
title_short | The Reinforcement Learning Workshop |
title_sort | reinforcement learning workshop learn how to apply cutting edge reinforcement learning algorithms to a wide range of control problems |
title_sub | Learn How to Apply Cutting-Edge Reinforcement Learning Algorithms to a Wide Range of Control Problems. |
topic | Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Algorithms. http://id.loc.gov/authorities/subjects/sh85003487 Algorithms https://id.nlm.nih.gov/mesh/D000465 Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. aat Programming & scripting languages: general. bicssc Artificial intelligence. bicssc Neural networks & fuzzy systems. bicssc Computers Intelligence (AI) & Semantics. bisacsh Computers Neural Networks. bisacsh Computers Programming Languages Python. bisacsh Algorithms fast Reinforcement learning fast |
topic_facet | Reinforcement learning. Algorithms. Algorithms Apprentissage par renforcement (Intelligence artificielle) Algorithmes. algorithms. Programming & scripting languages: general. Artificial intelligence. Neural networks & fuzzy systems. Computers Intelligence (AI) & Semantics. Computers Neural Networks. Computers Programming Languages Python. Reinforcement learning |
url | https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2575333 |
work_keys_str_mv | AT palmasalessandro thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT ghelfiemanuele thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT petrealexandragalina thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT kulkarnimayur thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT nsanand thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT nguyenquan thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT senaritra thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT soanthony thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT basaksaikat thereinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT palmasalessandro reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT ghelfiemanuele reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT petrealexandragalina reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT kulkarnimayur reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT nsanand reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT nguyenquan reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT senaritra reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT soanthony reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems AT basaksaikat reinforcementlearningworkshoplearnhowtoapplycuttingedgereinforcementlearningalgorithmstoawiderangeofcontrolproblems |