Verfügbarkeit: Deep reinforcement learning hands-on :

Deep reinforcement learning hands-on :: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /

This book is a practical, developer-oriented introduction to deep reinforcement learning (RL). Explore the theoretical concepts of RL, before discovering how deep learning (DL) methods and tools are making it possible to solve more complex and challenging problems than ever before. Apply deep RL met...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Lapan, Maxim (VerfasserIn)
Format:	Elektronisch E-Book
Sprache:	English
Veröffentlicht:	Birmingham, UK : Packt Publishing, 2018.
Schlagworte:	Reinforcement learning. Machine learning. Natural language processing (Computer science) Artificial intelligence. Natural Language Processing Artificial Intelligence Machine Learning Apprentissage par renforcement (Intelligence artificielle) Apprentissage automatique. Traitement automatique des langues naturelles. Intelligence artificielle. artificial intelligence. COMPUTERS > General. Artificial intelligence Machine learning Reinforcement learning Electronic book.
Online-Zugang:	Volltext
Zusammenfassung:	This book is a practical, developer-oriented introduction to deep reinforcement learning (RL). Explore the theoretical concepts of RL, before discovering how deep learning (DL) methods and tools are making it possible to solve more complex and challenging problems than ever before. Apply deep RL methods to training your agent to beat arcade ...
Beschreibung:	"Expert insight."
Beschreibung:	1 online resource (1 volume) : illustrations
Bibliographie:	Includes bibliographical references and index.
ISBN:	9781788839303 1788839307 1788834240 9781788834247

Internformat

MARC


LEADER	00000cam a2200000 i 4500
001	ZDB-4-EBA-on1046682461
003	OCoLC
005	20241004212047.0
006	m o d
007	cr unu\|\|\|\|\|\|\|\|
008	180731s2018 enka ob 001 0 eng d
040			\|a UMI \|b eng \|e rda \|e pn \|c UMI \|d STF \|d TOH \|d OCLCF \|d EBLCP \|d N$T \|d MERUC \|d ZCU \|d NLE \|d TEFOD \|d CEF \|d UKMGB \|d OCLCQ \|d G3B \|d S9I \|d UAB \|d C6I \|d OCLCQ \|d UX1 \|d K6U \|d OCLCQ \|d OCLCO \|d AAA \|d OCLCQ \|d PSYSI \|d OCLCQ \|d OCLCO \|d OCLCL \|d SXB \|d HOPLA
016	7		\|a 018936109 \|2 Uk
019			\|a 1042318736 \|a 1175638157
020			\|a 9781788839303
020			\|a 1788839307
020			\|a 1788834240
020			\|a 9781788834247
020			\|z 9781788834247
024	3		\|a 9781788834247
035			\|a (OCoLC)1046682461 \|z (OCoLC)1042318736 \|z (OCoLC)1175638157
037			\|a CL0500000982 \|b Safari Books Online
050		4	\|a Q325.5
072		7	\|a COM \|x 000000 \|2 bisacsh
082	7		\|a 006.31 \|2 23
049			\|a MAIN
100	1		\|a Lapan, Maxim, \|e author.
245	1	0	\|a Deep reinforcement learning hands-on : \|b apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / \|c Maxim Lapan.
264		1	\|a Birmingham, UK : \|b Packt Publishing, \|c 2018.
300			\|a 1 online resource (1 volume) : \|b illustrations
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
347			\|a data file
588	0		\|a Online resource; title from cover (Safari, viewed July 30, 2018).
500			\|a "Expert insight."
504			\|a Includes bibliographical references and index.
505	0		\|a Table of ContentsWhat is Reinforcement Learning?OpenAI GymDeep Learning with PyTorchThe Cross-Entropy MethodTabular Learning and the Bellman EquationDeep Q-NetworksDQN ExtensionsStocks Trading Using RLPolicy Gradients -- An AlternativeThe Actor-Critic MethodAsynchronous Advantage Actor-CriticChatbots Training with RL Web NavigationContinuous Action SpaceTrust Regions -- TRPO, PPO, and ACKTRBlack-Box Optimization in RLBeyond Model-Free -- ImaginationAlphaGo Zero.
520			\|a This book is a practical, developer-oriented introduction to deep reinforcement learning (RL). Explore the theoretical concepts of RL, before discovering how deep learning (DL) methods and tools are making it possible to solve more complex and challenging problems than ever before. Apply deep RL methods to training your agent to beat arcade ...
650		0	\|a Reinforcement learning. \|0 http://id.loc.gov/authorities/subjects/sh92000704
650		0	\|a Machine learning. \|0 http://id.loc.gov/authorities/subjects/sh85079324
650		0	\|a Natural language processing (Computer science) \|0 http://id.loc.gov/authorities/subjects/sh88002425
650		0	\|a Artificial intelligence. \|0 http://id.loc.gov/authorities/subjects/sh85008180
650		2	\|a Natural Language Processing \|0 https://id.nlm.nih.gov/mesh/D009323
650		2	\|a Artificial Intelligence \|0 https://id.nlm.nih.gov/mesh/D001185
650		2	\|a Machine Learning \|0 https://id.nlm.nih.gov/mesh/D000069550
650		6	\|a Apprentissage par renforcement (Intelligence artificielle)
650		6	\|a Apprentissage automatique.
650		6	\|a Traitement automatique des langues naturelles.
650		6	\|a Intelligence artificielle.
650		7	\|a artificial intelligence. \|2 aat
650		7	\|a COMPUTERS \|x General. \|2 bisacsh
650		7	\|a Artificial intelligence \|2 fast
650		7	\|a Machine learning \|2 fast
650		7	\|a Natural language processing (Computer science) \|2 fast
650		7	\|a Reinforcement learning \|2 fast
655		4	\|a Electronic book.
758			\|i has work: \|a Deep Reinforcement Learning Hands-On (Text) \|1 https://id.oclc.org/worldcat/entity/E39PCXfrHBJd8R88mbQmX6bWpd \|4 https://id.oclc.org/worldcat/ontology/hasWork
776	0	8	\|i Print version: \|a Lapan, Maxim. \|t Deep Reinforcement Learning Hands-On : Apply Modern RL Methods, with Deep Q-Networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More. \|d Birmingham : Packt Publishing Ltd, ©2018 \|z 9781788834247
856	4	0	\|l FWS01 \|p ZDB-4-EBA \|q FWS_PDA_EBA \|u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1837369 \|3 Volltext
938			\|a hoopla Digital \|b HOPL \|n MWT13589389
938			\|a EBL - Ebook Library \|b EBLB \|n EBL5434975
938			\|a EBSCOhost \|b EBSC \|n 1837369
994			\|a 92 \|b GEBAY
912			\|a ZDB-4-EBA
049			\|a DE-863

Datensatz im Suchindex

DE-BY-FWS_katkey	ZDB-4-EBA-on1046682461
_version_	1816882467138699264
adam_text
any_adam_object
author	Lapan, Maxim
author_facet	Lapan, Maxim
author_role	aut
author_sort	Lapan, Maxim
author_variant	m l ml
building	Verbundindex
bvnumber	localFWS
callnumber-first	Q - Science
callnumber-label	Q325
callnumber-raw	Q325.5
callnumber-search	Q325.5
callnumber-sort	Q 3325.5
callnumber-subject	Q - General Science
collection	ZDB-4-EBA
contents	Table of ContentsWhat is Reinforcement Learning?OpenAI GymDeep Learning with PyTorchThe Cross-Entropy MethodTabular Learning and the Bellman EquationDeep Q-NetworksDQN ExtensionsStocks Trading Using RLPolicy Gradients -- An AlternativeThe Actor-Critic MethodAsynchronous Advantage Actor-CriticChatbots Training with RL Web NavigationContinuous Action SpaceTrust Regions -- TRPO, PPO, and ACKTRBlack-Box Optimization in RLBeyond Model-Free -- ImaginationAlphaGo Zero.
ctrlnum	(OCoLC)1046682461
dewey-full	006.31
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	006 - Special computer methods
dewey-raw	006.31
dewey-search	006.31
dewey-sort	16.31
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
format	Electronic eBook
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>04280cam a2200733 i 4500</leader><controlfield tag="001">ZDB-4-EBA-on1046682461</controlfield><controlfield tag="003">OCoLC</controlfield><controlfield tag="005">20241004212047.0</controlfield><controlfield tag="006">m o d </controlfield><controlfield tag="007">cr unu\|\|\|\|\|\|\|\|</controlfield><controlfield tag="008">180731s2018 enka ob 001 0 eng d</controlfield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">UMI</subfield><subfield code="b">eng</subfield><subfield code="e">rda</subfield><subfield code="e">pn</subfield><subfield code="c">UMI</subfield><subfield code="d">STF</subfield><subfield code="d">TOH</subfield><subfield code="d">OCLCF</subfield><subfield code="d">EBLCP</subfield><subfield code="d">N$T</subfield><subfield code="d">MERUC</subfield><subfield code="d">ZCU</subfield><subfield code="d">NLE</subfield><subfield code="d">TEFOD</subfield><subfield code="d">CEF</subfield><subfield code="d">UKMGB</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">G3B</subfield><subfield code="d">S9I</subfield><subfield code="d">UAB</subfield><subfield code="d">C6I</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">UX1</subfield><subfield code="d">K6U</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">AAA</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">PSYSI</subfield><subfield code="d">OCLCQ</subfield><subfield code="d">OCLCO</subfield><subfield code="d">OCLCL</subfield><subfield code="d">SXB</subfield><subfield code="d">HOPLA</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">018936109</subfield><subfield code="2">Uk</subfield></datafield><datafield tag="019" ind1=" " ind2=" "><subfield code="a">1042318736</subfield><subfield code="a">1175638157</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781788839303</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1788839307</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1788834240</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781788834247</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9781788834247</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9781788834247</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1046682461</subfield><subfield code="z">(OCoLC)1042318736</subfield><subfield code="z">(OCoLC)1175638157</subfield></datafield><datafield tag="037" ind1=" " ind2=" "><subfield code="a">CL0500000982</subfield><subfield code="b">Safari Books Online</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">Q325.5</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">COM</subfield><subfield code="x">000000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="082" ind1="7" ind2=" "><subfield code="a">006.31</subfield><subfield code="2">23</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">MAIN</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Lapan, Maxim,</subfield><subfield code="e">author.</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Deep reinforcement learning hands-on :</subfield><subfield code="b">apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /</subfield><subfield code="c">Maxim Lapan.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Birmingham, UK :</subfield><subfield code="b">Packt Publishing,</subfield><subfield code="c">2018.</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (1 volume) :</subfield><subfield code="b">illustrations</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="347" ind1=" " ind2=" "><subfield code="a">data file</subfield></datafield><datafield tag="588" ind1="0" ind2=" "><subfield code="a">Online resource; title from cover (Safari, viewed July 30, 2018).</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">"Expert insight."</subfield></datafield><datafield tag="504" ind1=" " ind2=" "><subfield code="a">Includes bibliographical references and index.</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Table of ContentsWhat is Reinforcement Learning?OpenAI GymDeep Learning with PyTorchThe Cross-Entropy MethodTabular Learning and the Bellman EquationDeep Q-NetworksDQN ExtensionsStocks Trading Using RLPolicy Gradients -- An AlternativeThe Actor-Critic MethodAsynchronous Advantage Actor-CriticChatbots Training with RL Web NavigationContinuous Action SpaceTrust Regions -- TRPO, PPO, and ACKTRBlack-Box Optimization in RLBeyond Model-Free -- ImaginationAlphaGo Zero.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">This book is a practical, developer-oriented introduction to deep reinforcement learning (RL). Explore the theoretical concepts of RL, before discovering how deep learning (DL) methods and tools are making it possible to solve more complex and challenging problems than ever before. Apply deep RL methods to training your agent to beat arcade ...</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Reinforcement learning.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh92000704</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Machine learning.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh85079324</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Natural language processing (Computer science)</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh88002425</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Artificial intelligence.</subfield><subfield code="0">http://id.loc.gov/authorities/subjects/sh85008180</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Natural Language Processing</subfield><subfield code="0">https://id.nlm.nih.gov/mesh/D009323</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Artificial Intelligence</subfield><subfield code="0">https://id.nlm.nih.gov/mesh/D001185</subfield></datafield><datafield tag="650" ind1=" " ind2="2"><subfield code="a">Machine Learning</subfield><subfield code="0">https://id.nlm.nih.gov/mesh/D000069550</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Apprentissage par renforcement (Intelligence artificielle)</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Apprentissage automatique.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Traitement automatique des langues naturelles.</subfield></datafield><datafield tag="650" ind1=" " ind2="6"><subfield code="a">Intelligence artificielle.</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">artificial intelligence.</subfield><subfield code="2">aat</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">COMPUTERS</subfield><subfield code="x">General.</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Artificial intelligence</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Machine learning</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Natural language processing (Computer science)</subfield><subfield code="2">fast</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Reinforcement learning</subfield><subfield code="2">fast</subfield></datafield><datafield tag="655" ind1=" " ind2="4"><subfield code="a">Electronic book.</subfield></datafield><datafield tag="758" ind1=" " ind2=" "><subfield code="i">has work:</subfield><subfield code="a">Deep Reinforcement Learning Hands-On (Text)</subfield><subfield code="1">https://id.oclc.org/worldcat/entity/E39PCXfrHBJd8R88mbQmX6bWpd</subfield><subfield code="4">https://id.oclc.org/worldcat/ontology/hasWork</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Print version:</subfield><subfield code="a">Lapan, Maxim.</subfield><subfield code="t">Deep Reinforcement Learning Hands-On : Apply Modern RL Methods, with Deep Q-Networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More.</subfield><subfield code="d">Birmingham : Packt Publishing Ltd, ©2018</subfield><subfield code="z">9781788834247</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="l">FWS01</subfield><subfield code="p">ZDB-4-EBA</subfield><subfield code="q">FWS_PDA_EBA</subfield><subfield code="u">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1837369</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">hoopla Digital</subfield><subfield code="b">HOPL</subfield><subfield code="n">MWT13589389</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBL - Ebook Library</subfield><subfield code="b">EBLB</subfield><subfield code="n">EBL5434975</subfield></datafield><datafield tag="938" ind1=" " ind2=" "><subfield code="a">EBSCOhost</subfield><subfield code="b">EBSC</subfield><subfield code="n">1837369</subfield></datafield><datafield tag="994" ind1=" " ind2=" "><subfield code="a">92</subfield><subfield code="b">GEBAY</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-4-EBA</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-863</subfield></datafield></record></collection>
genre	Electronic book.
genre_facet	Electronic book.
id	ZDB-4-EBA-on1046682461
illustrated	Illustrated
indexdate	2024-11-27T13:29:04Z
institution	BVB
isbn	9781788839303 1788839307 1788834240 9781788834247
language	English
oclc_num	1046682461
open_access_boolean
owner	MAIN DE-863 DE-BY-FWS
owner_facet	MAIN DE-863 DE-BY-FWS
physical	1 online resource (1 volume) : illustrations
psigel	ZDB-4-EBA
publishDate	2018
publishDateSearch	2018
publishDateSort	2018
publisher	Packt Publishing,
record_format	marc
spelling	Lapan, Maxim, author. Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / Maxim Lapan. Birmingham, UK : Packt Publishing, 2018. 1 online resource (1 volume) : illustrations text txt rdacontent computer c rdamedia online resource cr rdacarrier data file Online resource; title from cover (Safari, viewed July 30, 2018). "Expert insight." Includes bibliographical references and index. Table of ContentsWhat is Reinforcement Learning?OpenAI GymDeep Learning with PyTorchThe Cross-Entropy MethodTabular Learning and the Bellman EquationDeep Q-NetworksDQN ExtensionsStocks Trading Using RLPolicy Gradients -- An AlternativeThe Actor-Critic MethodAsynchronous Advantage Actor-CriticChatbots Training with RL Web NavigationContinuous Action SpaceTrust Regions -- TRPO, PPO, and ACKTRBlack-Box Optimization in RLBeyond Model-Free -- ImaginationAlphaGo Zero. This book is a practical, developer-oriented introduction to deep reinforcement learning (RL). Explore the theoretical concepts of RL, before discovering how deep learning (DL) methods and tools are making it possible to solve more complex and challenging problems than ever before. Apply deep RL methods to training your agent to beat arcade ... Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Machine learning. http://id.loc.gov/authorities/subjects/sh85079324 Natural language processing (Computer science) http://id.loc.gov/authorities/subjects/sh88002425 Artificial intelligence. http://id.loc.gov/authorities/subjects/sh85008180 Natural Language Processing https://id.nlm.nih.gov/mesh/D009323 Artificial Intelligence https://id.nlm.nih.gov/mesh/D001185 Machine Learning https://id.nlm.nih.gov/mesh/D000069550 Apprentissage par renforcement (Intelligence artificielle) Apprentissage automatique. Traitement automatique des langues naturelles. Intelligence artificielle. artificial intelligence. aat COMPUTERS General. bisacsh Artificial intelligence fast Machine learning fast Natural language processing (Computer science) fast Reinforcement learning fast Electronic book. has work: Deep Reinforcement Learning Hands-On (Text) https://id.oclc.org/worldcat/entity/E39PCXfrHBJd8R88mbQmX6bWpd https://id.oclc.org/worldcat/ontology/hasWork Print version: Lapan, Maxim. Deep Reinforcement Learning Hands-On : Apply Modern RL Methods, with Deep Q-Networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More. Birmingham : Packt Publishing Ltd, ©2018 9781788834247 FWS01 ZDB-4-EBA FWS_PDA_EBA https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1837369 Volltext
spellingShingle	Lapan, Maxim Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / Table of ContentsWhat is Reinforcement Learning?OpenAI GymDeep Learning with PyTorchThe Cross-Entropy MethodTabular Learning and the Bellman EquationDeep Q-NetworksDQN ExtensionsStocks Trading Using RLPolicy Gradients -- An AlternativeThe Actor-Critic MethodAsynchronous Advantage Actor-CriticChatbots Training with RL Web NavigationContinuous Action SpaceTrust Regions -- TRPO, PPO, and ACKTRBlack-Box Optimization in RLBeyond Model-Free -- ImaginationAlphaGo Zero. Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Machine learning. http://id.loc.gov/authorities/subjects/sh85079324 Natural language processing (Computer science) http://id.loc.gov/authorities/subjects/sh88002425 Artificial intelligence. http://id.loc.gov/authorities/subjects/sh85008180 Natural Language Processing https://id.nlm.nih.gov/mesh/D009323 Artificial Intelligence https://id.nlm.nih.gov/mesh/D001185 Machine Learning https://id.nlm.nih.gov/mesh/D000069550 Apprentissage par renforcement (Intelligence artificielle) Apprentissage automatique. Traitement automatique des langues naturelles. Intelligence artificielle. artificial intelligence. aat COMPUTERS General. bisacsh Artificial intelligence fast Machine learning fast Natural language processing (Computer science) fast Reinforcement learning fast
subject_GND	http://id.loc.gov/authorities/subjects/sh92000704 http://id.loc.gov/authorities/subjects/sh85079324 http://id.loc.gov/authorities/subjects/sh88002425 http://id.loc.gov/authorities/subjects/sh85008180 https://id.nlm.nih.gov/mesh/D009323 https://id.nlm.nih.gov/mesh/D001185 https://id.nlm.nih.gov/mesh/D000069550
title	Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /
title_auth	Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /
title_exact_search	Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /
title_full	Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / Maxim Lapan.
title_fullStr	Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / Maxim Lapan.
title_full_unstemmed	Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / Maxim Lapan.
title_short	Deep reinforcement learning hands-on :
title_sort	deep reinforcement learning hands on apply modern rl methods with deep q networks value iteration policy gradients trpo alphago zero and more
title_sub	apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /
topic	Reinforcement learning. http://id.loc.gov/authorities/subjects/sh92000704 Machine learning. http://id.loc.gov/authorities/subjects/sh85079324 Natural language processing (Computer science) http://id.loc.gov/authorities/subjects/sh88002425 Artificial intelligence. http://id.loc.gov/authorities/subjects/sh85008180 Natural Language Processing https://id.nlm.nih.gov/mesh/D009323 Artificial Intelligence https://id.nlm.nih.gov/mesh/D001185 Machine Learning https://id.nlm.nih.gov/mesh/D000069550 Apprentissage par renforcement (Intelligence artificielle) Apprentissage automatique. Traitement automatique des langues naturelles. Intelligence artificielle. artificial intelligence. aat COMPUTERS General. bisacsh Artificial intelligence fast Machine learning fast Natural language processing (Computer science) fast Reinforcement learning fast
topic_facet	Reinforcement learning. Machine learning. Natural language processing (Computer science) Artificial intelligence. Natural Language Processing Artificial Intelligence Machine Learning Apprentissage par renforcement (Intelligence artificielle) Apprentissage automatique. Traitement automatique des langues naturelles. Intelligence artificielle. artificial intelligence. COMPUTERS General. Artificial intelligence Machine learning Reinforcement learning Electronic book.
url	https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1837369
work_keys_str_mv	AT lapanmaxim deepreinforcementlearninghandsonapplymodernrlmethodswithdeepqnetworksvalueiterationpolicygradientstrpoalphagozeroandmore

Verfügbarkeit

Es ist kein Print-Exemplar vorhanden.

Volltext öffnen

MARC

Datensatz im Suchindex

Es ist kein Print-Exemplar vorhanden.

Ähnliche Einträge