Deep Reinforcement Learning: Fundamentals, Research and Applications
Gespeichert in:
Weitere Verfasser: | , , |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Singapore
Springer
[2020]
|
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | xxvii, 514 Seiten Illustrationen, Diagramme |
ISBN: | 9789811540943 |
Internformat
MARC
LEADER | 00000nam a2200000zc 4500 | ||
---|---|---|---|
001 | BV047161029 | ||
003 | DE-604 | ||
005 | 20210329 | ||
007 | t | ||
008 | 210224s2020 a||| |||| 00||| eng d | ||
020 | |a 9789811540943 |c hbk |9 978-981-15-4094-3 | ||
035 | |a (OCoLC)1245330641 | ||
035 | |a (DE-599)BVBBV047161029 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-384 | ||
082 | 0 | |a 006.31 |2 23 | |
084 | |a QH 500 |0 (DE-625)141607: |2 rvk | ||
245 | 1 | 0 | |a Deep Reinforcement Learning |b Fundamentals, Research and Applications |c Hao Dong, Zihan Ding, Shanghang Zhang, Editors |
264 | 1 | |a Singapore |b Springer |c [2020] | |
300 | |a xxvii, 514 Seiten |b Illustrationen, Diagramme | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
650 | 4 | |a Machine Learning | |
650 | 4 | |a Data Mining and Knowledge Discovery | |
650 | 4 | |a Image Processing and Computer Vision | |
650 | 4 | |a Robotics | |
650 | 4 | |a Programming Techniques | |
650 | 4 | |a Natural Language Processing (NLP) | |
650 | 4 | |a Machine learning | |
650 | 4 | |a Data mining | |
650 | 4 | |a Optical data processing | |
650 | 4 | |a Robotics | |
650 | 4 | |a Computer programming | |
650 | 4 | |a Natural language processing (Computer science) | |
650 | 0 | 7 | |a Bestärkendes Lernen |g Künstliche Intelligenz |0 (DE-588)4825546-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Künstliche Intelligenz |0 (DE-588)4033447-8 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)4143413-4 |a Aufsatzsammlung |2 gnd-content | |
689 | 0 | 0 | |a Bestärkendes Lernen |g Künstliche Intelligenz |0 (DE-588)4825546-4 |D s |
689 | 0 | 1 | |a Künstliche Intelligenz |0 (DE-588)4033447-8 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Dong, Hao |0 (DE-588)1221983482 |4 edt | |
700 | 1 | |a Ding, Zihan |4 edt | |
700 | 1 | |a Zhang, Shanghang |4 edt | |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-981-154-095-0 |
856 | 4 | 2 | |m Digitalisierung UB Augsburg - ADAM Catalogue Enrichment |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=032566638&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-032566638 |
Datensatz im Suchindex
_version_ | 1804182236166094848 |
---|---|
adam_text | Contents Part I Fundamentals 1 Introduction to Deep Learning................................................................. Jingqing Zhang, Hang Yuan, and Hao Dong 3 2 Introduction to Reinforcement Learning............................................... Zihan Ding, Yanhua Huang, Hang Yuan, and Hao Dong 47 3 Taxonomy of Reinforcement Learning Algorithms............................... Hongming Zhang and Tianyang Yu 125 4 Deep Q-Networks......................................................................................... Yanhua Huang 135 5 Policy Gradient............................................................................................ Ruitong Huang, Tianyang Yu, Zihan Ding and Shanghang Zhang 161 6 Combine Deep Q -Networks with Actor-Critic...................................... Hongming Zhang, Tianyang Yu and Ruitong Huang 213 Part II Research 7 Challenges of Reinforcement Learning.................................................. Zihan Ding and Hao Dong 249 8 Imitation Learning..................................................................................... Zihan Ding 273 9 Integrating Learning and Planning.......................................................... Huaqing Zhang, Ruitong Huang, and Shanghang Zhang 307 10 Hierarchical Reinforcement Learning.................................................... Yanhua Huang 317 11 Multi-Agent Reinforcement Learning..................................................... Huaqing Zhang and Shanghang Zhang 335 xi
Contents xii 12 Parallel Computing..................................................................................... Huaqing Zhang and Tianyang Yu Part III 347 Applications 13 Learning to Run.......................................................................................... Zihan Ding and Hao Dong 14 Robust Image Enhancement ..................................................................... 379 Yanhua Huang 15 AlphaZero..................................................................................................... Hongming Zhang and Tianyang Yu 16 Robot Learning in Simulation.................................................................... 417 Zihan Ding and Hao Dong 17 Arena Platform for Multi-Agent Reinforcement Learning................. 443 Zihan Ding 18 Tricks of Implementation........................................................................... 467 Zihan Ding and Hao Dong Part IV 367 391 Summary 19 Algorithm Table .......................................................................................... Zihan Ding 485 20 Algorithm Cheatsheet................................................................................. 489 Zihan Ding
|
adam_txt |
Contents Part I Fundamentals 1 Introduction to Deep Learning. Jingqing Zhang, Hang Yuan, and Hao Dong 3 2 Introduction to Reinforcement Learning. Zihan Ding, Yanhua Huang, Hang Yuan, and Hao Dong 47 3 Taxonomy of Reinforcement Learning Algorithms. Hongming Zhang and Tianyang Yu 125 4 Deep Q-Networks. Yanhua Huang 135 5 Policy Gradient. Ruitong Huang, Tianyang Yu, Zihan Ding and Shanghang Zhang 161 6 Combine Deep Q -Networks with Actor-Critic. Hongming Zhang, Tianyang Yu and Ruitong Huang 213 Part II Research 7 Challenges of Reinforcement Learning. Zihan Ding and Hao Dong 249 8 Imitation Learning. Zihan Ding 273 9 Integrating Learning and Planning. Huaqing Zhang, Ruitong Huang, and Shanghang Zhang 307 10 Hierarchical Reinforcement Learning. Yanhua Huang 317 11 Multi-Agent Reinforcement Learning. Huaqing Zhang and Shanghang Zhang 335 xi
Contents xii 12 Parallel Computing. Huaqing Zhang and Tianyang Yu Part III 347 Applications 13 Learning to Run. Zihan Ding and Hao Dong 14 Robust Image Enhancement . 379 Yanhua Huang 15 AlphaZero. Hongming Zhang and Tianyang Yu 16 Robot Learning in Simulation. 417 Zihan Ding and Hao Dong 17 Arena Platform for Multi-Agent Reinforcement Learning. 443 Zihan Ding 18 Tricks of Implementation. 467 Zihan Ding and Hao Dong Part IV 367 391 Summary 19 Algorithm Table . Zihan Ding 485 20 Algorithm Cheatsheet. 489 Zihan Ding |
any_adam_object | 1 |
any_adam_object_boolean | 1 |
author2 | Dong, Hao Ding, Zihan Zhang, Shanghang |
author2_role | edt edt edt |
author2_variant | h d hd z d zd s z sz |
author_GND | (DE-588)1221983482 |
author_facet | Dong, Hao Ding, Zihan Zhang, Shanghang |
building | Verbundindex |
bvnumber | BV047161029 |
classification_rvk | QH 500 |
ctrlnum | (OCoLC)1245330641 (DE-599)BVBBV047161029 |
dewey-full | 006.31 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.31 |
dewey-search | 006.31 |
dewey-sort | 16.31 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik Wirtschaftswissenschaften |
discipline_str_mv | Informatik Wirtschaftswissenschaften |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02177nam a2200541zc 4500</leader><controlfield tag="001">BV047161029</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20210329 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">210224s2020 a||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9789811540943</subfield><subfield code="c">hbk</subfield><subfield code="9">978-981-15-4094-3</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1245330641</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV047161029</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-384</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.31</subfield><subfield code="2">23</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">QH 500</subfield><subfield code="0">(DE-625)141607:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Deep Reinforcement Learning</subfield><subfield code="b">Fundamentals, Research and Applications</subfield><subfield code="c">Hao Dong, Zihan Ding, Shanghang Zhang, Editors</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Singapore</subfield><subfield code="b">Springer</subfield><subfield code="c">[2020]</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xxvii, 514 Seiten</subfield><subfield code="b">Illustrationen, Diagramme</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Machine Learning</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data Mining and Knowledge Discovery</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Image Processing and Computer Vision</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Robotics</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Programming Techniques</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Natural Language Processing (NLP)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Machine learning</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Optical data processing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Robotics</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Computer programming</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Natural language processing (Computer science)</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4033447-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4143413-4</subfield><subfield code="a">Aufsatzsammlung</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4033447-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Dong, Hao</subfield><subfield code="0">(DE-588)1221983482</subfield><subfield code="4">edt</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Ding, Zihan</subfield><subfield code="4">edt</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhang, Shanghang</subfield><subfield code="4">edt</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-981-154-095-0</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Augsburg - ADAM Catalogue Enrichment</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=032566638&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-032566638</subfield></datafield></record></collection> |
genre | (DE-588)4143413-4 Aufsatzsammlung gnd-content |
genre_facet | Aufsatzsammlung |
id | DE-604.BV047161029 |
illustrated | Illustrated |
index_date | 2024-07-03T16:40:42Z |
indexdate | 2024-07-10T09:04:20Z |
institution | BVB |
isbn | 9789811540943 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-032566638 |
oclc_num | 1245330641 |
open_access_boolean | |
owner | DE-384 |
owner_facet | DE-384 |
physical | xxvii, 514 Seiten Illustrationen, Diagramme |
publishDate | 2020 |
publishDateSearch | 2020 |
publishDateSort | 2020 |
publisher | Springer |
record_format | marc |
spelling | Deep Reinforcement Learning Fundamentals, Research and Applications Hao Dong, Zihan Ding, Shanghang Zhang, Editors Singapore Springer [2020] xxvii, 514 Seiten Illustrationen, Diagramme txt rdacontent n rdamedia nc rdacarrier Machine Learning Data Mining and Knowledge Discovery Image Processing and Computer Vision Robotics Programming Techniques Natural Language Processing (NLP) Machine learning Data mining Optical data processing Computer programming Natural language processing (Computer science) Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd rswk-swf Künstliche Intelligenz (DE-588)4033447-8 gnd rswk-swf (DE-588)4143413-4 Aufsatzsammlung gnd-content Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 s Künstliche Intelligenz (DE-588)4033447-8 s DE-604 Dong, Hao (DE-588)1221983482 edt Ding, Zihan edt Zhang, Shanghang edt Erscheint auch als Online-Ausgabe 978-981-154-095-0 Digitalisierung UB Augsburg - ADAM Catalogue Enrichment application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=032566638&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Deep Reinforcement Learning Fundamentals, Research and Applications Machine Learning Data Mining and Knowledge Discovery Image Processing and Computer Vision Robotics Programming Techniques Natural Language Processing (NLP) Machine learning Data mining Optical data processing Computer programming Natural language processing (Computer science) Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd Künstliche Intelligenz (DE-588)4033447-8 gnd |
subject_GND | (DE-588)4825546-4 (DE-588)4033447-8 (DE-588)4143413-4 |
title | Deep Reinforcement Learning Fundamentals, Research and Applications |
title_auth | Deep Reinforcement Learning Fundamentals, Research and Applications |
title_exact_search | Deep Reinforcement Learning Fundamentals, Research and Applications |
title_exact_search_txtP | Deep Reinforcement Learning Fundamentals, Research and Applications |
title_full | Deep Reinforcement Learning Fundamentals, Research and Applications Hao Dong, Zihan Ding, Shanghang Zhang, Editors |
title_fullStr | Deep Reinforcement Learning Fundamentals, Research and Applications Hao Dong, Zihan Ding, Shanghang Zhang, Editors |
title_full_unstemmed | Deep Reinforcement Learning Fundamentals, Research and Applications Hao Dong, Zihan Ding, Shanghang Zhang, Editors |
title_short | Deep Reinforcement Learning |
title_sort | deep reinforcement learning fundamentals research and applications |
title_sub | Fundamentals, Research and Applications |
topic | Machine Learning Data Mining and Knowledge Discovery Image Processing and Computer Vision Robotics Programming Techniques Natural Language Processing (NLP) Machine learning Data mining Optical data processing Computer programming Natural language processing (Computer science) Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd Künstliche Intelligenz (DE-588)4033447-8 gnd |
topic_facet | Machine Learning Data Mining and Knowledge Discovery Image Processing and Computer Vision Robotics Programming Techniques Natural Language Processing (NLP) Machine learning Data mining Optical data processing Computer programming Natural language processing (Computer science) Bestärkendes Lernen Künstliche Intelligenz Künstliche Intelligenz Aufsatzsammlung |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=032566638&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT donghao deepreinforcementlearningfundamentalsresearchandapplications AT dingzihan deepreinforcementlearningfundamentalsresearchandapplications AT zhangshanghang deepreinforcementlearningfundamentalsresearchandapplications |