Adaptive representations for reinforcement learning:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Berlin ; Heidelberg
Springer
2010
|
Schriftenreihe: | Studies in computational intelligence
291 |
Schlagworte: | |
Online-Zugang: | Inhaltstext Inhaltsverzeichnis |
Beschreibung: | XIII, 116 S. graph. Darst. |
ISBN: | 9783642139314 |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV036799987 | ||
003 | DE-604 | ||
007 | t | ||
008 | 101126s2010 d||| |||| 00||| eng d | ||
015 | |a 10,N21 |2 dnb | ||
016 | 7 | |a 1002680387 |2 DE-101 | |
020 | |a 9783642139314 |c GB. : EUR 106.95 (freier Pr.), sfr 155.50 (freier Pr.) |9 978-3-642-13931-4 | ||
024 | 3 | |a 9783642139314 | |
035 | |a (OCoLC)699733824 | ||
035 | |a (DE-599)DNB1002680387 | ||
040 | |a DE-604 |b ger | ||
041 | 0 | |a eng | |
049 | |a DE-11 |a DE-355 | ||
082 | 0 | |a 006.31 |2 22/ger | |
084 | |a ST 300 |0 (DE-625)143650: |2 rvk | ||
084 | |a 004 |2 sdnb | ||
100 | 1 | |a Whiteson, Shimon |e Verfasser |4 aut | |
245 | 1 | 0 | |a Adaptive representations for reinforcement learning |c Shimon Whiteson |
264 | 1 | |a Berlin ; Heidelberg |b Springer |c 2010 | |
300 | |a XIII, 116 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Studies in computational intelligence |v 291 | |
650 | 0 | 7 | |a Bestärkendes Lernen |g Künstliche Intelligenz |0 (DE-588)4825546-4 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Bestärkendes Lernen |g Künstliche Intelligenz |0 (DE-588)4825546-4 |D s |
689 | 0 | |5 DE-604 | |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |t Adaptive Representations for Reinforcement Learning |
830 | 0 | |a Studies in computational intelligence |v 291 |w (DE-604)BV020822171 |9 291 | |
856 | 4 | |q text/html |u http://deposit.dnb.de/cgi-bin/dokserv?id=3480844&prov=M&dok%5Fvar=1&dok%5Fext=htm |3 Inhaltstext | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020716201&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-020716201 |
Datensatz im Suchindex
_version_ | 1805095025360502784 |
---|---|
adam_text |
CONTENTS 1 INTRODUCTION 1 1.1 MOTIVATION 2 1.2 APPROACH 2 1.3 OVERVIEW 4
2 REINFORCEMENT LEARNING 7 2.1 REINFORCEMENT LEARNING FRAMEWORK 7 2.2
TEMPORAL DIFFERENCE METHODS 9 2.3 POLICY SEARCH METHODS 11 3 ON-LINE
EVOLUTIONARY COMPUTATION 17 3.1 E-GREEDY EVOLUTION 20 3.2 SOFTMAX
EVOLUTION 21 3.3 INTERVAL ESTIMATION EVOLUTION 22 3.4 TESTBED DOMAINS 22
3.4.1 MOUNTAIN CAR 23 3.4.2 SERVER JOB SCHEDULING 24 3.5 RESULTS 26 3.6
DISCUSSION 28 4 EVOLUTIONARY FUNCTION APPROXIMATION 31 4.1 NEAT+Q 33 4.2
RESULTS 34 4.2.1 COMPARING MANUAL AND EVOLUTIONARY FUNCTION
APPROXIMATION 34 4.2.2 COMBINING ON-LINE EVOLUTION WITH EVOLUTIONARY
FUNCTION APPROXIMATION 37 4.2.3 COMPARING TO OTHER APPROACHES 38 4.2.4
COMPARING DARWINIAN AND LAMARCKIAN APPROACHES 42 BIBLIOGRAFISCHE
INFORMATIONEN HTTP://D-NB.INFO/1002680387 DIGITALISIERT DURCH 8.4.3
FS-NEAT 93 XII CONTENTS 4.2.5 CONTINUAL LEARNING TESTS 43 4.3 DISCUSSION
45 5 SAMPLE-EFFICIENT EVOLUTIONARY FUNCTION APPROXIMATION 47 5.1
SAMPLE-EFFICIENT NEAT+Q 47 5.2 RESULTS 49 5.3 DISCUSSION 52 6 AUTOMATIC
FEATURE SELECTION FOR REINFORCEMENT LEARNING 53 6.1 FS-NEAT 54 6.2
TESTBED DOMAIN 56 6.3 RESULTS 58 6.4 DISCUSSION 61 7 ADAPTIVE TILE
CODING 65 7.1 BACKGROUND 66 7.1.1 TILE CODING 66 7.2 METHOD 68 7.2.1
WHEN TO SPLIT 69 7.2.2 WHERE TO SPLIT 70 SUBSUBSECTION.7.2.2.1
SUBSUBSECTION.7.2.2.2 7.3 TESTBED DOMAINS 72 7.4 RESULTS 73 7.5
DISCUSSION 74 8 RELATED WORK 77 8.1 OPTIMIZING REPRESENTATIONS 77 8.1.1
SUPERVISED LEARNING 77 8.1.2 REINFORCEMENT LEARNING 78 8.1.3
EVOLUTIONARY COMPUTATION 82 8.2 COMBINING EVOLUTION AND LEARNING 84
8.2.1 APPLICATIONS TO SUPERVISED LEARNING 84 8.2.2 APPLICATIONS TO
REINFORCEMENT LEARNING 86 8.3 BALANCING EXPLORATION AND EXPLOITATION 89
8.3.1 FC-ARMED BANDIT PROBLEM 89 8.3.2 ASSOCIATIVE SEARCH 90 8.3.3
REINFORCEMENT LEARNING 90 8.4 FEATURE SELECTION 92 8.4.1 FILTERS 92
8.4.2 WRAPPERS 93 REFERENCES 107 CONTENTS XIII 9 CONCLUSION 95 9.1
PRIMARY CONCLUSIONS 95 9.2 NEGATIVE RESULTS 96 9.2.1 COMBINING FS-NEAT
WITH NEAT+Q 96 9.2.2 FEATURE SELECTION IN ADAPTIVE TILE CODING 97 9.2.3
FITNESS FUNCTIONS BASED ON BELLMAN ERROR 97 9.3 BROADER IMPLICATIONS 98
9.3.1 STOCHASTIC VS. DETERMINISTIC DOMAINS 98 9.3.2 THE VALUE FUNCTION
GAMBLE 99 9.3.3 THE ROLE OF SEARCH IN ADAPTIVE REPRESENTATIONS 100 9.4
FUTURE WORK 102 9.4.1 NON-STATIONARITY 102 9.4.2 STEADY-STATE
EVOLUTIONARY COMPUTATION 102 9.4.3 MODEL-BASED REINFORCEMENT LEARNING
103 9.5 FINAL REMARKS 104 A STATISTICAL SIGNIFICANCE 105 |
any_adam_object | 1 |
author | Whiteson, Shimon |
author_facet | Whiteson, Shimon |
author_role | aut |
author_sort | Whiteson, Shimon |
author_variant | s w sw |
building | Verbundindex |
bvnumber | BV036799987 |
classification_rvk | ST 300 |
ctrlnum | (OCoLC)699733824 (DE-599)DNB1002680387 |
dewey-full | 006.31 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.31 |
dewey-search | 006.31 |
dewey-sort | 16.31 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 cb4500</leader><controlfield tag="001">BV036799987</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">101126s2010 d||| |||| 00||| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">10,N21</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">1002680387</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642139314</subfield><subfield code="c">GB. : EUR 106.95 (freier Pr.), sfr 155.50 (freier Pr.)</subfield><subfield code="9">978-3-642-13931-4</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9783642139314</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)699733824</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB1002680387</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-11</subfield><subfield code="a">DE-355</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.31</subfield><subfield code="2">22/ger</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 300</subfield><subfield code="0">(DE-625)143650:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">004</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Whiteson, Shimon</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Adaptive representations for reinforcement learning</subfield><subfield code="c">Shimon Whiteson</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin ; Heidelberg</subfield><subfield code="b">Springer</subfield><subfield code="c">2010</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XIII, 116 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Studies in computational intelligence</subfield><subfield code="v">291</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="t">Adaptive Representations for Reinforcement Learning</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Studies in computational intelligence</subfield><subfield code="v">291</subfield><subfield code="w">(DE-604)BV020822171</subfield><subfield code="9">291</subfield></datafield><datafield tag="856" ind1="4" ind2=" "><subfield code="q">text/html</subfield><subfield code="u">http://deposit.dnb.de/cgi-bin/dokserv?id=3480844&prov=M&dok%5Fvar=1&dok%5Fext=htm</subfield><subfield code="3">Inhaltstext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020716201&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-020716201</subfield></datafield></record></collection> |
id | DE-604.BV036799987 |
illustrated | Illustrated |
indexdate | 2024-07-20T10:52:43Z |
institution | BVB |
isbn | 9783642139314 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-020716201 |
oclc_num | 699733824 |
open_access_boolean | |
owner | DE-11 DE-355 DE-BY-UBR |
owner_facet | DE-11 DE-355 DE-BY-UBR |
physical | XIII, 116 S. graph. Darst. |
publishDate | 2010 |
publishDateSearch | 2010 |
publishDateSort | 2010 |
publisher | Springer |
record_format | marc |
series | Studies in computational intelligence |
series2 | Studies in computational intelligence |
spelling | Whiteson, Shimon Verfasser aut Adaptive representations for reinforcement learning Shimon Whiteson Berlin ; Heidelberg Springer 2010 XIII, 116 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Studies in computational intelligence 291 Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd rswk-swf Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 s DE-604 Erscheint auch als Online-Ausgabe Adaptive Representations for Reinforcement Learning Studies in computational intelligence 291 (DE-604)BV020822171 291 text/html http://deposit.dnb.de/cgi-bin/dokserv?id=3480844&prov=M&dok%5Fvar=1&dok%5Fext=htm Inhaltstext DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020716201&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Whiteson, Shimon Adaptive representations for reinforcement learning Studies in computational intelligence Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd |
subject_GND | (DE-588)4825546-4 |
title | Adaptive representations for reinforcement learning |
title_auth | Adaptive representations for reinforcement learning |
title_exact_search | Adaptive representations for reinforcement learning |
title_full | Adaptive representations for reinforcement learning Shimon Whiteson |
title_fullStr | Adaptive representations for reinforcement learning Shimon Whiteson |
title_full_unstemmed | Adaptive representations for reinforcement learning Shimon Whiteson |
title_short | Adaptive representations for reinforcement learning |
title_sort | adaptive representations for reinforcement learning |
topic | Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd |
topic_facet | Bestärkendes Lernen Künstliche Intelligenz |
url | http://deposit.dnb.de/cgi-bin/dokserv?id=3480844&prov=M&dok%5Fvar=1&dok%5Fext=htm http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020716201&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV020822171 |
work_keys_str_mv | AT whitesonshimon adaptiverepresentationsforreinforcementlearning |