Verfügbarkeit: Adaptive representations for reinforcement learning

Adaptive representations for reinforcement learning:

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Whiteson, Shimon (VerfasserIn)
Format:	Buch
Sprache:	English
Veröffentlicht:	Berlin ; Heidelberg Springer 2010
Schriftenreihe:	Studies in computational intelligence 291
Schlagworte:	Bestärkendes Lernen > Künstliche Intelligenz
Online-Zugang:	Inhaltstext Inhaltsverzeichnis
Beschreibung:	XIII, 116 S. graph. Darst.
ISBN:	9783642139314

Internformat

MARC


LEADER	00000nam a2200000 cb4500
001	BV036799987
003	DE-604
007	t
008	101126s2010 d\|\|\| \|\|\|\| 00\|\|\| eng d
015			\|a 10,N21 \|2 dnb
016	7		\|a 1002680387 \|2 DE-101
020			\|a 9783642139314 \|c GB. : EUR 106.95 (freier Pr.), sfr 155.50 (freier Pr.) \|9 978-3-642-13931-4
024	3		\|a 9783642139314
035			\|a (OCoLC)699733824
035			\|a (DE-599)DNB1002680387
040			\|a DE-604 \|b ger
041	0		\|a eng
049			\|a DE-11 \|a DE-355
082	0		\|a 006.31 \|2 22/ger
084			\|a ST 300 \|0 (DE-625)143650: \|2 rvk
084			\|a 004 \|2 sdnb
100	1		\|a Whiteson, Shimon \|e Verfasser \|4 aut
245	1	0	\|a Adaptive representations for reinforcement learning \|c Shimon Whiteson
264		1	\|a Berlin ; Heidelberg \|b Springer \|c 2010
300			\|a XIII, 116 S. \|b graph. Darst.
336			\|b txt \|2 rdacontent
337			\|b n \|2 rdamedia
338			\|b nc \|2 rdacarrier
490	1		\|a Studies in computational intelligence \|v 291
650	0	7	\|a Bestärkendes Lernen \|g Künstliche Intelligenz \|0 (DE-588)4825546-4 \|2 gnd \|9 rswk-swf
689	0	0	\|a Bestärkendes Lernen \|g Künstliche Intelligenz \|0 (DE-588)4825546-4 \|D s
689	0		\|5 DE-604
776	0	8	\|i Erscheint auch als \|n Online-Ausgabe \|t Adaptive Representations for Reinforcement Learning
830		0	\|a Studies in computational intelligence \|v 291 \|w (DE-604)BV020822171 \|9 291
856	4		\|q text/html \|u http://deposit.dnb.de/cgi-bin/dokserv?id=3480844&prov=M&dok%5Fvar=1&dok%5Fext=htm \|3 Inhaltstext
856	4	2	\|m DNB Datenaustausch \|q application/pdf \|u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020716201&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA \|3 Inhaltsverzeichnis
943	1		\|a oai:aleph.bib-bvb.de:BVB01-020716201

Datensatz im Suchindex

_version_	1805095025360502784
adam_text	CONTENTS 1 INTRODUCTION 1 1.1 MOTIVATION 2 1.2 APPROACH 2 1.3 OVERVIEW 4 2 REINFORCEMENT LEARNING 7 2.1 REINFORCEMENT LEARNING FRAMEWORK 7 2.2 TEMPORAL DIFFERENCE METHODS 9 2.3 POLICY SEARCH METHODS 11 3 ON-LINE EVOLUTIONARY COMPUTATION 17 3.1 E-GREEDY EVOLUTION 20 3.2 SOFTMAX EVOLUTION 21 3.3 INTERVAL ESTIMATION EVOLUTION 22 3.4 TESTBED DOMAINS 22 3.4.1 MOUNTAIN CAR 23 3.4.2 SERVER JOB SCHEDULING 24 3.5 RESULTS 26 3.6 DISCUSSION 28 4 EVOLUTIONARY FUNCTION APPROXIMATION 31 4.1 NEAT+Q 33 4.2 RESULTS 34 4.2.1 COMPARING MANUAL AND EVOLUTIONARY FUNCTION APPROXIMATION 34 4.2.2 COMBINING ON-LINE EVOLUTION WITH EVOLUTIONARY FUNCTION APPROXIMATION 37 4.2.3 COMPARING TO OTHER APPROACHES 38 4.2.4 COMPARING DARWINIAN AND LAMARCKIAN APPROACHES 42 BIBLIOGRAFISCHE INFORMATIONEN HTTP://D-NB.INFO/1002680387 DIGITALISIERT DURCH 8.4.3 FS-NEAT 93 XII CONTENTS 4.2.5 CONTINUAL LEARNING TESTS 43 4.3 DISCUSSION 45 5 SAMPLE-EFFICIENT EVOLUTIONARY FUNCTION APPROXIMATION 47 5.1 SAMPLE-EFFICIENT NEAT+Q 47 5.2 RESULTS 49 5.3 DISCUSSION 52 6 AUTOMATIC FEATURE SELECTION FOR REINFORCEMENT LEARNING 53 6.1 FS-NEAT 54 6.2 TESTBED DOMAIN 56 6.3 RESULTS 58 6.4 DISCUSSION 61 7 ADAPTIVE TILE CODING 65 7.1 BACKGROUND 66 7.1.1 TILE CODING 66 7.2 METHOD 68 7.2.1 WHEN TO SPLIT 69 7.2.2 WHERE TO SPLIT 70 SUBSUBSECTION.7.2.2.1 SUBSUBSECTION.7.2.2.2 7.3 TESTBED DOMAINS 72 7.4 RESULTS 73 7.5 DISCUSSION 74 8 RELATED WORK 77 8.1 OPTIMIZING REPRESENTATIONS 77 8.1.1 SUPERVISED LEARNING 77 8.1.2 REINFORCEMENT LEARNING 78 8.1.3 EVOLUTIONARY COMPUTATION 82 8.2 COMBINING EVOLUTION AND LEARNING 84 8.2.1 APPLICATIONS TO SUPERVISED LEARNING 84 8.2.2 APPLICATIONS TO REINFORCEMENT LEARNING 86 8.3 BALANCING EXPLORATION AND EXPLOITATION 89 8.3.1 FC-ARMED BANDIT PROBLEM 89 8.3.2 ASSOCIATIVE SEARCH 90 8.3.3 REINFORCEMENT LEARNING 90 8.4 FEATURE SELECTION 92 8.4.1 FILTERS 92 8.4.2 WRAPPERS 93 REFERENCES 107 CONTENTS XIII 9 CONCLUSION 95 9.1 PRIMARY CONCLUSIONS 95 9.2 NEGATIVE RESULTS 96 9.2.1 COMBINING FS-NEAT WITH NEAT+Q 96 9.2.2 FEATURE SELECTION IN ADAPTIVE TILE CODING 97 9.2.3 FITNESS FUNCTIONS BASED ON BELLMAN ERROR 97 9.3 BROADER IMPLICATIONS 98 9.3.1 STOCHASTIC VS. DETERMINISTIC DOMAINS 98 9.3.2 THE VALUE FUNCTION GAMBLE 99 9.3.3 THE ROLE OF SEARCH IN ADAPTIVE REPRESENTATIONS 100 9.4 FUTURE WORK 102 9.4.1 NON-STATIONARITY 102 9.4.2 STEADY-STATE EVOLUTIONARY COMPUTATION 102 9.4.3 MODEL-BASED REINFORCEMENT LEARNING 103 9.5 FINAL REMARKS 104 A STATISTICAL SIGNIFICANCE 105
any_adam_object	1
author	Whiteson, Shimon
author_facet	Whiteson, Shimon
author_role	aut
author_sort	Whiteson, Shimon
author_variant	s w sw
building	Verbundindex
bvnumber	BV036799987
classification_rvk	ST 300
ctrlnum	(OCoLC)699733824 (DE-599)DNB1002680387
dewey-full	006.31
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	006 - Special computer methods
dewey-raw	006.31
dewey-search	006.31
dewey-sort	16.31
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
format	Book
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 cb4500</leader><controlfield tag="001">BV036799987</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">101126s2010 d\|\|\| \|\|\|\| 00\|\|\| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">10,N21</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">1002680387</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642139314</subfield><subfield code="c">GB. : EUR 106.95 (freier Pr.), sfr 155.50 (freier Pr.)</subfield><subfield code="9">978-3-642-13931-4</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9783642139314</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)699733824</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB1002680387</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-11</subfield><subfield code="a">DE-355</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.31</subfield><subfield code="2">22/ger</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 300</subfield><subfield code="0">(DE-625)143650:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">004</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Whiteson, Shimon</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Adaptive representations for reinforcement learning</subfield><subfield code="c">Shimon Whiteson</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin ; Heidelberg</subfield><subfield code="b">Springer</subfield><subfield code="c">2010</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XIII, 116 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Studies in computational intelligence</subfield><subfield code="v">291</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="t">Adaptive Representations for Reinforcement Learning</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Studies in computational intelligence</subfield><subfield code="v">291</subfield><subfield code="w">(DE-604)BV020822171</subfield><subfield code="9">291</subfield></datafield><datafield tag="856" ind1="4" ind2=" "><subfield code="q">text/html</subfield><subfield code="u">http://deposit.dnb.de/cgi-bin/dokserv?id=3480844&prov=M&dok%5Fvar=1&dok%5Fext=htm</subfield><subfield code="3">Inhaltstext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020716201&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-020716201</subfield></datafield></record></collection>
id	DE-604.BV036799987
illustrated	Illustrated
indexdate	2024-07-20T10:52:43Z
institution	BVB
isbn	9783642139314
language	English
oai_aleph_id	oai:aleph.bib-bvb.de:BVB01-020716201
oclc_num	699733824
open_access_boolean
owner	DE-11 DE-355 DE-BY-UBR
owner_facet	DE-11 DE-355 DE-BY-UBR
physical	XIII, 116 S. graph. Darst.
publishDate	2010
publishDateSearch	2010
publishDateSort	2010
publisher	Springer
record_format	marc
series	Studies in computational intelligence
series2	Studies in computational intelligence
spelling	Whiteson, Shimon Verfasser aut Adaptive representations for reinforcement learning Shimon Whiteson Berlin ; Heidelberg Springer 2010 XIII, 116 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier Studies in computational intelligence 291 Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd rswk-swf Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 s DE-604 Erscheint auch als Online-Ausgabe Adaptive Representations for Reinforcement Learning Studies in computational intelligence 291 (DE-604)BV020822171 291 text/html http://deposit.dnb.de/cgi-bin/dokserv?id=3480844&prov=M&dok%5Fvar=1&dok%5Fext=htm Inhaltstext DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020716201&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis
spellingShingle	Whiteson, Shimon Adaptive representations for reinforcement learning Studies in computational intelligence Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd
subject_GND	(DE-588)4825546-4
title	Adaptive representations for reinforcement learning
title_auth	Adaptive representations for reinforcement learning
title_exact_search	Adaptive representations for reinforcement learning
title_full	Adaptive representations for reinforcement learning Shimon Whiteson
title_fullStr	Adaptive representations for reinforcement learning Shimon Whiteson
title_full_unstemmed	Adaptive representations for reinforcement learning Shimon Whiteson
title_short	Adaptive representations for reinforcement learning
title_sort	adaptive representations for reinforcement learning
topic	Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd
topic_facet	Bestärkendes Lernen Künstliche Intelligenz
url	http://deposit.dnb.de/cgi-bin/dokserv?id=3480844&prov=M&dok%5Fvar=1&dok%5Fext=htm http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020716201&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA
volume_link	(DE-604)BV020822171
work_keys_str_mv	AT whitesonshimon adaptiverepresentationsforreinforcementlearning

Verfügbarkeit

Es ist kein Print-Exemplar vorhanden.

Fernleihe Bestellen Achtung: Nicht im THWS-Bestand! Beschreibung

MARC

Datensatz im Suchindex

Es ist kein Print-Exemplar vorhanden.

Ähnliche Einträge