Recent advances in reinforcement learning: 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers
Gespeichert in:
Weitere Verfasser: | |
---|---|
Format: | Tagungsbericht Buch |
Sprache: | English |
Veröffentlicht: |
Berlin [u.a.]
Springer
2012
|
Schriftenreihe: | Lecture notes in computer science
7188 : Lecture notes in artificial intelligence |
Schlagworte: | |
Online-Zugang: | Inhaltstext Inhaltsverzeichnis |
Beschreibung: | XIII, 344 S. Ill., graph. Darst. |
ISBN: | 3642299458 9783642299452 |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV040231762 | ||
003 | DE-604 | ||
005 | 20120619 | ||
007 | t| | ||
008 | 120604s2012 gw ad|| |||| 10||| eng d | ||
015 | |a 12,N15 |2 dnb | ||
016 | 7 | |a 1021362581 |2 DE-101 | |
020 | |a 3642299458 |9 3-642-29945-8 | ||
020 | |a 9783642299452 |c Pb. : ca. EUR 57.78 (DE) (freier Pr.), ca. EUR 59.40 (AT) (freier Pr.), ca. sfr 72.00 (freier Pr.) |9 978-3-642-29945-2 | ||
024 | 3 | |a 9783642299452 | |
028 | 5 | 2 | |a Best.-Nr.: 86095585 |
035 | |a (OCoLC)796258688 | ||
035 | |a (DE-599)DNB1021362581 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
044 | |a gw |c XA-DE-BE | ||
049 | |a DE-706 |a DE-83 |a DE-91G | ||
082 | 0 | |a 006.31 |2 22/ger | |
084 | |a 004 |2 sdnb | ||
245 | 1 | 0 | |a Recent advances in reinforcement learning |b 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers |c Scott Sanner ... (eds.) |
264 | 1 | |a Berlin [u.a.] |b Springer |c 2012 | |
300 | |a XIII, 344 S. |b Ill., graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Lecture notes in computer science |v 7188 : Lecture notes in artificial intelligence | |
650 | 0 | 7 | |a Bestärkendes Lernen |g Künstliche Intelligenz |0 (DE-588)4825546-4 |2 gnd |9 rswk-swf |
655 | 7 | |0 (DE-588)1071861417 |a Konferenzschrift |y 2011 |z Athen |2 gnd-content | |
689 | 0 | 0 | |a Bestärkendes Lernen |g Künstliche Intelligenz |0 (DE-588)4825546-4 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Sanner, Scott |4 edt | |
711 | 2 | |a EWRL |n 9 |d 2011 |c Athen |j Sonstige |0 (DE-588)1022897055 |4 oth | |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-3-642-29946-9 |
830 | 0 | |a Lecture notes in computer science |v 7188 : Lecture notes in artificial intelligence |w (DE-604)BV000000607 |9 7188 | |
856 | 4 | 2 | |m X:MVB |q text/html |u http://deposit.dnb.de/cgi-bin/dokserv?id=4004433&prov=M&dok_var=1&dok_ext=htm |3 Inhaltstext |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=025088168&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-025088168 |
Datensatz im Suchindex
_version_ | 1820875387256700928 |
---|---|
adam_text |
IMAGE 1
TABLE O F CONTENTS
INVITED T A L K A B S T R A C T S
INVITED TALK: UCRL AND AUTONOMOUS EXPLORATION 1
PETER AUER
INVITED TALK: INCREASING REPRESENTATIONAL POWER AND SCALING INFERENCE IN
REINFORCEMENT LEARNING 2
KRISTIAN KERSTING
INVITED TALK: PRISM - PRACTICAL RL: REPRESENTATION, INTERACTION,
SYNTHESIS, AND MORTALITY 3
PETER STONE
INVITED TALK: TOWARDS ROBUST REINFORCEMENT LEARNING ALGORITHMS 4 CSABA
SZEPESVDRI
O N L I N E R E I N F O R C E M E N T L E A R N I N G
AUTOMATIC DISCOVERY OF RANKING FORMULAS FOR PLAYING WITH MULTI-ARMED
BANDITS 5
FRANCIS MAES, LOUIS WEHENKEL, AND DAMIEN ERNST
GOAL-DIRECTED ONLINE LEARNING OF PREDICTIVE MODELS 18
SYLVIE C. W. ONG. YURI GRINBERG, AND JOELLE PINEAU
GRADIENT BASED ALGORITHMS WITH LOSS FUNCTIONS AND KERNELS FOR IMPROVED
ON-POLICY CONTROL 30
MATTHEW ROBARDS AND PETER SUNEHAG
L E A R N I N G A N D E X P L O R I N G M D P S
ACTIVE LEARNING OF MDP MODELS 42
MAURICIO ARAYA-LOPEZ, OLIVIER BUFFET, VINCENT THOMAS, AND FRANGOIS
CHARPILLET
HANDLING AMBIGUOUS EFFECTS IN ACTION LEARNING 54
BORIS LESNER AND BRUNO ZANUTTINI
FEATURE REINFORCEMENT LEARNING IN PRACTICE 66
PHUONG NGUYEN, PETER SUNEHAG, AND MARCUS HUTTER
HTTP://D-NB.INFO/1021362581
IMAGE 2
XII TABLE OF CONTENTS
F U N C T I O N A P P R O X I M A T I O N M E T H O D S FOR R E I N F O
R C E M E N T L E A R N I N G
REINFORCEMENT LEARNING WITH A BILINEAR Q FUNCTION 78
CHARLES ELKAN
L\-PENALIZED PROJECTED BELLMAN RESIDUAL 89
MATTHIEU GEIST AND BRUNO SCHERRER
REGULARIZED LEAST SQUARES TEMPORAL DIFFERENCE LEARNING WITH NESTED I2
AND \ PENALIZATION 102
MATTHEW W. HOFFMAN, ALESSAND,RO LAZARIC, MOHAMMAD GHAVAMZADEH, AND R E M
I MUNOS
RECURSIVE LEAST-SQUARES LEARNING WITH ELIGIBILITY TRACES 115
BRUNO SCHERRER AND MATTHIEU GEIST
VALUE FUNCTION APPROXIMATION THROUGH SPARSE BAYESIAN MODELING 128
NIKOLAOS TZIORTZIOTIS AND KONSTANTINOS BLEKAS
M A C R O - A C T I O N S IN R E I N F O R C E M E N T L E A R N I N G
AUTOMATIC CONSTRUCTION OF TEMPORALLY EXTENDED ACTIONS FOR M D P S USING
BISIMULATION METRICS 140
PABLO SAMUEL CASTRO AND DOINA, PRECUP
UNIFIED INTER AND INTRA OPTIONS LEARNING USING POLICY GRADIENT METHODS
153
KFIR Y. LEVY AND NAHUM SHIMKIN
OPTIONS WITH EXCEPTIONS 165
MUNU SAIRAMESH AND BALARAMAN RAVINDRAN
P O L I C Y S E A R C H A N D B O U N D S
ROBUST BAYESIAN REINFORCEMENT LEARNING THROUGH TIGHT LOWER BOUNDS 177
CHRISTOS DIMITRAKAKIS
OPTIMIZED LOOK-AHEAD TREE SEARCH POLICIES 189
FRANCIS MAES, LOUIS WEHENKEL. AND DAMIEN ERNST
A FRAMEWORK FOR COMPUTING BOUNDS FOR THE RETURN OF A POLICY 201
COSMIN PADURARU, DOINA PRECUP, AND JOELLE PINEAU
IMAGE 3
TABLE OF CONTENTS XIII
M U L T I - T A S K A N D T R A N S F E R R E I N F O R C E M E N T L E
A R N I N G
TRANSFERRING EVOLVED RESERVOIR FEATURES IN REINFORCEMENT LEARNING TASKS
213
KYRIAKOS C. CHATZIDIMITRIOU, IOANNIS PARTALAS, PERICLES A. MITKAS, AND
IOANNIS VLAHAVAS
TRANSFER LEARNING VIA MULTIPLE INTER-TASK MAPPINGS 225
ANESTIS FACHANTIDIS, IOANNIS PARTALAS, MATTHEW E. TAYLOR, AND IOANNIS
VLAHAVAS
MULTI-TASK REINFORCEMENT LEARNING: SHAPING AND FEATURE SELECTION . . . .
237 MATTHIJS SNEL AND SHIMON WHITESON
M U L T I - A G E N T R E I N F O R C E M E N T L E A R N I N G
TRANSFER LEARNING IN MULTI-AGENT REINFORCEMENT LEARNING DOMAINS . . . .
249 GEORGIOS BOUTSIOUKIS, IOANNIS PARTALAS, AND IOANNIS VLAHAVAS
AN EXTENSION OF A HIERARCHICAL REINFORCEMENT LEARNING ALGORITHM FOR
MULTIAGENT SETTINGS 261
IOANNIS LAMBROU, VASSILIS VASSILIADES, AND CHRIS CHRISTODOULOU
A P P R E N T I C E S H I P A N D I N V E R S E R E I N F O R C E M E N
T L E A R N I N G
BAYESIAN MULTITASK INVERSE REINFORCEMENT LEARNING 273
CHRISTOS DIMITRAKAKIS AND CONSTANTIN A. ROTHKOPF
BATCH, OFF-POLICY A N D MODEL-FREE APPRENTICESHIP LEARNING 285
EDOUARD KLEIN, MATTHIEU GEIST, AND OLIVIER PIETQUIN
R E A L - W O R L D R E I N F O R C E M E N T L E A R N I N G
INTRODUCTION OF FIXED MODE STATES INTO ONLINE PROFIT SHARING AND ITS
APPLICATION T O WAIST TRAJECTORY GENERATION OF BIPED ROBOT 297
SEIYA KURODA, KAZUTERU MIYAZAKI, AND HIROAKI KOBAYASHI
MAPREDUCE FOR PARALLEL REINFORCEMENT LEARNING 309
YUXI L I AND DALE SCHUURMANS
COMPOUND REINFORCEMENT LEARNING: THEORY AND AN APPLICATION T O FINANCE
321
TOHGOROH MATSUI, TAKASHI GOTO, KIYOSHI IZUMI, AND YU CHEN
PROPOSAL AND EVALUATION OF THE ACTIVE COURSE CLASSIFICATION SUPPORT
SYSTEM WITH EXPLOITATION-ORIENTED LEARNING 333
KAZUTERU MIYAZAKI AND MASAAKI IDA
A U T H O R I N D E X 345 |
any_adam_object | 1 |
author2 | Sanner, Scott |
author2_role | edt |
author2_variant | s s ss |
author_facet | Sanner, Scott |
building | Verbundindex |
bvnumber | BV040231762 |
classification_rvk | SS 4800 |
ctrlnum | (OCoLC)796258688 (DE-599)DNB1021362581 |
dewey-full | 006.31 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.31 |
dewey-search | 006.31 |
dewey-sort | 16.31 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Conference Proceeding Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 cb4500</leader><controlfield tag="001">BV040231762</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20120619</controlfield><controlfield tag="007">t|</controlfield><controlfield tag="008">120604s2012 gw ad|| |||| 10||| eng d</controlfield><datafield tag="015" ind1=" " ind2=" "><subfield code="a">12,N15</subfield><subfield code="2">dnb</subfield></datafield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">1021362581</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">3642299458</subfield><subfield code="9">3-642-29945-8</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642299452</subfield><subfield code="c">Pb. : ca. EUR 57.78 (DE) (freier Pr.), ca. EUR 59.40 (AT) (freier Pr.), ca. sfr 72.00 (freier Pr.)</subfield><subfield code="9">978-3-642-29945-2</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9783642299452</subfield></datafield><datafield tag="028" ind1="5" ind2="2"><subfield code="a">Best.-Nr.: 86095585</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)796258688</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB1021362581</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">XA-DE-BE</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-706</subfield><subfield code="a">DE-83</subfield><subfield code="a">DE-91G</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.31</subfield><subfield code="2">22/ger</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">004</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Recent advances in reinforcement learning</subfield><subfield code="b">9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers</subfield><subfield code="c">Scott Sanner ... (eds.)</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin [u.a.]</subfield><subfield code="b">Springer</subfield><subfield code="c">2012</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XIII, 344 S.</subfield><subfield code="b">Ill., graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Lecture notes in computer science</subfield><subfield code="v">7188 : Lecture notes in artificial intelligence</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)1071861417</subfield><subfield code="a">Konferenzschrift</subfield><subfield code="y">2011</subfield><subfield code="z">Athen</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sanner, Scott</subfield><subfield code="4">edt</subfield></datafield><datafield tag="711" ind1="2" ind2=" "><subfield code="a">EWRL</subfield><subfield code="n">9</subfield><subfield code="d">2011</subfield><subfield code="c">Athen</subfield><subfield code="j">Sonstige</subfield><subfield code="0">(DE-588)1022897055</subfield><subfield code="4">oth</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-3-642-29946-9</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Lecture notes in computer science</subfield><subfield code="v">7188 : Lecture notes in artificial intelligence</subfield><subfield code="w">(DE-604)BV000000607</subfield><subfield code="9">7188</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">X:MVB</subfield><subfield code="q">text/html</subfield><subfield code="u">http://deposit.dnb.de/cgi-bin/dokserv?id=4004433&prov=M&dok_var=1&dok_ext=htm</subfield><subfield code="3">Inhaltstext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=025088168&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-025088168</subfield></datafield></record></collection> |
genre | (DE-588)1071861417 Konferenzschrift 2011 Athen gnd-content |
genre_facet | Konferenzschrift 2011 Athen |
id | DE-604.BV040231762 |
illustrated | Illustrated |
indexdate | 2025-01-10T15:14:49Z |
institution | BVB |
institution_GND | (DE-588)1022897055 |
isbn | 3642299458 9783642299452 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-025088168 |
oclc_num | 796258688 |
open_access_boolean | |
owner | DE-706 DE-83 DE-91G DE-BY-TUM |
owner_facet | DE-706 DE-83 DE-91G DE-BY-TUM |
physical | XIII, 344 S. Ill., graph. Darst. |
publishDate | 2012 |
publishDateSearch | 2012 |
publishDateSort | 2012 |
publisher | Springer |
record_format | marc |
series | Lecture notes in computer science |
series2 | Lecture notes in computer science |
spelling | Recent advances in reinforcement learning 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers Scott Sanner ... (eds.) Berlin [u.a.] Springer 2012 XIII, 344 S. Ill., graph. Darst. txt rdacontent n rdamedia nc rdacarrier Lecture notes in computer science 7188 : Lecture notes in artificial intelligence Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd rswk-swf (DE-588)1071861417 Konferenzschrift 2011 Athen gnd-content Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 s DE-604 Sanner, Scott edt EWRL 9 2011 Athen Sonstige (DE-588)1022897055 oth Erscheint auch als Online-Ausgabe 978-3-642-29946-9 Lecture notes in computer science 7188 : Lecture notes in artificial intelligence (DE-604)BV000000607 7188 X:MVB text/html http://deposit.dnb.de/cgi-bin/dokserv?id=4004433&prov=M&dok_var=1&dok_ext=htm Inhaltstext DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=025088168&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Recent advances in reinforcement learning 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers Lecture notes in computer science Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd |
subject_GND | (DE-588)4825546-4 (DE-588)1071861417 |
title | Recent advances in reinforcement learning 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers |
title_auth | Recent advances in reinforcement learning 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers |
title_exact_search | Recent advances in reinforcement learning 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers |
title_full | Recent advances in reinforcement learning 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers Scott Sanner ... (eds.) |
title_fullStr | Recent advances in reinforcement learning 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers Scott Sanner ... (eds.) |
title_full_unstemmed | Recent advances in reinforcement learning 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers Scott Sanner ... (eds.) |
title_short | Recent advances in reinforcement learning |
title_sort | recent advances in reinforcement learning 9th european workshop ewrl 2011 athens greece september 9 11 2011 revised selected papers |
title_sub | 9th european workshop, EWRL 2011, Athens, Greece, September 9 - 11, 2011 ; revised selected papers |
topic | Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd |
topic_facet | Bestärkendes Lernen Künstliche Intelligenz Konferenzschrift 2011 Athen |
url | http://deposit.dnb.de/cgi-bin/dokserv?id=4004433&prov=M&dok_var=1&dok_ext=htm http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=025088168&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV000000607 |
work_keys_str_mv | AT sannerscott recentadvancesinreinforcementlearning9theuropeanworkshopewrl2011athensgreeceseptember9112011revisedselectedpapers AT ewrlathen recentadvancesinreinforcementlearning9theuropeanworkshopewrl2011athensgreeceseptember9112011revisedselectedpapers |