Risk and reinforcement learning: concepts and dynamic programming
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Buch |
Sprache: | German |
Veröffentlicht: |
Bremen
ZKW
1994
|
Schriftenreihe: | ZKW-Bericht
1994,8 |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | 70 S. graph. Darst. |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV010115813 | ||
003 | DE-604 | ||
005 | 19951018 | ||
007 | t | ||
008 | 950320s1994 gw d||| |||| 00||| ger d | ||
016 | 7 | |a 943685508 |2 DE-101 | |
035 | |a (OCoLC)75597568 | ||
035 | |a (DE-599)BVBBV010115813 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a ger | |
044 | |a gw |c DE | ||
049 | |a DE-12 |a DE-91G | ||
100 | 1 | |a Heger, Matthias |e Verfasser |4 aut | |
245 | 1 | 0 | |a Risk and reinforcement learning |b concepts and dynamic programming |c Matthias Heger. Zentrum für Kognitionswissenschaften, Universität Bremen |
264 | 1 | |a Bremen |b ZKW |c 1994 | |
300 | |a 70 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a ZKW-Bericht |v 1994,8 | |
650 | 0 | 7 | |a Risikoverhalten |0 (DE-588)4050133-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Verstärkung |0 (DE-588)4130203-5 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Lernen |0 (DE-588)4035408-8 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Mathematisches Modell |0 (DE-588)4114528-8 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Risikoverhalten |0 (DE-588)4050133-4 |D s |
689 | 0 | 1 | |a Lernen |0 (DE-588)4035408-8 |D s |
689 | 0 | 2 | |a Verstärkung |0 (DE-588)4130203-5 |D s |
689 | 0 | 3 | |a Mathematisches Modell |0 (DE-588)4114528-8 |D s |
689 | 0 | |5 DE-604 | |
830 | 0 | |a ZKW-Bericht |v 1994,8 |w (DE-604)BV010115820 |9 1994,8 | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=006716755&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-006716755 |
Datensatz im Suchindex
_version_ | 1807682400278282240 |
---|---|
adam_text |
CONTENTS
1
INTRODUCTION
3
1.1
UNCERTAINTY,
STATE
SPACE
COMPLEXITY
AND
RISK
.
3
1.2
ORGANIZATION
.
5
2
DECISION
MAKING
UNDER
RISK
AND
UNCERTAINTY
7
2.1
RISK
.
7
2.2
UTILITY
THEORY
.
8
2.3
DECISION
CRITERIA
.
10
3
ALPHA-VALUES
AND
DECISION
OPERATORS
11
3.1
ELEMENTS
OF
PROBABILITY
THEORY
.
11
3.2
ALPHA-SETS
.
16
3.3
ALPHA-VALUES
.
20
3.4
DECISION
OPERATORS
.
27
4
RISK-SENSITIVE
MARKOV
DECISION
TASKS
31
4.1
EPISODIC
MARKOV
DECISION
PROCESS
(EMDP)
.
31
4.2
THE
USUAL
MEASURE
OF
PERFORMANCE
FOR
POLICIES
.
39
4.3
DRAWBACKS
.
41
4.4
DO-OPTIMALITY
.
45
4.5
COMPLEXITY
IN
DIFFERENT
RISK-SENSITIVITIES
.
48
5
DYNAMIC
PROGRAMMING
FOR
MINIMAX-BASED
TASKS
51
5.1
RECURRENCE
RELATIONS
.
52
5.2
DYNAMIC
PROGRAMMING
OPERATORS
.
61
5.3
THE
BELLMAN
EQUATION
.
63
5.4
SUMMARY
AND
DISCUSSION
.
65
CONCLUSIONS
67
ACKNOWLEDGEMENTS
68
BIBLIOGRAPHY
69 |
any_adam_object | 1 |
author | Heger, Matthias |
author_facet | Heger, Matthias |
author_role | aut |
author_sort | Heger, Matthias |
author_variant | m h mh |
building | Verbundindex |
bvnumber | BV010115813 |
ctrlnum | (OCoLC)75597568 (DE-599)BVBBV010115813 |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>00000nam a2200000 cb4500</leader><controlfield tag="001">BV010115813</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">19951018</controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">950320s1994 gw d||| |||| 00||| ger d</controlfield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">943685508</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)75597568</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV010115813</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">ger</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">DE</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-12</subfield><subfield code="a">DE-91G</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Heger, Matthias</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Risk and reinforcement learning</subfield><subfield code="b">concepts and dynamic programming</subfield><subfield code="c">Matthias Heger. Zentrum für Kognitionswissenschaften, Universität Bremen</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Bremen</subfield><subfield code="b">ZKW</subfield><subfield code="c">1994</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">70 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">ZKW-Bericht</subfield><subfield code="v">1994,8</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Risikoverhalten</subfield><subfield code="0">(DE-588)4050133-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Verstärkung</subfield><subfield code="0">(DE-588)4130203-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Lernen</subfield><subfield code="0">(DE-588)4035408-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Mathematisches Modell</subfield><subfield code="0">(DE-588)4114528-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Risikoverhalten</subfield><subfield code="0">(DE-588)4050133-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Lernen</subfield><subfield code="0">(DE-588)4035408-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="2"><subfield code="a">Verstärkung</subfield><subfield code="0">(DE-588)4130203-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="3"><subfield code="a">Mathematisches Modell</subfield><subfield code="0">(DE-588)4114528-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">ZKW-Bericht</subfield><subfield code="v">1994,8</subfield><subfield code="w">(DE-604)BV010115820</subfield><subfield code="9">1994,8</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=006716755&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-006716755</subfield></datafield></record></collection> |
id | DE-604.BV010115813 |
illustrated | Illustrated |
indexdate | 2024-08-18T00:17:56Z |
institution | BVB |
language | German |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-006716755 |
oclc_num | 75597568 |
open_access_boolean | |
owner | DE-12 DE-91G DE-BY-TUM |
owner_facet | DE-12 DE-91G DE-BY-TUM |
physical | 70 S. graph. Darst. |
publishDate | 1994 |
publishDateSearch | 1994 |
publishDateSort | 1994 |
publisher | ZKW |
record_format | marc |
series | ZKW-Bericht |
series2 | ZKW-Bericht |
spelling | Heger, Matthias Verfasser aut Risk and reinforcement learning concepts and dynamic programming Matthias Heger. Zentrum für Kognitionswissenschaften, Universität Bremen Bremen ZKW 1994 70 S. graph. Darst. txt rdacontent n rdamedia nc rdacarrier ZKW-Bericht 1994,8 Risikoverhalten (DE-588)4050133-4 gnd rswk-swf Verstärkung (DE-588)4130203-5 gnd rswk-swf Lernen (DE-588)4035408-8 gnd rswk-swf Mathematisches Modell (DE-588)4114528-8 gnd rswk-swf Risikoverhalten (DE-588)4050133-4 s Lernen (DE-588)4035408-8 s Verstärkung (DE-588)4130203-5 s Mathematisches Modell (DE-588)4114528-8 s DE-604 ZKW-Bericht 1994,8 (DE-604)BV010115820 1994,8 DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=006716755&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Heger, Matthias Risk and reinforcement learning concepts and dynamic programming ZKW-Bericht Risikoverhalten (DE-588)4050133-4 gnd Verstärkung (DE-588)4130203-5 gnd Lernen (DE-588)4035408-8 gnd Mathematisches Modell (DE-588)4114528-8 gnd |
subject_GND | (DE-588)4050133-4 (DE-588)4130203-5 (DE-588)4035408-8 (DE-588)4114528-8 |
title | Risk and reinforcement learning concepts and dynamic programming |
title_auth | Risk and reinforcement learning concepts and dynamic programming |
title_exact_search | Risk and reinforcement learning concepts and dynamic programming |
title_full | Risk and reinforcement learning concepts and dynamic programming Matthias Heger. Zentrum für Kognitionswissenschaften, Universität Bremen |
title_fullStr | Risk and reinforcement learning concepts and dynamic programming Matthias Heger. Zentrum für Kognitionswissenschaften, Universität Bremen |
title_full_unstemmed | Risk and reinforcement learning concepts and dynamic programming Matthias Heger. Zentrum für Kognitionswissenschaften, Universität Bremen |
title_short | Risk and reinforcement learning |
title_sort | risk and reinforcement learning concepts and dynamic programming |
title_sub | concepts and dynamic programming |
topic | Risikoverhalten (DE-588)4050133-4 gnd Verstärkung (DE-588)4130203-5 gnd Lernen (DE-588)4035408-8 gnd Mathematisches Modell (DE-588)4114528-8 gnd |
topic_facet | Risikoverhalten Verstärkung Lernen Mathematisches Modell |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=006716755&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV010115820 |
work_keys_str_mv | AT hegermatthias riskandreinforcementlearningconceptsanddynamicprogramming |