Continuous time Markov decision processes: theory and applications
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
Berlin [u.a.]
Springer
[2009]
|
Schriftenreihe: | Stochastic modelling and applied probability
62 |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | xvii, 231 Seite |
ISBN: | 9783642025464 9783642025471 |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV035683332 | ||
003 | DE-604 | ||
005 | 20220107 | ||
007 | t | ||
008 | 090818s2009 |||| 00||| eng d | ||
016 | 7 | |a 994394683 |2 DE-101 | |
020 | |a 9783642025464 |c Druck |9 978-3-642-02546-4 | ||
020 | |a 9783642025471 |c eISBN |9 978-3-642-02547-1 | ||
035 | |a (OCoLC)502397459 | ||
035 | |a (DE-599)DNB994394683 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-91G |a DE-83 |a DE-11 |a DE-19 |a DE-703 |a DE-188 |a DE-29T |a DE-20 |a DE-824 | ||
082 | 0 | |a 519.233 |2 22/ger | |
084 | |a SK 620 |0 (DE-625)143249: |2 rvk | ||
084 | |a SK 820 |0 (DE-625)143258: |2 rvk | ||
084 | |a 60J27 |2 msc | ||
084 | |a 510 |2 sdnb | ||
084 | |a 93E20 |2 msc | ||
084 | |a MAT 607f |2 stub | ||
084 | |a 90C40 |2 msc | ||
084 | |a MAT 900f |2 stub | ||
100 | 1 | |a Guo, Xianping |0 (DE-588)140541888 |4 aut | |
245 | 1 | 0 | |a Continuous time Markov decision processes |b theory and applications |c Xianping Guo ; Onésimo Hernández-Lerma |
246 | 1 | 3 | |a Continuous-time Markov decision processes |
264 | 1 | |a Berlin [u.a.] |b Springer |c [2009] | |
264 | 4 | |c © 2009 | |
300 | |a xvii, 231 Seite | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Stochastic modelling and applied probability |v 62 | |
650 | 0 | 7 | |a Markov-Entscheidungsprozess |0 (DE-588)4168927-6 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Markov-Entscheidungsprozess |0 (DE-588)4168927-6 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Hernández-Lerma, Onésimo |d 1946- |0 (DE-588)111571081 |4 aut | |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-3-642-02547-1 |
830 | 0 | |a Stochastic modelling and applied probability |v 62 |w (DE-604)BV019623501 |9 62 | |
856 | 4 | 2 | |m DNB Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=017737558&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-017737558 |
Datensatz im Suchindex
_version_ | 1804139391138922496 |
---|---|
adam_text | X CONTENTS INTRODUCTION AND SUMMARY 1 1.1 INTRODUCTION 1 1.2
PRELIMINARY EXAMPLES 1 1.3 SUMMARY OF THE FOLLOWING CHAPTERS 6
CONTINUOUS-TIME MARKOV DECISION PROCESSES 9 2.1 INTRODUCTION 9 2.2 THE
CONTROL MODEL 10 2.3 CONTINUOUS-TIME MARKOV DECISION PROCESSES 13 2.4
BASIC OPTIMALITY CRITERIA 16 AVERAGE OPTIMALITY FOR FINITE MODELS 19 3.1
INTRODUCTION 19 3.2 N-BIAS OPTIMALITY CRITERIA 20 3.3 DIFFERENCE
FORMULAS OF N-BIASES 23 3.4 CHARACTERIZATION OF N-BIAS POLICIES 29 3.5
COMPUTATION OF N-BIAS OPTIMAL POLICIES 36 3.5.1 THE POLICY ITERATION
ALGORITHM FOR AVERAGE OPTIMALITY . . . 36 3.5.2 THE 0-BIAS POLICY
ITERATION ALGORITHM 39 3.5.3 N-BIAS POLICY ITERATION ALGORITHMS 43 3.6
THE LINEAR PROGRAMMING APPROACH 46 3.6.1 LINEAR PROGRAMMING FOR ERGODIC
MODELS 46 3.6.2 LINEAR PROGRAMMING FOR MULTICHAIN MODELS 49 3.7 NOTES 52
DISCOUNT OPTIMALITY FOR NONNEGATIVE COSTS 55 4.1 INTRODUCTION 55 4.2 THE
NONNEGATIVE MODEL 55 4.3 PRELIMINARIES 56 4.4 THE DISCOUNTED COST
OPTIMALITY EQUATION 60 4.5 EXISTENCE OF OPTIMAL POLICIES 63 4.6
APPROXIMATION RESULTS 63 BIBLIOGRAFISCHE INFORMATIONEN
HTTP://D-NB.INFO/994394683 DIGITALISIERT DURCH X CONTENTS 4.7 THE POLICY
ITERATION APPROACH 66 4.8 EXAMPLES 68 4.9 NOTES 69 5 AVERAGE OPTIMALITY
FOR NONNEGATIVE COSTS 71 5.1 INTRODUCTION 71 5.2 THE AVERAGE-COST
CRITERION 72 5.3 THE MINIMUM NONNEGATIVE SOLUTION APPROACH 73 5.4 THE
AVERAGE-COST OPTIMALITY INEQUALITY 76 5.5 THE AVERAGE-COST OPTIMALITY
EQUATION 80 5.6 EXAMPLES 81 5.7 NOTES 84 6 DISCOUNT OPTIMALITY FOR
UNBOUNDED REWARDS 87 6.1 INTRODUCTION 87 6.2 THE DISCOUNTED-REWARD
OPTIMALITY EQUATION 89 6.3 DISCOUNT OPTIMAL STATIONARY POLICIES 95 6.4 A
VALUE ITERATION ALGORITHM 98 6.5 EXAMPLES 98 6.6 NOTES 102 7 AVERAGE
OPTIMALITY FOR UNBOUNDED REWARDS 105 7.1 INTRODUCTION 105 7.2
EXPONENTIAL ERGODICITY CONDITIONS 106 7.3 THE EXISTENCE OF AR OPTIMAL
POLICIES 109 7.4 THE POLICY ITERATION ALGORITHM 113 7.5 EXAMPLES 119 7.6
NOTES 124 8 AVERAGE OPTIMALITY FOR PATHWISE REWARDS 127 8.1 INTRODUCTION
127 8.2 THE OPTIMAL CONTROL PROBLEM 129 8.3 OPTIMALITY CONDITIONS AND
PRELIMINARIES 129 8.4 THE EXISTENCE OF PAR OPTIMAL POLICIES 131 8.5
POLICY AND VALUE ITERATION ALGORITHMS 138 8.6 AN EXAMPLE 139 8.7 NOTES
142 9 ADVANCED OPTIMALITY CRITERIA 143 9.1 BIAS AND WEAKLY OVERTAKING
OPTIMALITY 143 9. CONTENTS XI 10.2 PRELIMINARIES 164 10.3 COMPUTATION OF
THE AVERAGE VARIANCE 164 10.4 VARIANCE MINIMIZATION 170 10.5 EXAMPLES
171 10.6 NOTES 173 11 CONSTRAINED OPTIMALITY FOR DISCOUNT CRITERIA 175
11.1 THE MODEL WITH A CONSTRAINT 175 11.2 PRELIMINARIES 177 11.3 PROOF
OF THEOREM 11.4 182 11.4 AN EXAMPLE 184 11.5 NOTES 186 12 CONSTRAINED
OPTIMALITY FOR AVERAGE CRITERIA 187 12.1 AVERAGE OPTIMALITY WITH A
CONSTRAINT 187 12.2 PRELIMINARIES 188 12.3 PROOF OF THEOREM 12.4 192
12.4 AN EXAMPLE 192 12.5 NOTES 194 A 195 A.I LIMIT THEOREMS 195 A.2
RESULTS FROM MEASURE THEORY 197 * 203 B.I CONTINUOUS-TIME MARKOV CHAINS
203 B.2 STATIONARY DISTRIBUTIONS AND ERGODICITY 206 C 209 C.I THE
CONSTRUCTION OF TRANSITION FUNCTIONS 209 C.2 ERGODICITY BASED ON THE
SS-MATRIX 214 C.3 DYNKIN S FORMULA 218 REFERENCES 221 INDEX 229
|
any_adam_object | 1 |
author | Guo, Xianping Hernández-Lerma, Onésimo 1946- |
author_GND | (DE-588)140541888 (DE-588)111571081 |
author_facet | Guo, Xianping Hernández-Lerma, Onésimo 1946- |
author_role | aut aut |
author_sort | Guo, Xianping |
author_variant | x g xg o h l ohl |
building | Verbundindex |
bvnumber | BV035683332 |
classification_rvk | SK 620 SK 820 |
classification_tum | MAT 607f MAT 900f |
ctrlnum | (OCoLC)502397459 (DE-599)DNB994394683 |
dewey-full | 519.233 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 519 - Probabilities and applied mathematics |
dewey-raw | 519.233 |
dewey-search | 519.233 |
dewey-sort | 3519.233 |
dewey-tens | 510 - Mathematics |
discipline | Mathematik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02004nam a2200505 cb4500</leader><controlfield tag="001">BV035683332</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20220107 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">090818s2009 |||| 00||| eng d</controlfield><datafield tag="016" ind1="7" ind2=" "><subfield code="a">994394683</subfield><subfield code="2">DE-101</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642025464</subfield><subfield code="c">Druck</subfield><subfield code="9">978-3-642-02546-4</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642025471</subfield><subfield code="c">eISBN</subfield><subfield code="9">978-3-642-02547-1</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)502397459</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)DNB994394683</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91G</subfield><subfield code="a">DE-83</subfield><subfield code="a">DE-11</subfield><subfield code="a">DE-19</subfield><subfield code="a">DE-703</subfield><subfield code="a">DE-188</subfield><subfield code="a">DE-29T</subfield><subfield code="a">DE-20</subfield><subfield code="a">DE-824</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">519.233</subfield><subfield code="2">22/ger</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SK 620</subfield><subfield code="0">(DE-625)143249:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SK 820</subfield><subfield code="0">(DE-625)143258:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">60J27</subfield><subfield code="2">msc</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">510</subfield><subfield code="2">sdnb</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">93E20</subfield><subfield code="2">msc</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">MAT 607f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">90C40</subfield><subfield code="2">msc</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">MAT 900f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Guo, Xianping</subfield><subfield code="0">(DE-588)140541888</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Continuous time Markov decision processes</subfield><subfield code="b">theory and applications</subfield><subfield code="c">Xianping Guo ; Onésimo Hernández-Lerma</subfield></datafield><datafield tag="246" ind1="1" ind2="3"><subfield code="a">Continuous-time Markov decision processes</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin [u.a.]</subfield><subfield code="b">Springer</subfield><subfield code="c">[2009]</subfield></datafield><datafield tag="264" ind1=" " ind2="4"><subfield code="c">© 2009</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xvii, 231 Seite</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Stochastic modelling and applied probability</subfield><subfield code="v">62</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Markov-Entscheidungsprozess</subfield><subfield code="0">(DE-588)4168927-6</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Markov-Entscheidungsprozess</subfield><subfield code="0">(DE-588)4168927-6</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Hernández-Lerma, Onésimo</subfield><subfield code="d">1946-</subfield><subfield code="0">(DE-588)111571081</subfield><subfield code="4">aut</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-3-642-02547-1</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Stochastic modelling and applied probability</subfield><subfield code="v">62</subfield><subfield code="w">(DE-604)BV019623501</subfield><subfield code="9">62</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=017737558&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-017737558</subfield></datafield></record></collection> |
id | DE-604.BV035683332 |
illustrated | Not Illustrated |
indexdate | 2024-07-09T21:43:20Z |
institution | BVB |
isbn | 9783642025464 9783642025471 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-017737558 |
oclc_num | 502397459 |
open_access_boolean | |
owner | DE-91G DE-BY-TUM DE-83 DE-11 DE-19 DE-BY-UBM DE-703 DE-188 DE-29T DE-20 DE-824 |
owner_facet | DE-91G DE-BY-TUM DE-83 DE-11 DE-19 DE-BY-UBM DE-703 DE-188 DE-29T DE-20 DE-824 |
physical | xvii, 231 Seite |
publishDate | 2009 |
publishDateSearch | 2009 |
publishDateSort | 2009 |
publisher | Springer |
record_format | marc |
series | Stochastic modelling and applied probability |
series2 | Stochastic modelling and applied probability |
spelling | Guo, Xianping (DE-588)140541888 aut Continuous time Markov decision processes theory and applications Xianping Guo ; Onésimo Hernández-Lerma Continuous-time Markov decision processes Berlin [u.a.] Springer [2009] © 2009 xvii, 231 Seite txt rdacontent n rdamedia nc rdacarrier Stochastic modelling and applied probability 62 Markov-Entscheidungsprozess (DE-588)4168927-6 gnd rswk-swf Markov-Entscheidungsprozess (DE-588)4168927-6 s DE-604 Hernández-Lerma, Onésimo 1946- (DE-588)111571081 aut Erscheint auch als Online-Ausgabe 978-3-642-02547-1 Stochastic modelling and applied probability 62 (DE-604)BV019623501 62 DNB Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=017737558&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Guo, Xianping Hernández-Lerma, Onésimo 1946- Continuous time Markov decision processes theory and applications Stochastic modelling and applied probability Markov-Entscheidungsprozess (DE-588)4168927-6 gnd |
subject_GND | (DE-588)4168927-6 |
title | Continuous time Markov decision processes theory and applications |
title_alt | Continuous-time Markov decision processes |
title_auth | Continuous time Markov decision processes theory and applications |
title_exact_search | Continuous time Markov decision processes theory and applications |
title_full | Continuous time Markov decision processes theory and applications Xianping Guo ; Onésimo Hernández-Lerma |
title_fullStr | Continuous time Markov decision processes theory and applications Xianping Guo ; Onésimo Hernández-Lerma |
title_full_unstemmed | Continuous time Markov decision processes theory and applications Xianping Guo ; Onésimo Hernández-Lerma |
title_short | Continuous time Markov decision processes |
title_sort | continuous time markov decision processes theory and applications |
title_sub | theory and applications |
topic | Markov-Entscheidungsprozess (DE-588)4168927-6 gnd |
topic_facet | Markov-Entscheidungsprozess |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=017737558&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV019623501 |
work_keys_str_mv | AT guoxianping continuoustimemarkovdecisionprocessestheoryandapplications AT hernandezlermaonesimo continuoustimemarkovdecisionprocessestheoryandapplications AT guoxianping continuoustimemarkovdecisionprocesses AT hernandezlermaonesimo continuoustimemarkovdecisionprocesses |