Further topics on discrete-time Markov control processes:
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Buch |
Sprache: | English |
Veröffentlicht: |
New York ; Berlin ; Heidelberg ; Barcelona ; Hong Kong ; London
Springer
1999
|
Schriftenreihe: | Applications of mathematics
42 |
Schlagworte: | |
Online-Zugang: | Inhaltsverzeichnis |
Beschreibung: | Literaturverz. S. 251 - 261 |
Beschreibung: | XII, 276 S. |
ISBN: | 0387986944 |
Internformat
MARC
LEADER | 00000nam a2200000 cb4500 | ||
---|---|---|---|
001 | BV012782993 | ||
003 | DE-604 | ||
005 | 20230302 | ||
007 | t | ||
008 | 990921s1999 gw |||| 00||| eng d | ||
020 | |a 0387986944 |c Pp. : DM 119.00 |9 0-387-98694-4 | ||
035 | |a (OCoLC)40654817 | ||
035 | |a (DE-599)BVBBV012782993 | ||
040 | |a DE-604 |b ger |e rakddb | ||
041 | 0 | |a eng | |
044 | |a gw |c DE | ||
049 | |a DE-824 |a DE-703 |a DE-91G |a DE-29T |a DE-634 |a DE-83 |a DE-11 |a DE-188 | ||
050 | 0 | |a QA274.7 | |
082 | 0 | |a 519.2/33 |2 21 | |
084 | |a SK 820 |0 (DE-625)143258: |2 rvk | ||
084 | |a SK 880 |0 (DE-625)143266: |2 rvk | ||
084 | |a SK 970 |0 (DE-625)143276: |2 rvk | ||
084 | |a 93E20 |2 msc | ||
084 | |a MAT 624f |2 stub | ||
084 | |a MAT 605f |2 stub | ||
084 | |a 90C40 |2 msc | ||
084 | |a 49L20 |2 msc | ||
084 | |a 93-02 |2 msc | ||
084 | |a 90C39 |2 msc | ||
100 | 1 | |a Hernández-Lerma, Onésimo |d 1946- |e Verfasser |0 (DE-588)111571081 |4 aut | |
245 | 1 | 0 | |a Further topics on discrete-time Markov control processes |c Onésimo Hernández-Lerma ; Jean Bernard Lasserre |
264 | 1 | |a New York ; Berlin ; Heidelberg ; Barcelona ; Hong Kong ; London |b Springer |c 1999 | |
300 | |a XII, 276 S. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 1 | |a Applications of mathematics |v 42 | |
500 | |a Literaturverz. S. 251 - 261 | ||
650 | 7 | |a Commande, Théorie de la |2 ram | |
650 | 7 | |a Controlesystemen |2 gtt | |
650 | 7 | |a Markov, Processus de |2 ram | |
650 | 7 | |a Markov-processen |2 gtt | |
650 | 7 | |a Numerieke wiskunde |2 gtt | |
650 | 7 | |a Processos de markov |2 larpcal | |
650 | 7 | |a Processos estocásticos |2 larpcal | |
650 | 7 | |a Systèmes échantillonnés |2 ram | |
650 | 4 | |a Control theory | |
650 | 4 | |a Discrete-time systems | |
650 | 4 | |a Markov processes | |
650 | 0 | 7 | |a Markov-Entscheidungsprozess |0 (DE-588)4168927-6 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Diskreter Markov-Prozess |0 (DE-588)4150185-8 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Markov-Entscheidungsprozess |0 (DE-588)4168927-6 |D s |
689 | 0 | 1 | |a Diskreter Markov-Prozess |0 (DE-588)4150185-8 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Lasserre, Jean-Bernard |d 1953- |e Verfasser |0 (DE-588)121290255 |4 aut | |
830 | 0 | |a Applications of mathematics |v 42 |w (DE-604)BV000895226 |9 42 | |
856 | 4 | 2 | |m HBZ Datenaustausch |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=008694032&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
999 | |a oai:aleph.bib-bvb.de:BVB01-008694032 |
Datensatz im Suchindex
_version_ | 1804127454157078528 |
---|---|
adam_text | Contents
Preface vii
7 Ergodicity and Poisson s Equation 1
7.1 Introduction 1
7.2 Weighted norms and signed kernels 2
A. Weighted norm spaces 2
B. Signed kernels 3
C. Contraction maps 5
7.3 Recurrence concepts 7
A. Irreducibility and recurrence 7
B. Invariant measures 8
C. Conditions for irreducibility and recurrence 9
D. w Geometric ergodicity 11
7.4 Examples on w geometric ergodicity 17
7.5 Poisson s equation 24
A. The multichain case 26
B. The unichain P.E 31
C. Examples 34
8 Discounted Dynamic Programming with Weighted Norms 39
8.1 Introduction 39
8.2 The control model and control policies 40
8.3 The optimality equation 43
A. Assumptions 44
B. The discounted cost optimality equation 47
x Contents
C. The dynamic programming operator 48
D. Proof of Theorem 8.3.6 51
8.4 Further analysis of value iteration 55
A. Asymptotic discount optimality 56
B. Estimates of VI convergence 57
C. Rolling horizon procedures 58
D. Forecast horizons and elimination
of non optimal actions 59
8.5 The weakly continuous case 65
8.6 Examples 68
8.7 Further remarks 73
9 The Expected Total Cost Criterion 75
9.1 Introduction 75
9.2 Preliminaries 76
A. Extended real numbers 76
B. Integrability 78
9.3 The expected total cost 79
9.4 Occupation measures 84
A. Expected occupation measures 85
B. The sufficiency problem 88
9.5 The optimality equation 93
A. The optimality equation 93
B. Optimality criteria 95
C. Deterministic stationary policies 100
9.6 The transient case 103
A. Transient models 103
B. Optimality conditions 109
C. Reduction to deterministic policies 110
D. The policy iteration algorithm 113
10 Undiscounted Cost Criteria 117
10.1 Introduction 117
A. Undiscounted criteria 117
B. AC criteria 119
C. Outline of the chapter 120
10.2 Preliminaries 120
A. Assumptions 121
B. Corollaries 123
C. Discussion 124
10.3 From AC optimality to undiscounted criteria 126
A. The AC optimality inequality 128
B. The AC optimality equation 129
C. Uniqueness of the ACOE 131
D. Bias optimal policies 132
Contents xi
E. Undiscounted criteria 135
10.4 Proof of Theorem 10.3.1 137
A. Preliminary lemmas 137
B. Completion of the proof 141
10.5 Proof of Theorem 10.3.6 143
A. Proof of part (a) 143
B. Proof of part (b) 146
C. Policy iteration 146
10.6 Proof of Theorem 10.3.7 149
10.7 Proof of Theorem 10.3.10 150
10.8 Proof of Theorem 10.3.11 154
10.9 Examples 156
11 Sample Path Average Cost 163
11.1 Introduction 163
A. Definitions 163
B. Outline of the chapter 166
11.2 Preliminaries 167
A. Positive Harris recurrence 167
B. Limiting average variance 168
11.3 The w geometrically ergodic case 175
A. Optimality in UDS 177
B. Optimality in II 178
C. Variance minimization 180
D. Proof of Theorem 11.3.5 181
E. Proof of Theorem 11.3.8 185
11.4 Strictly unbounded costs 188
11.5 Examples 196
12 The Linear Programming Approach 203
12.1 Introduction 203
A. Outline of the chapter 204
12.2 Preliminaries 205
A. Dual pairs of vector spaces 205
B. Infinite linear programming 212
C. Approximation of linear programs 214
D. Tightness and invariant measures 215
12.3 Linear programs for the AC problem 218
A. The linear programs 219
B. Solvability of (P) 222
C. Absence of duality gap 224
D. The Farkas alternative 226
12.4 Approximating sequences and strong duality 233
A. Minimizing sequences for (P) 233
B. Maximizing sequences for (P*) 234
xii Contents
12.5 Finite LP approximations 238
A. Aggregation 238
B. Aggregation relaxation 239
C. Aggregation relaxion inner approximations 240
12.6 Proof of Theorems 12.5.3, 12.5.5, 12.5.7 242
References 251
Abbreviations 263
Glossary of notation 265
Index 271
|
any_adam_object | 1 |
author | Hernández-Lerma, Onésimo 1946- Lasserre, Jean-Bernard 1953- |
author_GND | (DE-588)111571081 (DE-588)121290255 |
author_facet | Hernández-Lerma, Onésimo 1946- Lasserre, Jean-Bernard 1953- |
author_role | aut aut |
author_sort | Hernández-Lerma, Onésimo 1946- |
author_variant | o h l ohl j b l jbl |
building | Verbundindex |
bvnumber | BV012782993 |
callnumber-first | Q - Science |
callnumber-label | QA274 |
callnumber-raw | QA274.7 |
callnumber-search | QA274.7 |
callnumber-sort | QA 3274.7 |
callnumber-subject | QA - Mathematics |
classification_rvk | SK 820 SK 880 SK 970 |
classification_tum | MAT 624f MAT 605f |
ctrlnum | (OCoLC)40654817 (DE-599)BVBBV012782993 |
dewey-full | 519.2/33 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 519 - Probabilities and applied mathematics |
dewey-raw | 519.2/33 |
dewey-search | 519.2/33 |
dewey-sort | 3519.2 233 |
dewey-tens | 510 - Mathematics |
discipline | Mathematik |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>02557nam a2200661 cb4500</leader><controlfield tag="001">BV012782993</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20230302 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">990921s1999 gw |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">0387986944</subfield><subfield code="c">Pp. : DM 119.00</subfield><subfield code="9">0-387-98694-4</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)40654817</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV012782993</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakddb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">gw</subfield><subfield code="c">DE</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-824</subfield><subfield code="a">DE-703</subfield><subfield code="a">DE-91G</subfield><subfield code="a">DE-29T</subfield><subfield code="a">DE-634</subfield><subfield code="a">DE-83</subfield><subfield code="a">DE-11</subfield><subfield code="a">DE-188</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">QA274.7</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">519.2/33</subfield><subfield code="2">21</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SK 820</subfield><subfield code="0">(DE-625)143258:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SK 880</subfield><subfield code="0">(DE-625)143266:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SK 970</subfield><subfield code="0">(DE-625)143276:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">93E20</subfield><subfield code="2">msc</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">MAT 624f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">MAT 605f</subfield><subfield code="2">stub</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">90C40</subfield><subfield code="2">msc</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">49L20</subfield><subfield code="2">msc</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">93-02</subfield><subfield code="2">msc</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">90C39</subfield><subfield code="2">msc</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Hernández-Lerma, Onésimo</subfield><subfield code="d">1946-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)111571081</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Further topics on discrete-time Markov control processes</subfield><subfield code="c">Onésimo Hernández-Lerma ; Jean Bernard Lasserre</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">New York ; Berlin ; Heidelberg ; Barcelona ; Hong Kong ; London</subfield><subfield code="b">Springer</subfield><subfield code="c">1999</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XII, 276 S.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Applications of mathematics</subfield><subfield code="v">42</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Literaturverz. S. 251 - 261</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Commande, Théorie de la</subfield><subfield code="2">ram</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Controlesystemen</subfield><subfield code="2">gtt</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Markov, Processus de</subfield><subfield code="2">ram</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Markov-processen</subfield><subfield code="2">gtt</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Numerieke wiskunde</subfield><subfield code="2">gtt</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Processos de markov</subfield><subfield code="2">larpcal</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Processos estocásticos</subfield><subfield code="2">larpcal</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">Systèmes échantillonnés</subfield><subfield code="2">ram</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Control theory</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Discrete-time systems</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Markov processes</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Markov-Entscheidungsprozess</subfield><subfield code="0">(DE-588)4168927-6</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Diskreter Markov-Prozess</subfield><subfield code="0">(DE-588)4150185-8</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Markov-Entscheidungsprozess</subfield><subfield code="0">(DE-588)4168927-6</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Diskreter Markov-Prozess</subfield><subfield code="0">(DE-588)4150185-8</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Lasserre, Jean-Bernard</subfield><subfield code="d">1953-</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)121290255</subfield><subfield code="4">aut</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Applications of mathematics</subfield><subfield code="v">42</subfield><subfield code="w">(DE-604)BV000895226</subfield><subfield code="9">42</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=008694032&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-008694032</subfield></datafield></record></collection> |
id | DE-604.BV012782993 |
illustrated | Not Illustrated |
indexdate | 2024-07-09T18:33:36Z |
institution | BVB |
isbn | 0387986944 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-008694032 |
oclc_num | 40654817 |
open_access_boolean | |
owner | DE-824 DE-703 DE-91G DE-BY-TUM DE-29T DE-634 DE-83 DE-11 DE-188 |
owner_facet | DE-824 DE-703 DE-91G DE-BY-TUM DE-29T DE-634 DE-83 DE-11 DE-188 |
physical | XII, 276 S. |
publishDate | 1999 |
publishDateSearch | 1999 |
publishDateSort | 1999 |
publisher | Springer |
record_format | marc |
series | Applications of mathematics |
series2 | Applications of mathematics |
spelling | Hernández-Lerma, Onésimo 1946- Verfasser (DE-588)111571081 aut Further topics on discrete-time Markov control processes Onésimo Hernández-Lerma ; Jean Bernard Lasserre New York ; Berlin ; Heidelberg ; Barcelona ; Hong Kong ; London Springer 1999 XII, 276 S. txt rdacontent n rdamedia nc rdacarrier Applications of mathematics 42 Literaturverz. S. 251 - 261 Commande, Théorie de la ram Controlesystemen gtt Markov, Processus de ram Markov-processen gtt Numerieke wiskunde gtt Processos de markov larpcal Processos estocásticos larpcal Systèmes échantillonnés ram Control theory Discrete-time systems Markov processes Markov-Entscheidungsprozess (DE-588)4168927-6 gnd rswk-swf Diskreter Markov-Prozess (DE-588)4150185-8 gnd rswk-swf Markov-Entscheidungsprozess (DE-588)4168927-6 s Diskreter Markov-Prozess (DE-588)4150185-8 s DE-604 Lasserre, Jean-Bernard 1953- Verfasser (DE-588)121290255 aut Applications of mathematics 42 (DE-604)BV000895226 42 HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=008694032&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis |
spellingShingle | Hernández-Lerma, Onésimo 1946- Lasserre, Jean-Bernard 1953- Further topics on discrete-time Markov control processes Applications of mathematics Commande, Théorie de la ram Controlesystemen gtt Markov, Processus de ram Markov-processen gtt Numerieke wiskunde gtt Processos de markov larpcal Processos estocásticos larpcal Systèmes échantillonnés ram Control theory Discrete-time systems Markov processes Markov-Entscheidungsprozess (DE-588)4168927-6 gnd Diskreter Markov-Prozess (DE-588)4150185-8 gnd |
subject_GND | (DE-588)4168927-6 (DE-588)4150185-8 |
title | Further topics on discrete-time Markov control processes |
title_auth | Further topics on discrete-time Markov control processes |
title_exact_search | Further topics on discrete-time Markov control processes |
title_full | Further topics on discrete-time Markov control processes Onésimo Hernández-Lerma ; Jean Bernard Lasserre |
title_fullStr | Further topics on discrete-time Markov control processes Onésimo Hernández-Lerma ; Jean Bernard Lasserre |
title_full_unstemmed | Further topics on discrete-time Markov control processes Onésimo Hernández-Lerma ; Jean Bernard Lasserre |
title_short | Further topics on discrete-time Markov control processes |
title_sort | further topics on discrete time markov control processes |
topic | Commande, Théorie de la ram Controlesystemen gtt Markov, Processus de ram Markov-processen gtt Numerieke wiskunde gtt Processos de markov larpcal Processos estocásticos larpcal Systèmes échantillonnés ram Control theory Discrete-time systems Markov processes Markov-Entscheidungsprozess (DE-588)4168927-6 gnd Diskreter Markov-Prozess (DE-588)4150185-8 gnd |
topic_facet | Commande, Théorie de la Controlesystemen Markov, Processus de Markov-processen Numerieke wiskunde Processos de markov Processos estocásticos Systèmes échantillonnés Control theory Discrete-time systems Markov processes Markov-Entscheidungsprozess Diskreter Markov-Prozess |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=008694032&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
volume_link | (DE-604)BV000895226 |
work_keys_str_mv | AT hernandezlermaonesimo furthertopicsondiscretetimemarkovcontrolprocesses AT lasserrejeanbernard furthertopicsondiscretetimemarkovcontrolprocesses |