Adaptive Markov Control Processes:
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | English |
Veröffentlicht: |
New York, NY
Springer New York
1989
|
Schriftenreihe: | Applied Mathematical Sciences
79 |
Schlagworte: | |
Online-Zugang: | Volltext |
Beschreibung: | This book is concerned with a class of discrete-time stochastic control processes known as controlled Markov processes (CMP's), also known as Markov decision processes or Markov dynamic programs. Starting in the mid-1950swith Richard Bellman, many contributions to CMP's have been made, and applications to engineering, statistics and operations research, among other areas, have also been developed. The purpose of this book is to present some recent developments on the theory of adaptive CMP's, i. e. , CMP's that depend on unknown parameters. Thus at each decision time, the controller or decision-maker must estimate the true parameter values, and then adapt the control actions to the estimated values. We do not intend to describe all aspects of stochastic adaptive control; rather, the selection of material reflects our own research interests. The prerequisite for this book is a knowledgeof real analysis and prob ability theory at the level of, say, Ash (1972) or Royden (1968), but no previous knowledge of control or decision processes is required. The pre sentation, on the other hand, is meant to beself-contained,in the sensethat whenever a result from analysisor probability is used, it is usually stated in full and references are supplied for further discussion, if necessary. Several appendices are provided for this purpose. The material is divided into six chapters. Chapter 1 contains the basic definitions about the stochastic control problems we are interested in; a brief description of some applications is also provided |
Beschreibung: | 1 Online-Ressource (XIV, 148 p) |
ISBN: | 9781441987143 9781461264545 |
ISSN: | 0066-5452 |
DOI: | 10.1007/978-1-4419-8714-3 |
Internformat
MARC
LEADER | 00000nmm a2200000zcb4500 | ||
---|---|---|---|
001 | BV042419295 | ||
003 | DE-604 | ||
005 | 00000000000000.0 | ||
007 | cr|uuu---uuuuu | ||
008 | 150317s1989 |||| o||u| ||||||eng d | ||
020 | |a 9781441987143 |c Online |9 978-1-4419-8714-3 | ||
020 | |a 9781461264545 |c Print |9 978-1-4612-6454-5 | ||
024 | 7 | |a 10.1007/978-1-4419-8714-3 |2 doi | |
035 | |a (OCoLC)1184430489 | ||
035 | |a (DE-599)BVBBV042419295 | ||
040 | |a DE-604 |b ger |e aacr | ||
041 | 0 | |a eng | |
049 | |a DE-384 |a DE-703 |a DE-91 |a DE-634 | ||
082 | 0 | |a 519.2 |2 23 | |
084 | |a MAT 000 |2 stub | ||
100 | 1 | |a Hernández-Lerma, O. |e Verfasser |4 aut | |
245 | 1 | 0 | |a Adaptive Markov Control Processes |c by O. Hernández-Lerma |
264 | 1 | |a New York, NY |b Springer New York |c 1989 | |
300 | |a 1 Online-Ressource (XIV, 148 p) | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
490 | 0 | |a Applied Mathematical Sciences |v 79 |x 0066-5452 | |
500 | |a This book is concerned with a class of discrete-time stochastic control processes known as controlled Markov processes (CMP's), also known as Markov decision processes or Markov dynamic programs. Starting in the mid-1950swith Richard Bellman, many contributions to CMP's have been made, and applications to engineering, statistics and operations research, among other areas, have also been developed. The purpose of this book is to present some recent developments on the theory of adaptive CMP's, i. e. , CMP's that depend on unknown parameters. Thus at each decision time, the controller or decision-maker must estimate the true parameter values, and then adapt the control actions to the estimated values. We do not intend to describe all aspects of stochastic adaptive control; rather, the selection of material reflects our own research interests. The prerequisite for this book is a knowledgeof real analysis and prob ability theory at the level of, say, Ash (1972) or Royden (1968), but no previous knowledge of control or decision processes is required. The pre sentation, on the other hand, is meant to beself-contained,in the sensethat whenever a result from analysisor probability is used, it is usually stated in full and references are supplied for further discussion, if necessary. Several appendices are provided for this purpose. The material is divided into six chapters. Chapter 1 contains the basic definitions about the stochastic control problems we are interested in; a brief description of some applications is also provided | ||
650 | 4 | |a Mathematics | |
650 | 4 | |a Distribution (Probability theory) | |
650 | 4 | |a Probability Theory and Stochastic Processes | |
650 | 4 | |a Mathematik | |
650 | 0 | 7 | |a Gesteuerter Markov-Prozess |0 (DE-588)4157165-4 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Adaptivregelung |0 (DE-588)4000457-0 |2 gnd |9 rswk-swf |
650 | 0 | 7 | |a Markov-Prozess |0 (DE-588)4134948-9 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Gesteuerter Markov-Prozess |0 (DE-588)4157165-4 |D s |
689 | 0 | 1 | |a Adaptivregelung |0 (DE-588)4000457-0 |D s |
689 | 0 | |8 1\p |5 DE-604 | |
689 | 1 | 0 | |a Adaptivregelung |0 (DE-588)4000457-0 |D s |
689 | 1 | 1 | |a Markov-Prozess |0 (DE-588)4134948-9 |D s |
689 | 1 | |8 2\p |5 DE-604 | |
856 | 4 | 0 | |u https://doi.org/10.1007/978-1-4419-8714-3 |x Verlag |3 Volltext |
912 | |a ZDB-2-SMA |a ZDB-2-BAE | ||
940 | 1 | |q ZDB-2-SMA_Archive | |
999 | |a oai:aleph.bib-bvb.de:BVB01-027854712 | ||
883 | 1 | |8 1\p |a cgwrk |d 20201028 |q DE-101 |u https://d-nb.info/provenance/plan#cgwrk | |
883 | 1 | |8 2\p |a cgwrk |d 20201028 |q DE-101 |u https://d-nb.info/provenance/plan#cgwrk |
Datensatz im Suchindex
_version_ | 1804153089765146624 |
---|---|
any_adam_object | |
author | Hernández-Lerma, O. |
author_facet | Hernández-Lerma, O. |
author_role | aut |
author_sort | Hernández-Lerma, O. |
author_variant | o h l ohl |
building | Verbundindex |
bvnumber | BV042419295 |
classification_tum | MAT 000 |
collection | ZDB-2-SMA ZDB-2-BAE |
ctrlnum | (OCoLC)1184430489 (DE-599)BVBBV042419295 |
dewey-full | 519.2 |
dewey-hundreds | 500 - Natural sciences and mathematics |
dewey-ones | 519 - Probabilities and applied mathematics |
dewey-raw | 519.2 |
dewey-search | 519.2 |
dewey-sort | 3519.2 |
dewey-tens | 510 - Mathematics |
discipline | Mathematik |
doi_str_mv | 10.1007/978-1-4419-8714-3 |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03543nmm a2200541zcb4500</leader><controlfield tag="001">BV042419295</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">150317s1989 |||| o||u| ||||||eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781441987143</subfield><subfield code="c">Online</subfield><subfield code="9">978-1-4419-8714-3</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781461264545</subfield><subfield code="c">Print</subfield><subfield code="9">978-1-4612-6454-5</subfield></datafield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/978-1-4419-8714-3</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1184430489</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV042419295</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-384</subfield><subfield code="a">DE-703</subfield><subfield code="a">DE-91</subfield><subfield code="a">DE-634</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">519.2</subfield><subfield code="2">23</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">MAT 000</subfield><subfield code="2">stub</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Hernández-Lerma, O.</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Adaptive Markov Control Processes</subfield><subfield code="c">by O. Hernández-Lerma</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">New York, NY</subfield><subfield code="b">Springer New York</subfield><subfield code="c">1989</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (XIV, 148 p)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Applied Mathematical Sciences</subfield><subfield code="v">79</subfield><subfield code="x">0066-5452</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">This book is concerned with a class of discrete-time stochastic control processes known as controlled Markov processes (CMP's), also known as Markov decision processes or Markov dynamic programs. Starting in the mid-1950swith Richard Bellman, many contributions to CMP's have been made, and applications to engineering, statistics and operations research, among other areas, have also been developed. The purpose of this book is to present some recent developments on the theory of adaptive CMP's, i. e. , CMP's that depend on unknown parameters. Thus at each decision time, the controller or decision-maker must estimate the true parameter values, and then adapt the control actions to the estimated values. We do not intend to describe all aspects of stochastic adaptive control; rather, the selection of material reflects our own research interests. The prerequisite for this book is a knowledgeof real analysis and prob ability theory at the level of, say, Ash (1972) or Royden (1968), but no previous knowledge of control or decision processes is required. The pre sentation, on the other hand, is meant to beself-contained,in the sensethat whenever a result from analysisor probability is used, it is usually stated in full and references are supplied for further discussion, if necessary. Several appendices are provided for this purpose. The material is divided into six chapters. Chapter 1 contains the basic definitions about the stochastic control problems we are interested in; a brief description of some applications is also provided</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Mathematics</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Distribution (Probability theory)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Probability Theory and Stochastic Processes</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Mathematik</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Gesteuerter Markov-Prozess</subfield><subfield code="0">(DE-588)4157165-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Adaptivregelung</subfield><subfield code="0">(DE-588)4000457-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Markov-Prozess</subfield><subfield code="0">(DE-588)4134948-9</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Gesteuerter Markov-Prozess</subfield><subfield code="0">(DE-588)4157165-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Adaptivregelung</subfield><subfield code="0">(DE-588)4000457-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="8">1\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Adaptivregelung</subfield><subfield code="0">(DE-588)4000457-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="1"><subfield code="a">Markov-Prozess</subfield><subfield code="0">(DE-588)4134948-9</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="8">2\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://doi.org/10.1007/978-1-4419-8714-3</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-2-SMA</subfield><subfield code="a">ZDB-2-BAE</subfield></datafield><datafield tag="940" ind1="1" ind2=" "><subfield code="q">ZDB-2-SMA_Archive</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-027854712</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">2\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield></record></collection> |
id | DE-604.BV042419295 |
illustrated | Not Illustrated |
indexdate | 2024-07-10T01:21:04Z |
institution | BVB |
isbn | 9781441987143 9781461264545 |
issn | 0066-5452 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-027854712 |
oclc_num | 1184430489 |
open_access_boolean | |
owner | DE-384 DE-703 DE-91 DE-BY-TUM DE-634 |
owner_facet | DE-384 DE-703 DE-91 DE-BY-TUM DE-634 |
physical | 1 Online-Ressource (XIV, 148 p) |
psigel | ZDB-2-SMA ZDB-2-BAE ZDB-2-SMA_Archive |
publishDate | 1989 |
publishDateSearch | 1989 |
publishDateSort | 1989 |
publisher | Springer New York |
record_format | marc |
series2 | Applied Mathematical Sciences |
spelling | Hernández-Lerma, O. Verfasser aut Adaptive Markov Control Processes by O. Hernández-Lerma New York, NY Springer New York 1989 1 Online-Ressource (XIV, 148 p) txt rdacontent c rdamedia cr rdacarrier Applied Mathematical Sciences 79 0066-5452 This book is concerned with a class of discrete-time stochastic control processes known as controlled Markov processes (CMP's), also known as Markov decision processes or Markov dynamic programs. Starting in the mid-1950swith Richard Bellman, many contributions to CMP's have been made, and applications to engineering, statistics and operations research, among other areas, have also been developed. The purpose of this book is to present some recent developments on the theory of adaptive CMP's, i. e. , CMP's that depend on unknown parameters. Thus at each decision time, the controller or decision-maker must estimate the true parameter values, and then adapt the control actions to the estimated values. We do not intend to describe all aspects of stochastic adaptive control; rather, the selection of material reflects our own research interests. The prerequisite for this book is a knowledgeof real analysis and prob ability theory at the level of, say, Ash (1972) or Royden (1968), but no previous knowledge of control or decision processes is required. The pre sentation, on the other hand, is meant to beself-contained,in the sensethat whenever a result from analysisor probability is used, it is usually stated in full and references are supplied for further discussion, if necessary. Several appendices are provided for this purpose. The material is divided into six chapters. Chapter 1 contains the basic definitions about the stochastic control problems we are interested in; a brief description of some applications is also provided Mathematics Distribution (Probability theory) Probability Theory and Stochastic Processes Mathematik Gesteuerter Markov-Prozess (DE-588)4157165-4 gnd rswk-swf Adaptivregelung (DE-588)4000457-0 gnd rswk-swf Markov-Prozess (DE-588)4134948-9 gnd rswk-swf Gesteuerter Markov-Prozess (DE-588)4157165-4 s Adaptivregelung (DE-588)4000457-0 s 1\p DE-604 Markov-Prozess (DE-588)4134948-9 s 2\p DE-604 https://doi.org/10.1007/978-1-4419-8714-3 Verlag Volltext 1\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk 2\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk |
spellingShingle | Hernández-Lerma, O. Adaptive Markov Control Processes Mathematics Distribution (Probability theory) Probability Theory and Stochastic Processes Mathematik Gesteuerter Markov-Prozess (DE-588)4157165-4 gnd Adaptivregelung (DE-588)4000457-0 gnd Markov-Prozess (DE-588)4134948-9 gnd |
subject_GND | (DE-588)4157165-4 (DE-588)4000457-0 (DE-588)4134948-9 |
title | Adaptive Markov Control Processes |
title_auth | Adaptive Markov Control Processes |
title_exact_search | Adaptive Markov Control Processes |
title_full | Adaptive Markov Control Processes by O. Hernández-Lerma |
title_fullStr | Adaptive Markov Control Processes by O. Hernández-Lerma |
title_full_unstemmed | Adaptive Markov Control Processes by O. Hernández-Lerma |
title_short | Adaptive Markov Control Processes |
title_sort | adaptive markov control processes |
topic | Mathematics Distribution (Probability theory) Probability Theory and Stochastic Processes Mathematik Gesteuerter Markov-Prozess (DE-588)4157165-4 gnd Adaptivregelung (DE-588)4000457-0 gnd Markov-Prozess (DE-588)4134948-9 gnd |
topic_facet | Mathematics Distribution (Probability theory) Probability Theory and Stochastic Processes Mathematik Gesteuerter Markov-Prozess Adaptivregelung Markov-Prozess |
url | https://doi.org/10.1007/978-1-4419-8714-3 |
work_keys_str_mv | AT hernandezlermao adaptivemarkovcontrolprocesses |