Verfügbarkeit: Sample Efficient Multiagent Learning in the Presence of Markovian Agents

Sample Efficient Multiagent Learning in the Presence of Markovian Agents:

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Chakraborty, Doran (VerfasserIn)
Format:	Elektronisch E-Book
Sprache:	English
Veröffentlicht:	2014
Schriftenreihe:	Studies in Computational Intelligence 523
Schlagworte:	Engineering Artificial intelligence Computational Intelligence Artificial Intelligence (incl. Robotics) Ingenieurwissenschaften Künstliche Intelligenz
Online-Zugang:	BTU01 FHA01 FHI01 FHN01 FHR01 FKE01 FRO01 FWS01 FWS02 UBY01 Volltext Inhaltsverzeichnis Abstract
Beschreibung:	The problem of Multiagent Learning (or MAL) is concerned with the study of how intelligent entities can learn and adapt in the presence of other such entities that are simultaneously adapting. The problem is often studied in the stylized settings provided by repeated matrix games (a.k.a. normal form games). The goal of this book is to develop MAL algorithms for such a setting that achieve a new set of objectives which have not been previously achieved. In particular this book deals with learning in the presence of a new class of agent behavior that has not been studied or modeled before in a MAL context: Markovian agent behavior. Several new challenges arise when interacting with this particular class of agents. The book takes a series of steps towards building completely autonomous learning algorithms that maximize utility while interacting with such agents. Each algorithm is meticulously specified with a thorough formal treatment that elucidates its key theoretical properties
Beschreibung:	1 Online-Ressource (XVIII, 147 p.) 31 illus
ISBN:	9783319026060
DOI:	10.1007/978-3-319-02606-0

Internformat

MARC


LEADER	00000nmm a2200000zcb4500
001	BV041470995
003	DE-604
005	20140124
007	cr\|uuu---uuuuu
008	131210s2014 \|\|\|\| o\|\|u\| \|\|\|\|\|\|eng d
020			\|a 9783319026060 \|9 978-3-319-02606-0
024	7		\|a 10.1007/978-3-319-02606-0 \|2 doi
035			\|a (OCoLC)874381652
035			\|a (DE-599)BVBBV041470995
040			\|a DE-604 \|b ger \|e aacr
041	0		\|a eng
049			\|a DE-Aug4 \|a DE-92 \|a DE-634 \|a DE-859 \|a DE-898 \|a DE-573 \|a DE-861 \|a DE-706 \|a DE-863 \|a DE-862
082	0		\|a 006.3 \|2 23
100	1		\|a Chakraborty, Doran \|e Verfasser \|4 aut
245	1	0	\|a Sample Efficient Multiagent Learning in the Presence of Markovian Agents \|c by Doran Chakraborty
264		1	\|c 2014
300			\|a 1 Online-Ressource (XVIII, 147 p.) \|b 31 illus
336			\|b txt \|2 rdacontent
337			\|b c \|2 rdamedia
338			\|b cr \|2 rdacarrier
490	1		\|a Studies in Computational Intelligence \|v 523
500			\|a The problem of Multiagent Learning (or MAL) is concerned with the study of how intelligent entities can learn and adapt in the presence of other such entities that are simultaneously adapting. The problem is often studied in the stylized settings provided by repeated matrix games (a.k.a. normal form games). The goal of this book is to develop MAL algorithms for such a setting that achieve a new set of objectives which have not been previously achieved. In particular this book deals with learning in the presence of a new class of agent behavior that has not been studied or modeled before in a MAL context: Markovian agent behavior. Several new challenges arise when interacting with this particular class of agents. The book takes a series of steps towards building completely autonomous learning algorithms that maximize utility while interacting with such agents. Each algorithm is meticulously specified with a thorough formal treatment that elucidates its key theoretical properties
505	0		\|a Introduction -- Background -- Learn or Exploit in Adversary Induced Markov Decision Processes -- Convergence, Targeted Optimality and Safety in Multiagent Learning -- Maximizing -- Targeted Modeling of Markovian agents -- Structure Learning in Factored MDPs -- Related Work -- Conclusion and Future Work
650		4	\|a Engineering
650		4	\|a Artificial intelligence
650		4	\|a Computational Intelligence
650		4	\|a Artificial Intelligence (incl. Robotics)
650		4	\|a Ingenieurwissenschaften
650		4	\|a Künstliche Intelligenz
830		0	\|a Studies in Computational Intelligence \|v 523 \|w (DE-604)BV020822171 \|9 523
856	4	0	\|u https://doi.org/10.1007/978-3-319-02606-0 \|x Verlag \|3 Volltext
856	4	2	\|m Springer Fremddatenuebernahme \|q application/pdf \|u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026917137&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA \|3 Inhaltsverzeichnis
856	4	2	\|m Springer Fremddatenuebernahme \|q application/pdf \|u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026917137&sequence=000003&line_number=0002&func_code=DB_RECORDS&service_type=MEDIA \|3 Abstract
912			\|a ZDB-2-ENG
999			\|a oai:aleph.bib-bvb.de:BVB01-026917137
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l BTU01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l FHA01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l FHI01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l FHN01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l FHR01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l FKE01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l FRO01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l FWS01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l FWS02 \|p ZDB-2-ENG \|x Verlag \|3 Volltext
966	e		\|u https://doi.org/10.1007/978-3-319-02606-0 \|l UBY01 \|p ZDB-2-ENG \|x Verlag \|3 Volltext

Datensatz im Suchindex

DE-BY-FWS_katkey	1016034
_version_	1824553640880242689
adam_text	SAMPLE EFFICIENT MULTIAGENT LEARNING IN THE PRESENCE OF MARKOVIAN AGENTS / CHAKRABORTY, DORAN : 2014 TABLE OF CONTENTS / INHALTSVERZEICHNIS INTRODUCTION BACKGROUND LEARN OR EXPLOIT IN ADVERSARY INDUCED MARKOV DECISION PROCESSES CONVERGENCE, TARGETED OPTIMALITY AND SAFETY IN MULTIAGENT LEARNING MAXIMIZING TARGETED MODELING OF MARKOVIAN AGENTS STRUCTURE LEARNING IN FACTORED MDPS RELATED WORK CONCLUSION AND FUTURE WORK DIESES SCHRIFTSTUECK WURDE MASCHINELL ERZEUGT. SAMPLE EFFICIENT MULTIAGENT LEARNING IN THE PRESENCE OF MARKOVIAN AGENTS / CHAKRABORTY, DORAN : 2014 ABSTRACT / INHALTSTEXT THE PROBLEM OF MULTIAGENT LEARNING (OR MAL) IS CONCERNED WITH THE STUDY OF HOW INTELLIGENT ENTITIES CAN LEARN AND ADAPT IN THE PRESENCE OF OTHER SUCH ENTITIES THAT ARE SIMULTANEOUSLY ADAPTING. THE PROBLEM IS OFTEN STUDIED IN THE STYLIZED SETTINGS PROVIDED BY REPEATED MATRIX GAMES (A.K.A. NORMAL FORM GAMES). THE GOAL OF THIS BOOK IS TO DEVELOP MAL ALGORITHMS FOR SUCH A SETTING THAT ACHIEVE A NEW SET OF OBJECTIVES WHICH HAVE NOT BEEN PREVIOUSLY ACHIEVED. IN PARTICULAR THIS BOOK DEALS WITH LEARNING IN THE PRESENCE OF A NEW CLASS OF AGENT BEHAVIOR THAT HAS NOT BEEN STUDIED OR MODELED BEFORE IN A MAL CONTEXT: MARKOVIAN AGENT BEHAVIOR. SEVERAL NEW CHALLENGES ARISE WHEN INTERACTING WITH THIS PARTICULAR CLASS OF AGENTS. THE BOOK TAKES A SERIES OF STEPS TOWARDS BUILDING COMPLETELY AUTONOMOUS LEARNING ALGORITHMS THAT MAXIMIZE UTILITY WHILE INTERACTING WITH SUCH AGENTS. EACH ALGORITHM IS METICULOUSLY SPECIFIED WITH A THOROUGH FORMAL TREATMENT THAT ELUCIDATES ITS KEY THEORETICAL PROPERTIES DIESES SCHRIFTSTUECK WURDE MASCHINELL ERZEUGT.
any_adam_object	1
author	Chakraborty, Doran
author_facet	Chakraborty, Doran
author_role	aut
author_sort	Chakraborty, Doran
author_variant	d c dc
building	Verbundindex
bvnumber	BV041470995
collection	ZDB-2-ENG
contents	Introduction -- Background -- Learn or Exploit in Adversary Induced Markov Decision Processes -- Convergence, Targeted Optimality and Safety in Multiagent Learning -- Maximizing -- Targeted Modeling of Markovian agents -- Structure Learning in Factored MDPs -- Related Work -- Conclusion and Future Work
ctrlnum	(OCoLC)874381652 (DE-599)BVBBV041470995
dewey-full	006.3
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	006 - Special computer methods
dewey-raw	006.3
dewey-search	006.3
dewey-sort	16.3
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
doi_str_mv	10.1007/978-3-319-02606-0
format	Electronic eBook
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>04174nmm a2200565zcb4500</leader><controlfield tag="001">BV041470995</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20140124 </controlfield><controlfield tag="007">cr\|uuu---uuuuu</controlfield><controlfield tag="008">131210s2014 \|\|\|\| o\|\|u\| \|\|\|\|\|\|eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783319026060</subfield><subfield code="9">978-3-319-02606-0</subfield></datafield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/978-3-319-02606-0</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)874381652</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV041470995</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-Aug4</subfield><subfield code="a">DE-92</subfield><subfield code="a">DE-634</subfield><subfield code="a">DE-859</subfield><subfield code="a">DE-898</subfield><subfield code="a">DE-573</subfield><subfield code="a">DE-861</subfield><subfield code="a">DE-706</subfield><subfield code="a">DE-863</subfield><subfield code="a">DE-862</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.3</subfield><subfield code="2">23</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Chakraborty, Doran</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Sample Efficient Multiagent Learning in the Presence of Markovian Agents</subfield><subfield code="c">by Doran Chakraborty</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2014</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (XVIII, 147 p.)</subfield><subfield code="b">31 illus</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Studies in Computational Intelligence</subfield><subfield code="v">523</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">The problem of Multiagent Learning (or MAL) is concerned with the study of how intelligent entities can learn and adapt in the presence of other such entities that are simultaneously adapting. The problem is often studied in the stylized settings provided by repeated matrix games (a.k.a. normal form games). The goal of this book is to develop MAL algorithms for such a setting that achieve a new set of objectives which have not been previously achieved. In particular this book deals with learning in the presence of a new class of agent behavior that has not been studied or modeled before in a MAL context: Markovian agent behavior. Several new challenges arise when interacting with this particular class of agents. The book takes a series of steps towards building completely autonomous learning algorithms that maximize utility while interacting with such agents. Each algorithm is meticulously specified with a thorough formal treatment that elucidates its key theoretical properties</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Introduction -- Background -- Learn or Exploit in Adversary Induced Markov Decision Processes -- Convergence, Targeted Optimality and Safety in Multiagent Learning -- Maximizing -- Targeted Modeling of Markovian agents -- Structure Learning in Factored MDPs -- Related Work -- Conclusion and Future Work</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Engineering</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Artificial intelligence</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Computational Intelligence</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Artificial Intelligence (incl. Robotics)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Ingenieurwissenschaften</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Künstliche Intelligenz</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Studies in Computational Intelligence</subfield><subfield code="v">523</subfield><subfield code="w">(DE-604)BV020822171</subfield><subfield code="9">523</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Springer Fremddatenuebernahme</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026917137&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Springer Fremddatenuebernahme</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026917137&sequence=000003&line_number=0002&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Abstract</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-2-ENG</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-026917137</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">BTU01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">FHA01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">FHI01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">FHN01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">FHR01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">FKE01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">FRO01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">FWS01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">FWS02</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-3-319-02606-0</subfield><subfield code="l">UBY01</subfield><subfield code="p">ZDB-2-ENG</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield></record></collection>
id	DE-604.BV041470995
illustrated	Not Illustrated
indexdate	2025-02-20T06:39:06Z
institution	BVB
isbn	9783319026060
language	English
oai_aleph_id	oai:aleph.bib-bvb.de:BVB01-026917137
oclc_num	874381652
open_access_boolean
owner	DE-Aug4 DE-92 DE-634 DE-859 DE-898 DE-BY-UBR DE-573 DE-861 DE-706 DE-863 DE-BY-FWS DE-862 DE-BY-FWS
owner_facet	DE-Aug4 DE-92 DE-634 DE-859 DE-898 DE-BY-UBR DE-573 DE-861 DE-706 DE-863 DE-BY-FWS DE-862 DE-BY-FWS
physical	1 Online-Ressource (XVIII, 147 p.) 31 illus
psigel	ZDB-2-ENG
publishDate	2014
publishDateSearch	2014
publishDateSort	2014
record_format	marc
series	Studies in Computational Intelligence
series2	Studies in Computational Intelligence
spellingShingle	Chakraborty, Doran Sample Efficient Multiagent Learning in the Presence of Markovian Agents Studies in Computational Intelligence Introduction -- Background -- Learn or Exploit in Adversary Induced Markov Decision Processes -- Convergence, Targeted Optimality and Safety in Multiagent Learning -- Maximizing -- Targeted Modeling of Markovian agents -- Structure Learning in Factored MDPs -- Related Work -- Conclusion and Future Work Engineering Artificial intelligence Computational Intelligence Artificial Intelligence (incl. Robotics) Ingenieurwissenschaften Künstliche Intelligenz
title	Sample Efficient Multiagent Learning in the Presence of Markovian Agents
title_auth	Sample Efficient Multiagent Learning in the Presence of Markovian Agents
title_exact_search	Sample Efficient Multiagent Learning in the Presence of Markovian Agents
title_full	Sample Efficient Multiagent Learning in the Presence of Markovian Agents by Doran Chakraborty
title_fullStr	Sample Efficient Multiagent Learning in the Presence of Markovian Agents by Doran Chakraborty
title_full_unstemmed	Sample Efficient Multiagent Learning in the Presence of Markovian Agents by Doran Chakraborty
title_short	Sample Efficient Multiagent Learning in the Presence of Markovian Agents
title_sort	sample efficient multiagent learning in the presence of markovian agents
topic	Engineering Artificial intelligence Computational Intelligence Artificial Intelligence (incl. Robotics) Ingenieurwissenschaften Künstliche Intelligenz
topic_facet	Engineering Artificial intelligence Computational Intelligence Artificial Intelligence (incl. Robotics) Ingenieurwissenschaften Künstliche Intelligenz
url	https://doi.org/10.1007/978-3-319-02606-0 http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026917137&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=026917137&sequence=000003&line_number=0002&func_code=DB_RECORDS&service_type=MEDIA
volume_link	(DE-604)BV020822171
work_keys_str_mv	AT chakrabortydoran sampleefficientmultiagentlearninginthepresenceofmarkovianagents

Verfügbarkeit

Volltext öffnen

MARC

Datensatz im Suchindex

Ähnliche Einträge