Verfügbarkeit: Multi-agent machine learning

Multi-agent machine learning: a reinforcement approach

"Multi-Agent Machine Learning: A Reinforcement Learning Approach is a framework to understanding different methods and approaches in multi-agent machine learning. It also provides cohesive coverage of the latest advances in multi-agent differential games and presents applications in game theory...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Schwartz, Howard M. (VerfasserIn)
Format:	Buch
Sprache:	English
Veröffentlicht:	Hoboken, New Jersey Wiley 2014
Schlagworte:	TECHNOLOGY & ENGINEERING / Electronics / General Reinforcement learning Differential games Swarm intelligence Machine learning Mehragentensystem Bestärkendes Lernen > Künstliche Intelligenz Maschinelles Lernen Schwarmintelligenz
Online-Zugang:	Cover image Inhaltsverzeichnis
Zusammenfassung:	"Multi-Agent Machine Learning: A Reinforcement Learning Approach is a framework to understanding different methods and approaches in multi-agent machine learning. It also provides cohesive coverage of the latest advances in multi-agent differential games and presents applications in game theory and robotics. Framework for understanding a variety of methods and approaches in multi-agent machine learning. Discusses methods of reinforcement learning such as a number of forms of multi-agent Q-learning Applicable to research professors and graduate students studying electrical and computer engineering, computer science, and mechanical and aerospace engineering"..
Beschreibung:	xi, 242 Seiten Diagramme
ISBN:	9781118362082

Internformat

MARC


LEADER	00000nam a2200000 c 4500
001	BV042186941
003	DE-604
005	20220112
007	t
008	141114s2014 xxu\|\|\|\| \|\|\|\| 00\|\|\| eng d
010			\|a 014016950
020			\|a 9781118362082 \|9 978-1-118-36208-2
035			\|a (OCoLC)897814171
035			\|a (DE-599)BVBBV042186941
040			\|a DE-604 \|b ger \|e rda
041	0		\|a eng
044			\|a xxu \|c US
049			\|a DE-384
050		0	\|a Q325.6
082	0		\|a 519.3 \|2 23
084			\|a ST 300 \|0 (DE-625)143650: \|2 rvk
100	1		\|a Schwartz, Howard M. \|e Verfasser \|0 (DE-588)1059786249 \|4 aut
245	1	0	\|a Multi-agent machine learning \|b a reinforcement approach \|c Howard M. Schwartz, Department of Systems and Computer Engineering Carleton University
264		1	\|a Hoboken, New Jersey \|b Wiley \|c 2014
300			\|a xi, 242 Seiten \|b Diagramme
336			\|b txt \|2 rdacontent
337			\|b n \|2 rdamedia
338			\|b nc \|2 rdacarrier
520			\|a "Multi-Agent Machine Learning: A Reinforcement Learning Approach is a framework to understanding different methods and approaches in multi-agent machine learning. It also provides cohesive coverage of the latest advances in multi-agent differential games and presents applications in game theory and robotics. Framework for understanding a variety of methods and approaches in multi-agent machine learning. Discusses methods of reinforcement learning such as a number of forms of multi-agent Q-learning Applicable to research professors and graduate students studying electrical and computer engineering, computer science, and mechanical and aerospace engineering"..
650		7	\|a TECHNOLOGY & ENGINEERING / Electronics / General \|2 bisacsh
650		4	\|a Reinforcement learning
650		4	\|a Differential games
650		4	\|a Swarm intelligence
650		4	\|a Machine learning
650		4	\|a TECHNOLOGY & ENGINEERING / Electronics / General
650	0	7	\|a Mehragentensystem \|0 (DE-588)4389058-1 \|2 gnd \|9 rswk-swf
650	0	7	\|a Bestärkendes Lernen \|g Künstliche Intelligenz \|0 (DE-588)4825546-4 \|2 gnd \|9 rswk-swf
650	0	7	\|a Maschinelles Lernen \|0 (DE-588)4193754-5 \|2 gnd \|9 rswk-swf
650	0	7	\|a Schwarmintelligenz \|0 (DE-588)4793676-9 \|2 gnd \|9 rswk-swf
689	0	0	\|a Mehragentensystem \|0 (DE-588)4389058-1 \|D s
689	0	1	\|a Maschinelles Lernen \|0 (DE-588)4193754-5 \|D s
689	0		\|5 DE-604
689	1	0	\|a Bestärkendes Lernen \|g Künstliche Intelligenz \|0 (DE-588)4825546-4 \|D s
689	1	1	\|a Schwarmintelligenz \|0 (DE-588)4793676-9 \|D s
689	1	2	\|a Maschinelles Lernen \|0 (DE-588)4193754-5 \|D s
689	1		\|8 1\p \|5 DE-604
856	4		\|u http://catalogimages.wiley.com/images/db/jimages/9781118362082.jpg \|3 Cover image
856	4	2	\|m HBZ Datenaustausch \|q application/pdf \|u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027626064&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA \|3 Inhaltsverzeichnis
999			\|a oai:aleph.bib-bvb.de:BVB01-027626064
883	1		\|8 1\p \|a cgwrk \|d 20201028 \|q DE-101 \|u https://d-nb.info/provenance/plan#cgwrk

Datensatz im Suchindex

_version_	1804152699155906560
adam_text	Titel: Multi-agent machine learning Autor: Schwartz, Howard M Jahr: 2014 Contents Preface ix Chapter 1 A Brief Review of Supervised Learning 1 1.1 Least Squares Estimates 1 1.2 Recursive Least Squares 5 1.3 Least Mean Squares 6 1.4 Stochastic Approximation 10 References 11 Chapter 2 Single-Agent Reinforcement Learning 12 2.1 Introduction 12 2.2 n-Armed Bandit Problem 13 2.3 The Learning Structure 15 2.4 The Value Function 17 2.5 The Optimal Value Functions 18 2.5.1 The Grid World Example 20 2.6 Markov Decision Processes 23 2.7 Learning Value Functions 25 2.8 Policy Iteration 26 2.9 Temporal Difference Learning 28 2.10 TD Learning of the State-Action Function 30 Vl Contents 2.11 Q-Learning 32 2.12 Eligibility Traces 33 References 37 Chapter 3 Learning in Two-Player Matrix Games 38 3.1 Matrix Games 38 3.2 Nash Equilibria in Two-Player Matrix Games 42 3.3 Linear Programming in Two-Player Zero-Sum Matrix Games 43 3.4 The Learning Algorithms 47 3.5 Gradient Ascent Algorithm 47 3.6 WoLF-IGA Algorithm 51 3.7 Policy Hill Climbing (PHC) 52 3.8 WoLF-PHC Algorithm 54 3.9 Decentralized Learning in Matrix Games 57 3.10 Learning Automata 59 3.11 Linear Reward-Inaction Algorithm 59 3.12 Linear Reward-Penalty Algorithm 60 3.13 The Lagging Anchor Algorithm 60 3.14 Lr_, Lagging Anchor Algorithm 62 3.14.1 Simulation 68 References 70 Chapter 4 Learning in Multiplayer Stochastic Games 73 4.1 Introduction 73 4.2 Multiplayer Stochastic Games 75 4.3 Minimax-Q Algorithm 79 4.3.1 2x2 Grid Game 80 4.4 Nash Q-Learning 87 4.4.1 The Learning Process 95 4.5 The Simplex Algorithm 96 4.6 The Lemke-Howson Algorithm 100 4.7 Nash-Q Implementation 107 4.8 Friend-or-Foe Q-Learning 111 4.9 Infinite Gradient Ascent 112 Contents vii 4.10 Policy Hill Climbing 114 4.11 WoLF-PHC Algorithm 114 4.12 Guarding a Territory Problem in a Grid World 117 4.12.1 Simulation and Results 119 4.13 Extension of Ln_, Lagging Anchor Algorithm to Stochastic Games 125 4.14 The Exponential Moving-Average Q-Learning (EMA Q-Learning) Algorithm 128 4.15 Simulation and Results Comparing EMA Q-Learning to Other Methods 131 4.15.1 Matrix Games 131 4.15.2 Stochastic Games 134 References 141 Chapter 5 Differential Games 144 5.1 Introduction 144 5.2 A Brief Tutorial on Fuzzy Systems 146 5.2.1 Fuzzy Sets and Fuzzy Rules 146 5.2.2 Fuzzy Inference Engine 148 5.2.3 Fuzzifier and Defuzzifier 151 5.2.4 Fuzzy Systems and Examples 152 5.3 Fuzzy Q-Learning 155 5.4 Fuzzy Actor-Critic Learning 159 5.5 Homicidal Chauffeur Differential Game 162 5.6 Fuzzy Controller Structure 165 5.7 Q(A)-Learning Fuzzy Inference System 166 5.8 Simulation Results for the Homicidal Chauffeur — 171 5.9 Learning in the Evader-Pursuer Game with Two Cars 174 5.10 Simulation of the Game of Two Cars 177 5.11 Differential Game of Guarding a Territory 180 5.12 Reward Shaping in the Differential Game of Guarding a Territory 184 5.13 Simulation Results 185 5.13.1 One Defender Versus One Invader 185 5.13.2 Two Defenders Versus One Invader 191 References 197 viii Contents Chapter 6 Swarm Intelligence and the Evolution of Personality Traits 200 6.1 Introduction 200 6.2 The Evolution of Swarm Intelligence 200 6.3 Representation of the Environment 201 6.4 Swarm-Based Robotics in Terms of Personalities 203 6.5 Evolution of Personality Traits 206 6.6 Simulation Framework 207 6.7 A Zero-Sum Game Example 208 6.7.1 Convergence 208 6.7.2 Simulation Results 214 6.8 Implementation for Next Sections 216 6.9 Robots Leaving a Room 218 6.10 Tracking a Target 221 6.11 Conclusion 232 References 233 Index -237
any_adam_object	1
author	Schwartz, Howard M.
author_GND	(DE-588)1059786249
author_facet	Schwartz, Howard M.
author_role	aut
author_sort	Schwartz, Howard M.
author_variant	h m s hm hms
building	Verbundindex
bvnumber	BV042186941
callnumber-first	Q - Science
callnumber-label	Q325
callnumber-raw	Q325.6
callnumber-search	Q325.6
callnumber-sort	Q 3325.6
callnumber-subject	Q - General Science
classification_rvk	ST 300
ctrlnum	(OCoLC)897814171 (DE-599)BVBBV042186941
dewey-full	519.3
dewey-hundreds	500 - Natural sciences and mathematics
dewey-ones	519 - Probabilities and applied mathematics
dewey-raw	519.3
dewey-search	519.3
dewey-sort	3519.3
dewey-tens	510 - Mathematics
discipline	Informatik Mathematik
format	Book
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03030nam a2200565 c 4500</leader><controlfield tag="001">BV042186941</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20220112 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">141114s2014 xxu\|\|\|\| \|\|\|\| 00\|\|\| eng d</controlfield><datafield tag="010" ind1=" " ind2=" "><subfield code="a">014016950</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781118362082</subfield><subfield code="9">978-1-118-36208-2</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)897814171</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV042186941</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="044" ind1=" " ind2=" "><subfield code="a">xxu</subfield><subfield code="c">US</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-384</subfield></datafield><datafield tag="050" ind1=" " ind2="0"><subfield code="a">Q325.6</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">519.3</subfield><subfield code="2">23</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 300</subfield><subfield code="0">(DE-625)143650:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Schwartz, Howard M.</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1059786249</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Multi-agent machine learning</subfield><subfield code="b">a reinforcement approach</subfield><subfield code="c">Howard M. Schwartz, Department of Systems and Computer Engineering Carleton University</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Hoboken, New Jersey</subfield><subfield code="b">Wiley</subfield><subfield code="c">2014</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xi, 242 Seiten</subfield><subfield code="b">Diagramme</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">"Multi-Agent Machine Learning: A Reinforcement Learning Approach is a framework to understanding different methods and approaches in multi-agent machine learning. It also provides cohesive coverage of the latest advances in multi-agent differential games and presents applications in game theory and robotics. Framework for understanding a variety of methods and approaches in multi-agent machine learning. Discusses methods of reinforcement learning such as a number of forms of multi-agent Q-learning Applicable to research professors and graduate students studying electrical and computer engineering, computer science, and mechanical and aerospace engineering"..</subfield></datafield><datafield tag="650" ind1=" " ind2="7"><subfield code="a">TECHNOLOGY & ENGINEERING / Electronics / General</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Reinforcement learning</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Differential games</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Swarm intelligence</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Machine learning</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">TECHNOLOGY & ENGINEERING / Electronics / General</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Mehragentensystem</subfield><subfield code="0">(DE-588)4389058-1</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Maschinelles Lernen</subfield><subfield code="0">(DE-588)4193754-5</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Schwarmintelligenz</subfield><subfield code="0">(DE-588)4793676-9</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Mehragentensystem</subfield><subfield code="0">(DE-588)4389058-1</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2="1"><subfield code="a">Maschinelles Lernen</subfield><subfield code="0">(DE-588)4193754-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="689" ind1="1" ind2="0"><subfield code="a">Bestärkendes Lernen</subfield><subfield code="g">Künstliche Intelligenz</subfield><subfield code="0">(DE-588)4825546-4</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="1"><subfield code="a">Schwarmintelligenz</subfield><subfield code="0">(DE-588)4793676-9</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2="2"><subfield code="a">Maschinelles Lernen</subfield><subfield code="0">(DE-588)4193754-5</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="5">DE-604</subfield></datafield><datafield tag="856" ind1="4" ind2=" "><subfield code="u">http://catalogimages.wiley.com/images/db/jimages/9781118362082.jpg</subfield><subfield code="3">Cover image</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">HBZ Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027626064&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-027626064</subfield></datafield><datafield tag="883" ind1="1" ind2=" "><subfield code="8">1\p</subfield><subfield code="a">cgwrk</subfield><subfield code="d">20201028</subfield><subfield code="q">DE-101</subfield><subfield code="u">https://d-nb.info/provenance/plan#cgwrk</subfield></datafield></record></collection>
id	DE-604.BV042186941
illustrated	Not Illustrated
indexdate	2024-07-10T01:14:52Z
institution	BVB
isbn	9781118362082
language	English
lccn	014016950
oai_aleph_id	oai:aleph.bib-bvb.de:BVB01-027626064
oclc_num	897814171
open_access_boolean
owner	DE-384
owner_facet	DE-384
physical	xi, 242 Seiten Diagramme
publishDate	2014
publishDateSearch	2014
publishDateSort	2014
publisher	Wiley
record_format	marc
spelling	Schwartz, Howard M. Verfasser (DE-588)1059786249 aut Multi-agent machine learning a reinforcement approach Howard M. Schwartz, Department of Systems and Computer Engineering Carleton University Hoboken, New Jersey Wiley 2014 xi, 242 Seiten Diagramme txt rdacontent n rdamedia nc rdacarrier "Multi-Agent Machine Learning: A Reinforcement Learning Approach is a framework to understanding different methods and approaches in multi-agent machine learning. It also provides cohesive coverage of the latest advances in multi-agent differential games and presents applications in game theory and robotics. Framework for understanding a variety of methods and approaches in multi-agent machine learning. Discusses methods of reinforcement learning such as a number of forms of multi-agent Q-learning Applicable to research professors and graduate students studying electrical and computer engineering, computer science, and mechanical and aerospace engineering".. TECHNOLOGY & ENGINEERING / Electronics / General bisacsh Reinforcement learning Differential games Swarm intelligence Machine learning TECHNOLOGY & ENGINEERING / Electronics / General Mehragentensystem (DE-588)4389058-1 gnd rswk-swf Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd rswk-swf Maschinelles Lernen (DE-588)4193754-5 gnd rswk-swf Schwarmintelligenz (DE-588)4793676-9 gnd rswk-swf Mehragentensystem (DE-588)4389058-1 s Maschinelles Lernen (DE-588)4193754-5 s DE-604 Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 s Schwarmintelligenz (DE-588)4793676-9 s 1\p DE-604 http://catalogimages.wiley.com/images/db/jimages/9781118362082.jpg Cover image HBZ Datenaustausch application/pdf http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027626064&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA Inhaltsverzeichnis 1\p cgwrk 20201028 DE-101 https://d-nb.info/provenance/plan#cgwrk
spellingShingle	Schwartz, Howard M. Multi-agent machine learning a reinforcement approach TECHNOLOGY & ENGINEERING / Electronics / General bisacsh Reinforcement learning Differential games Swarm intelligence Machine learning TECHNOLOGY & ENGINEERING / Electronics / General Mehragentensystem (DE-588)4389058-1 gnd Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd Maschinelles Lernen (DE-588)4193754-5 gnd Schwarmintelligenz (DE-588)4793676-9 gnd
subject_GND	(DE-588)4389058-1 (DE-588)4825546-4 (DE-588)4193754-5 (DE-588)4793676-9
title	Multi-agent machine learning a reinforcement approach
title_auth	Multi-agent machine learning a reinforcement approach
title_exact_search	Multi-agent machine learning a reinforcement approach
title_full	Multi-agent machine learning a reinforcement approach Howard M. Schwartz, Department of Systems and Computer Engineering Carleton University
title_fullStr	Multi-agent machine learning a reinforcement approach Howard M. Schwartz, Department of Systems and Computer Engineering Carleton University
title_full_unstemmed	Multi-agent machine learning a reinforcement approach Howard M. Schwartz, Department of Systems and Computer Engineering Carleton University
title_short	Multi-agent machine learning
title_sort	multi agent machine learning a reinforcement approach
title_sub	a reinforcement approach
topic	TECHNOLOGY & ENGINEERING / Electronics / General bisacsh Reinforcement learning Differential games Swarm intelligence Machine learning TECHNOLOGY & ENGINEERING / Electronics / General Mehragentensystem (DE-588)4389058-1 gnd Bestärkendes Lernen Künstliche Intelligenz (DE-588)4825546-4 gnd Maschinelles Lernen (DE-588)4193754-5 gnd Schwarmintelligenz (DE-588)4793676-9 gnd
topic_facet	TECHNOLOGY & ENGINEERING / Electronics / General Reinforcement learning Differential games Swarm intelligence Machine learning Mehragentensystem Bestärkendes Lernen Künstliche Intelligenz Maschinelles Lernen Schwarmintelligenz
url	http://catalogimages.wiley.com/images/db/jimages/9781118362082.jpg http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=027626064&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA
work_keys_str_mv	AT schwartzhowardm multiagentmachinelearningareinforcementapproach

Verfügbarkeit

Es ist kein Print-Exemplar vorhanden.

Fernleihe Bestellen Achtung: Nicht im THWS-Bestand! Inhaltsverzeichnis

MARC

Datensatz im Suchindex

Es ist kein Print-Exemplar vorhanden.

Ähnliche Einträge