Internformat: Strength or Accuracy: Credit Assignment in Learning Classifier Systems

Strength or Accuracy: Credit Assignment in Learning Classifier Systems:

Classifier systems are an intriguing approach to a broad range of machine learning problems, based on automated generation and evaluation of condi tion/action rules. Inreinforcement learning tasks they simultaneously address the two major problems of learning a policy and generalising over it (and...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Kovacs, Tim (VerfasserIn)
Format:	Elektronisch E-Book
Sprache:	English
Veröffentlicht:	London Springer London 2004
Ausgabe:	1st ed. 2004
Schriftenreihe:	Distinguished Dissertations
Schlagworte:	Artificial Intelligence Algorithm Analysis and Problem Complexity Computer Appl. in Administrative Data Processing Artificial intelligence Algorithms Application software Hochschulschrift
Online-Zugang:	UBY01 Volltext
Zusammenfassung:	Classifier systems are an intriguing approach to a broad range of machine learning problems, based on automated generation and evaluation of condi tion/action rules. Inreinforcement learning tasks they simultaneously address the two major problems of learning a policy and generalising over it (and re lated objects, such as value functions). Despite over 20 years of research, however, classifier systems have met with mixed success, for reasons which were often unclear. Finally, in 1995 Stewart Wilson claimed a long-awaited breakthrough with his XCS system, which differs from earlier classifier sys tems in a number of respects, the most significant of which is the way in which it calculates the value of rules for use by the rule generation system. Specifically, XCS (like most classifiersystems) employs a genetic algorithm for rule generation, and the way in whichit calculates rule fitness differsfrom earlier systems. Wilson described XCS as an accuracy-based classifiersystem and earlier systems as strength-based. The two differin that in strength-based systems the fitness of a rule is proportional to the return (reward/payoff) it receives, whereas in XCS it is a function of the accuracy with which return is predicted. The difference is thus one of credit assignment, that is, of how a rule's contribution to the system's performance is estimated. XCS is a Q learning system; in fact, it is a proper generalisation of tabular Q-learning, in which rules aggregate states and actions. In XCS, as in other Q-learners, Q-valuesare used to weightaction selection
Beschreibung:	1 Online-Ressource (XVI, 307 p)
ISBN:	9780857294166
DOI:	10.1007/978-0-85729-416-6

Internformat

MARC


LEADER	00000nmm a2200000zc 4500
001	BV047064030
003	DE-604
005	00000000000000.0
007	cr\|uuu---uuuuu
008	201216s2004 \|\|\|\| o\|\|u\| \|\|\|\|\|\|eng d
020			\|a 9780857294166 \|9 978-0-85729-416-6
024	7		\|a 10.1007/978-0-85729-416-6 \|2 doi
035			\|a (ZDB-2-SCS)978-0-85729-416-6
035			\|a (OCoLC)1227479216
035			\|a (DE-599)BVBBV047064030
040			\|a DE-604 \|b ger \|e aacr
041	0		\|a eng
049			\|a DE-706
082	0		\|a 006.3 \|2 23
100	1		\|a Kovacs, Tim \|e Verfasser \|4 aut
245	1	0	\|a Strength or Accuracy: Credit Assignment in Learning Classifier Systems \|c by Tim Kovacs
250			\|a 1st ed. 2004
264		1	\|a London \|b Springer London \|c 2004
300			\|a 1 Online-Ressource (XVI, 307 p)
336			\|b txt \|2 rdacontent
337			\|b c \|2 rdamedia
338			\|b cr \|2 rdacarrier
490	0		\|a Distinguished Dissertations
520			\|a Classifier systems are an intriguing approach to a broad range of machine learning problems, based on automated generation and evaluation of condi tion/action rules. Inreinforcement learning tasks they simultaneously address the two major problems of learning a policy and generalising over it (and re lated objects, such as value functions). Despite over 20 years of research, however, classifier systems have met with mixed success, for reasons which were often unclear. Finally, in 1995 Stewart Wilson claimed a long-awaited breakthrough with his XCS system, which differs from earlier classifier sys tems in a number of respects, the most significant of which is the way in which it calculates the value of rules for use by the rule generation system. Specifically, XCS (like most classifiersystems) employs a genetic algorithm for rule generation, and the way in whichit calculates rule fitness differsfrom earlier systems. Wilson described XCS as an accuracy-based classifiersystem and earlier systems as strength-based. The two differin that in strength-based systems the fitness of a rule is proportional to the return (reward/payoff) it receives, whereas in XCS it is a function of the accuracy with which return is predicted. The difference is thus one of credit assignment, that is, of how a rule's contribution to the system's performance is estimated. XCS is a Q learning system; in fact, it is a proper generalisation of tabular Q-learning, in which rules aggregate states and actions. In XCS, as in other Q-learners, Q-valuesare used to weightaction selection
650		4	\|a Artificial Intelligence
650		4	\|a Algorithm Analysis and Problem Complexity
650		4	\|a Computer Appl. in Administrative Data Processing
650		4	\|a Artificial intelligence
650		4	\|a Algorithms
650		4	\|a Application software
655		7	\|0 (DE-588)4113937-9 \|a Hochschulschrift \|2 gnd-content
776	0	8	\|i Erscheint auch als \|n Druck-Ausgabe \|z 9781447110583
776	0	8	\|i Erscheint auch als \|n Druck-Ausgabe \|z 9781852337704
776	0	8	\|i Erscheint auch als \|n Druck-Ausgabe \|z 9780857294173
856	4	0	\|u https://doi.org/10.1007/978-0-85729-416-6 \|x Verlag \|z URL des Eerstveröffentlichers \|3 Volltext
912			\|a ZDB-2-SCS
940	1		\|q ZDB-2-SCS_2000/2004
999			\|a oai:aleph.bib-bvb.de:BVB01-032471142
966	e		\|u https://doi.org/10.1007/978-0-85729-416-6 \|l UBY01 \|p ZDB-2-SCS \|q ZDB-2-SCS_2000/2004 \|x Verlag \|3 Volltext

Datensatz im Suchindex

_version_	1804182061529956352
adam_txt
any_adam_object
any_adam_object_boolean
author	Kovacs, Tim
author_facet	Kovacs, Tim
author_role	aut
author_sort	Kovacs, Tim
author_variant	t k tk
building	Verbundindex
bvnumber	BV047064030
collection	ZDB-2-SCS
ctrlnum	(ZDB-2-SCS)978-0-85729-416-6 (OCoLC)1227479216 (DE-599)BVBBV047064030
dewey-full	006.3
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	006 - Special computer methods
dewey-raw	006.3
dewey-search	006.3
dewey-sort	16.3
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
discipline_str_mv	Informatik
doi_str_mv	10.1007/978-0-85729-416-6
edition	1st ed. 2004
format	Electronic eBook
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03363nmm a2200493zc 4500</leader><controlfield tag="001">BV047064030</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">cr\|uuu---uuuuu</controlfield><controlfield tag="008">201216s2004 \|\|\|\| o\|\|u\| \|\|\|\|\|\|eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9780857294166</subfield><subfield code="9">978-0-85729-416-6</subfield></datafield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/978-0-85729-416-6</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ZDB-2-SCS)978-0-85729-416-6</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1227479216</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV047064030</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">aacr</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-706</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.3</subfield><subfield code="2">23</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Kovacs, Tim</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Strength or Accuracy: Credit Assignment in Learning Classifier Systems</subfield><subfield code="c">by Tim Kovacs</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1st ed. 2004</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">London</subfield><subfield code="b">Springer London</subfield><subfield code="c">2004</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (XVI, 307 p)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Distinguished Dissertations</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Classifier systems are an intriguing approach to a broad range of machine learning problems, based on automated generation and evaluation of condi tion/action rules. Inreinforcement learning tasks they simultaneously address the two major problems of learning a policy and generalising over it (and re lated objects, such as value functions). Despite over 20 years of research, however, classifier systems have met with mixed success, for reasons which were often unclear. Finally, in 1995 Stewart Wilson claimed a long-awaited breakthrough with his XCS system, which differs from earlier classifier sys tems in a number of respects, the most significant of which is the way in which it calculates the value of rules for use by the rule generation system. Specifically, XCS (like most classifiersystems) employs a genetic algorithm for rule generation, and the way in whichit calculates rule fitness differsfrom earlier systems. Wilson described XCS as an accuracy-based classifiersystem and earlier systems as strength-based. The two differin that in strength-based systems the fitness of a rule is proportional to the return (reward/payoff) it receives, whereas in XCS it is a function of the accuracy with which return is predicted. The difference is thus one of credit assignment, that is, of how a rule's contribution to the system's performance is estimated. XCS is a Q learning system; in fact, it is a proper generalisation of tabular Q-learning, in which rules aggregate states and actions. In XCS, as in other Q-learners, Q-valuesare used to weightaction selection</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Artificial Intelligence</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Algorithm Analysis and Problem Complexity</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Computer Appl. in Administrative Data Processing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Artificial intelligence</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Algorithms</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Application software</subfield></datafield><datafield tag="655" ind1=" " ind2="7"><subfield code="0">(DE-588)4113937-9</subfield><subfield code="a">Hochschulschrift</subfield><subfield code="2">gnd-content</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">9781447110583</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">9781852337704</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">9780857294173</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://doi.org/10.1007/978-0-85729-416-6</subfield><subfield code="x">Verlag</subfield><subfield code="z">URL des Eerstveröffentlichers</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-2-SCS</subfield></datafield><datafield tag="940" ind1="1" ind2=" "><subfield code="q">ZDB-2-SCS_2000/2004</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-032471142</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://doi.org/10.1007/978-0-85729-416-6</subfield><subfield code="l">UBY01</subfield><subfield code="p">ZDB-2-SCS</subfield><subfield code="q">ZDB-2-SCS_2000/2004</subfield><subfield code="x">Verlag</subfield><subfield code="3">Volltext</subfield></datafield></record></collection>
genre	(DE-588)4113937-9 Hochschulschrift gnd-content
genre_facet	Hochschulschrift
id	DE-604.BV047064030
illustrated	Not Illustrated
index_date	2024-07-03T16:12:21Z
indexdate	2024-07-10T09:01:34Z
institution	BVB
isbn	9780857294166
language	English
oai_aleph_id	oai:aleph.bib-bvb.de:BVB01-032471142
oclc_num	1227479216
open_access_boolean
owner	DE-706
owner_facet	DE-706
physical	1 Online-Ressource (XVI, 307 p)
psigel	ZDB-2-SCS ZDB-2-SCS_2000/2004 ZDB-2-SCS ZDB-2-SCS_2000/2004
publishDate	2004
publishDateSearch	2004
publishDateSort	2004
publisher	Springer London
record_format	marc
series2	Distinguished Dissertations
spelling	Kovacs, Tim Verfasser aut Strength or Accuracy: Credit Assignment in Learning Classifier Systems by Tim Kovacs 1st ed. 2004 London Springer London 2004 1 Online-Ressource (XVI, 307 p) txt rdacontent c rdamedia cr rdacarrier Distinguished Dissertations Classifier systems are an intriguing approach to a broad range of machine learning problems, based on automated generation and evaluation of condi tion/action rules. Inreinforcement learning tasks they simultaneously address the two major problems of learning a policy and generalising over it (and re lated objects, such as value functions). Despite over 20 years of research, however, classifier systems have met with mixed success, for reasons which were often unclear. Finally, in 1995 Stewart Wilson claimed a long-awaited breakthrough with his XCS system, which differs from earlier classifier sys tems in a number of respects, the most significant of which is the way in which it calculates the value of rules for use by the rule generation system. Specifically, XCS (like most classifiersystems) employs a genetic algorithm for rule generation, and the way in whichit calculates rule fitness differsfrom earlier systems. Wilson described XCS as an accuracy-based classifiersystem and earlier systems as strength-based. The two differin that in strength-based systems the fitness of a rule is proportional to the return (reward/payoff) it receives, whereas in XCS it is a function of the accuracy with which return is predicted. The difference is thus one of credit assignment, that is, of how a rule's contribution to the system's performance is estimated. XCS is a Q learning system; in fact, it is a proper generalisation of tabular Q-learning, in which rules aggregate states and actions. In XCS, as in other Q-learners, Q-valuesare used to weightaction selection Artificial Intelligence Algorithm Analysis and Problem Complexity Computer Appl. in Administrative Data Processing Artificial intelligence Algorithms Application software (DE-588)4113937-9 Hochschulschrift gnd-content Erscheint auch als Druck-Ausgabe 9781447110583 Erscheint auch als Druck-Ausgabe 9781852337704 Erscheint auch als Druck-Ausgabe 9780857294173 https://doi.org/10.1007/978-0-85729-416-6 Verlag URL des Eerstveröffentlichers Volltext
spellingShingle	Kovacs, Tim Strength or Accuracy: Credit Assignment in Learning Classifier Systems Artificial Intelligence Algorithm Analysis and Problem Complexity Computer Appl. in Administrative Data Processing Artificial intelligence Algorithms Application software
subject_GND	(DE-588)4113937-9
title	Strength or Accuracy: Credit Assignment in Learning Classifier Systems
title_auth	Strength or Accuracy: Credit Assignment in Learning Classifier Systems
title_exact_search	Strength or Accuracy: Credit Assignment in Learning Classifier Systems
title_exact_search_txtP	Strength or Accuracy: Credit Assignment in Learning Classifier Systems
title_full	Strength or Accuracy: Credit Assignment in Learning Classifier Systems by Tim Kovacs
title_fullStr	Strength or Accuracy: Credit Assignment in Learning Classifier Systems by Tim Kovacs
title_full_unstemmed	Strength or Accuracy: Credit Assignment in Learning Classifier Systems by Tim Kovacs
title_short	Strength or Accuracy: Credit Assignment in Learning Classifier Systems
title_sort	strength or accuracy credit assignment in learning classifier systems
topic	Artificial Intelligence Algorithm Analysis and Problem Complexity Computer Appl. in Administrative Data Processing Artificial intelligence Algorithms Application software
topic_facet	Artificial Intelligence Algorithm Analysis and Problem Complexity Computer Appl. in Administrative Data Processing Artificial intelligence Algorithms Application software Hochschulschrift
url	https://doi.org/10.1007/978-0-85729-416-6
work_keys_str_mv	AT kovacstim strengthoraccuracycreditassignmentinlearningclassifiersystems

Verfügbarkeit

MARC

Datensatz im Suchindex

Ähnliche Einträge