Random-TD Function Approximator

In this paper, adaptive controller architecture based on a combination of temporal-difference (TD) learning and an on-line variant of Random Forest (RF) classifier is proposed. We call this implementation Random-TD. The approach iteratively improves its control strategies by exploiting only relevant...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Journal of advanced computational intelligence and intelligent informatics Ročník 13; číslo 2; s. 155 - 161
Hlavní autor:	Osman, Hassab Elgawi
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	01.03.2009
ISSN:	1343-0130, 1883-8014
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Abstract	In this paper, adaptive controller architecture based on a combination of temporal-difference (TD) learning and an on-line variant of Random Forest (RF) classifier is proposed. We call this implementation Random-TD. The approach iteratively improves its control strategies by exploiting only relevant parts of action and is able to learn completely in on-line mode. Such capability of on-line adaptation would take us closer to the goal of more robust and adaptable control. To illustrate this and to demonstrate the applicability of the approach, it has been applied to a non-linear, non-stationary control task, Cart-Pole balancing and on high-dimensional control problems –Ailerons, Elevator, Kinematics, and Friedman–. The results demonstrate that our hybrid approach is adaptable and can significantly improves the performance of TD methods while speeding up the learning process.
AbstractList	In this paper, adaptive controller architecture based on a combination of temporal-difference (TD) learning and an on-line variant of Random Forest (RF) classifier is proposed. We call this implementation Random-TD. The approach iteratively improves its control strategies by exploiting only relevant parts of action and is able to learn completely in on-line mode. Such capability of on-line adaptation would take us closer to the goal of more robust and adaptable control. To illustrate this and to demonstrate the applicability of the approach, it has been applied to a non-linear, non-stationary control task, Cart-Pole balancing and on high-dimensional control problems –Ailerons, Elevator, Kinematics, and Friedman–. The results demonstrate that our hybrid approach is adaptable and can significantly improves the performance of TD methods while speeding up the learning process.
Author	Osman, Hassab Elgawi
Author_xml	– sequence: 1 givenname: Hassab Elgawi surname: Osman fullname: Osman, Hassab Elgawi
BookMark	eNp9j81KAzEUhYNUsNa-gBv7Aqk3uUkmsyz1FwqC1HXIJBlIaSdDZgR9e9PWlQtX99zFdzjfNZl0qQuE3DJYcqiVvN9ZF2MsD9TLHpiUF2TKtEaqgYlJySiQAkO4IvNh2AGUzBUINiV377bz6UC3D4unz86NMXWLVd_n9BUPdkz5hly2dj-E-e-dkY-nx-36hW7enl_Xqw11iHKkztdceKmVk2gFBCkkoMemqVwjKwlatYqD9rxRTtXBV4KzULe-8q0sWxBnhJ97XU7DkENr-lwW5G_DwJwszdnSHC3NybJA-g_k4miPDmO2cf8f-gN1HloO
CitedBy_id	crossref_primary_10_1016_j_neucom_2016_08_155
Cites_doi	10.1023/A:1010933404324 10.1109/TNN.1998.712192 10.1109/CVPRW.2008.4563065 10.1145/1143844.1143901 10.1177/105971230501300301 10.1016/j.neucom.2007.11.026 10.1007/BF00115009 10.1214/aos/1176347963 10.1109/TSMC.1983.6313077
ContentType	Journal Article
CorporateAuthor	Image Science and Engineering Lab, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama 226-8503, Japan
CorporateAuthor_xml	– name: Image Science and Engineering Lab, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama 226-8503, Japan
DBID	AAYXX CITATION
DOI	10.20965/jaciii.2009.p0155
DatabaseName	CrossRef
DatabaseTitle	CrossRef
DatabaseTitleList	CrossRef
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISSN	1883-8014
EndPage	161
ExternalDocumentID	10_20965_jaciii_2009_p0155
GroupedDBID	AAYXX ALMA_UNASSIGNED_HOLDINGS CITATION GROUPED_DOAJ ISHAI P2P TUS
ID	FETCH-LOGICAL-c335t-cd924d586c53a40e54503d3bb7cb575086f6208d2b6c69ed7421e9fd7df513233
ISICitedReferencesCount	2
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000448658100013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN	1343-0130
IngestDate	Tue Nov 18 22:33:07 EST 2025 Sat Nov 29 06:43:30 EST 2025
IsDoiOpenAccess	false
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	2
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c335t-cd924d586c53a40e54503d3bb7cb575086f6208d2b6c69ed7421e9fd7df513233
OpenAccessLink	https://doi.org/10.20965/jaciii.2009.p0155
PageCount	7
ParticipantIDs	crossref_primary_10_20965_jaciii_2009_p0155 crossref_citationtrail_10_20965_jaciii_2009_p0155
PublicationCentury	2000
PublicationDate	2009-03-01
PublicationDateYYYYMMDD	2009-03-01
PublicationDate_xml	– month: 03 year: 2009 text: 2009-03-01 day: 01
PublicationDecade	2000
PublicationTitle	Journal of advanced computational intelligence and intelligent informatics
PublicationYear	2009
References	key-10.20965/jaciii.2009.p0155-3 key-10.20965/jaciii.2009.p0155-2 key-10.20965/jaciii.2009.p0155-1 key-10.20965/jaciii.2009.p0155-11 key-10.20965/jaciii.2009.p0155-12 key-10.20965/jaciii.2009.p0155-7 key-10.20965/jaciii.2009.p0155-6 key-10.20965/jaciii.2009.p0155-5 key-10.20965/jaciii.2009.p0155-4 key-10.20965/jaciii.2009.p0155-9 key-10.20965/jaciii.2009.p0155-8 key-10.20965/jaciii.2009.p0155-10
References_xml	– ident: key-10.20965/jaciii.2009.p0155-2 doi: 10.1023/A:1010933404324 – ident: key-10.20965/jaciii.2009.p0155-4 – ident: key-10.20965/jaciii.2009.p0155-9 doi: 10.1109/TNN.1998.712192 – ident: key-10.20965/jaciii.2009.p0155-5 doi: 10.1109/CVPRW.2008.4563065 – ident: key-10.20965/jaciii.2009.p0155-6 doi: 10.1145/1143844.1143901 – ident: key-10.20965/jaciii.2009.p0155-11 doi: 10.1177/105971230501300301 – ident: key-10.20965/jaciii.2009.p0155-10 – ident: key-10.20965/jaciii.2009.p0155-12 doi: 10.1016/j.neucom.2007.11.026 – ident: key-10.20965/jaciii.2009.p0155-7 doi: 10.1007/BF00115009 – ident: key-10.20965/jaciii.2009.p0155-8 – ident: key-10.20965/jaciii.2009.p0155-3 doi: 10.1214/aos/1176347963 – ident: key-10.20965/jaciii.2009.p0155-1 doi: 10.1109/TSMC.1983.6313077
SSID	ssj0001326041 ssib051641541
Score	1.7579434
Snippet	In this paper, adaptive controller architecture based on a combination of temporal-difference (TD) learning and an on-line variant of Random Forest (RF)...
SourceID	crossref
SourceType	Enrichment Source Index Database
StartPage	155
Title	Random-TD Function Approximator
Volume	13
WOSCitedRecordID	wos000448658100013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 1883-8014 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001326041 issn: 1343-0130 databaseCode: DOA dateStart: 20070101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1883-8014 dateEnd: 99991231 omitProxy: false ssIdentifier: ssib051641541 issn: 1343-0130 databaseCode: M~E dateStart: 19970101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1JS8NAFB60evDiLu7m4K0EZzLZ5ljU4qmKVOitZJZIocZiq_bn-2ZJmtYFPXgJ4ZF5ZPoeb5m-fB9C50lOwFNy4kMyFT5EP-yzDId-JHKmGCQJyi3ZRNLppL0eu3ND7GNDJ5AURTqdstG_mhpkYGz96ewfzF0pBQHcg9HhCmaH668Mf58VEgJf96rZhpxlzNvSwOHTwVNmoYW_qkarWQBhaB7KI8JBHbHTAjWVAs0t4Cre2cD87didqN5AUZ7x5vXwMXsfzB0tsNlslYuGNNQC98eJsrI0pTqthXMhlNZcJajFQ2IxeF1qJRZ3fTFqBxqBxtAFCA2oYTBER7haWofIXkhd1UAhtDJGS9_q0OSarG90LKOVIIkYq3XbEGoi6BKheCSz4zgoY3Fo23O3afuJlVF78enVamVMrR7pbqJ1ZzqvZR1gCy2pYhttlCQdnovZO-is8gev9Aev7g-76KF93b288R0rhi8ojSa-kNAyyyiNRUSzECsogTGVlPNEcKi9oUXN4wCnMuCxiJmSSRgQxXKZyDyCXVK6hxrFc6H2kScCWJWrNA-pCglOeSQVUZhTCjpjzg4QKTfZFw4yXjOXDPvf_94HqFmtGVnAlB-ePvzT00dobeamx6gxeXlVJ2hVvE0G45dTY-APwmVheA
linkProvider	Directory of Open Access Journals
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Random-TD+Function+Approximator&rft.jtitle=Journal+of+advanced+computational+intelligence+and+intelligent+informatics&rft.au=Osman%2C+Hassab+Elgawi&rft.date=2009-03-01&rft.issn=1343-0130&rft.eissn=1883-8014&rft.volume=13&rft.issue=2&rft.spage=155&rft.epage=161&rft_id=info:doi/10.20965%2Fjaciii.2009.p0155&rft.externalDBID=n%2Fa&rft.externalDocID=10_20965_jaciii_2009_p0155
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1343-0130&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1343-0130&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1343-0130&client=summon