Document Detail


Approximate Dynamic Programming for Optimal Stationary Control with Control-Dependent Noise.
MedLine Citation:
PMID:  21954203     Owner:  NLM     Status:  Publisher    
Abstract/OtherAbstract:
This brief studies the stochastic optimal control problem via reinforcement learning and approximate/adaptive dynamic programming (ADP). A policy iteration algorithm is derived in the presence of both additive and multiplicative noise using Itô calculus. The expectation of the approximated cost matrix is guaranteed to converge to the solution of some algebraic Riccati equation that gives rise to the optimal cost value. Moreover, the covariance of the approximated cost matrix can be reduced by increasing the length of time interval between two consecutive iterations. Finally, a numerical example is given to illustrate the efficiency of the proposed ADP methodology.
Authors:
Yu Jiang; Zhong-Ping Jiang
Related Documents :
20561763 - Evaluating verbal and non-verbal communication skills, in an ethnogeriatric osce.
19638243 - Six year follow-up of students who participated in a school-based physical activity int...
1442793 - Health and safety education for worker empowerment.
Publication Detail:
Type:  JOURNAL ARTICLE     Date:  2011-9-26
Journal Detail:
Title:  IEEE transactions on neural networks / a publication of the IEEE Neural Networks Council     Volume:  -     ISSN:  1941-0093     ISO Abbreviation:  -     Publication Date:  2011 Sep 
Date Detail:
Created Date:  2011-9-28     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  101211035     Medline TA:  IEEE Trans Neural Netw     Country:  -    
Other Details:
Languages:  ENG     Pagination:  -     Citation Subset:  -    
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Provision of essential newborn care training to midwives in Zambia is a low-cost intervention that r...
Next Document:  Estimating the Ultimate Bound and Positively Invariant Set for a Class of Hopfield Networks.