Deep feature-action processing with mixture of updates

Altahhan, A (2015) Deep feature-action processing with mixture of updates. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9492. pp. 1-10. ISSN 0302-9743 DOI: https://doi.org/10.1007/978-3-319-26561-2_1

Abstract

© Springer International Publishing Switzerland 2015. This paper explores the possibility of combining an actor and critic in one architecture and uses a mixture of updates to train them. It describes a model for robot navigation that uses architecture similar to an actor-critic reinforcement learning architecture. It sets up the actor as a layer seconded by another layer which deduce the value function. Therefore, the effect is to have similar to a critic outcome combined with the actor in one network. The model hence can be used as the base for a truly deep reinforcement learning architecture that can be explored in the future. More importantly this work explores the results of mixing conjugate gradient update with gradient update for the mentioned architecture. The reward signal is back propagated from the critic to the actor through conjugate gradient eligibility trace for the second layer combined with gradient eligibility trace for the first layer. We show that this mixture of updates seems to work well for this model. The features layer have been deeply trained by applying a simple PCA on the whole set of images histograms acquired during the first running episode. The model is also able to adapt to a reduced features dimension autonomously. Initial experimental result on real robot shows that the agent accomplished good success rate in reaching a goal location.

Identification Number:	https://doi.org/10.1007/978-3-319-26561-2_1
Status:	Published
Refereed:	Yes
Publisher:	Springer
Uncontrolled Keywords:	08 Information And Computing Sciences, Artificial Intelligence & Image Processing,
Depositing User (symplectic)	Deposited by Altahhan, Abdulrahman
Date Deposited:	29 Nov 2017 16:15
Last Modified:	28 Aug 2025 21:39
Item Type:	Article

Download

Restricted to Repository staff only.
Due to copyright restrictions, this file is not available for public download. For more information please email openaccess@leedsbeckett.ac.uk.

CORE (COnnecting REpositories)

Deep feature-action processing with mixture of updates

Abstract

More Information

Download

Export Citation

Share

Explore Further

Statistics

Review