Not logged in.

Contribution Details

Type Journal Article
Scope Discipline-based scholarship
Title Reinforcement learning or active inference?
Organization Unit
Authors
  • K J Friston
  • Jean Daunizeau
  • S J Kiebel
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Journal Title PLoS ONE
Publisher Public Library of Science (PLoS)
Geographical Reach international
ISSN 1932-6203
Volume 4
Number 7
Page Range e6421
Date 2009
Abstract Text This paper questions the need for reinforcement learning or control theory when optimising behaviour. We show that it is fairly simple to teach an agent complicated and adaptive behaviours using a free-energy formulation of perception. In this formulation, agents adjust their internal states and sampling of the environment to minimize their free-energy. Such agents learn causal structure in the environment and sample it in an adaptive and self-supervised fashion. This results in behavioural policies that reproduce those optimised by reinforcement learning and dynamic programming. Critically, we do not need to invoke the notion of reward, value or utility. We illustrate these points by solving a benchmark problem in dynamic programming; namely the mountain-car problem, using active perception or inference under the free-energy principle. The ensuing proof-of-concept may be important because the free-energy formulation furnishes a unified account of both action and perception and may speak to a reappraisal of the role of dopamine in the brain.
Free access at PubMed ID
Digital Object Identifier 10.1371/journal.pone.0006421
PubMed ID 19641614
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)