Transcript Slide 1
Brain Awareness Week at the European Parliament
How our brain works:
Recent advances in the theory of brain function
1
“Objects are always imagined as being present in the field of
vision as would have to be there in order to produce the same
impression on the nervous mechanism” - Hermann Ludwig
Ferdinand von Helmholtz
Geoffrey Hinton
From the Helmholtz machine to
the Bayesian brain and selforganization
Thomas Bayes
Richard Feynman
Hermann Haken
2
temperature
What is the difference between a
snowflake and a bird?
Phase-boundary
…a bird can avoid surprises
3
What is the difference between snowfall and a flock of birds?
Ensemble dynamics and swarming
…birds (biological agents) stay in the same place
They resist the second law of thermodynamics, which says
that their entropy should increase
4
But what is the entropy?
T
…entropy is just average surprise H dt S (t ) S (t ) ln p( s | m)
0
s
A
Low surprise (I am usually here)
High surprise (I am never here)
This means biological agents must self-organize to minimise surprise. In
other words, to ensure they occupy a limited number of (attracting) states
5
But there is a small problem… agents cannot measure their surprise
s
?
But they can measure their free-energy, which is always bigger than surprise
F (t ) S (t )
This means agents should minimize their free-energy. So what does this mean?
6
What is free-energy?
…free-energy is basically prediction error
sensations – predictions
= prediction error
where small errors mean low surprise
7
How can we minimize prediction error (free-energy)?
sensations – predictions
Prediction error
Change
sensory input
Change
predictions
Action
Perception
…prediction errors drive action and perception to suppress themselves
8
But where do predictions come from?
…they come from the brain’s model of the world
Sensory input
Models (hypotheses)
Prediction
error
This means the brain models and predicts its sensations
(cf, a Helmholtz machine).
9
So how do prediction errors change predictions?
sensory input
Forward connections convey
feedback
Adjust hypotheses
Prediction errors
Predictions
prediction
Backward connections return
predictions
…by hierarchical message passing in the brain
10
Why hierarchical message passing?
cortical layers
Specialised
cortical areas
…because the brain is organized hierarchically, where each
level predicts the level below
11
David Mumford
Hierarchical message passing in the brain
Forward prediction error
(i ) v
(i 2)v
(i 1)v
(i 1) x
(i ) x
(i 1)v
(i ) v
s(t )
(i ) x
(i1) x
Backward predictions
12
What about action?
predictions
Reflexes to action
dorsal root
sensory error
s(a)
action
ventral horn
a
Action can only suppress (sensory) prediction error.
This means action fulfils our (sensory) predictions
13
Summary
•Biological agents resist the second law of thermodynamics
•They must minimize their average surprise (entropy)
•They minimize surprise by suppressing prediction error (free-energy)
•Prediction error can be reduced by changing predictions (perception)
•Prediction error can be reduced by changing sensations (action)
•Perception entails recurrent message passing in the brain to optimise predictions
•Action makes predictions come true (and minimises surprise)
Examples:
Perception (birdsongs)
Action (goal-directed reaching)
Policies (the mountain car problem)
14
Making bird songs with Lorenz attractors
Syrinx
Sonogram
Frequency
Vocal centre
v1
v
v2
0.5 1
1.5
time (sec)
causal states
hidden states
15
Perception and
message passing
prediction and error
20
15
10
5
0
-5
10
v
20
30
40
50
60
causal states
Backward predictions
20
x
15
stimulus
10
5000
5
4500
s(t )
4000
Forward prediction error
v
0
x
-5
3500
-10
3000
hidden states
10
20
30
40
50
60
20
2500
2000
15
0.2
0.4
0.6
time (seconds)
0.8
10
5
0
-5
10
20
30
40
50
60
16
Frequency (Hz)
Perceptual categorization
Song a
Song b
Song c
time (seconds)
17
Hierarchical models of birdsong: sequences of sequences
v1(1)
v2(1)
sonogram
Syrinx
Frequency (KHz)
Neuronal hierarchy
0.5
1
1.5
Time (sec)
18
Simulated lesions and hallucinations
percept
LFP
Frequency (Hz)
LFP (micro-volts)
60
40
20
0
-20
-40
no top-down messages
LFP
Frequency (Hz)
LFP (micro-volts)
60
40
20
0
-20
-40
-60
no lateral messages
Frequency (Hz)
LFP (micro-volts)
LFP
0.5
1
1.5
time (seconds)
60
40
20
0
-20
-40
-60
0
500
1000
1500
2000
peristimulus time (ms)
19
(1) x
Action, predictions and priors
(2)v
(1)v
(1) x
(1)v
visual input
V
s w
J
(0, 0)
x1
Descending
sensory prediction
error
J1
proprioceptive input
(1)v
x
s 1w
x2
x2
J2
V (v1 , v2 , v3 )
a
20
The mountain car problem
The environment
The cost-function
0.7
Adriaan Fokker
Max Planck
0.6
height
0.5
( x)
0.4
0.3
c ( x, h )
0.2
0.1
0
-2
-1
0
1
2
position
True equations of motion
x
x
f
1
x (a) x 8 x
( x)
(h)
position
happiness
Policy (predicted motion)
x x
f
x cx x
“I expect to move faster when cost is positive”
21
a
Exploring & exploiting the environment
With cost
(i.e., exploratory
dynamics)
22
Policies and prior expectations
Using just the free-energy principle and a simple gradient ascent scheme, we have solved
a benchmark problem in optimal control theory using a handful of learning trials.
If priors are so important, where do they come from?
23
…we inherit them
Darwinian evolution of virtual block
creatures. A population of several
hundred creatures is created within a
supercomputer, and each creature is
tested for their ability to perform a given
task, such the ability to swim in a
simulated water environment. The
successful survive, and their virtual genes
are copied, combined, and mutated to
make offspring. The new creatures are
again tested, and some may be
improvements on their parents. As this
cycle of variation and selection continues,
creatures with more and more successful
behaviours can emerge.
The selection of adaptive predictions
24
Time-scale
Free-energy minimisation leading to…
10 3 s
Perception and Action: The optimisation of neuronal and
neuromuscular activity to suppress prediction errors (or freeenergy) based on generative models of sensory data.
100 s
103 s
106 s
1015 s
Learning and attention: The optimisation of synaptic gain and
efficacy over seconds to hours, to encode the precisions of
prediction errors and causal structure in the sensorium. This
entails suppression of free-energy over time.
Neurodevelopment: Model optimisation through activitydependent pruning and maintenance of neuronal connections that
are specified epigenetically
Evolution: Optimisation of the average free-energy (free-fitness)
over time and individuals of a given class (e.g., conspecifics) by
selective pressure on the epigenetic specification of their
generative models.
25
Thank you
And thanks to collaborators:
Jean Daunizeau
Lee Harrison
Stefan Kiebel
James Kilner
Klaas Stephan
And colleagues:
Peter Dayan
Jörn Diedrichsen
Paul Verschure
Florentin Wörgötter
26