How We Cognize Space Zenon Pylyshyn Rutgers Center fir Cognitive Science Rutgers University, New Brunswick, NJ.

Download Report

Transcript How We Cognize Space Zenon Pylyshyn Rutgers Center fir Cognitive Science Rutgers University, New Brunswick, NJ.

How We Cognize Space
Zenon Pylyshyn
Rutgers Center fir Cognitive Science
Rutgers University,
New Brunswick, NJ
How we cognize space
Is There an Image Space in the Head?


The most common approach to the
question of how we represent a spatial
layout is that we represent it in the form of
a mental image. The format of mental
images is supposed to be particularly
suited for representing spatial information.
The mind’s great illusion: That you see a
world inside your head when you imagine
 the “Picture Theory” of mental imagery
The four-part plan of this lecture:
1. First I will talk a little about “the imagery debate” and
introduce recent neuroscience evidence.
2. I will then focus on what I consider the core of the
debate: How mental images represent space.
3. Then I will talk about the special case of images that
are projected onto the perceived world.
4. To show the generality of projected images that I will
need to spend a few minutes to introduce the idea of
visual indexes or FINSTs, as a type of a deictic or
demonstrative reference.
5. Finally I will combine the idea of indexes with the
evidence on mental imagery and suggest how
generalized indexes may allow spatial properties of the
currently-perceived world to translate into apparent
spatial properties of images.
I. The imagery debate:
A capsule overview
The main question is whether thoughts
experienced as “mental images” or as “seeing
with the mind’s eye” are different from other
thoughts, and if so how.
 The dominant view is the “picture theory” of
mental images, which assumes that images
stored in a spatial medium and are examined
by the visual system the same way that the
original scene would be.

Does imagery use the visual
system and if so, what does that
tell us about the nature of images?



There is some evidence that the visual system is
“active” during imagery
This has led to the view that the visual system
must be examining some not-yet-interpreted
image, just as it was thought to do in visual
perception.
But the last step is unwarranted because even if
the visual system was involved, it would only
mean that both vision and imagery use some of
the same processes and the same kind of
representations, but neither need be pictorial.
Failure of the picture-theory in vision
In vision the picture theory was meant to explain why
our perception is panoramic and stable while the visual
inputs are highly local, partial and constantly changing
But the picture theory of vision has been thoroughly
discredited: There is no rich panoramic display in vision
(e.g., see change blindness, superposition studies, …)
The picture theory of vision
is a non-starter, even for cats
(Cartoon by Kliban)
A more plausible theory of vision
(even for cats)
II. The newest round of
the imagery debate


In recent years the picture theory has been revived,
due largely to two neuroscience findings:
1. The visual cortex (V1) is activated during imagery
2. The visual cortex is retinotopically organized (i.e.,
it appears to map the retina in a topographically
continuous or homeomorphic manner).
From this, people have concluded that mental
imagery uses a literal spatial display, located in V1.
The goal of neuroscience research on
mental imagery is to find a display of the
imagined pattern in visual cortex
We already know that
there is a topographical
projection of retinal
activity in visual cortex
The tool of choice has
been the use of brain
scans (esp fMRI, PET)
Tootell, R. B., Silverman, M.
S., Switkes, E., & de Valois, R.
L. (1982). Deoxyglucose
analysis of retinotopic
organization in primate striate
cortex. Science, 218(4575),
What do recent neuroscience results
tell us about mental imagery?
None of the brain-scan (fMRI, PET) results
supports the picture theory of mental
images for reasons that I will discuss next
1. Even if there is a 2D mapping of retinal
activity in visual cortex (V1), this should
not be identified with the mental image.
2. Patterns in V1 do not function the same
way as mental images for several reasons.
3. Even if dynamic 3D patterns were found in
V1 it would not explain most mental
imagery research findings.
The topographical structure of the visual
cortex could not support mental images
1. Even of there is a 2D mapping of retinal
activity in V1, this cannot be identified with
the mental image which is panoramic,
3-dimensional, dynamic and has many other
properties that could not be mapped onto V1,
so we would need a different theory for them.
Why activity in visual cortex could
not correspond to a mental image
2. Patterns in V1 are different from mental images:
a) Patterns in V1 are foveal and retinocentric while
mental images are panoramic and allocentric
b) There is no spontaneous 3D interpretation of
patterns in mental images <parallelogram example>
c) There is no amodal completion of patterns in
mental images <Kanizsa example>
d) Order of access of information in a mental image
is not free <name letters of a familiar word backwords>
e) Emmert’s law does not hold for images <unlike
afterimages>
f) There is no visual (re)interpretation of images
<Slezak example>
Why activity in visual cortex could
not correspond to a mental image
2. Patterns in V1 are different from mental images:
a) Patterns in V1 are foveal and retinocentric while
mental images are panoramic allocentric
b) There is no spontaneous 3D interpretation of
patterns in mental images <parallelogram example>
c) There is no amodal completion of patterns in mental
images <Kanizsa example>
d) Order of access of information in a mental image is
not free <letter reading example>
e) Emmert’s law does not hold for images
f) There is no visual (re)interpretation of images
<Slezak example>
Imagine two parallelograms (as
below) one above the another
Close your eyes and then imagine
these two parallelograms
Connect the corresponding top and bottom vertices
What do you see? Keep looking to see if anything changes
Did you see this? Did it flip?
Why activity in visual cortex could
not correspond to a mental image
2. Patterns in V1 are different from mental images:
a) Patterns in V1 are foveal and retinocentric while
mental images are panoramic allocentric
b) There is no spontaneous 3D interpretation of
patterns in mental images <parallelogram example>
c) There is no amodal completion of patterns in
mental images <Kanizsa example>
d) Order of access of information in a mental image is
not free <letter reading example>
e) Emmert’s law does not hold for images
f) There is no visual (re)interpretation of images
<Slezak example>
Amodal completion in imagery?
Amodal completion in imagery?
Why activity in visual cortex could
not correspond to a mental image
2. Patterns in V1 are different from mental images:
a) Patterns in V1 are foveal and retinocentric while
mental images are panoramic allocentric
b) There is no spontaneous 3D interpretation of
patterns in mental images <parallelogram example>
c) There is no amodal completion of patterns in mental
images <Kanizsa example>
d) Order of access of information in a mental image is
not free <letter reading example>
e) Emmert’s law does not hold for images
f) There is no visual (re)interpretation of images
<Slezak example>
Slezak figures
Pick one (or two) of these animals and
memorize what they look like. Now
rotate it in your mind by 90 degrees
clockwise and see what it looks like.
Rotated Slezak figures
 No subject was able to recognize
the mentally rotated figure
 Subjects remembered the figures
well enough so if they drew it they
could recognize the rotated figure
Even if patterns in visual cortex were
isomorphic to those in the mental
image, it still would not explain most
results of mental imagery research!
3. The reason that patterns of activation in
striate cortex would not explain most of
the results of mental imagery research is
that the results are largely cognitively
penetrable and therefore require the
appeal to knowledge, goals, utilities, etc
and inferences over them. In other words
they require a cognitive explanation.
Task Demands and the tacit
knowledge explanation
The task of “imagining X” is the task of
pretending that you are seeing X and simulating
as much of that event as seems relevant to the
task using your tacit knowledge about how the
event might unfold. The task also requires
certain other skills (e.g., estimating time-tocollision, generating time intervals, etc) but it
does not require that you use a spatial display.
 Examples…

There are many examples showing
that the result that was attributed to
the mental image format is actually
due to tacit knowledge



Color mixing example to illustrate the difference
between the two sources of observations <slide>
Imagine dropping weights from different heights
Mental Image size (It has been shown that it
takes longer to report small details from a
small image than from a large on. What does
this mean? What would you think if the
result showed the opposite?)

Mental scanning <example slide>
Color mixing example
Studies of mental scanning
A window on the mind?
2
1.8
1.6
Latency (secs)
1.4
scan image
imagine lights
show direction
1.2
1
0.8
0.6
0.4
0.2
0
Relative distance on image
(Pylyshyn & Bannon. See Pylyshyn, 1981)
Mental representation of space:
The core of the imagery debate


It seems to be almost impossible to deny that thinking
using mental images exploits spatial properties of images
in some important sense. In what sense? (Do images
“preserve metrical spatial properties”?)
It is always possible to encode spatial relations in any
form of representation that has a numeral system, so why
assume that the representation of space is itself spatial?
1) Phenomenology: we see things as “laid out in space”!
2) Psychophysical evidence from projected images
(illusions, S-R compatibility…)
3) Clinical evidence (visual/imaginal neglect)
Use of real visible space in
“projected” mental imagery

“Projected images” serve to directing attention and to
associate thoughts with selected visible objects. Examples:
 Robust version of mental scanning (scanning with eyes open)
 Visual illusions involving projected images <Bernbaum & Chung,
1981>
 Projected memory images act like displays <Podgorny & Shepherd>
 S-R Compatibility with images (Tlauka & McKenna, 1998)
 Visuomotor (prism) adaptation from mental images <Finke, 1980>
Shepard & Podgorny experiment
Both when the displays are seen and when the F is
imagined, RT to say whether the dot was on the F was
fastest when the dot was at the vertex of the F, then when
on an arm of the F, then when far away from the F – and
slowest when one square off the F.
S-R Compatibility Effect with display
S-R Compatibility Effect with Images
Might all spatial images
work like projected images?

There are three key ideas behind the proposal that
spatial mental images are the projection of the spatial
layout of imagined objects onto a perceived scene
1. Recognition that the spatial properties exhibited in
experiments with projected images depend only on the
location of a few items and not on other visual properties
2. The idea of a limited-capacity amodal indexing
mechanism or deictic reference: FINSTs and Anchors.
3. The idea of a primitive amodal spatial sense that allows
us to perceive and recall the location of objects in an
allocentric frame of reference, independent of the
objects’ perceptual properties or of sense modality, and
automatically updated by our movements
We don’t need a spatial display in our
head if we have the right kind of deictic
contact with real (perceived) space



None of the experiments that are alleged to show the
existence of a spatial display (in visual cortex) need to
appeal to anything more than a small number of
imagined locations. (e.g., Shepherd & Podgorny, Finke, Tlauka,…)
If we can index a small number of (occupied) locations
in real space (using FINSTs) we can use them to
allocate attention or to program motor commands.
If these indexed objects are also bound to objects of
thought this will result in our thoughts (i.e. images)
having persisting spatial relations.
Aside on Visual Index (FINST) Theory

FINSTs are direct, unmediated, nonconceptual connections
between objects in the world and mental symbols
 FINSTs serve as visual demonstratives (like “this” or “that”).


Such direct references are essential for solving the
correspondence problem in vision – especially in the case
of visual representations built up incrementally over
different glances or “noticings”.
Some instances where we need Indexes:
 Visual stability, recognizing n-place relations, subitizing, and
multiple-object tracking
Several objects must be picked out
at once in relational judgments

When we judge that certain objects are
collinear, we must have picked out the relevant
individual objects first.
Several objects must be picked out
at once in relational judgments

The same is true for other relational judgments
like inside or on-the-same-contour… etc. We
must pick out the relevant individual objects first.
A concrete demonstration of
what visual indexes can do
Multiple Object Tracking studies (MOT)
 Basic finding: People can track up to 5 individual
objects that do not have a unique description
 We have shown that it is unlikely that the tracking
is done by updating locations but rather that
individuating and keeping track of certain kinds of
individuals is a primitive visual operation
 Tracking is primitive and likely both preconceptual
and preattemtive
 The mechanism for tracking is the same as the
mechanism that is used for picking out elements
when images are “projected” onto a scene.
How do we do it? What properties
of individual objects do we use?
How do we do it? What properties
of individual objects do we use?
But you can also imagine in the
dark or with your eyes closed!


Does imagery work differently in the dark or
with eyes closed?
Must indexes be visual?
The Sense of Space




This phrase is meant to denote an extremely welldeveloped human capacity to recall and orient to
locations in space; a space that is independent of
modality and is anchored to real allocentric space.
There is a major difference between a sense of space
and a visual image. The sense of space is not a
subjective experience but a skill that is largely
unconscious. There has long been a suspicion that what
has been studied under the name “mental imagery” is
really spatial ability (e.g., unconscious images?).
The sense of space does not need an internal spatial
medium; it can derive spatial properties by binding
mental particulars to real perceived space.
Perceptual Indexes (I.e., FINSTs and Anchors) are
mechanisms that allow representations to inherit some
of the spatial properties of the perceived world.
Some illustrations of the sense of space
Many phenomena that have been cited in support of the
picture theory of mental imagery only implicate a spatial
sense, not the visual perception of a mental display

Sense of space is not specific to (or parasitic on) vision
 Blind people exhibit all the observed phenomena of mental
imagery

Responses to images exhibit S-R compatibility and the
Simon effect – i.e., reactions made towards a stimulus are
faster than ones made away from it.
 The space that is relevant to the Simon effect is amodal (you get
cross-modal Simon effects)


Hemispatial Neglect is a deficit in orienting attention to
real locations – that’s why it may be mirrored in imagery
Mental Images can induce visuomotor adaptation
 But only location, not visual pattern, plays a role (R. Finke)

Observations such as the mental scanning effect, when
they are not due to task demands, can be explained in
terms of scanning through perceived space
Conclusion




Many of the “mental imagery” findings in the literature are the
result of subjects using their tacit knowledge to simulate what it
would be like to see the situation described.
The neuroscience evidence does not show that there is a 2D
display in visual cortex on which we “draw” images when we
imagine. The activity in visual cortex is of the wrong kind to
underwrite mental imagery.
More interesting are the studies in which people project images
onto perceived scenes because these studies do show the
involvement of spatial properties. But these experiments never
need to assume that a picture-like pattern is projected. All they
need to assume is that a few objects in the visual scene are
indexed and associated with objects of thought. The rest of the
spatial properties come from perception.
Although the clear cases are when images are projected onto a
visual scene, the same is likely true of other modalities that
contribute to our sense of space.
Representing space
The spatial character of mental
images (and other “spatial”
representations) comes from
binding objects of thought to
real objects in 3D space. The
space in mental imagery comes
from real concurrently-perceived
spatial relations, which give us
our exquisite sense of space.
References
Bernbaum, K., & Chung, C. S. (1981). Müller-Lyer illusion induced by imagination. Journal of Mental Imagery,
5(1), 125-128.
Kanizsa, G., & Gerbino, W. (1982). Amodal completion: Seeing or thinking? In B. Beck (Ed.), Organization and
Representation in Perception (pp. 167-190). Hillsdale, NJ: Erlbaum.
Pylyshyn, Z. W. (1973). What the Mind's Eye Tells the Mind's Brain: A Critique of Mental Imagery.
Psychological Bulletin, 80, 1-24.
Pylyshyn, Z. W. (1981). The imagery debate: Analogue media versus tacit knowledge. Psychological Review, 88,
16-45.
Pylyshyn, Z. W. (1989). The role of location indexes in spatial perception: A sketch of the FINST spatial-index
model. Cognition, 32, 65-97.
Pylyshyn, Z. W. (2000). Situating vision in the world. Trends in Cognitive Sciences, 4(5), 197-207.
Pylyshyn, Z. W. (2001). Visual indexes, preconceptual objects, and situated vision. Cognition, 80(1/2), 127-158.
Pylyshyn, Z. W. (2002). Mental Imagery: In search of a theory. Behavioral and Brain Sciences, 25(2), 157-237.
Pylyshyn, Z. W. (2003). Return of the Mental Image: Are there really pictures in the brain? Trends in Cognitive
Sciences, 7(3), 113-118.
Pylyshyn, Z. W. (in press). Seeing and visualizing: It's not what you think. Cambridge, MA: MIT Press/Bradford
Books.
Pylyshyn, Z. W., & Storm, R. W. (1988). Tracking multiple independent targets: evidence for a parallel tracking
mechanism. Spatial Vision, 3(3), 1-19.
Shepard, R. N., & Metzler, J. (1971). Mental rotation of three dimensional objects. Science, 171, 701-703.
Shepard, R. N., & Podgorny, P. (1978). Cognitive processes that resemble perceptual processes. In W. K. Estes
(Ed.), Handbook of learning and cognitive processes (Vol. 5). Hillsdale, NJ: Erlbaum.
Slezak, P. (1995). The `philosophical' case against visual imagery. In P. Slezak, T. Caelli & R. Clark (Eds.),
Perspective on Cognitive Science: Theories, Experiments and Foundations (pp. 237-271). Stamford, CT: Ablex.
Tlauka, M., & McKenna, F. P. (1998). Mental imagery yields stimulus-response compatibility. Acta
Psychologica, 67-79.