Recognition I: Extended Gaussian Images Andrew Nashel COMP 290-075: Computer Vision
Download
Report
Transcript Recognition I: Extended Gaussian Images Andrew Nashel COMP 290-075: Computer Vision
Recognition I:
Extended Gaussian Images
Andrew Nashel
COMP 290-075: Computer Vision
http://www.cs.unc.edu/~nashel/290-075/
Overview
Motivation
Gaussian Image
Extended Gaussian Image
– Gaussian Curvature
– Extended Circular Image
Complex EGI
Motivation
Object recognition is often one of the ultimate goals
for vision systems.
It is necessary for real world interactions such as:
– Navigation through environments
– Robotic handling of objects
– Object inspection
It brings together many components of computer
vision:
– Depth extraction
– Image segmentation
– Geometric modeling
The Gaussian Image
Surface normal information for any object may be
mapped onto a unit (Gaussian) sphere by finding the
point on the sphere with the same surface normal:
n̂
~
n̂
~
Properties of the Gaussian Image
This mapping is called the Gaussian image of the
object when the surface normals for each point on the
object are placed such that:
– tails lie at the center of the Gaussian sphere
– heads lie on the sphere at the matching normal point
In areas of convex objects with positive curvature, no
two points will have the same normal.
Patches on the surface with zero curvature (lines or
areas) may correspond to a single point on the sphere.
Rotations of the object correspond to rotations of the
sphere.
The Extended Gaussian Image
We can extend the Gaussian image by
– placing a mass at each point on the sphere equal to the area
of the surface having the given normal
– masses are represented by vectors parallel to the normals,
with length equal to the mass
An example:
Block
EGI of Block
Using the EGI
EGIs for different objects or object types may be
computed and stored in a model database as a
surface normal vector histogram.
Given a depth image, surface normals may be
extracted by plane fitting.
By comparing EGI histogram of the extracted
normals and those in the database, the identity and
orientation of the object may be found.
Problems with the EGI
EGIs only uniquely define convex objects.
An infinite number of non-convex objects may have
the same EGI:
Areas: a+b=c
d+e=f
b
d
b
d
e
a
e
a
e
c
Gaussian Curvature
Formally, we will develop the extended Gaussian
image based upon the Gaussian curvature of the
object.
Consider a patch of area
O on the object, and the
corresponding area S on
the Gaussian sphere:
Defining Gaussian Curvature
Given patches O and S, we define Gaussian
curvature K as the limit of the ratio of the two areas
as they approach zero:
dS
S
K = lim
=
O0 O
dO
– If the object surface is strongly curved, then the corresponding
points on the Gaussian sphere will be spread out.
– If the surface is planar, the normals will be parallel and will
map to a single point on the Gaussian sphere.
Defining Curvature Continued
If we integrate over a patch O on the object we have:
∫∫O K dO = ∫∫S dS = S
where S is the area of the patch on the Gaussian sphere.
We call the expression on the left the integral
curvature. This allows us to handle surfaces with
discontinuities in surface normals.
Defining Curvature Continued
Similarly, if we integrate over a patch S:
∫∫S 1/K dS = ∫∫O dO = O
where O is the area of the patch on the object.
This relationship suggests that the inverse of the
curvature will be used to define the extended
Gaussian image.
Defining EGI
Let u and v be used to specify points on the original
surface, and let and specify points on the
Gaussian sphere.
We now define the extended Gaussian image as:
G(,) =
1
K(u,v)
the inverse of the Gaussian curvature, where (,) is the
point on the Gaussian sphere corresponding to the
point (u,v) on the object.
The Discrete Case EGI
To represent the information of the Gaussian sphere
in a computer, the sphere is divided into cells:
For each image cell on the left, a surface orientation
is found and accumulated in the corresponding cell of
the sphere.
Discrete Approximation
In an actual implementation of a discrete EGI, we
start with a surface orientation map.
Shown here is a needle
diagram of an inclined
torus obtained by
photometric stereo:
Orientation Histogram
The discrete approximation of the EGI is called the
orientation histogram.
The needle diagram of the
torus is projected onto a
tessellated unit sphere to
create an orientation
histogram, displayed as a
set of spikes:
The Extended Circular Image
The extended circular image is the 2-D equivalent of
the extended Gaussian image.
Polygon Morphing with the ECI
An alternative to pixel-based morphing algorithms for
convex polygons:
– First compute the ECI representation of the source and
target polygons.
– Match source and target normals on the ECI circle to create
source-target pairs.
– Interpolate weights and angles between pairs to find the ECI
of intermediate steps.
– Reconstruct the convex polygon from the ECI.
Java implementation:
http://web.mit.edu/manoli/ecimorph/www/code/MMorph.html
The Complex EGI
Another problem with the EGI is that the weights in
the representation only encode area information and
not positional data, thus it is impossible to determine
translation.
The Complex EGI is an alternative formulation in
which the weight at each discrete cell is a complex
number:
– The magnitude at each cell is the surface area as in the
standard EGI.
– The phase is the signed distance of the surface patch from a
designated origin along the normal.
The Complex EGI
We see that it is a simple modification to handle
displacement determination:
Conclusions
The extended Gaussian image is a useful technique
for representing the shape of an object, and even its
position (complex EGI).
However, it is only a component tool to be used in a
shape recognition process which also includes:
–
–
–
–
Surface orientation determination
Image segmentation into objects
Prototypical models/object databases
System control - what object to handle/inspect?
References
Horn, B.K.P. 1984. Extended Gaussian images. In Proceedings
of the IEEE 72, 12 (Dec.), pp. 1656-1678.
Horn, B.K.P. 1986. Robot Vision. MIT Press, Cambridge, MA,
pp. 365-399.
Kamvysselis, M. 1997. 2D Polygon Morphing using the
Extended Gaussian Image.
http://web.mit.edu/manoli/ecimorph/www/ecimorph.html
Kang, S.B. and K. Ikeuchi. 1990. 3-D Object Pose
Determination Using Complex EGI. tech. report CMU-RI-TR-9018, Robotics Institute, Carnegie Mellon University.