Super-Patches Image Tessellation with Arbitrarily Shaped Patches Eddie K. H. Ng, University of Toronto Joint work with Brendan Frey.
Download
Report
Transcript Super-Patches Image Tessellation with Arbitrarily Shaped Patches Eddie K. H. Ng, University of Toronto Joint work with Brendan Frey.
Super-Patches
Image Tessellation with Arbitrarily
Shaped Patches
Eddie K. H. Ng, University of Toronto
Joint work with Brendan Frey.
Motivations
Patch-based models have been successfully
applied to many different areas of computer
vision.
Patch size / shape selection remains a skillful
labor and confined to simple geometric shapes.
Causes problems in part-based object
recognition.
Eddie K. H. Ng
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006
Goals
Learn a set of coherent patches of arbitrary
shapes and sizes.
Use the resulting patches to enhance
performance of various tasks in computer
vision.
Eddie K. H. Ng
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006
Inspiration
Super-pixels [Ren
& Malik, 2003]
Create a local,
coherent entity while
maintaining overall
structure
Reduce computational
complexity.
X. Ren and J. Malik. Learning a classification model for segmentation. In Proc. 9th Int. Conf. Computer Vision, volume 1,
pages 10-17, 2003.
* Image taken from http://www.cs.sfu.ca/~mori/research/superpixels/.
Eddie K. H. Ng
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006
Basic Concepts [1]
The shape and size of each patch is specified
through a kernel function (e.g. Gaussian (m, s)).
The set of kernel functions compete for the
ownership each pixel.
Initial Conditions
Input Image
Eddie K. H. Ng
Patches after Competition
Input Image
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006
Basic Concepts [2]
The tessellation is then a map showing the
respective ownerships.
Free parameter - number of patches *
Patches after Competition
Input Image
Eddie K. H. Ng
Tessellated Image
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006
Basic Concepts [4]
But how does the patches compete with each
other?
Each patch is mapped to a region in the Epitome.
Prior knowledge of Image Classes
(epitome)
Input Image
Eddie K. H. Ng
Tessellated Image
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006
Basic Concepts [5]
Cost Function:
C
R ,G , B
l
2
N pixels
N patches
~
~
I l xi ki l Tk xi
k
i
e zki
ki
z ji
j e
1 ~ ~ T 1 ~ ~
zki xi m k k xi m k
2
Optimize cost function using conjugate gradient
Eddie K. H. Ng
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006
Run Demo
Other Explored Paths
Variations on the cost function:
Garbage Model
Isotropic Patches
Shape Regularization
Grow-and-Merge
Different formulation of the problem
Other kernel functions
Other learning algorithms
Eddie K. H. Ng
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006
References
Epitome
Low-level vision
W.T. Freeman, E.C. Pasztor, O. T. Carmichael, Learning Low-level vision, International Journal of
Computer Vision 40(1), 25-47, 2000
Super-pixels
N. Jojic, B. J. Frey, A. Kannan, Epitomic analysis of appearance and shape , ICCV 2003.
X. Ren and J. Malik. Learning a classification model for segmentation. In Proc. 9th Int. Conf. Computer
Vision, volume 1, pages 10-17, 2003.
Web resources
www.psi.utoronto.ca
http://research.microsoft.com/~jojic/epitome.htm
Eddie K. H. Ng
CIAR Summer School on Learning and Vision
Aug 15 to 19, 2006