Transcript Outline

Multimedia Information Systems
Vijay Atluri
atluri@andromeda
Office 200R Ackerson Hall
Phone: 973-353-1642
Office Hours: 2 hours after the class and by
appointment
1
What are the most known media?

Images
 2-d
color images, gray scale medical images 2-d (X-ray) or 3-d
(MRI scans)
 Captured images

Captured images need to be analyzed
 Synthesized


images / visualizations (Artificially created)
Synthetic images do not need to be analyzed ( featured information is
already available).
Text
 sequential
vs. hypertext
 semistructured

Organized into chapters, paragraphs etc. or may be indexed using
HTML/SGML/XML
2
What are the most known media?

Video
 video
clips, movies
 captured
 interactive

Audio
 digitized

voice or music
1-d time-series
 financial,
marketing, production time series data such as stock
prices, sales numbers

Handwritten
 electronic


notes
Traditional data
Scientific data
 collections
of sensor data, e.g.,
<x,y,z,t,pressure,temperature,resistivity…>
3
MM Applications

Travel Industry
 intelligent

travel agent, multimedia presentation,
Entertainment Industry
 Film
clip database, video-on-demand, pay-per-view, interactive
TV, in-flight entertainment, video game database, video dating
services
 Users will be able to select using a mix of query/retrieval and
browsing capabilities

Education and training
 classroom
without walls: distance learning, teleclassrooms,
interactive training, self education, employee reeducation
multimedia training is 40% more effective, retention rate is 30% higher,
learning curve is 30% shorter (study conducted by DoD)
 approximately US spends $56.6 billion every year (Marketing research by
Training Magazine)

4
MM Applications

Expert Advice
 Auto

repair, medical advice, ..
Home Shopping
 multimedia







presentation of goods sold, sales information
Medical databases
Text and photograph archives
Digital Libraries
Office automation
Electronic encyclopedia
DNA databases
Geographic Information Systems
5
MMDBMS: Requirements

Ability to uniformly query multimedia data
 query
and seamlessly integrate data contained in different
databases (that may possibly use different schema), flat files,
object-oriented databases, spatial databases, arbitrary legacy
sources
 elicit the content of the media data (a challenge by itself) which is
highly dependent on the media type and storage format
 merge results from different data sources and media types

Ability to retrieve media objects from a local storage
device in a smooth, jitter-free manner
 considering
large storage requirements, highly compressed
format, secondary and tertiary storage devices, mix of storage
devices with different performance characteristics
6
MMDBMS: Requirements


Ability to develop a presentation in audiovisual media from
the answer generated by the query
Ability to deliver the presentation that satisfies various
quality of service requirements
 synchronization,
fidelity, temporal constraints
 does not suffer from jitter and hiccups
 limited buffer availability and bandwidth (output devices may
reside at distributed network nodes)
7
What kind of queries can users ask?









Find all images which are created by J. Smith
Find all images with the same color, shape and texture
Find all images which look like this image
Find all images which look like this sketch
Find all images with the same color distribution like a
sunset photograph
Find all images which contains a part which looks like this
image or sketch
Find companies whose stock prices move similarly
Find other companies that have similar sales patterns with
our company
Find cases in the past that resemble last month’s sales
pattern of our product
8
What kind of queries can users ask?






Find past days in which the solar magnetic wind showed
patterns similar to today’s pattern
Find similar music scores or video clips
Find all images of “sunny days” (we are getting into
semantics)
Find all images which contain a car
Find all images which contain a car and a man who look
like this
Find all image pairs which contain similar objects. (data
mining)
9
Sample Multimedia Scenario (from the text)

Consider a police investigation of a large-scale drug
operation:
 Video
data captured by surveillance cameras that record the
activities taking place at various locations
 Audio data captured by legally authorized telephone wiretaps
 Image data consisting of still photographs taken by investigators
 Document data seized by the police when raiding one or more
places
 Structured relational data containing background information,
bank records, etc., of the suspects involved
 Geographic information systems data containing geographic data
relevant to the drug investigation being conducted
10
Example of Queries for the MM Scenario

A police officer, Tom, has a photograph in front of him. He
wants to find the identity of the person in picture.
 Q1:
“Retrieve all images from the image library in which the
person appearing in the photograph appears”

Tom wants to examine pictures of a suspect Dick.
 Q2:
“Retrieve all images from the image library in which Dick
appears”

Two types of queries
 Image-based
 Keyword-based
11
Example of Queries for the MM Scenario

Q1: input is an image, output is a ranked list of images that
are similar to the query image
 Need
to know what
“similarity” means
 “ranking” means

 Need

to efficiently support these two operations
Q2: input is a keyword, output is an image whose name
attribute is Dick
 Need
to know how to associate different attributes with images
 Need to know how to effectively index and retrieve images based
on such attributes
12
Example of Queries for the MM Scenario

Tom is listening to an audio surveillance tape.It contains a
conversation between two individuals A and B
 Q3:

Tom wants to review all audio logs that Dick participated
during some specified time period
 Q4:

“Find the identity of B, given that A is Dick
“Find all audio tapes in which Dick was a participant”
Tom is browsing an archive of text documents (old
newspaper archives, police dept files on old unsolved
murder cases, witness statements)
 Q5:
“Find all documents that deal with financial transactions with
ABC corporation”

Similar queries may be posed on video data
13
Example of Queries for the MM Scenario

MM Query
 Q6:
“Find all individuals who have been photographed with Dick
and who have been convicted of attempted murder in North
America and who have recently had electronic fund transfers
made into their bank account from ABC Corp.”
Need to access heterogeneous database systems
 Need to access several MM databases:

•
•
•
•
Mugshot database containing the pictures and names of individuals
Surveillance photograph database of still images
Surveillance video database
Image processing algorithms to determine who is present in which video or
still photograph
14
MM Research Issues

Queries
 Need
a single language
with which MM data of different types can be accessed
 with which one should be able to specify operations to combine different
media types (just like join, union, intersection, difference, Cartesian
product)
 that must be able to access metadata as well as raw data
 that must be able to merge, manipulate, and join together results from
different media sources

 After
devising such a language, we need techniques to
optimize a single query
 develop servers that can optimize processing of a set of queries

15
MM Research Issues

Content
 What
is meant by content?
 Under what conditions can it be described textually?
 Under what conditions it must be described directly through the
original media type?
 How can we extract the content of

an image, a video-clip, an audio-clip, a free/structured text document?
 How
should we index the results of the extracted content?
 What is retrieval similarity?
 What algorithms can be used to efficiently retrieve media data on
the basis of similarity?
16
MM Research Issues

Storage
 How
well disks, CD-ROM, tape systems and tape libraries work?
 How do we design disk/CD-ROM/tape servers so as to optimally
satisfy different clients concurrently when they execute

playback, rewind, fast forward, pause, etc.
17
MM Research Issues

Presentation and Delivery
 How
do we specify the content of MM presentations?
 How do we specify the form (temporal/spatial layout, fidelity) of
this content?
 How do we create a presentation schedule that satisfies these
temporal, spatial and fidelity requirements?
 How can we deliver MM presentation to users when there is
a need to interact with other remote servers to assemble the presentation
 a bound on the buffer, bandwidth, load, and other resources
 a mismatch between the host server’s capabilities and the customer’s
machine capabilities, preferences, etc.?

 How
can such presentations optimize Quality of Service?
18