- A Powerful Computing Technology Department of Computer Science Wayne State University.
Download
Report
Transcript - A Powerful Computing Technology Department of Computer Science Wayne State University.
- A Powerful Computing Technology
Department of Computer Science
Wayne State University
1
Road Map
Overview
Recommender Systems
Clustering
Classification
Association Analysis
PageRank
Social Networks
2
Different Forms of Data
Text Data
3
Different Forms of Data
Image Data
4
Different Forms of Data
Video Data
5
Different Forms of Data
Network Data
6
Why Data Mining is Important?
Difficulty of identifying patterns in big data.
Extracting only WANTED data within a short time.
We are drowning in data, but starving for knowledge!
7
How Data Mining can help?
We do not care if GOOGLE has more than billion
web pages.
We only care about the information that is useful
for us.
8
What is Data Mining
The analysis of data to extract useful patterns or
information from a large data collection.
Automated Analysis of Massive Data
Also known as: Knowledge Discovery in Databases
Learn More: http://en.wikipedia.org/wiki/Data_mining
9
Applications
10
Data Miner
An educational tool that teaches you Data Mining
techniques.
Consists of two basic parts such that,
Demonstration
Explains how to work with the interactive part.
Interactive part
Teaching data mining through user interaction.
11
Recommender Systems
Goal:
present information items that are likely to be of interest to
the user.
Lots of online products, books, movies, etc.
Reduce my choices…please!!!!
Learn More: http://pespmc1.vub.ac.be/collfilt.html
12
Recommender Systems
Netflix Recommender System
13
Do you watch movies using
Then you
might like
If you have
watched
this movie
Or may be
you like
So on you might
like these too
This might catch
your interest too
14
Amazon Recommender System
Amazon Recommender System
15
Data Miner - Recommender System
Recommendation based on content
16
Recommendation
17
Finding a Friend With Similar Taste
YOU
See what they like
Measure the similarity
Select your Neighbors
18
Measuring the Similarity
19
Cluster Analysis
Cluster:
A collection of data objects
Cluster Analysis:
Grouping some given objects with similar attributes.
Similar (or related) to one another within the same group
Dissimilar (or unrelated) to the objects in other groups
Learn More: http://home.dei.polimi.it/matteucc/Clustering/tutorial_html
20
Cluster Analysis
Data Set:
Clusters:
Flowers
Fruit
21
Clustering
Now you have seen Flowers and Fruits visually.
Flowers
Fruit
So to which cluster, would you add this object?
Yes, to FRUIT!!
22
Classification
Assigning given items to a known class which have items
with similar attributes.
Explains through Decision Trees.
23
Classification
PURE Classification.
Each branch contains animals belong to a single CLASS.
24
Classification
You have learned what is Mammal and what is Bird.
Can you tell what is this?
Yes, this is indeed a BIRD!!
25
Association Analysis
Discover interesting relationships in a set of transactions.
Understand relationships between items.
E.g.
If a customers buys shoes, then 10% of the times they also buys socks.
60% of all shoppers will buy bread when they also purchase a pint of
milk.
26
Association Analysis
Items:
Transactions:
27
PageRank
Links from popular and related web sites increases the popularity of the given
web site.
Amazon
Yahoo
Pillsbury
YouTube
Billboard
Pandora
Dominos
Pizza
Crayola
Pizza
Hut
Danskin
Shelfari
28
Search Results
When searching on Google, it
will list web sites related to the
input text according to their
importance.
29
Social Networks
Social networking websites allow users to be part of a
virtual community.
E.g. Facebook, Twitter, MySpace
They provide users with simple tools to create a
custom profile with text and pictures.
Users can share their lives with other people through
these networks.
30
Social Networks
Learn More:
http://en.wikipedia.org/wiki/Social_network
http://pc.net/glossary/definition/socialnetworking
31
Thank You !!
Enjoy the Day…
32