- A Powerful Computing Technology Department of Computer Science Wayne State University.

Download Report

Transcript - A Powerful Computing Technology Department of Computer Science Wayne State University.

- A Powerful Computing Technology
Department of Computer Science
Wayne State University
1
Road Map
 Overview
 Recommender Systems
 Clustering
 Classification
 Association Analysis
 PageRank
 Social Networks
2
Different Forms of Data
 Text Data
3
Different Forms of Data
 Image Data
4
Different Forms of Data
 Video Data
5
Different Forms of Data
 Network Data
6
Why Data Mining is Important?
 Difficulty of identifying patterns in big data.
 Extracting only WANTED data within a short time.
We are drowning in data, but starving for knowledge!
7
How Data Mining can help?
 We do not care if GOOGLE has more than billion
web pages.
 We only care about the information that is useful
for us.
8
What is Data Mining
 The analysis of data to extract useful patterns or
information from a large data collection.
Automated Analysis of Massive Data
 Also known as: Knowledge Discovery in Databases
 Learn More: http://en.wikipedia.org/wiki/Data_mining
9
Applications
10
Data Miner
 An educational tool that teaches you Data Mining
techniques.
 Consists of two basic parts such that,
 Demonstration

Explains how to work with the interactive part.
 Interactive part

Teaching data mining through user interaction.
11
Recommender Systems
 Goal:
present information items that are likely to be of interest to
the user.
 Lots of online products, books, movies, etc.
 Reduce my choices…please!!!!
 Learn More: http://pespmc1.vub.ac.be/collfilt.html
12
Recommender Systems
 Netflix Recommender System
13
Do you watch movies using
Then you
might like
If you have
watched
this movie
Or may be
you like
So on you might
like these too
This might catch
your interest too
14
Amazon Recommender System
 Amazon Recommender System
15
Data Miner - Recommender System
 Recommendation based on content
16
Recommendation
17
Finding a Friend With Similar Taste
YOU
See what they like
Measure the similarity
Select your Neighbors
18
Measuring the Similarity
19
Cluster Analysis
 Cluster:
 A collection of data objects
 Cluster Analysis:
 Grouping some given objects with similar attributes.
 Similar (or related) to one another within the same group
 Dissimilar (or unrelated) to the objects in other groups
 Learn More: http://home.dei.polimi.it/matteucc/Clustering/tutorial_html
20
Cluster Analysis
 Data Set:
 Clusters:
Flowers
Fruit
21
Clustering
 Now you have seen Flowers and Fruits visually.
Flowers
Fruit
 So to which cluster, would you add this object?
Yes, to FRUIT!!
22
Classification
 Assigning given items to a known class which have items
with similar attributes.
 Explains through Decision Trees.
23
Classification
 PURE Classification.
 Each branch contains animals belong to a single CLASS.
24
Classification
 You have learned what is Mammal and what is Bird.
 Can you tell what is this?
Yes, this is indeed a BIRD!!
25
Association Analysis
 Discover interesting relationships in a set of transactions.
 Understand relationships between items.
E.g.
 If a customers buys shoes, then 10% of the times they also buys socks.
 60% of all shoppers will buy bread when they also purchase a pint of
milk.
26
Association Analysis
 Items:
 Transactions:
27
PageRank
 Links from popular and related web sites increases the popularity of the given
web site.
Amazon
Yahoo
Pillsbury
YouTube
Billboard
Pandora
Dominos
Pizza
Crayola
Pizza
Hut
Danskin
Shelfari
28
Search Results
 When searching on Google, it
will list web sites related to the
input text according to their
importance.
29
Social Networks
 Social networking websites allow users to be part of a
virtual community.
E.g. Facebook, Twitter, MySpace
 They provide users with simple tools to create a
custom profile with text and pictures.
 Users can share their lives with other people through
these networks.
30
Social Networks
 Learn More:
 http://en.wikipedia.org/wiki/Social_network
 http://pc.net/glossary/definition/socialnetworking
31
Thank You !!
Enjoy the Day…
32