COMP 4332, RMBI 4330 Advanced Data Mining (Spring 2012) Qiang Yang Hong Kong University of Science and Technology [email protected] http://www.cs.ust.hk 2015/11/6 Course Introduction.

Download Report

Transcript COMP 4332, RMBI 4330 Advanced Data Mining (Spring 2012) Qiang Yang Hong Kong University of Science and Technology [email protected] http://www.cs.ust.hk 2015/11/6 Course Introduction.

COMP 4332, RMBI 4330
Advanced Data Mining (Spring
2012)
Qiang Yang
Hong Kong University of Science and Technology
[email protected]
http://www.cs.ust.hk
2015/11/6
Course Introduction
1
Topics


Review of Basics
Practical Data Mining







Imbalanced Data
Streaming and Time Series Data
Big Data
Social Recommendation
Social Media and Social Networks
Hands on: 2 Major Projects
Student Presentations
2015/11/6
Course Introduction
2
Outcome and Objective




Student will know the current state of
the art in Data Mining
Student will be able to implement a
practical data mining project
Student will be able to present their
ideas well
Prepared for PG study, Internship, etc.
2015/11/6
Course Introduction
3
Projects:

Project 1:


KDDCUPs on credit rating and customer
retention (KDDCUP 2009)
Project 2:


based on KDDCUPs
Yahoo! Music Recommendation (KDDCUP
2011)
Project 3 (Optional): KDDCUP 2012
2015/11/6
Course Introduction
4
KDDCUP Examples
—
KDDCUP from past years
—
In general, we wish to
2007:
—
Predict if a user is going to rate a movie?
—
Predict how many users are going to rate a
movie?
—
—
2006:
—
—
Input: Data
—
Output:
—
Build model
—
Apply model to future data
Predict if a patient has cancer from
medical images
—
2005:
—
Given a web query (“Apple”), predict
the categories (IT, Food)
—
1998:
—
Given a person, predict if this
person is going to donate money
2015/11/6
Course Introduction
5
5
Important Sites
 Instructor Web Site
 http://www.cse.ust.hk/~qyang/4332
 TA: Yin Zhu and Kaixiang Mo
 Assignment Hand-in: online
 [email protected]
 Course Discussion Site:
 Check out the web cite…
2015/11/6
Course Introduction
6
Prerequisites

Statistics and Probability would help,


Machine Learning/Pattern Recognition would
help,


But will be reviewed in class
We will review some most important algorithms
One programming language

We will teach new languages in the tutorial
2015/11/6
Course Introduction
7
Grading
 Assignments 10%
 Course Projects and Presentations: 50%
 Final Exam 40%
2015/11/6
Course Introduction
8
More info

Textbooks:
 Listed on Course Website

2015/11/6
Buy them online if you wish
Course Introduction
9