Transcript Slide 1

Alexander Gelbukh
Moscow, Russia
1
Mexico
2
Computing Research Center (CIC),
Mexico
3
Chung-Ang University, Korea
Electronic Commerce and
Internet Application Lab
4
Special Topics in Computer Science
Advanced Topics in
Information Retrieval
Alexander Gelbukh
www.Gelbukh.com
5
Information Retrieval
 In a huge amount
 of poorly structured information
 find the information that you need
 when you don’t know exactly what you need
 or can’t explain it
Key concepts:
 The Web
 User information need
 Ranking
6
7
8
Importance
 Knowledge: the main treasure of man
 Web: Repository? Cemetery of information!
 Natural language and multimedia information
o Poorly structured, badly written
 Corporate and organizational document bases
o Senate speeches: Mexico
o Medical data collections
o Corporate memory. Microsoft knowledge base
 Future: data explosion  increasing importance
9
Perspectives
 Corporations: corporate databases
 Organizations: document bases
 Government
o European Union multilingual problem
o The same in Asia
 Academy
o
o
o
o
Lots of open research topics
Web topics
Computational Linguistics topics
Intelligent technologies, AI
10
Textbook
http://sunsite.dcc.uchile.cl/irbook/
11
Contents
1. Introduction
2. Modeling
3. Retrieval Evaluation
4. Query Languages
5. Query Operations
6. Text and Multimedia Languages and Properties
7. Text Operations
8. Indexing and Searching
9. Parallel and Distributed IR
10. User Interfaces and Visualization
11. Multimedia IR: Models and Languages
12. Multimedia IR: Indexing and Searching
13. Searching the Web
14. Libraries and Bibliographical Systems
15. Digital Libraries
12
Calendar
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
March 4
March 11
March 18
March 25
April 1
April 8
April 15
April 22
April 29
May 6
May 13
May 20
May 27
June 3
June 10
June 17
Presentation of the course
Chapter 1: Introduction. Paper presentation.
Chapter 2: Modeling. Paper presentation.
Chapter 3: Retrieval evaluation. Paper presentation.
Chapters 4-7. Paper presentation.
Chapter 8: Indexing and Searching. Paper presentation.
Chapter 9: Parallel and Distributed IR.
Midterm exam. Consultations. Discussion.
Chapter 11: Multimedia IR: Models and languages
Paper presentation and discussion.
Chapter 12: Multimedia IR: Indexing and Searching
Paper presentation and discussion.
Thesis presentation.
Thesis presentation.
Paper presentation and discussion.
Final exam. Consultations. Discussion.
13
Class structure
Main course: Information Retrieval
 Discussion of previous chapter. Questions
 I briefly present a new chapter
Research seminar: Natural Language Processing
 Discussion of previous paper. Questions.
o Identification of possible research topics
 Presentation of a new paper or current work
 Discussion and questions
 Goal: publications!
14
Evaluation




Oral tests
Written test
Activity in paper presentations and discussions
Preparation of papers for publication
15
Papers for the next classes
 March 11:
Challenges in the Interaction of Information
Retrieval and Natural Language Processing
Ricardo Baeza-Yates
 March 18:
Head/Modifier Frames for Information Retrieval
Cornelis H.A. Koster
16
Thank you!
Till March 11
17