CS588: Deep Learning based Image Search (Spring 2024)
Instructor: Sung-eui Yoon
- When: 10:30-12:00 on Mon. and Wed.
- Where: Lecture room 3444, Information Science and Electronics Bldg (E3)
- First class: Feb. 26 (Mon)
- Textbook: In-class handouts and ongoing draft (web), ongoing draft (pdf) on image search
- Board: KLMS
- Question Page: Question Submission
- Paper Submission Page: Paper Summary Submission (before every Mon. class)
Outline
- Course overview
- Lectures and tentative schedule
- Student presentations
- Additional reference materials
Course overview
Thanks to rapid advances of digital camera and various image processing tools, we can easily create new pictures, images, and videos for various purposes. This in turn results in a huge amount of images in the internet and even in personal computers. For example, flickr, an image hosting website, contains more than five billion images and flickr members update more than three thousands image every minute.
These huge image databases pose numerous technical challenges in terms of image processing, searching, storing, etc. In this class we will discuss various scalable techniques for web-scale image/video databases and novel applications that can utilize such data.
In summary, what you will get at the end of the course:
- Broad understanding on image/video retrieval techniques
- In-depth knowledge on recent methods that can handle web-scale data
- Study novel applications that utilize web-data
What you will do:
- Choose and present a few papers from recent conferences.
- Final project: come up with your own idea related to the topic, (optionally) implement it to improve the state-of-the-art techniques
- Mid-term exam: reviewing basic image retrieval methods
Lecture schedule (subject to change)
Date | Topics and slides | Related material(s) |
---|---|---|
|
Overview on the course and course policy | |
|
Classical Keypoint Localization | |
|
Scale Invariant Region Selection and SIFT |
|
|
Deep Learning based Image Search | |
|
||
|
Re-Ranking and Inverted Index |
|
|
Hashing Techniques | |
|
Person Re-identification |
|
|
Pixel Retrieval | |
|
Diffusion for Objects Retrieval | |
|
||
|
Applications of Adversarial Attacks on Matching-based Algorithms |
|
|
Optical Flow | |
|
No class due to the general election | |
|
No class (midterm week) | |
|
Midterm Exam | |
|
Paper Presentation I: 1. Kyubeom Han 2. Sheikh Shafayat |
|
|
Paper Presentation I: 1. FILIPPO MOMENTE 2. Suhyeon Ha |
|
|
Paper Presentation I: 1. Jinhwan Seo 2. Jumin Lee |
|
|
Midterm Project Presentation: 1. T1 (Kyubeom Han, Jinhwan Seo) 2. T2 (Sheikh Shafayat) |
|
|
No class due to the substitute holiday | |
|
Midterm Project Presentation: 1. T3 (FILIPPO MOMENTE) 2. T4 (Suhyeon Ha, Jumin Lee) |
|
|
No class due to ICRA attendance | |
|
No class due to Buddha's birthday | |
|
Paper Presentation II: 1. Sheikh Shafayat 2. FILIPPO MOMENTE |
|
|
Paper Presentation II: 1. Jinhwan Seo 2. Jumin Lee |
|
|
Paper Presentation II: 1. Kyubeom Han 2. Suhyeon Ha |
|
|
Reserved | |
|
Final Project Presentation: 1. T1 (Kyubeom Han, Jinhwan Seo) 2. T2 (Sheikh Shafayat) |
|
|
Final Project Presentation: 1. T3 (FILIPPO MOMENTE) 2. T4 (Suhyeon Ha, Jumin Lee) |
|
|
Reserved (Final exam period) |
Student presentations and reports
For your presentations, please use the this powerpoint template; paper presentation
guideline is available.
For your final report, please use the this latex template
Additional reference materials and links
- CS688 homepage at spring of 2021
- CS688 homepage at spring of 2020
- CS688 homepage at fall of 2018
- CS688 homepage at fall of 2016
- WST665/CS688 homepage at fall of 2014
- WST665/CS688 homepage at fall of 2012
- WST665/CS770 homepage at fall of 2011
Computer vision resources (papers, videos, code, datasets, etc.):
- CVPapers, Vision talk videos
- Video lectures:
- Multimedia Information Retrieval
- VLFeat: contains popular computer vision algorithms including SIFT, MSER, k-means, hierarchical k-means, agglomerative information bottleneck, and quick shift
Paper search:
Acknowledgements: The course materials are based on those of Prof. Fei-Fei Li, Stanford. Thank you so much!