KAIST Logo

Department of Computer Science
Div. of Web Science and Technology
Korea Advanced Institute of Science and Technology (KAIST)

CS688/WST665: Web-Scale Image Retrieval (Fall 2012)


CS688/WST665: Web-Scale Image Retrieval (Fall 2012)

Instructor: Sung-eui Yoon


When and where: 1:00-2:15pm on Tue. and Thur. at Room 3444 in the CS building
First class: Sep-4 (please come to first class for more information)
Please refer to the first lecture slide for other course information
Textbook: In-class handouts (no textbooks)
Board: Noah board


Line

Outline

  • Course overview
  • Lectures and tentative schedule
  • Student presentations
  • Additional reference materials
  • Line

    Course overview

    We extract feature points between two similar images and match them, followed by overlaying them together based on the matched points in the bottom row.

    Thanks to rapid advances of digital camera and various image processing tools, we can easily create new pictures, images, and videos for various purposes. This in turn results in a huge amount of images in the internet and even in personal computers. For example, flickr, an image hosting website, contains more than five billion images and flickr members update more than three thousands image every minute.

    These huge image databases pose numerous technical challenges in terms of image processing, searching, storing, etc. In this class we will discuss various scalable techniques for web-scale image/video databases and novel applications that can utilize such data.

    In summary, what you will get at the end of the course:

  • Broad understanding on image/video retrieval techniques
  • In-depth knowledge on recent methods that can handle web-scale data
  • Study novel applications that utilize web-data
  • What you will do:

  • Choose and present a few papers from a paper list
  • Final project: come up with your own idea related to the topic, (optionally) implement it to improve the state-of-the-art techniques
  • Mid-term exam: reviewing basic image retrieval methods
  • Line

    Lecture schedule (subject to change)

    # of lecture, date Topics and slides Related material(s)
    Sep-4 (Th) Overview on the course and course policy Q and A for the course (frequently updated)
    Sep-6 (Th)
    Sep-11 (T)
    Keypoint localization
    Sep-13 (Th) Scale-invariant region detector
    Sep-18 (T) Descriptors PA1: get to know basic libraries
    Sep-25 (T.)
    Sep-27 (Th)
    Intro. to object detection
    Oct-4 (Th)
    Oct-9 (T)
    Bag-of-Word approach PA2: basic image retrieval, Image database
    Oct-11 (Th)
    Oct-16 (T)
    Oct-18 (Th)
    Recent image retrieval systems
    Oct-23 (T) Mid-term exam
    Oct-30 (T) Seungwoo, Mingyang
    Nov-1 (Th) Soyoung, Suwon
    Nov-6 (T) Kwangheum, Junghyuk
    Nov-8 (Th) Gayeon, Aurelie
    Nov-13 (T) Project feedback
    Nov-15 (Th) Project feedback
    Nov-20 (T) Florent, Gangwook
    Nov-22 (Th) Jongwook, Donghyun
    Nov-27 (T) Jongin, Soonmin
    Nov-29 (Th) No class, because of attending a conf. (SIGGRAPH Asia)
    Dec-4 (T) Kyunghyun
    Dec-6 (Th) Changhoon, Sungheon
    Dec-11 (T)
    Dec-13 (Th)
    Final project presentation
    Line

    Student presentations and reports

    For your presentations, please use the this powerpoint template; paper presentation guideline is available.

    For your final report, please use the this latex template

    Line

    Additional reference materials and links

  • WST665/CS770 homepage at fall of 2011

  • Computer vision resources (papers, code, datasets, etc.):

  • CVPapers
  • VLFeat: contains popular computer vision algorithms including SIFT, MSER, k-means, hierarchical k-means, agglomerative information bottleneck, and quick shift
  • Paper search:

  • Google scholar
  • Tim Rowley's graphics paper collections
  • Ke-Sen Huang's graphics paper collections
  • Line

    Acknowledgements: The course materials are based on those of Prof. Fei-Fei Li, Stanford. Thank you so much! Line

    Copyright 2012. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the author.

    This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.