Instructor: Prof. Fei-Fei Li

Office: Room 246 Gates Bldg

Email: feifeili [at] cs [dot] stanford [dot] edu

Office hours: email cs231b-spr1213-staff [at] lists [dot] stanford [dot] edu for appointments with Prof. Li


Teaching Assistant: Philipp Krähenbühl

Office: Room 286 Gates Bldg

Email: philkr [at] stanford [dot] edu

Office hours: Th 3:30pm-4:30pm


Teaching Assistant: Bangpeng Yao

Office: Room 244 Gates Bldg

Email: bangpeng [at] cs.stanford [dot] edu

Office hours: Mo 4:00pm-5:00pm


Class Time and Location:

Tuesday & Thursday, 9:30am-10:45am, Gates 200

Course Description:

More than one-third of the brain is engaged in visual processing, the most sophisticated human sensory system. Yet visual recognition technology has fundamentally influenced our lives on the same scale and scope as text-based technology has, thanks to Google, Twitter, Facebook, etc.
This course is designed for those students who are interested in cutting edge computer vision research, and/or are aspiring to be an entrepreneur using vision technology. During the 10-week course, we will guide the students through the design and implenentation of three core vision technologies: segmentation, detection and classification on three highly practical, real-world problems. We will focus on teaching the fundamental theory, detailed algorithms, practical engineering insights, and guide them to develop state-of-the-art systems evaluated based on the most modern and standard benchmark datasets.

Grading policy:

  Paper presentation and participation: 15%
  Course projects (including code, write up, presentation): 85%
     -   Project 1: 25%;   Project 2: 25%;   Project 3: 35%



CS231A or equivalent (need instructor's approval), and a good machine learning background (e.g. CS221, CS228, CS229).
Coding skills: fluent in Matlab, C/C++.


None required.