Please scroll down for all the paper downloads.

 

 

Lecture

Date

Description

Readings

Presenter

1

Wed, Sep 23

 

Class introduction

 

 

Wed, Sep 30

Class cancelled

Make-up Session: 9am - 12pm, Fri Oct 9

 

 

2

Wed, Oct 7

Course project papers

Piyush;

Louis;

3

Fri, Oct 9

9am - 12pm

Object recognition tutorial

Fei-Fei;

4

Wed, Oct 14

Pictorial structure

I-Ting;

5

Wed, Oct 21
3D object categorization

Siddharth;

6

Wed, Oct 28

Object in context;

Project proposal due

Jaewon;

7

Wed, Nov 4

Natural scene understanding

Zixuan;

8

Wed, Nov 11

Total scene understanding

Georgios;

Haider;

9

Fri, Nov 20

Human action recognition

Amir;

 

Wed, Nov 25

NO CLASS,

Thanksgiving break

 

 

10

Wed, Dec 2
Video analysis

Ashish;

Michael;

11

TBA

Course project presentation

   

 

Fri, Dec 11

Course project due

 


 

References

 

Lecture #2:

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei. (2009) ImageNet: A Large-Scale Hierarchical Image Database, To Appear in IEEE Computer Vision and Pattern Recognition (CVPR).

N. Ikizler and D.A. Forsyth (2008) Searching for Complex Human Activities with No Visual Examples, Int. J. Computer Vision. Vol. 80, no. 3, pp. 337-357.


Lecture #3:

L. Fei-fei, R. Fergus, and A. Torralba. (2006) Recognizing and learning object categories, http://people.csail.mit.edu/torralba/iccv2005, Tutorial presented at ICCV 2005 pages visited Feb. 7, 2006.

 

Lecture #4:

P. Felzenszwalb, D. Huttenlocher (2005). Pictorial Structures for Object Recognition, International Journal of Computer Vision, Vol. 61, No. 1, January 2005.

P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan. (2009) Object Detection with Discriminatively Trained Part-Based Models, IEEE Pattern Analysis and Machine Intelligence (PAMI). Accepted for publication.


Lecture #5:

S. Savarese and L. Fei-Fei. (2007) 3D generic object categorization, localization and pose estimation, IEEE International Conference in Computer Vision (ICCV). 2007.

*M. Sun, *H. Su, S. Savarese and L. Fei-Fei. (2009) A Multi-View Probabilistic Model for 3D Object Classes, To appear in IEEE Computer Vision and Pattern Recognition (CVPR) (*indicates equal contributions)

 

Lecture #6:

D. Hoiem, A. Efros, and M. Herbert. (2006) Putting Objects in Perspective, Proc. IEEE International Conf. Computer Vision and Pattern Recognition (CVPR).

A. Gupta, L. Davis. (2008) Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers, Proceedings of the 10th European Conference on Computer Vision: Part I.

 

Lecture #7:
L. Fei-Fei and P. Perona. (2005) A Bayesian Hierarchical Model for Learning Natural Scene Categories. IEEE Comp. Vis. Patt. Recog.

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, New York, June 2006, vol. II, pp. 2169-2178.

 

Lecture #8:
L.-J. Li, R. Socher and L. Fei-Fei. (2009) Towards Total Scene Understanding:Classification, Annotation and Segmentation in an Automatic Framework, To appear in Computer Vision and Pattern Recognition (CVPR). (Oral)

B. Yao, X. Yang, Liang Lin, M.W. Lee, and S.C. Zhu. (2009) I2T: Image Parsing to Text Description, Proceedings of IEEE, (under review, invited for the special issue on Internet Vision).

Z.W. Tu, X.R. Chen, A.L. Yuille, and S.C. Zhu,(2005) Image parsing: unifying segmentation, detection and recognition, Int'l J. of Computer Vision, 63(2), 113-140.

 

Lecture #9:
I. Laptev, M. Marszałek, C. Schmid and B. Rozenfeld. (2008) Learning realistic human actions from movies, in Proc. CVPR'08, Anchorage, US.

B. Babenko, M.H. Yang, S.J. Belongie. (2009) Visual tracking with online Multiple Instance Learning, in Proc. CVPR'09, pp. 983-990.

 

Lecture #10:

A. Gupta, P. Srinivasan, J. B. Shi, L.S. Davis. (2009) Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos, in Proc. CVPR'09 pp. 2012-2019.

Y. Ke, R. Sukthankar, and M. Hebert. Event Detection in Crowded Videos,  ICCV, 2007.