Please scroll down for all the paper downloads.
Lecture |
Date |
Description |
Readings |
Presenter |
1 |
Thur, Sep 27 |
Class introduction |
|
Fei-Fei Li |
2 |
Thur, Oct 4 |
Still images - Classification |
Yang et al, CVPR 2010; |
Kevin Tang; |
3 |
Thur, Oct 11 |
Still images - Human & Object Interaction |
Delaitre et al, NIPS 2011; |
Aman Sikka; |
4 |
Thur, Oct 18 |
Videos - Features |
Wang & Mori, NIPS 2008; |
Evan Shieh; |
5 |
Thur, Oct 25 |
Videos - Tracking and Recognition |
Yang & Nevatia, ECCV 2012; |
Cameron Schaeffer; |
6 |
Thur, Nov 1 |
Videos - Temporal models |
Gaidon et al, CVPR 2011; |
Evan Shieh; |
7 |
Thur, Nov 8 |
Videos - High-level |
Fouhey et al, ECCV 2012; |
David Held; |
8 |
Thur, Nov 15 |
Social interactions |
Kevin Tang; |
|
9 |
Thur, Nov 22 |
Thanksgiving holiday, no class |
|
|
10 |
Thur, Nov 29 |
Depth images |
Kyunghee Kim; |
|
11 |
Thur, Dec 6 |
Project presentation |
|
All students |
References
Lecture #2:
W. Yang, Y. Wang, and G. Mori. Recognizing human actions from still images with latent poses. CVPR 2010.
S. Maji, L. Bourdev, and J. Malik. Action recognition from a distributed representation of pose and appearance. CVPR 2011.
Lecture #3:
V. Delaitre, J. Sivic, and I. Laptev. Learning person-object interactions for action recognition in still images. NIPS 2011.
A. Prest, C. Schmid, and V. Ferrari. Weakly supervised learning of interactions between humans and objects. PAMI 2012.
Lecture #4:
Y. Wang and G. Mori. Learning a discriminative hidden part model for human action recognition. NIPS 2008.
S. Sadanand and J. Corso. Action bank: A high-level representation of activity in video. CVPR 2012.
Lecture #5:
B. Yang and R. Nevatia. Online learned discriminative part-based appearance models for multi-human tracking. ECCV 2012.
W. Choi and S. Savarese. A unified framework for multi-target tracking and collective activity recognition. ECCV 2012.
Lecture #6:
A. Gaidon, Z. Harchaoui, and C. Schmid. Actom sequence models for efficient action detection. CVPR 2011.
N. Ikizler and D. Forsyth. Searching video for complex activities with finite state models. CVPR 2007.
Lecture #7:
D. Fouhey, V. Delaitre, A. Gupta, A. Efros, I. Laptev, and J. Sivic. People watching: human actions as a cue for single view geometry. ECCV 2012.
K. Kitani, B. Ziebart, J. Bagnell, and M. Hebert. Activity forecasting. ECCV 2012.
Lecture #8:
A. Fathi, J.Hodgins, and J. Rehg. Social interactions: A first-person perspective. CVPR 2012
T. Lan, L. Sigal, and G. Mori. Social Roles in Hierarchical Models for Human Activity Recognition. CVPR 2012.
Lecture #10:
J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio, R. Moore, A. Kipman, and A. Blake. Real-time human pose recognition in parts from a single depth image. CVPR 2011.
J. Wang, Z. Liu, Y. Wu, and J. Yuan. Mining actionlet ensemble for action recognition with depth cameras. CVPR 2012.