Multimedia Laboratory

Projects / Deep Learning

Deep learning is a sub-field of machine learning that is based on learning several levels of representations, corresponding to a hierarchy of features or factors or concepts, where higher-level concepts are defined from lower-level ones, and the same lower-level concepts can help to define many higher-level concepts.

Below we list down some of the recent advances and developments in the deep learning field:

We are trying our best effort to collect the most recent deep learning papers, which are published in major computer vision and machine learning conferences. Please have a look here.

We design new deep architecture for facial point detection, pedestrian detection, face parsing, and segmentation.

Pedestrian detection

  1. P. Luo, Y. Tian, X. Wang, and X. Tang, "Switchable Deep Network for Pedestrian Detection", IEEE Conf. on Computer Vision and Pattern Recognition, June 2014
  2. W. Ouyang and X. Wang, "Joint Deep Learning for Pedestrian Detection," in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF] [Project Page]
  3. X. Zeng, W. Ouyang and X. Wang, "Multi-Stage Contextual Deep Learning for Pedestrian Detection," in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF]
  4. W. Ouyang, X. Zeng and X. Wang, "Modeling Mutual Visibility Relationship with a Deep Model in Pedestrian Detection," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3222-3229, 2013 [PDF] [Project Page]
  5. W. Ouyang, and X. Wang, "A Discriminative Deep Model for Pedestrian Detection with Occlusion Handling," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3258-3265, 2012 [PDF]

Face analysis

  1. Y. Sun, X. Wang, and X. Tang. "Deep learning face representation from predicting 10,000 classes", IEEE Conf. on Computer Vision and Pattern Recognition, June 2014
  2. Y. Sun, X. Wang, and X. Tang, "Hybrid Deep Learning for Computing Face Similarities," in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF]
  3. Z. Zhu, P. Luo, X. Wang, and X. Tang, "Deep Learning Identity Preserving Face Space," in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF]
  4. P. Luo, X. Wang, and X. Tang, "A Deep Sum-Product Architecture for Robust Facial Attributes Analysis," in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF]
  5. Y. Sun, X. Wang and X. Tang, "Deep Convolutional Network Cascade for Facial Point Detection," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3476-3483, 2013 [PDF] [Project Page]
  6. P. Luo, X. Wang, and X. Tang, "Hierarchical Face Parsing via Deep Learning", in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2480-2487, 2012 [PDF]

Person parsing

  1. W. Ouyang, X. Chu, and X. Wang, "Multi-source Deep Learning for Human Pose Estimation", IEEE Conf. on Computer Vision and Pattern Recognition, June 2014
  2. P. Luo, X. Wang, and X. Tang, "Pedestrian Parsing via Deep Decompositional Neural Network," in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF] [Project Page]

Person re-identification

  1. W. Li, R. Zhao, T. Xiao, and X. Wang, "DeepReID: Deep Filter Pairing Neural Network for Person Re-Identification", IEEE Conf. on Computer Vision and Pattern Recognition, June 2014


Back to Top

Description Download
A demo code that allows you to input a pedestrian image and then compute the label map.

Reference:
  1. P. Luo, X. Wang, and X. Tang, "Pedestrian Parsing via Deep Decompositional Neural Network," in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF] [Project Page]
Zip
A demo code that shows you how the frontal-view face image of a query face image is reconstructed.

Reference:
  1. Z. Zhu, P. Luo, X. Wang, and X. Tang, "Deep Learning Identity Preserving Face Space," in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF] [Project Page]
Zip
Matlab training and testing source code for pedestrian detection using the proposed approach. Models trained on INRIA and Caltech are provided.

Reference:
  1. Wanli Ouyang, Xiaogang Wang, "Joint Deep Learning for Pedestrian Detection", in Proceedings of IEEE International Conference on Computer Vision (ICCV) 2013 [PDF] [Project Page]
  2. Wanli Ouyang, Xiaogang Wang, "A Discriminative Deep Model for Pedestrian Detection with Occlusion Handling", in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2012 [PDF] [Project Page]
Webpage
Executable files for the face detector and facial point detector.

Reference:
  1. Y. Sun, X. Wang and X. Tang, "Deep Convolutional Network Cascade for Facial Point Detection," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3476-3483, 2013 [PDF] [Project Page]
Webpage
Description Download
Slides that provides an overview of recent deep learning research in our lab [Presented by Xiaogang Wang]. PDF