Multimedia Lab >> Research >> Projects
IDFace: Identification and Detection of Face
Automatic face identification and detection has great potential in a large array of application areas, including banking and security system access authentication, video surveillance, mugshot matching for law enforcement, duplicate ID card identification, face-based video compression for video conferencing, and multimedia information retrieval. In this project, we will develop a core set of essential tools for face recognition system development. In addition, several prototype application systems utilizing the developed software packages will be implemented to demonstrate the superiority of the technology.
3D Object Reconstruction from Single 2D Line Drawings
Multimedia applications extensively use 3D models. A 2D line drawing is the simplest and most straightforward way of illustrating a 3D object. It is very helpful if such a drawing can be used for generating a 3D model directly. Unfortunately, current CAD tools cannot do it, denying designers a convenient means of input. Therefore, it is highly desirable to develop algorithms that can convert a design sketch into a 3D model. Given a line drawing representing a 3D object, our approach to the conversion problem is finding face topology first and then doing 3D geometry reconstruction. Now we have developed a tool that can reconstruct complex 3D objects with planar faces and common 3D objects with curved faces from single 2D line drawings.
 
Image Segmentation by Iterative Optimization

Image segmentation plays an important role in computer vision and image analysis. We formulate image segmentation as a probability maximization problem and solve it using an iterative optimization scheme. Our algorithm can automatically segment an image into regions with relevant textures or colors without the need to know the number of regions in advance. Its results match image edges very well and are consistent with human perception. (The Program)

 

Chinese News Video Retrieval Engine

As digital video libraries and archives of immense size are becoming available over data networks, efficient video retrieval and browsing have become crucially important. In this project, we construct a Chinese news video retrieval engine using a range of newly developed technology, including automatic news video story parsing, video caption recognition, and speech recognition.

 

Video-based Chinese Character Recognition
We develop a handwritten Chinese character recognition system using a video camera. The system combines the advantages of both online and offline approaches. It allows users to write on any regular paper just like using an off-line system. At the same time, using a video camera attached on the computer, the system can capture the stroke temporal information similar to an on-line system.
 
Real Time Feature Based 3-D Deformable Face Tracking
We develop a novel framework for 3D tracking of the non-rigid face deformation from a single camera. The proposed algorithm integrates three types of features which discriminate face deformation across different views: 1) the semantic features which provide constant correspondences between 3D model points and major facial features; 2) the silhouette features which provide dynamic correspondences between 3D model points and facial silhouette under varying views; 3) the online tracking features that provide redundant correspondences between 3D model points and salient image features. In order to estimate the high dimensional 3D deformation parameters, we develop a hierarchical parameter estimation algorithm to robustly estimate both rigid and non-rigid 3D parameters. (Demo)