

Data Analysis
The WearCam Data Analysis involves many steps, depending on the features we want to extract from the WearCam video. Computer Vision algorithms are used to detect and track faces and objects, and statistical analysis is applied to the results. This data can then be used to study the child's visual behaviour (see WearCam & Autism).
Gaze Direction Detection
The forward direction of the field of view of the Wearcam is only an approximation of what the child is actually seeing. The center of attention of the visual field varies significantly when the child is looking sideways. A small mirror placed on the bottom of the wearcam captures the eyes region of the face.
Gaze direction detection is computed using Support Vector Regression. The system creates a mapping between the appearance of the eyes and the direction of the gaze in the image. There is no calibration required by the user, as detection is performed offline.
The system can detect the direction of the gaze with an accuracy of about 2.5° for children and 2° for adult subjects. The video shows a normally developing child wearing the WearCam and playing a touch-screen game used for gaze tracking performance estimation.
Object Detection
Object detection is performed in succesive steps:
Keyword Spotting
Keyword spotting makes use of the audio data recorded with the WearCam to recognize when specific keywords have been uttered during the recording. This can be used in conjunction with face or object detection to correlate utterance of specific words with specific gaze behaviour (like for example looking at an object or person).
Keyword spotting is based on a left-right model for each keyword, and a fully connected model for garbage (non-keywords). Keyword recognition is then made by comparing the log-likelihood of each model for the uttered speech.
Videos
|
Gaze & Object Detection Applied to an Interaction Scene |
Selected publications
People involved in this project