This Issue

writing-guides   ealerts
Connect with WS
 


USING LABELED AND UNLABELED DATA FOR PROBABILISTIC MODELING OF FACE ORIENTATION

SHUMEET BALUJA

Justsystem Pittsburgh Research Center, 4616 Henry Street, Pittsburgh, PA 15213, USA

This paper describes probabilistic modeling methods to solve the problem of discriminating between five facial orientations with very little labeled data. Three models are explored. The first model maintains no inter-pixel dependencies, the second model is capable of modeling a set of arbitrary pair-wise dependencies, and the last model allows dependencies only between neighboring pixels. We show that for all three of these models, the accuracy of the learned models can be greatly improved by augmenting a small number of labeled training images with a large set of unlabeled images using Expectation–Maximization. This is important because it is often difficult to obtain image labels, while many unlabeled images are readily available. Through a large set of empirical tests, we examine the benefits of unlabeled data for each of the models. By using only two randomly selected labeled examples per class, we can discriminate between the five facial orientations with an accuracy of 94%; with six labeled examples, we achieve an accuracy of 98%.

Keywords: Probabilistic modeling; face orientation discrimination; expectation maximization; unlabeled data; Bayesian models; machine learning; computer vision
Cited by (2):
, , , , . (2016) Fuzziness based semi-supervised learning approach for intrusion detection system. Information Sciences. Online publication date: 1-May-2016. [CrossRef]
, , , , , . (2011) Comparing two video-based techniques for driver fatigue detection: classification versus optical flow approach. Machine Vision and Applications 22:4, 597-618. Online publication date: 1-Jul-2011. [CrossRef]