World Scientific
  • Search
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×
Our website is made possible by displaying certain online content using javascript.
In order to view the full content, please disable your ad blocker or whitelist our website www.worldscientific.com.

System Upgrade on Tue, Oct 25th, 2022 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at [email protected] for any enquiries.

LEARNING SPARSE MIXTURE MODELS FOR DISCRIMINATIVE CLASSIFICATION

    Recently Saul and Lee proposed a mixture model for discriminative classification of non-negative data via non-negative matrix factorization for feature extraction. In order to improve the generalization, this paper considers a sparse version of the model. The basic idea is to minimize the sum of the weights of un-normalized mixture models for posterior distributions according to regularization method. Experiments on CBCL face database and USPS digit data set assess the validity of the proposed approach.

    References

    • CBCL Face Database #1, MIT Center For Biological and Computation Learning, http://www.ai.mit.edu/projects/cbcl . Google Scholar
    • Z. Chen and S. Haykin, Neural Comput. 14, 2791 (2002). Crossref, ISIGoogle Scholar
    • R. O.   Duda , P. E.   Hart and D. G.   Stork , Pattern Classification , 2nd edn. ( John Wiley & Sons , 2001 ) . Google Scholar
    • K.   Fukunaga , Introduction to Statistical Pattern Recognition , 2nd edn. ( Morgan Kaufmann , 1990 ) . Google Scholar
    • P. O. Hoyer, Non-negative sparse coding, Neural Networks for Signal Processing XII, Proc. IEEE Workshop on Neural Networks for Signal Processing (2002) pp. 557–565. Google Scholar
    • T. S. Jaakkola and D. Haussler, Advances in Neural Information Processing Systems (1999) pp. 487–493. Google Scholar
    • A.   Klautau , N.   Jevtic and A.   Orlitsky , Discriminative Gaussian mixture models: a comparison with Kernel classifiers , Proc. Twentieth Int. Conf. Machine Learning (ICML-2003) ( 2003 ) . Google Scholar
    • D. Keysers, F. J. Och and H. Ney, Maximum entropy and Gaussian models for image object recognition, DAGM 2002, Pattern Recognition, 24th DAGM Symp., Lecture Notes in Computer Science 2449 (Springer-Verlag, 2002) pp. 498–506. Google Scholar
    • D. D. Lee and L. K. Saul, Advances in Neural Information Processing Systems (2001) pp. 556–562. Google Scholar
    • D. D. Lee and H. S. Seung, Nature 401, 788 (1999). Crossref, ISIGoogle Scholar
    • S. Z.   Li , X. W.   Hou and H. J.   Zhang , Learning spatially localized parts-based representation , Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition ( 2001 ) . Google Scholar
    • W. Liu, N. Zheng and X. Lu, Non-negative matrix factorization for visual coding, Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (2003) pp. III_293–III_296. Google Scholar
    • D. J. C. Mackay, Bayesian methods for adaptive models, Ph.D. thesis, California Institute of Technology, USA (1991) . Google Scholar
    • J. B. Paris, Synthese 117, 75 (1999). Crossref, ISIGoogle Scholar
    • G. Rätsch, T. Onoda and K.-R. Müller, Mach. Learn. 42(3), 287 (2001). Crossref, ISIGoogle Scholar
    • S.   Raudys , Statistical and Neural Classifiers: An Integrated Approach to Design ( Springer , London , 2001 ) . CrossrefGoogle Scholar
    • Y. D. Rubinstein and T. Hastie, Discriminative vs informative learning, Proc. Third Int. Conf. Knowledge Discovery and Data Mining (1997) pp. 49–53. Google Scholar
    • W. S. Sarle, Neural Network FAQ, Part 3 of 7: Generalization, Periodic Posting to the Usenet Newsgroup Comp.ai.neural-nets, ftp://ftp.sas.com/pub/neural/FAQ.html (2001) . Google Scholar
    • L. K.   Saul and D. D.   Lee , Advances in Neural Information Processing Systems ( 2002 ) . Google Scholar
    • R. Schlüter, B. Müller and H. Ney, Speech Commun. 34, 287 (2001). Crossref, ISIGoogle Scholar
    • USPS data set, available on http://www.kernel-machines.org/data.html . Google Scholar
    • V. N.   Vapnik , The Nature of Statistical Learning Theory , 2nd edn. ( Springer , NY , 2000 ) . CrossrefGoogle Scholar
    • P. M. Williams, Neural Comput. 1, 425 (1994). Google Scholar
    • M. H. Yang, D. Kriegman and N. Ahuja, IEEE Trans. Patt. Anal. Mach. Intell. 24, 34 (2002). Crossref, ISIGoogle Scholar
    • T. Zhang, Advances in Neural Information Processing Systems (2001) pp. 703–709. Google Scholar