Statistics and Its Interface

Volume 4 (2011)

Number 1

Class-specific variable selection for multicategory support vector machines

Pages: 19 – 26

DOI: http://dx.doi.org/10.4310/SII.2011.v4.n1.a3

Author

Jian Guo (Department of Statistics, University of Michigan, Ann Arbor, Mich., U.S.A.)

Abstract

This paper proposes a class-specific variable selection method for multicategory support vector machines (MSVMs). Different from existing variable selection methods for MSVMs, the proposed method not only captures the important variables for classification, but also identifies the discriminable and nondiscriminable classes so as to enhance the interpretation for multicategory classification problems. Specifically, it minimizes the hinge loss of MSVMs coupled with a pairwise fusion penalty. For each variable, this penalty identifies nondiscriminable classes by imposing their associated coefficients in the decision functions to some identical value. Several simulated and real examples demonstrate that the proposed method provides better interpretation through class-specific variable selection while preserving comparable prediction performance with other MSVM methods.

Keywords

fusion penalty, lasso, support vector machine, variable selection

2010 Mathematics Subject Classification

62F07

Full Text (PDF format)