P-packSVM: Parallel Primal grAdient desCent Kernel SVM

Zeyuan Allen-Zhu; Weizhu Chen; Gang Wang; Chenguang Zhu; Zheng Chen

P-packSVM: Parallel Primal grAdient desCent Kernel SVM

Zeyuan Allen-Zhu ,
Weizhu Chen ,
Gang Wang ,
Chenguang Zhu ,
Zheng Chen

ICDM 2009 | December 2009

Published by IEEE Computer Society

Download BibTex

It is an extreme challenge to produce a nonlinear SVM classifier on very large scale data. In this paper we describe a novel P-packSVM algorithm that can solve the Support Vector Machine (SVM) optimization problem with an arbitrary kernel. This algorithm embraces the best known stochastic gradient descent method to optimize the primal objective, and has 1\/ϵ dependency in complexity to obtain a solution of optimization error ϵ. The algorithm can be highly paralleled with a special packing strategy, and experiences sub-linear speed-up with hundreds of processors. We demonstrate that P-packSVM achieves accuracy sufficiently close to that of SVM-light, and overwhelms the state-of-the-art parallel SVM trainer PSVM in both accuracy and efficiency. As an illustration, our algorithm trains CCAT dataset with 800k samples in 13 minutes and 95% accuracy, while PSVM needs 5 hours but only has 92% accuracy. We at last demonstrate the capability of P-packSVM on 8 million training samples.

Copyright © 2007 IEEE. Reprinted from IEEE Computer Society.This material is posted here with permission of the IEEE. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org.By choosing to view this document, you agree to all provisions of the copyright laws protecting it.