svm-subset - a subset selection tool for LIBSVM

Author

       Chih-Chung Chang, Chih-Jen Lin <cjlin@csie.ntu.edu.tw>, Chen-Tse Tsai <ctse.tsai@gmail.com> (packaging)

Bugs

       Please report bugs to the Debian BTS.

Description

       Training  large  data  is time consuming. Sometimes one should work on a smaller subset first. The python
       script subset.py randomly selects a specified number of samples. For classification data,  we  provide  a
       stratified selection to ensure the same class distribution in the subset.

Examples

              svm-subset heart_scale 100 file1 file2

       From  heart_scale  100  samples  are  randomly  selected and stored in file1. All remaining instances are
       stored in file2.

Files

       See svm-train(1) for the format of dataset

Name

       svm-subset - a subset selection tool for LIBSVM

Options

       -s method

       0      -- stratified selection (classification only) (default)

       1      -- random selection

       output1
              The subset. If output1 is omitted, the subset will be printed on the screen.

       output2
              The rest of data.

Synopsis

svm-subset[-smethod]datasetnumber[output1][output2]

svm-subset - a subset selection tool for LIBSVM

Contents

Author

Bugs

Description

Examples

Files

Name

Options

See Also

Synopsis

See Also