mlpack_preprocess_binarize - binarize data
Contents
Additional Information
For further information, including relevant papers, citations, and theory, consult the documentation
found at http://www.mlpack.org or included with your distribution of mlpack.
mlpack-4.5.1 29 January 2025 mlpack_preprocess_binarize(1)
Description
This utility takes a dataset and binarizes the variables into either 0 or 1 given threshold. User can
apply binarization on a dimension or the whole dataset. The dimension to apply binarization to can be
specified using the ’--dimension (-d)' parameter; if left unspecified, every dimension will be binarized.
The threshold for binarization can also be specified with the ’--threshold (-t)' parameter; the default
threshold is 0.0.
The binarized matrix may be saved with the '--output_file (-o)' output parameter.
For example, if we want to set all variables greater than 5 in the dataset ’X.csv' to 1 and variables
less than or equal to 5.0 to 0, and save the result to 'Y.csv', we could run
$ mlpack_preprocess_binarize--input_file X.csv --threshold 5 --output_file Y.csv
But if we want to apply this to only the first (0th) dimension of 'X.csv', we could instead run
$ mlpack_preprocess_binarize--input_file X.csv --threshold 5 --dimension 0 --output_file Y.csv
Name
mlpack_preprocess_binarize - binarize data
Optional Input Options
--dimension(-d)[int]
Dimension to apply the binarization. If not set, the program will binarize every dimension by
default. Default value 0.
--help(-h)[bool]
Default help info.
--info[string]
Print help on a specific option. Default value ''.
--threshold(-t)[double]
Threshold to be applied for binarization. If not set, the threshold defaults to 0.0. Default value
0.
--verbose(-v)[bool]
Display informational messages and the full list of parameters and timers at the end of execution.
--version(-V)[bool]
Display the version of mlpack.
Optional Output Options
--output_file (-o) [unknown] Matrix in which to save the output.
Required Input Options
--input_file(-i)[unknown]
Input data matrix.
Synopsis
mlpack_preprocess_binarize-iunknown [-dint] [-tdouble] [-Vbool] [-ounknown] [-h-v]
