step_svast
creates a specification of a recipe
step that will perform x-VAST scaling on the columns
step_xvast( recipe, ..., scaling = "autoscale", role = NA, trained = FALSE, outcome = NULL, means = NULL, sds = NULL, cvs = NULL, na_rm = TRUE, skip = FALSE, id = rand_id("xvast") ) # S3 method for step_xvast tidy(x, ...)
recipe | A recipe object. The step will be added to the sequence of operations for this recipe. |
---|---|
... | One or more selector functions to choose which
variables are affected by the step. See |
scaling | Either |
role | Not used by this step since no new variables are created. |
trained | A logical to indicate if the quantities for preprocessing have been estimated. |
outcome | When a single outcome is available, character
string or call to |
means | A named numeric vector of means. This
is |
sds | A named numeric vector of stadard deviations. This
is |
cvs | A named numeric vector of variation coeficients. This
is |
na_rm | A logical value indicating whether |
skip | A logical. Should the step be skipped when the
recipe is baked by |
id | A character string that is unique to this step to identify it. |
x | A |
An updated version of recipe
with the new step
added to the sequence of existing steps (if any). For the
tidy
method, a tibble with columns terms
(the
selectors or variables selected), value
(the
standard deviations and means), and statistic
for the type of value.
supervised maximum Variable Stability (x-VAST) scaling preforms centering and scaling followed by a weighting of each variable by the maximum of the class-specific variation coeficients.
The argument scaling
controls which scaling method should be used before
variable weighting. autoscale
will perform mean-centering and standard deviation
scaling while pareto
will scale by the square-root of the standard deviation.
Yang, J., Zhao, X., Lu, X., Lin, X., & Xu, G. (2015). A data preprocessing strategy for metabolomics to reduce the mask effect in data analysis. Frontiers in molecular biosciences, 2, 4. https://doi.org/10.3389/fmolb.2015.00004 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428451/
library(tidySpectR) library(recipes) autoscale_xvast <- recipe(Species ~. , iris) %>% step_xvast(all_predictors(), scaling = 'autoscale', outcome = 'Species') pareto_svast <- recipe(Species ~. , iris) %>% step_xvast(all_predictors(), scaling = 'pareto', outcome = 'Species')