step_svast creates a specification of a recipe
step that will perform x-VAST scaling on the columns
step_xvast( recipe, ..., scaling = "autoscale", role = NA, trained = FALSE, outcome = NULL, means = NULL, sds = NULL, cvs = NULL, na_rm = TRUE, skip = FALSE, id = rand_id("xvast") ) # S3 method for step_xvast tidy(x, ...)
| recipe | A recipe object. The step will be added to the sequence of operations for this recipe.  | 
    
|---|---|
| ... | One or more selector functions to choose which
variables are affected by the step. See   | 
    
| scaling | Either   | 
    
| role | Not used by this step since no new variables are created.  | 
    
| trained | A logical to indicate if the quantities for preprocessing have been estimated.  | 
    
| outcome | When a single outcome is available, character
string or call to   | 
    
| means | A named numeric vector of means. This
is   | 
    
| sds | A named numeric vector of stadard deviations. This
is   | 
    
| cvs | A named numeric vector of variation coeficients. This
is   | 
    
| na_rm | A logical value indicating whether   | 
    
| skip | A logical. Should the step be skipped when the
recipe is baked by   | 
    
| id | A character string that is unique to this step to identify it.  | 
    
| x | A   | 
    
An updated version of recipe with the new step
added to the sequence of existing steps (if any). For the
tidy method, a tibble with columns terms (the
selectors or variables selected), value (the
standard deviations and means), and statistic for the type of value.
supervised maximum Variable Stability (x-VAST) scaling preforms centering and scaling followed by a weighting of each variable by the maximum of the class-specific variation coeficients.
The argument scaling controls which scaling method should be used before
variable weighting. autoscale will perform mean-centering and standard deviation
scaling while pareto will scale by the square-root of the standard deviation.
Yang, J., Zhao, X., Lu, X., Lin, X., & Xu, G. (2015). A data preprocessing strategy for metabolomics to reduce the mask effect in data analysis. Frontiers in molecular biosciences, 2, 4. https://doi.org/10.3389/fmolb.2015.00004 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428451/
library(tidySpectR) library(recipes) autoscale_xvast <- recipe(Species ~. , iris) %>% step_xvast(all_predictors(), scaling = 'autoscale', outcome = 'Species') pareto_svast <- recipe(Species ~. , iris) %>% step_xvast(all_predictors(), scaling = 'pareto', outcome = 'Species')