Generate a variable used to selected a training sample

make_train(n = 0.7, nr = NULL, blocks = NULL, seed = 1234)

Arguments

n

Number (or fraction) of observations to label as training

nr

Number of rows in the dataset

blocks

A vector to use for blocking or a data.frame from which to construct a blocking vector

seed

Random seed

Value

0/1 variables for filtering

Examples

make_train(.5, 10)
#>  [1] 1 1 0 0 0 1 0 1 1 0
make_train(.5, 10) %>% table()
#> .
#> 0 1 
#> 5 5 
make_train(100, 1000) %>% table()
#> .
#>   0   1 
#> 900 100 
make_train(.15, blocks = mtcars$vs) %>% table() / nrow(mtcars)
#> .
#>       0       1 
#> 0.84375 0.15625 
make_train(.10, blocks = iris$Species) %>% table() / nrow(iris)
#> .
#>   0   1 
#> 0.9 0.1 
make_train(.5, blocks = iris[, c("Petal.Width", "Species")]) %>% table()
#> .
#>  0  1 
#> 75 75