15.3 Creating Training and Testing Set and Control
- Use rsample package and stratified sampling
- Using mutliple cross validation for resampling
library(rsample)
library(caret)
set.seed(999) #for reproducibility (can pick your own seed, but keep it consistent)
= initial_split(data = data_cr, prop = 0.8, strata = "Creditability")
idx = training(idx)
d_train1 = testing(idx)
d_test1 prop.table(table(d_train1$Creditability))
0 1
0.3 0.7
prop.table(table(d_test1$Creditability))
0 1
0.3 0.7
= trainControl(method = "repeatedcv", number = 10, repeats = 2) #using repeated cross validate (repeating twice) cntrl1