I've parked a 10Kx20, approximately, dataset. To make it grouped, just cbind an equal number of rows from a seq, such as
classes <- sample(c(1,2),10000,replace = TRUE)
See the caret package to do the split.
ggplot2 can do the plotting easily for x,y pairs with `fill = classes"
caret::confusionMatrix
How to train a model is simpler than deciding what model to train. Come back with a more directed question for #5?