Why shouldn't resampling be automated?

here is link.

One thing that should not be automated is the choice of resampling method. The code templates require the user to choose the rsample function that is appropriate.

Why shouldn't resampling be automated?
I think, Do I need to adjust the ratio depending on the amount of data, because it will have a negative impact on the generalization performance?

Additional question.

Is it possible that tuning automation will appear in the future for functions for sampling as well?
Will there be a package like tune?
Shouldn't that theoretically be automated too?

thank you!

The sampling you want to do will depend on what kind of data you have and the task at hand you are trying to accomplish. Assuming you want to estimate predictive performance and go with cross-validation, you still need to decide which type of cross-validation you want to do.

You could do a

There are also cases where you don't want to do cross-validation as your resampling scheme and you want to do bootstrapping instead.

There is no one best resampling method that works all the time hence why this field is left for the user to specify.

3 Likes

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

If you have a query related to it or one of the replies, start a new topic and refer back with a link.