Hello guys,
I am relatively new to machine learning generally and R specifically.
Is the (caret) package a stronger tool for using random forest rather than the (randomForest) package?
From my understanding, caret helps optimize the hyperparameters of a random forest model, is the (randomForest) package capable of finding the optimum hyperparameters as well? or they can be found after multiple iterations in the generated code from that package? e.g.
I'm biased but yes it is appropriate for that and other models. As is the tidymodels.
caret, mlr, and tidymodels follow a methodology that is a lot less risky than repeated calling of the same function using different parameters. You could be setting yourself up for overfitting otherwise.
Poor accuracy could be a function of many things (including not having informative predictors). It might help to describe what you are trying to do, the type of data, etc.
That's a warning (not an error). There are cases where the model predicts the same value for all samples. The result is that R2 can't be calculated and produce an NA. It's not a warning that should stop you form using the results; the models with that issue are no good anyway.