Let's say if the stack had an r-squared for the training of 0.98 and 0.9 for testing, is that considered underperformance? or should the testing r-squared be significantly much lower than that of the training to consider that to be underperformance?
Thank you