Of course, you can tell whether you have overfit you ranking system if you have been using it for a while. You can tell if you have added a feature (or features) that has caused overfitting.
For example, there are some systems that I have been using since 2014.
I simply do a rank performance test from 2014 until now and look at the performance. I then remove a factor (just a single factor one each time) and normalize the other factors.
If your performance improves over the out-of-sample period when you remove one of the factors then perhaps you have overfit you model.
This is cross-validation. More specifically, it is cross-validation for feature selection.
So, some might debate what to do going forward. Removing the factor and re-optimizing the system over the max period (2000 or 2005 to now) would be one possible way.
But whatever you do, you may not need the “experts” at P123 to tell you whether you have overfit your system. And you can do this with a hold-out period before you start to run the system live.
If you do this with a hold-out period you would optimize from 2000 to 2013, say. Once optimized you would run the system from 2014 to now and remove factors one-at-a-time and normalize the others—as outlined above.
If the system’s performance improves after removing some factors you would consider (it is up to you) not using those factors in your final system. You would then re-optimize the system over the max period, form 2000 (or 2005) till now—without the factors that might cause overfitting.
Or not. But if not, you must not believe that overfitting is a potential problem. You must think it is just a fun topic for the forum, I guess.
Make no mistake, this is from Statistics 101 or a first course in Econometrics. It is a simple topic that belongs in the forum.
And honestly, someone (anyone) at P123 should be more than familiar with this topic. Upper division undergraduate stuff—along with a few things that use a standard deviation–would not be a bad thing.
In any case, there should be no resistance to an idea that is USED EVERYWHERE.
To be fair this has been on the forum before. Denny Halwes advocated this but he used even/odd universes. This is not appropriate because of a non-PIT type of problem. Support for this being a problem can be found in “Advances in Financial Machine Learning” by Marcos López de Prado. Even/odd works with some types of data but not stock data.
Denny was ahead of his time. But I wonder if his Designer Models didn’t have some overfitting problems. Perhaps, they had problems from the use of even/odd universes.
Whether you use multiple regression, kernel regression, Econometrics or the P123 method–with stock data—a pro or academic will use something like what is outlined above. Walk-forward validation being a more advance, but similar, method. The main advantage of Walk-Forward and other methods being they avoid the non-PIT problem
-Jim