AI Factor - Designer Model?

ustonapc · May 16, 2025, 6:51am

Like many of his designer models that did not perform well before, Andreas has removed this AI model from P123 DM Library (I just check - selection bias?).

Regards
James

pitmaster · May 16, 2025, 8:41am

I believe there were other reasons for the removal—the model's performance was actually close to the benchmark, so not necessarily poor.

That said, I do agree that removing Designer Models after launch shouldn't be allowed, as it can introduce selection bias. It would also encourage creators to be more thoughtful before publishing.

If you'd like to support this suggestion, feel free to vote here: Designer Models enhancements

Jrinne · May 16, 2025, 11:47am

I agree. It’s inherently difficult to draw strong conclusions—positive or negative—about any single model’s performance.

One major issue is the multiple comparisons problem, especially when models are evaluated in the context of many others. If we’re selecting the best-performing models (or even just a few top ones), we run into the issue addressed by the Bonferroni correction: the more comparisons we make, the higher the likelihood that any apparent outperformance is due to chance.

That said, I have a practical suggestion related to this—not just a theoretical point:

It would be helpful if P123 simply displayed the number of models each designer has submitted, along with the total number of designer models submitted overall. That basic context alone would allow users to make more informed assessments about whether a model’s performance is likely meaningful—or just the result of random variation.

Importantly, this wouldn’t require revealing how the other models performed. To apply a Bonferroni correction, you only need the number of comparisons—not the individual results.

Even if P123 isn’t ready to publish complete data on all designer models, sharing just these submission counts would still represent a big step forward in transparency and statistical integrity.

ustonapc · May 16, 2025, 12:21pm

It is pretty straight forward to see if a particular designer model performance is positive or negative.

There is a sort function and if the designer model underperforms the 3M,1Y,2Y benchmark by a signficant margin, it probably means performance is not great.

Here is a screenshot below.

Regards
James

Jrinne · May 16, 2025, 1:11pm

I

James,

I thought I was actually agreeing with your broader (pervious) point which is that you cannot just look at whether the returns are positive or negative for a small amount of data, but rather—as you said--survivorship bias (and other considerations) come into play

Maybe I missed the point of one of your posts?

pitmaster · May 20, 2025, 5:57am

I had been following the daily performance of a promising DM created by a user in this community.

This weekend, the DM was removed after being available for a couple of months. Its performance was reasonable—approximately in line with the benchmark.

What I find a bit concerning is the lack of clarity around such removals. Of course, users have every right to take down their work, and I fully respect that. However, a bit more transparency could help maintain trust and integrity within the community. Without it, there’s a risk of unintentionally misleading one another.

Doney1000 · May 20, 2025, 6:38am

Maybe there should be DM "graveyard" were old stats keep visible and a reason for deletion has to be stated?

pitmaster · September 4, 2025, 12:31pm

One of the longest running DM from Andreas is now using an AI Factor Predictor Ranking System. Good luck!

https://www.portfolio123.com/app/r2g/summary?id=1598189

AlgoMan · September 4, 2025, 4:19pm

Was there any further discussion about this?