Numerai Hedge Fund 2023 performance as requested

ustonapc · July 27, 2024, 8:25am

Whycliffes,

As requested, here are the performance of Numerai two market neutral hedge funds until end 2023 so that everyone can see that even Numerai can get it so wrong (especially both are supposd to be market-neutral based funds). It maybe due to overfitting of their AI/ML models.

I think the performances in 2023 were so bad that they have now stopped publishing the monthly performance.

Regards
James

month_end	Numerai One	Numerai Supreme
9/30/2019	-0.26%	0.00%
10/31/2019	-0.39%	0.00%
11/30/2019	1.41%	0.00%
12/31/2019	0.74%	0.00%
1/31/2020	-0.47%	0.00%
2/29/2020	-1.15%	0.00%
3/31/2020	-1.54%	0.00%
4/30/2020	-0.80%	0.00%
5/31/2020	1.35%	0.00%
6/30/2020	2.75%	0.00%
7/31/2020	0.46%	0.00%
8/31/2020	-1.52%	0.00%
9/30/2020	3.12%	0.00%
10/31/2020	2.71%	0.00%
11/30/2020	-1.69%	0.00%
12/31/2020	2.93%	0.00%
1/31/2021	-1.71%	0.00%
2/28/2021	1.99%	0.00%
3/31/2021	0.21%	0.00%
4/30/2021	5.83%	0.00%
5/31/2021	-0.54%	0.00%
6/30/2021	1.40%	0.00%
7/31/2021	-1.24%	0.00%
8/31/2021	1.09%	0.00%
9/30/2021	0.66%	0.00%
10/31/2021	1.14%	0.00%
11/30/2021	-0.92%	0.00%
12/31/2021	4.25%	0.00%
1/31/2022	4.04%	0.00%
2/28/2022	1.10%	0.00%
3/31/2022	-2.84%	0.00%
4/30/2022	0.23%	0.00%
5/31/2022	1.84%	0.00%
6/30/2022	1.09%	0.00%
7/31/2022	-1.67%	0.00%
8/31/2022	2.51%	0.89%
9/30/2022	-0.29%	-2.02%
10/31/2022	-0.07%	1.75%
11/30/2022	6.92%	8.19%
12/31/2022	5.89%	5.73%
1/31/2023	2.32%	0.58%
2/28/2023	2.43%	4.83%
3/31/2023	1.50%	-1.78%
4/30/2023	-1.35%	-0.85%
5/31/2023	-9.97%	-12.26%
6/30/2023	-5.56%	-7.06%
7/31/2023	-7.37%	-2.32%
8/31/2023	-0.75%	-0.31%
9/30/2023	-1.07%	0.12%
10/31/2023	2.89%	2.89%
11/30/2023	0.11%	-4.20%
12/31/2023	-0.56%	-1.32%
	2019*	2020	2021	2022	2023
Numerai One	1.50%	6.15%	12.16%	18.75%	-17.38%
Numerai Supreme					-21.68%

*from Sept to Dec 2019

Whycliffes · July 27, 2024, 9:05am

It seems that even the best among us in ML have a challenge in creating models that yield higher returns out of sample!!

ZGWZ · July 27, 2024, 10:38am

The machine learning community has always loved overfitting. That's why XGBOOST is always the algorithm used by the winners in Kaggle, but it doesn't work well in P123's AI system - the secret lies in the fact that the Kaggle winners just got lucky!

The fact is that even the simplest linear models can do well. We need more shrinkage, not more fitting.

Jrinne · July 27, 2024, 11:24am

ZGWZ,

Nice point that I had not fully considered in this context.

High-variance gets lucky and will win when there are a lot of trials (or entrants in a Kaggle competitions), for sure.

Just as small schools do better (but also worse) when you look at their result (more variance due to the smaller student sample). Just as a 5-stock models will get lucky and be the best if you run a lot of trials (and used to be quite the rage at P123).

For extra trees you reduce the variance by keeping the min_samples_split , or min_samples_leaf a high number, I think. I believe this is expansion to Extra Trees Regressors is consistent with ZGWZ's point if you like Extra Trees Regressor.

Interesting and important, I think.

Jim

ZGWZ · July 27, 2024, 12:23pm

It seems that the default Extra Trees model's hyperparameters are good enough that my further tweaking just brings bad things. And for some reason, my highest ranked models are always linear models.

trendyist · July 27, 2024, 12:23pm

IMO, XGBoost can be very strong on P123 as well. But like Jrinne mentioned with extra trees, there are a lot of hyperparams that are important to help with the overfitting e.g. min_child_weight, subsample, colsample_bytree, gamma, alpha, lambda.

ZGWZ · July 27, 2024, 12:26pm

Hyperparameter search leads to another overfitting. I would rather use default parameters for robustness.

The triumph of XGBoost in Kaggle is the result of a combination of its inability to adequately cope with model uncertainty, and a very large parameter space.

Jrinne · July 27, 2024, 12:30pm

FWIW my Extra Trees Regressor does best with "min_samples_split": 4000,

I note Extra Trees Regressor is sensitive to noise variables as it does not select features according to the optimal split and can be more likely to select a noise variable because of this. So it can overfit to noise variables, if you include them in your feature set. Random forests, that do select features using the best split, have their own problems with overfitting as you know, however.
,
Linear models, especially with regularization, are inherently resistant to overfitting as I am sure you already know.

But ZGWZ, you seems to understand this well and you are probably familiar with the "No Free Lunch Theorem." As I understand it, this is a mathematical proof and therefore it is unarguably true. It states there is no single-best model for every situation. For sure that applies to Extra Trees Regressor not always being the best model.

BTW, Claude 3 says this about the No Free Lunch Theorem proving that Extra Trees Regressor is not always the best (and I hope I never imply that in my posts): The theorem is proven mathematically and applies to all optimization problems, including supervised learning tasks in machine learning.

JIm

ZGWZ · July 27, 2024, 1:36pm

Financial data seems to be well suited for fitting with linear models. Even in the case where I had previously used the logarithm of the raw data as features, I had expected the nonlinear model to shine, but the best model was still Lasso/ENet