KNN better than SMA for making prediction of factor performance?

Jrinne · November 23, 2024, 3:56pm

I analyzed whether SMA can be used to identify trending or mean reversion of factors here: I really wish factor momentum would work - help me show that it actually does - #4 by Jrinne

There are obvious limitations the method.

uses only one feature for prediction (SMA for the same feature as a target)
Its parametric assuming linearity and normality.
most uses of SMA assume trending, while we found mean reversion dominates

I did a similar analysis using K-nearest neighbors (KNN) and the SMAs of all features to predict the future returns of a single target feature. Repeating this for all target features.

Implications:

Using all factors together provides marginal improvement in predictive power
Non-linear relationships may be present but are still quite weak
Even with more sophisticated methods, factor predictability remains challenging

The improvement in R² values suggests there is some benefit to considering multiple factors together, but the overall low values still indicate limited predictability in factor returns.

judgetrade · November 25, 2024, 3:18am

Jrinne, thank you, which factors did trend best?

Jrinne · November 25, 2024, 9:47am

Hi Andreas,

I used regression to answer this question here: I really wish factor momentum would work - help me show that it actually does - #4 by Jrinne. Using linear regression, I found that over a month's period, value ratios and earnings estimate revisions were more likely to tend than other factors I looked at. Most factors mean-reverted over that period.

Specifically, I used 4 weeks of returns to predict the next 4 weeks of returns. Here is an example of a factor that mean-reverted. Here the regression line had a negative slope. At the far left of the graph, the 5 data points with the worst 4 weeks (on the x-axis) went on to have some of the best months going forward (the y-axis). None of those 5 data points showed negative returns.

But one could argue for a U-shaped curve with this data which would be one reason to use KNN as it does not assume linearity. Looking at the scatter plot, extreme negative returns (left side) AND extreme positive returns (right side) both tend to be followed by positive returns, suggesting a U-shaped relationship. However, it gets more difficult to classify a factor as just trending or mean-reverting when encountering U-shaped curves. While less descriptive, KNN methods may have more predictive power when data is not always linear.

The data I presented in this post supports a significant improvement in predictive power using KNN compared to linear methods: I really wish factor momentum would work - help me show that it actually does - #10 by Jrinne

Jim