P123,
P123 is the premier platform for machine learning.
P123 is integrating machine learning into its platform—as I understand it. I have little specific infromation on this but I anxiously await the roll-out.
In addition, Dan Parquette is continuously improving the API that some members use for machine learning. Or a whole host of algorithms that may be entirely separate from machine learning (I guess). Obviously, bootstrapping is entirely separate from machine learning. Obviously.
Yep, P123 is the premier platform for machine learning for retail investors (and some professionals).
James (ustonapc) made me aware of this site: numerai
I probably will not participate in this but it may be interesting for several reasons:
- Probably easier that Quantopian because the data is closer to that array I keep requesting: Array with columns of features and target (excess returns). Numari sees the wisdom in this. Heck, upload it into SPSS (or JASP), run a multiple regression and see if you win. Total time required: 10 minutes. Better yet, just do it in Excel. Probably the ultimate goal for P123 as Dan seems to be progressively moving toward this—recently providing Google spreadsheets with easy loading into Colab.
[color=firebrick]Yep. From the image: “# Training data contains features and targets”[/color] This is definitely just a csv file with factors (features) and targets. What a great idea! And thank you Dan for moving in that direction.
-
Probably similar to how a Kaggle competition handles data. Kaggle may be a large target audience for P123 marketing.
-
Useful for P123 to gauge interest and find its market for machine learning.
-
Supports what Steve Auger (InspectorSector) has said and what P123 MAY be doing with machine learning. Namely, Boosting is often an optimal or near optimal machine learning tool. Or, at least, MAY belong in marketing literature. Image loading XGBoost at numari below.
More importantly, Boosting is ideally suited for using ranks as factors (features). Something that Marco understands, I believe. Few other methods can claim this. Arguably neural nets can be constructed that can use ranking data effectively.
[color=firebrick]This is a pretty good example of how P123 should ultimately want to handle the data, I think.[/color] Down to keeping a hold-out set with no target to be used at the very end to test the model. Apparently done by numari (and not the user) here. But important no matter how you look at it.
Not too many lines of code either.
Jim