Maybe I’m mistaken, but since the 2020 switchover of data providers, it has become my impression that Portfolio123 powered by FactSet might NO LONGER be accurately characterized as Point-In-Time Data. If a database includes significant errors on the day it’s published, we, as disciplined, systematic investors, must assess if it is reliable enough for our needs. Fortunately, in my case, the use of factors from the Fundamental Category has dropped off significantly since 2016, when I concentrated my investment efforts and advisory services 100% on ETFs. ETFs are far superior investment vehicles than common stocks for an evidence-based, data-driven investment process, IMO.
It seems obvious that FactSet (and every other data provider) would want to eliminate ERRORS in the data to the fullest extent possible. After all, if you operate a business hawking farm-fresh chicken eggs each morning from a pedal-powered cart, you wouldn’t stay in business long if you let slightly rotten eggs get through to the customer. The quality of goods sold must be high for any business to be successful. The entire data-use continuum is affected by rotten data, from FactSet, to Portfolio123, to ETFOptimize, and finally, the end-user-investor can all be affected by a subpar quality of goods sold from the perspective of Point-In-Time data.
The two most critical, data-quality characteristics that are necessary for any systematic investment process to be accurate and successful are: 1) Data is POINT-IN-TIME - meaning that historical data is as accurate as possible at the time posted, and 2) there is NO SURVIVORSHIP BIAS in the data.
Eliminating Survivorship Bias means that at any backtested point in time, the companies or ETFs included in the database are the ones that existed at that time. For example, suppose you were to assess today’s list of the ~ 500 constituents of the S&P 500 Index/ETF (SPY). In that case, that list might be significantly different than the list of the 500 companies that were included in the index in 2007 (before the Financial Crisis).
However, if we used the list of today’s companies to assess historical market conditions, today’s list only includes corporate winners. It would not have the terrible stock performances during the 2007-2009 Financial Crisis when many publicly traded companies failed or were delisted from the S&P 500. Businesses such as Bear Stearns, IndyMac, Washington Mutual, Lehman Brothers, and AIG are no longer around, so it may be said that this list would exhibit Survivorship Bias when examining the historical performance of today’s S&P 500 companies.
To me, the elimination of Survivorship Bias is another of many examples where accurate Point-In-Time data is invaluable. Quantitative designers must have a precise list of active public companies or ETFs constituting the Universe as it evolves, with correct constituents of the Universe on each specific date. Right now, this is most important to me because I am creating a plethora of breadth indicators. However, to Jim’s point below, I also use earnings estimates for the constituents of ETFs.

