TEJ Point-in-Time Audited Financial Database  – Rejecting “Peek-ahead” Backtesting

Introduction

In quantitative investment research, financial data is the cornerstone of strategy development and validation. Every factor or model design must be built on reliable and complete financial data. If the data itself contains flaws or omissions, even the most sophisticated strategies may fail in the real market. Therefore, establishing a financial database that can faithfully restore the true historical picture is crucial for quantitative researchers. 

TEJ Point-in-Time Audited Financial Database designed specifically for quantitative strategy research, provides Point-in-Time (PIT) data as the foundation for backtesting

Why does backtesting performance always get “discounted” in live trading?

In many research scenarios, traditional databases or self-collected data yield impressive results in backtesting, yet the performance in real markets frequently falls short of expectations. This gap does not stem from flaws in strategy logic, but rather from two fatal issues hidden in the raw data: look-ahead bias and survivorship bias. 

  • Look-ahead Bias 
    Backtests may mistakenly include information that was only published later. Such data did not exist at that point in history, but was incorrectly fed into the model, creating misleadingly strong results. 
  • Survivorship Bias 
    Tranditional databases that only keep currently listed firms exclude those that went bankrupt, delisted, or were merged. This makes the sample inconsistent with the real market, overstating returns and understating risks. 

These two pain points are the most easily overlooked in quantitative research and strategy backtesting, yet they are precisely the key factors that determine whether a strategy can truly work in the market. 

The foundation of a reliable strategy: Point-in-Time 

To build a strategy that truly reflects reality and withstands the test of the market, its foundation must be established on a data structure that can “reproduce the information available at the historical moment.” This is the core spirit of Point-in-Time (PIT). 

TEJ, with 30 years of data processing experience, has specially developed the Financial Database for Investment -TEJ PIT Audited Financial Database  designed for rigorous quantitative research and institutional-level backtesting.

TEJ PIT Audited Financial Database:Restoring the True Historical Picture

The core of TEJ PIT Audited Financial Database is a rigorous Point-in-Time (PIT) data framework. Our data source is based on the public information disclosed via the Market Observation Post System (MOPS). However, since official data are overwritten once corrections are made, such data cannot be directly used for backtesting. 

The value of TEJ lies in the subsequent two core processes, ensuring that your research can faithfully reconstruct the complete historical landscape. 

Eliminating Look-ahead Bias:Announcement Timestamp & Full Version Retention

For researchers, attempting to manually reconstruct history from MOPS is an extremely difficult task. Two major challenges arise: 

  • Overwriting of structured data:The financial statement items available on MOPS are always the latest version. Once a company reissues or reclassifies a report, the old figures are overwritten and become irretrievable. 
  • Cumbersome manual tracking:Even if changes are identified, researchers must manually navigate multiple MOPS sections—such as  “Electronic Books  ,” or “Material Information”—to locate updated reports. This process is not only time-consuming and error-prone, but also greatly reduces research efficiency. 

The PIT framework of TEJ systematically resolves these issues and guarantees the most accurate historical snapshot: 

  • Full version retention: All versions of submitted financial statements are collected and preserved without omission. 
  • Precise announcement timestampsEach version is labeled with its exact announcement time, allowing backtesting programs at any point T to automatically retrieve only the correct financial version announced before T. 

Case1: TRANS-SUN (6967.TW) 

The company mistakenly reported the number of “total treasury shares “. On August 14, 2025 , it issued a correction via MOPS “Material Information,” revising the figure from 25,637 shares to 500,000 shares. 

In TEJ PIT Audited Financial Database, we retained both versions with their respective timestamps. 

  • Any backtest prior to 2025/08/14 would still read the old figure of 25,637 shares. 
  • Only after that exact timestamp would the corrected figure of 500,000 shares take effect. 

This precise treatment ensures that no backtest uses future information and that strategies reflect the reality known to the market at the time.   

Figure 1:Correction notice for TRANS-SUN (6967.TW)’s Q2 2025 financial report

這張圖片的 alt 屬性值為空,它的檔案名稱為 image-707.png
Source: MOPS Material Information 

Figure 2:TEJ database retains complete versions of TRANS-SUN (6967.TW) ‘s financial data for Q2 2025, both before and after auditor certification. 

這張圖片的 alt 屬性值為空,它的檔案名稱為 6967.TW_-1024x423.png
Source: TEJ PIT Audited Financial Database 

More importantly, this correction was not fully disclosed in the English version or other electronic filings, since the original financial PDF itself contained no error. This highlights the fragmentation and inconsistency of official information sources, making it easy for researchers to miss such critical changes. TEJ’s systematic collection process is designed to resolve such gaps.  

📌 You may intetesting: TEJ Factor Library-100+ factors built for Taiwan market

Case 2: TSMC Financial Reports 

TEJ’s handling of announcement dates balances historical completeness with maximum precision. Since 2013 (based on the cover date of reports), announcement times are recorded down to hours, minutes, and seconds. This granularity even allows backtesting accuracy to extend to intraday price reactions. 

Figure 3:TSMC (2330) financial report release dates in TEJ database, 2013 

這張圖片的 alt 屬性值為空,它的檔案名稱為 2330.TW_-1024x439.png
Source: TEJ PIT Audited Financial Database 

Ensuring Integrity: Building a Complete and Unified Sample Space 

A real market includes successful firms, delisted companies, and industries with very different models. If the sample is incomplete, strategy returns will be distorted. TEJ PIT Audited Financial Database secures sample integrity from two angles: 

  • Time – removing survivorship bias: Covers all companies ever listed since 2005, including delisted firms. 
  • Industry dimension – financial and non-financial firms use different reporting structures, and raw statements are usually split across separate tables. TEJ database consolidates all industries, including financials, into a single unified module. 

This design ensures the strategy universe always matches the actual market at any point in history. It provides a complete, unbiased, and efficient foundation for market averages and cross-industry factor tests. 

Example: Both Siliconware Precision (2325, delisted in 2018) and Cathay Financial Holdings (2882, still listed) are fully preserved in the database. 

Figure 4:TEJ PIT Audited Financial Database provides complete listed, delisted, financial and non-financial firms

這張圖片的 alt 屬性值為空,它的檔案名稱為 2325.TW_-1024x396.png
Source: TEJ PIT Audited Financial Database 

TEJ’s Value-added Features: From Data to Research Efficiency 

TEJ PIT Audited Financial Database not only ensures unbiased backtesting but also provides value-added features that turn raw statements into ready-to-use research materials, enabling focus on strategy development instead of data preprocessing. 

Standardized Accounting Items: Achieving True Comparability 

Quantitative research relies not only on raw financial items but also on numerous ratios derived from them. Researchers often need to extract multiple fields and calculate them manually, a process that is tedious and prone to error when handling different reporting periods. 

To address this, TEJ PIT Audited Financial Database builds on standardized accounting items from MOPS and pre-computes more than 300 commonly used ratios and indicators, covering: 

  • Financial statements:balance sheet, income statement, cash flow statement.
  • Ratios:ROA, ROE, inventory turnover, asset turnover, revenue growth, operating income growth, current ratio, operating expense ratio, BPS, EPS, etc.
  • Financial sector specificsDiscounts and loans – Banking, insurance commission income, net securities brokering income, etc. 

Most importantly, all derived indicators adopt the Point-in-Time methodology, ensuring calculations across periods are also free from look-ahead bias. 

Bridging Accounting Standards: GAAP Adjusted to IFRS 

Taiwan adopted IFRS for all listed companies in 2013, creating a discontinuity in long-term data. For researchers needing 15–20 years of history, reconciling GAAP with IFRS manually is both complex and time-consuming. 

TEJ resolves this by adjusting all pre-2012 GAAP reports into IFRS-equivalent formats. This enables seamless long-term analysis across accounting regimes, ensuring strategy validity is not distorted by changes in reporting standards. 

Rich Time Frequencies: Quarterly, Accumulated, and TTM 

MOPS does not directly provide Q4 standalone data or Trailing Twelve Months (TTM) values. TEJ fills these gaps by offering three reporting frequencies, clearly marked as A / Q / TTM: 

  • Accumulated (A):As originally reported, e.g., Q3 accumulated covers January–September. 
  • Quarterly (Q):Derived by subtracting prior cumulative values, ensuring consistency across quarters. This includes calculated Q4 data absent from MOPS. 
  • TTM (Trailing Twelve Months):Sum of the most recent four quarters, filtering out seasonality and revealing long-term trends. 

These flexible timeframes empower more robust factor design for both value and growth strategies. 

Figure 5:TEJ database provides complete annual, quarterly, and trailing twelve-month (TTM) data 

這張圖片的 alt 屬性值為空,它的檔案名稱為 2330.TW_1-1024x280.png
Sample:TSMC (2330.TW)  Source: TEJ PIT Audited Financial Database 

Industry-specific Details: Supporting Financial Sector Accounting 

Different industries operate with distinct models and key indicators, and generic items are insufficient to evaluate financial firms. To address this, TEJ PIT Audited Financial Database includes sector-specific accounts for banks, insurers, holding companies, and securities firms. This allows factors to be built closer to industry fundamentals, expanding both the depth and scope of analysis. 

Figure 6:Examples of accounting categories specific to the financial industry 

這張圖片的 alt 屬性值為空,它的檔案名稱為 2882.TW_-1024x304.png
Source: TEJ PIT Audited Financial Database

Case Study:  XPEC Entertainment  (3662.TW) – Seeing the Value of Point-in-Time 

The well-known case of XPEC Entertainment (3662.TW) was not only a failed merger but also a litmus test for the rigor of backtesting data. 

In 2016, the market was driven into frenzy after Japanese investor “Bai Chi Gan Tou Digital Entertainment” announced a high-premium tender offer for XPEC. This dream quickly collapsed when the acquirer defaulted at the end of August, leading to a stock price crash and severe investor losses. 

 Figure7:Stock price XPEC (3662.TW) of and timeline of events 

這張圖片的 alt 屬性值為空,它的檔案名稱為 3662.TW_-1024x508.png
(Source:TEJ stock price database) 

More critical issues emerged afterward. On May 16, 2016, XPEC originally reported Q1 EPS as positive (0.1). Yet, after the fraud was exposed, the company retroactively announced on August 14, 2017, that the true Q1 EPS was -16.55, a massive loss. 

Figure 8Multiple historical versions of XPEC (3662.TW) ‘s EPS data for Q1 2016

這張圖片的 alt 屬性值為空,它的檔案名稱為 3662.TW_1-1024x541.png
Source: TEJ PIT Audited Financial Database 

This case highlights two very different backtesting outcomes: 

  1. Contaminated backtest (typical databases): 
    If the database only keeps the “latest version,” any backtest conducted after August 2017 would show Q1 2016 EPS as -16.55. A model might appear to have avoided XPEC, but that is a false success—information not available in 2016 was wrongly used. 
  2. True backtest (TEJ PIT Audited Financial Database): 
    With full version retention and precise announcement dates, the database shows only EPS = 0.1 during 2016/05/16–2017/08/13. Only from 2017/08/14 does EPS = -16.55 enter the dataset. This ensures strategies reflect the market reality at the time, not future knowledge. 

This example demonstrates the core value of Point-in-Time: fair and credible backtesting based solely on information available to the market at each historical moment. 

From Data Users to Data Masters 

The success of quantitative research starts with the quality of data. Even a seemingly minor flaw can invalidate an otherwise sound strategy. TEJ PIT Audited Financial Database directly addresses the most critical challenges with three key strengths: 

  1. Point-in-Time with full version retention and announcement timestamps
  2. Complete sample coverage, including all listed and delisted companies
  3. Standardized indicators, GAAP-to-IFRS adjustment, and multiple frequencies

Through these advantages, TEJ delivers not just data but a complete, verifiable solution that accelerates research and strategy deployment. With the right tools, researchers can move beyond data traps and focus on strategy design and factor discovery. TEJ PIT Audited Financial Database stands as the most reliable cornerstone for building sustainable alpha in today’s highly competitive markets. 

💡Contact us to request a custom Quantitative database demo

Back
Processing...