Factor Research – Tracking Smart Money Footprints via Foreign Institutional Concentration – QFII Part 1

Jun 15 2026

Keywords：

Chip Analysis Factor Investing Factor Library institutional investors market data qfii

Table of Contents

Foreign-Institutional Inflows and Outflows: Deciphering “Smart Money” Footprints in Taiwan Blue-Chip Stocks

In Taiwan stock market, Qualified Foreign Institutional Investors (QFIIs) serve as critical marginal capital for large-cap weight stocks. Taking Taiwan Semiconductor Manufacturing Company (TSMC, 2330.TW) as an example, statistics from the TEJ Broker Trading Database show that from 2023 to May 2026, an average of 57% of TSMC’s daily trading volume originated from foreign institutional brokers. This indicates that the asset allocation direction and trading dynamics of foreign capital exert a significant influence on the pricing of large-cap stocks.

This article focuses on an analytical research of the Foreign-Institutional Trading Concentration (conc_qfii) factor within the TEJ Factor Library. By calculating the ratio of a stock’s daily trading volume from foreign brokers to the total trading volume across all brokers, we measure the extent to which a stock’s trading activity is dominated by foreign institutional main players.

Empirical results indicate that the conc_qfii factor possesses strong predictive power for future expected returns in Taiwan stock market. Under specific size stratification and value-weighted frameworks, its cross-sectional stock-picking performance effectively outperforms the market benchmark (Formosa Return Index, IR0078). This article will systematically examine the theoretical foundation, cross-sectional distribution characteristics, and Information Coefficients (IC) of this factor. By clarifying the critical heterogeneity of foreign institutional chips, we lay a solid theoretical foundation for subsequent quantitative trading strategies.

➡️ Explore the TEJ Factor Library deeper to unlock more quantitative research insights!

Foreign-Institutional Trading Concentration: The Informed Trading Hypothesis and Critical “Size-Conditionality”

The core logic behind the Foreign-Institutional Trading Concentration factor stems from the “information asymmetry” and “informed trading“ hypotheses in financial markets: informed traders leave observable footprints in the cross-sectional distribution of trading volume when trading frequently, and trading volume variance rises as the share of informed trading increases (Lof & Van Bommel, 2023).

In Taiwan stock market, foreign institutions are typical informed traders and hold substantial amounts of capital (commonly referred to as “smart money”). Therefore, the degree of foreign capital dominance in an individual stock’s trading can be viewed as the focus of institutional attention and allocation intensity. Herd behavior (co-directional trading) among institutions accelerates price adjustments (Wermers, 1999), whereas retail investors are typically driven by attention biases to buy stocks, a behavioral bias from which institutional investors are relatively immune (Barber & Odean, 2008). Consequently, focusing on foreign institutional volume rather than the entire market preserves a purer informed signal.

However, this informed signal possesses a critical characteristic in the Taiwan market: Size-conditionality. Due to the inherent trading structure of mega-cap stocks (e.g., TSMC), where foreign institutional brokers naturally account for a structurally higher proportion of volume, the performance of conc_qfii in predicting returns is completely reversed across different market capitalization scales: it exhibits a positive relationship in large-cap stocks but a negative relationship in small-cap stocks. Therefore, the true value of this factor cannot be captured in a broad-market equal-weighted pool; it must be manifested within a framework of size stratification and value weighting.

Factor Definition and Data Sources

Data Sources

The trading data and multi-factor indicators used in this study are sourced from the TEJ Database:

Chip Data: TEJ Broker Trading Details (used to observe inflows/outflows via foreign institutional broker channels).
Trading Data: Stock prices, market capitalization, industry classifications, and market multi-factors.
Benchmark Index: Formosa Return Index (IR0078).

The sample period spans from December 2020 to May 2026, and the research subjects cover all common stocks listed on the Taiwan Stock Exchange (TWSE) and the Taipei Exchange (TPEx).

➡️ TEJ broker ‘s trading details

Variable Construction

On each trading day, the buy amount and sell amount from foreign brokers for stock i are summed up, divided by the total buy and sell amount of all brokers for that stock on the same day, and multiplied by 100:

The factor value ranges between 0 and 100. Higher values indicate that the stock’s trading volume is heavily dominated by foreign brokers. Notably, this formula sums both buys and sells, making it a direction-neutral indicator. It measures the participation weight of foreign main players in the stock’s turnover rather than their net buying direction; thus, large-scale liquidations or selling pressure will equally drive up the concentration score.

Descriptive Statistics: Heavily Right-Skewed

On each trading day, stocks are sorted by their factor values from smallest to largest and divided into ten equal deciles (P1 lowest to P10 highest). As shown in Table 1, the cross-sectional distribution of the factor is heavily right-skewed. The average concentration of deciles P1 to P9 rises gently from 1.65% to 27.86%, with intra-group standard deviations strictly below 5%.

However, for P10 (the highest decile), the mean abruptly jumps to 43.71%, with a standard deviation of 11.63%, and the maximum value touches 100%.

Cross-sectional checks reveal that the market value weight of P10 is highly concentrated in a few mega-cap dragon stocks (the top three stocks alone account for over 50% of the total market capitalization weight in P10), re-emphasizing the profound importance of size stratification.

Table 1: Descriptive Statistics of conc_qfii Deciles

Decile	Minimum (Min)	Maximum (Max)	Mean	Standard Deviation (Std)	Observations (Count)	Percentage (%)
P1	0.00	6.83	1.65	1.10	171,176	10.03
P2	0.36	12.38	4.79	1.64	170,622	10.00
P3	1.08	16.29	7.89	2.03	170,537	9.99
P4	2.00	20.05	10.72	2.30	170,628	10.00
P5	3.19	24.07	13.34	2.54	170,750	10.00
P6	4.74	27.76	15.95	2.81	170,406	9.98
P7	6.69	31.12	18.82	3.11	170,531	9.99
P8	9.17	43.43	22.37	3.54	170,637	10.00
P9	12.57	61.03	27.86	4.53	170,521	9.99
P10	16.63	100.00	43.71	11.63	171,062	10.02

Note: Statistics are aggregated from all daily observations within each decile over the sample period; concentration units are in %; data period: 2020/12–2026/05; Taiwan listed common stocks)

Return Performance and Risk Factor Testing (Fama-French Alpha)

Average Expected Return and Cumulative Return Analysis

To systematically evaluate the factor’s stock-picking efficacy, we analyze the daily average expected return over a future 21-day holding period. We cross-examine four distinct combinations: “Equal Weight vs. Value Weight” across “All Stocks vs. Large-Cap Stocks”.

The empirical results in Table 2 unveil a critical phenomenon: if an “equal-weighted” scheme is applied, the long-short spread return (P10 – P1) is strictly negative for both all-stocks and large-cap stock universes, forming an inverted U-shaped profile (implying that higher concentration leads to lower returns). However, once we switch to “value weighting”, the long-short spread immediately flips to positive: +0.031% for all stocks and reaching a maximum of +0.040% for the large-cap value-weighted combination.

Table 2: Average Daily Expected Return of conc_qfii Deciles — Four Portfolio Matrix (21D)

Decile	Equal Weight – All Stocks	Value Weight – All Stocks	Equal Weight – Large Cap	Value Weight – Large Cap
P1	0.079%	0.069%	0.062%	0.063%
P2	0.062%	0.064%	0.073%	0.083%
P3	0.059%	0.055%	0.075%	0.097%
P4	0.063%	0.063%	0.073%	0.092%
P5	0.066%	0.073%	0.068%	0.085%
P6	0.069%	0.085%	0.063%	0.086%
P7	0.069%	0.088%	0.063%	0.086%
P8	0.063%	0.082%	0.057%	0.080%
P9	0.059%	0.082%	0.056%	0.082%
P10 (Highest)	0.055%	0.100%	0.056%	0.104%
Long-Short Spread (P10 – P1)	−0.024%	+0.031%	−0.006%	+0.040%
Monotonic ρ	−0.42	+0.79	−0.72	+0.21

Note: Monotonic p represents the Spearman rank correlation between the decile ranks P1 ->P10 and their average returns; > 0 indicates returns increase with concentration, < 0 indicates a decrease; data period: 2020/12–2026/05; returns do not account for transaction costs

Figure 1: Average Daily Expected Return of conc_qfii Deciles (Value-Weighted · Large-Cap)

*Note: Corresponding to the “Value Weight – Large Cap” column in Table 2; P10 clearly dominates, turning the long-short spread positive; returns do not account for transaction costs*

If we expand our view to the cumulative return across the entire sample period, the conclusion remains robust: under the framework of value weighting and large-cap constraints, the highest decile group (P10) demonstrates long-term, stable outperformance over the lowest decile group (P1) as well as the benchmark index.

Figure 2: Cumulative Return of conc_qfii ，Value-Weighted X Large-Cap: Highest Decile (P10), Lowest Decile (P1), and Benchmark Index (IR0078)

*Note: Transaction costs are not considered*

Risk Factor Regression (Fama-French Alpha)

To further verify whether the returns of the conc_qfii long-short portfolio are merely compensations for exposure to known systematic risk factors, we run time-series regressions of the long-short portfolios (P10 – P1) against CAPM, the Fama-French three-factor (FF3), and five-factor (FF5) models (applying Newey-West adjusted t-statistics).

Table 3: Risk Factor Regression Alpha Comparison for conc_qfii Long-Short Portfolio (P10−P1)

Model	Equal Weight – All Stocks	Value Weight – All Stocks	Equal Weight – Large Cap	Value Weight – Large Cap
CAPM	−0.978%*** (−5.01)	−0.703%** (−2.12)	−0.732%* (−1.95)	+0.775% (+1.24)
FF3	−0.880%*** (−4.59)	−0.295% (−1.40)	−0.640% (−1.35)	+1.343% (+2.03)**
FF5	−0.935%*** (−5.37)	−0.400% (−1.59)	−1.010%* (−1.94)	+0.886% (+1.40)

Note: alpha is the monthly regression intercept in %; transaction costs are not considered; numbers in parentheses indicate Newey-West adjusted t-statistics; significance levels: *** p<0.01, p<0.05, * p<0.1; data period: 2020/12–2026/05)

The regression findings (Table 3) mirror the return profiles precisely:

Negative Bias Under Equal Weighting: The long-short Alpha for “Equal Weight – All Stocks” is intensely and significantly negative across all models (FF5 alpha = -0.935%). This reflects that under an equal-weighted blueprint, the right-tailed factor is dragged into reverse by a few mega-caps and volatile small-cap stocks.
Premium in Value-Weighted Large-Caps: Conversely, only the “Value Weight – Large Cap” configuration yields strictly positive Alphas across all three models, with the FF3 model capturing statistical significance at the 5% level (alpha = +1.343%, t = 2.03).
Elimination of Size and Value Camouflage: Crucially, after controlling for Size and Value factors (moving from CAPM to FF3), the long-short alpha increases rather than shrinks (from +0.775% to +1.343%). If this premium were merely an implicit proxy for size or value risk, controlling for them should absorb and diminish the intercept. Its expansion successfully dispels doubts of “size or value factor camouflage,” demonstrating that foreign concentration provides standalone predictive information.

Information Coefficient (IC) Analysis

The Information Coefficient (IC) employs Spearman’s Rank Correlation to calculate the sorting consistency between individual stock factor values and future returns. Operating independently of intra-group weights or extreme outliers, it stands as the cleanest tool for diagnosing raw factor direction.

The IC summary in Table 4 shows that all holding periods possess significantly positive mean ICs, with a 21-day win rate touching 60%. Yet, this unearths a classic quantitative paradox: “If the broad-market IC mean is significantly positive, why is the equal-weighted long-short spread return negative?”

The culprit lies in the weight distortions of extreme deciles combined with small-cap noise interference. While Rank IC establishes the cross-sectional reality that “higher concentration dictates a higher subsequent return rank,” the equal-weighted dollar return is heavily skewed and distorted by idiosyncratic small-cap behaviors.

Once the asset universe is narrowed down to the top 30% of large-cap stocks by market capitalization, the 21-day mean IC doubles from 0.0186 to 0.0403, the risk-adjusted IC (Information Ratio) elevates to 0.3864, and the 21-day win rate reaches 64.48%. This serves as definitive proof that the predictive muscle of this factor is highly concentrated within large-cap equities, while its performance in the total market is simply diluted by opposing signals from small-cap stocks.

Table 4: Statistical Summary of conc_qfii Information Coefficients (IC) — All Stocks vs. Large-Cap Stocks

	All Stocks (1D)				Large-Cap （ the top 30% stocks by market capitalization ）
Metric	1D	5D	10D	21D	1D	5D	10D	21D
IC Mean	0.0187	0.0216	0.0201	0.0186	0.0212	0.0312	0.0371	0.0403
IC Std	0.0896	0.0844	0.0779	0.0744	0.1267	0.1124	0.1066	0.1043
Risk Adjusted IC	0.2089	0.2557	0.2577	0.2496	0.1671	0.2773	0.3477	0.3864
IC > 0 (%)	57.29	59.04	60.15	60.06	55.90	60.89	63.75	64.48
IC > 0.03 (%)	44.10	45.30	44.83	44.28	47.60	50.09	52.95	54.52
IC > 0.05 (%)	35.06	36.72	35.52	33.76	40.31	44.10	45.48	47.32
IC t-value	6.88	8.42	8.48	8.22	5.50	9.13	11.45	12.72
IC p-value	<0.001***	<0.001***	<0.001***	<0.001***	<0.001***	<0.001***	<0.001***	<0.001***
IC Skewness	0.1287	0.1068	0.0239	0.0040	0.0082	0.0658	−0.0995	−0.1253
IC Kurtosis	0.1940	−0.1189	0.0045	−0.1251	−0.1741	−0.0297	−0.1138	−0.1442

Note: IC is the daily cross-sectional Spearman rank correlation between factor value and expected return; Risk-Adjusted IC = IC mean / IC std; sample period: 2020/12–2026/05; significance levels: *** p<0.01, p<0.05, * p<0.1)

Conclusion: Unlocking Large-Cap Chip Secrets — How to Build a Real-World Tactical Strategy?

Through a systematic portfolio sorting methodology and rigorous IC diagnostic testing, we have successfully verified that Foreign-Institutional Trading Concentration (conc_qfii) is an exceptional Alpha source for tracking the “smart money” within Taiwan’s large-cap space. However, this factor is not a silver bullet to be blindly followed; its stark “size-conditionality” mapping proves that it functions as a strong positive predictor among large caps but reverses into a negative one among small caps. This emphasizes that only by adopting a dedicated “large-cap focus” combined with a “value-weighted” allocation blueprint can the true profitable power of this factor be entirely unleashed.

Now that we have rigorously established the factor’s stock-picking efficacy in theory and cross-sectional data, the next logical step is to step onto the high-friction battlefield of active execution. In Part 2 (Strategy and Empirical Backtesting), we will unveil the complete process of translating these insights into actionable trading strategies and dissect their performance against the real world!

➡️ Unlock Part 2: Strategy & Backtesting

Was this viewpoint helpful to you?

Back