What a credible result would probably show
Without inventing market Sharpe numbers, the prior from the literature is fairly clear.
First, clustered buys should outperform isolated buys on a risk-adjusted basis in most markets with usable data.
Second, clustered sales should be weaker and less consistent, unless the sample is carefully filtered to remove mechanical and compensation-related selling.
Third, the signal should be stronger in smaller, less-covered firms and in markets where information diffuses more slowly.
Fourth, post-disclosure alpha should be lower where filings are faster, cleaner and more widely followed, because the market processes them more efficiently.
None of these are controversial. They are the sort of results that survive because they are tied to economic mechanisms rather than statistical whimsy.
Why some markets may look better than they really are
A high Sharpe in one country can reflect genuine inefficiency. It can also reflect one of four less romantic explanations.
Sparse events
If a market produces few clusters, a handful of successful episodes can dominate the estimate.
Small-cap concentration
Equal-weighted portfolios in small-cap-heavy markets often look brilliant on paper and expensive in reality.
Publication quirks
If filings are timestamped or aggregated in unusual ways, event windows can accidentally capture pre-existing momentum or stale disclosure.
Survivorship and mapping bias
If issuer identifiers are cleaner for surviving firms than delisted ones, backtests can drift upward. Delistings have an unhelpful habit of being relevant precisely when insiders are informative.
How to run the 17-market test without embarrassing yourself
A revisit worthy of the name should be conservative. The original insight is strong enough that it does not need decorative optimisation.
Event definition
Use issuer-level events where at least three distinct insiders file same-direction discretionary trades within 30 calendar days. Distinct means distinct beneficial owners after identity resolution, not three line items from one executive and his family office.
Signal timestamp
Form the signal on the public filing date, using the close of that day or the next tradable close depending on timestamp granularity. This avoids the common sin of trading on information before it was public.
Portfolio construction
Build market-neutral or benchmark-relative long-short portfolios if shorting is feasible, otherwise long-only on clustered buys. Equal-weight and value-weight both deserve reporting. If only one is shown, equal-weight is usually the more academically sensitive and less institutionally realistic choice.
Risk adjustment
Report raw returns, volatility, Sharpe, hit rate and factor-adjusted alpha where local factor models exist. At minimum, compare against market and size exposures. A cluster strategy that is secretly just a small-cap rebound strategy is still interesting, but it should not wear a false moustache.
Robustness checks
Use subperiod splits, pre and post MAR for European markets where relevant, and alternative cluster thresholds such as 2-in-30 and 3-in-60. If the signal survives mild perturbations, confidence rises. If it disappears the moment you change one parameter, you have found a PowerPoint effect.
Capacity and costs
Estimate turnover and implementation costs. The signal may be statistically elegant and economically tiny. This happens more often than quants admit in public.
The literature’s message is robust, but the modern question is decay
The useful debate is no longer whether clustered insider buying contains information. It does. The more interesting question is how much of that information remains harvestable after disclosure reforms, data commoditisation and the spread of event-driven systematic strategies.
Has the edge been arbitraged away?
Possibly in the largest, cleanest markets, at least partly. U.S. insider data is easy to access and heavily mined. That tends to compress the post-filing window. But “compressed” is not the same as “gone”. Signals tied to genuine private information often degrade slowly because they are capacity-limited and episodic.
In Europe and other non-U.S. markets, the answer may differ by country. Where data remains fragmented or publication formats are poor, the informational edge can persist longer, though implementation costs also rise. Alpha often hides in places where CSV files fear to tread.
Why the cross-market view still matters
A 17-market comparison can tell us whether the signal is fundamentally human or mostly institutional.
If clustered buys work almost everywhere, then insider coordination is a broad behavioural fact. If they work only where disclosure is messy, then the edge is more about data processing than about insiders per se. Both findings would be useful. One is a finance result, the other a market structure result. Investors can make money from either, provided they know which business they are in.