アノマリー研究におけるpハッキング

「Replicating Anomalies」というNBER論文が上がっている（H/T Francis Diebold。ungated版（SSRN））。著者はKewei Hou（オハイオ州立大）、Chen Xue（シンシナティ大）、Lu Zhang（オハイオ州立大）。
以下はその要旨。

The anomalies literature is infested with widespread p-hacking. We replicate the entire anomalies literature in finance and accounting by compiling a largest-to-date data library that contains 447 anomaly variables. With microcaps alleviated via New York Stock Exchange breakpoints and value-weighted returns, 286 anomalies (64%) including 95 out of 102 liquidity variables (93%) are insignificant at the conventional 5% level. Imposing the cutoff t-value of three raises the number of insignificance to 380 (85%). Even for the 161 significant anomalies, their magnitudes are often much lower than originally reported. Out of the 161, the q-factor model leaves 115 alphas insignificant (150 with t < 3). In all, capital markets are more efficient than previously recognized.
（拙訳）
アノマリーの研究はpハッキングが幅広く蔓延している。我々は、447のアノマリー変数を収録した、現時点で最大のデータライブラリを構築し、ファイナンスと会計の全アノマリー研究を再現した。ニューヨーク証券取引所の区切りと時価加重リターンの使用で極小時価総額銘柄の問題を緩和したことにより*1、102の流動性変数のうちの95変数（93%）を含め、286のアノマリー変数（64%）が通常の5%水準で非有意となった。3というt値の閾値を課すことにより、非有意の数は380（85%）にまで増加した。161の有意なアノマリーについても、その大きさは元々の報告値よりも小さくなることが多かった。161のうち、qファクターモデル*2は115のアルファを非有意とした（t<3となったのは150）。全般に、資本市場はこれまで思われていたよりも効率的であった。

*1:ungated版の本文には以下のような記述がある：
Some studies exclude stocks with prices per share lower than $1 or $5. We do not impose such a sample screen. Many studies also equal-weight portfolio returns. We instead use value-weights.
We do so for several reasons. First, value-weights more accurately reflect the wealth effect experienced by investors, as emphasized by Fama (1998). Second, Fama and French (2008) document that microcaps are influential in equal-weighted returns. Microcaps are stocks with the market equity below the 20th percentile of NYSE stocks. Microcaps are on average only 3% of the market value of the NYSE-Amex-NASDAQ universe, but account for about 60% of the total number of stocks....
...With NYSE-Amex-NASDAQ breakpoints, microcaps typically account for more than 60% of the stocks in extreme deciles. These microcaps can greatly inflate the anomalies, especially when combined with equal-weights. In contrast, using NYSE breakpoints assigns a fair number of small and big stocks into extreme deciles, alleviating the impact of microcaps.
（拙訳）
研究の中には1株当たり価格が1ドルないし5ドル以下の株式を除外するものもある。我々はそのようなサンプルのスクリーニングを行わなかった。多くの研究はまた、等加重のポートフォリオリターンを用いる。我々は時価加重を用いた。
そのようにした理由は幾つかある。第一に、ファーマ（1998）（訳注：これ）が強調したように、時価加重は投資家が経験する資産効果をより正確に反映する。第二に、ファーマ＝フレンチ（2008）（訳注：これ）は、等加重リターンで極小時価総額銘柄の影響が大きくなると述べている。極小時価総額銘柄とは、NYSE株式において20番目のパーセンタイルより時価総額が小さい銘柄である。極小時価総額銘柄は平均してNYSE-Amex-NASDAQユニバースの時価総額の3%に過ぎないが、全銘柄数の約60%を占める。・・・
・・・NYSE-Amex-NASDAQの区切りを使うと、極小時価総額銘柄は、10分位の端のポートフォリオにおいて株式の60%以上を占めるのが通例である。こうした極小時価総額銘柄は、特に等加重と組み合わせた場合には、アノマリーを大きく拡大する。対照的に、NYSEの区切りを用いると10分位の端のポートフォリオに大小の株式を適度な数だけ割り当て、極小時価総額銘柄の影響を緩和する。

*2:今回の論文と同じ著者たちが2015年の論文（WP）で打ち出したモデル。cf. 関連日本語ブログ記事、関連日本語論文。