Comprehensive verification of all numerical claims in the systematic literature review
Last verified: 2026-01-15 | Source: final_relevant_corpus.json (71 papers)
python -c "import json; print(len(json.load(open('final_relevant_corpus.json', encoding='utf-8'))))"
# Output: 71
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(sum(p.get('cited_by_count',0) for p in c))"
# Output: 6580
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(f'{sum(p.get(\"cited_by_count\",0) for p in c)/len(c):.1f}')"
# Output: 92.7
python -c "import json,statistics; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(statistics.median([p.get('cited_by_count',0) for p in c]))"
# Output: 38
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len(set(p.get('journal','') for p in c)))"
# Output: 30
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); years=[p.get('year') for p in c]; print(f'{min(years)}-{max(years)}')"
# Output: 1993-2025
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('source_corpus')=='58-corpus']))"
# Output: 58
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('source_corpus')=='53-corpus']))"
# Output: 5
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('source_corpus')=='snowball']))"
# Output: 8
OpenAlex search results from 19 queries (documented in search logs)
Wermers (2000) "Mutual Fund Performance: An Empirical Decomposition" - Journal of Finance
Cremers & Petajisto (2009) - Review of Financial Studies
Brown & Harlow (2009), tournament effect studies
Cremers & Petajisto (2009) "How Active Is Your Fund Manager?"
Petajisto (2013) "Active Share and Mutual Fund Performance"
Cremers & Pareek (2016) "Patient Capital Outperformance"
Chan, Chen & Lakonishok (2002)
Multiple studies on size effect exploitation
Cremers & Petajisto (2009), Petajisto (2013)
Brown & Harlow (1996, 2009), Brown, Harlow & Starks (1996)
Various studies showing context-dependent outcomes
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('source_corpus')!='snowball' and p.get('year',0)<2000]))"
# Output: 5
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('source_corpus')!='snowball' and 2000<=p.get('year',0)<2010]))"
# Output: 19
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('source_corpus')!='snowball' and 2010<=p.get('year',0)<2020]))"
# Output: 23
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('source_corpus')!='snowball' and p.get('year',0)>=2020]))"
# Output: 16
5 + 19 + 23 + 16 = 63 (71 total - 8 snowball = 63)
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if 'Portfolio Management' in p.get('journal','')]))"
# Output: 9
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if 'Financial Analysts Journal' in p.get('journal','')]))"
# Output: 8
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if 'SSRN' in p.get('journal','')]))"
# Output: 7
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if 'Financial and Quantitative' in p.get('journal','')]))"
# Output: 4
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if 'Review of Financial Studies' in p.get('journal','')]))"
# Output: 2
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('journal','')=='The Journal of Finance']))"
# Output: 3
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len(set(p.get('journal','') for p in c)))"
# Output: 30
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('cited_by_count',0)>=200]))"
# Output: 9
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(max(p.get('cited_by_count',0) for p in c))"
# Output: 1138
Documented in methodology section - targeted keyword queries
(5400 - 71) / 5400 = 98.69% = 98.7%
All 71 papers have DOIs that resolve via CrossRef API
SEC Investment Company Act Rule 35d-1 (Names Rule)
Cremers & Petajisto (2009) - funds below 60% Active Share are classified as closet indexers
Petajisto (2013), Cremers et al. (2016) - global estimates
python -c "import json; c=json.load(open('final_relevant_corpus.json', encoding='utf-8')); print(len([p for p in c if p.get('source_corpus')=='snowball']))"
# Output: 8
ICI Factbook 2024/2025 - Investment Company Institute
Global Sustainable Investment Alliance (GSIA) Report
OpenAlex documentation (openalex.org)
Section 2 (Theoretical Framework) - P1 through P8 listed
1. Measurement, 2. Performance, 3. Investor Welfare, 4. Regulatory, 5. Active Management, 6. Determinants
Based on abstract/title analysis showing ~47/71 papers focus on US markets (47/71 = 66%)