DATA
Full dataset — CSV download
61 markets × 22 fields per market, plus labor economics, historical time series, and per-market history. Everything behind the dashboard, exactly as it drives the calculations. Each CSV includes a citation header with sources and generation date.
Data vintage: FY 2024 / FY 2025 · Next refresh: Q3 2026 (mid-FY26 10-Ks)
OPEN-SOURCE PROJECT — HELP US MAKE IT BETTER
Help us make this data canonical
This dataset will only be as good as the people who scrutinize it. We compile from 10-K filings, Economic Census, government datasets, and industry trackers — but every figure has uncertainty, and some markets rely on private-firm estimates we'd love to ground in better sources. Find an error? Disagree with a methodology choice? Want us to track a new market? Tell us.
Every correction or suggestion is logged publicly on GitHub. We respond, we credit contributors, and we update the data with sourcing.
Don't have a GitHub account? Email aggregate@logosfund.com and we'll log your feedback. The CSV files above are released under a permissive license — fork the repo, propose changes, send pull requests. Citations and attributions are non-negotiable.
Downloads
↓
markets.csv
Full 62-market concentration dataset: category, denominator, geo, market size, leaders, S1/CR3/HHI, NAICS, primary + validation sources, calculation note. Citation header included.
61 rows · 23 cols
↓
labor.csv
Labor economics overlay: leader rev/employee, SG&A %, industry-average rev/employee, AI-compressibility score (1-5). Sourced from SEC 10-K filings.
61 rows · 13 cols
↓
sga.csv
Top-3 SG&A research per market: leader, p2, p3 with name, SG&A %, revenue $B, fiscal year, parent-level flag, source citation, source URL. Top-3 revenue-weighted and simple-average SG&A% computed per market. Coverage 0-3 indicating data completeness.
61 markets · 32 cols
↓
historical.csv
Composite category-level CR3 and S1 from 1960–2035 (2030+ projected). 13 datapoints across all 5 categories (incl. 3 manufacturing sub-types). Citation header included.
13 rows · 19 cols
↓
market-timeseries.csv
Per-market historical S1 + CR3 in long format. 25 deep-dive markets with year-by-year history, includes leader and contextual note per data point.
129 rows · 8 cols
↓
markets.json
Legacy JSON format — matches the original v2 dashboard dataset structure. Simpler schema, fewer fields. Use markets.csv for the full record.
JSON
Schema
id slug, e.g. "internet-search"
cat category (Internet / Software / Cognitive / Physical / Mfg)
sub manufacturing sub-type or "—"
mkt market name
denom denominator label (e.g. "Query Share")
denom_detail full denominator explanation
geo "US" or "Global"
year data vintage
size_b market size in $B (0 = non-revenue)
size_source source of the size figure
leader, p2, p3 top 3 players
s1 leader share (%)
cr3 top 3 combined (%)
hhi Herfindahl-Hirschman Index
firms approximate number of firms
naics NAICS code (if applicable)
primary_source primary source with quality tier
validation array of validation sources with tiers
calc_note 2-3 sentences explaining the calculation
License & use
The compiled dataset is released for non-commercial research and journalism use. The underlying sources (Economic Census, SEC filings, StatCounter, Synergy Research quarterly summaries, SIPRI, USGS, EIA, trade association data) have their own licenses — consult those sources directly for commercial redistribution rights. Paywalled trackers (IDC, Gartner, IBISWorld) are cited but not reproduced.
Source Quality Hierarchy
Every market has a primary source plus 1–3 validation sources. Revenue numerators come from SEC 10-K filings; denominators validated against the Economic Census where available.
TIER 1Authoritative anchorsGovernment data + SEC filings
US Economic Census (data.census.gov) · SEC EDGAR / Company 10-Ks · BEA Industry Value-Added (FRED) · BLS QCEW · USGS Mineral Commodity Summaries · EIA Refinery Capacity Report · FDA device listings
TIER 2Leading commercial trackersIndustry-standard paid trackers
IDC Semiannual Software Tracker · Gartner Market Share & Magic Quadrant · IBISWorld NAICS reports · StatCounter Global Stats · eMarketer / Insider Intelligence · Synergy Research Group
TIER 3Category specialistsBest-in-class niche authorities
SIPRI (aerospace & defense) · IQVIA (pharma) · TrendForce (semiconductors) · CIMdata (CAD/PLM) · Nilson Report (payments) · Am Law 100 (legal) · ENR Top 500 (A&E) · AM Best (insurance) · SIA (staffing) · Ad Age (agencies) · Barron's / Cerulli (wealth) · Evaluate MedTech · Nielsen Gauge (streaming) · Gridwise (ride-hail) · Bloomberg Second Measure (delivery) · RC Top 100 (roofing) · SDM Top 100 (fire/safety) · Big 4 annual reports · ALM Intelligence (consulting)
Data vintage: Q2 2025 research compilation. Labels updated 2026-Q2. Tier 1 sources refreshed from FY2025 SEC filings where available. For full methodology see the Methodology page.