Corpus statistics
Live counts across the 21-corpus terminology stack. Recomputed at each ISR revalidation from the Postgres source of truth.
28,692canonical terms
21,123with prose definitions
28,145Voyage AI embeddings (1024-dim)
32,501labels (pref + alt)
11,114ontology relations
1,718plain-language supplements
26declared sources
208app DB mappings
Per-source breakdown
| Source | Terms | With definitions | Alt labels | Supplements | License |
|---|
XBRL US-GAAP Taxonomy gaap | 17,182 | 14,760 (86%) | 1,701 | 0 | FAF royalty-free |
FIBO fibo | 3,005 | 2,662 (89%) | 98 | 1,714 | MIT |
NAICS 2022 naics | 2,122 | 0 (0%) | 2,125 | 0 | US public domain |
Tryton account_us (US-GAAP CoA) tryton | 1,201 | 0 (0%) | 15 | 0 | GPLv3+ (names + types treated as facts; descriptions paraphrased) |
U.S. Code (Title 26) tax | 1,143 | 1,143 (100%) | 0 | 0 | US public domain |
schema.org business | 1,007 | 933 (93%) | 51 | 4 | CC-BY-SA 3.0 |
St. Louis Fed FRED Glossary fred | 669 | 0 (0%) | 31 | 0 | US public domain |
Bureau of Labor Statistics Glossary bls | 580 | 580 (100%) | 41 | 0 | US public domain |
Skiptrace vendor code tables intentcore | 473 | 18 (4%) | 4 | 0 | Vendor data dictionary; SKOS modeling MIT |
Wikidata wikidata | 347 | 296 (85%) | 295 | 0 | CC0 1.0 |
LKIF-Core legal | 209 | 199 (95%) | 0 | 0 | CC-BY 4.0 |
Bureau of Economic Analysis Glossary bea | 200 | 200 (100%) | 0 | 0 | US public domain |
Lexicon extension ext | 122 | 94 (77%) | 197 | 0 | MIT |
Plaid PFC Taxonomy plaid | 120 | 104 (87%) | 239 | 0 | Published spec |
GnuCash US chart templates gnucash | 100 | 68 (68%) | 279 | 0 | GPLv2+ (names treated as facts; definitions paraphrased) |
ERPNext standard charts erpnext | 95 | 0 (0%) | 200 | 0 | GPLv3 (paraphrased) |
W3C PROV-O prov | 49 | 22 (45%) | 14 | 0 | W3C Document License |
BLS Standard Occupational Classification 2018 soc | 23 | 0 (0%) | 0 | 0 | US public domain (17 USC 105) |
IRS Publications irs | 21 | 21 (100%) | 0 | 0 | US public domain |
OFX 2.x Specification ofx | 17 | 17 (100%) | 30 | 0 | Published spec |
OMB Statistical Policy Directive 15 omb | 7 | 6 (86%) | 0 | 0 | US public domain (17 USC 105) |
Documented but not ingested
Relation-predicate breakdown
| Predicate | Edges |
|---|
rdfs:subClassOf | 10,533 |
skos:closeMatch | 209 |
skos:broader | 172 |
owl:equivalentClass | 163 |
skos:broadMatch | 27 |
skos:exactMatch | 10 |
Most-referenced terms
- Countries — 243 inbound edges
Countries - registered security — 221 inbound edges
RegisteredSecurity - bank — 159 inbound edges
Q22687 - preferred share with fixed rate dividend — 112 inbound edges
PreferredShareWithFixedRateDividend - extendable preferred share — 84 inbound edges
ExtendablePreferredShare - exchangeable preferred share — 84 inbound edges
ExchangeablePreferredShare - CreativeWork — 74 inbound edges
CreativeWork - non-cumulative preferred share — 72 inbound edges
NonCumulativePreferredShare - savings bank — 68 inbound edges
Q157963 - Intangible — 63 inbound edges
Intangible