Every methodology in this library rests on data, and a reader is entitled to know where it comes from. The rule that governs it is one line: every quantitative claim about the world carries its provenance. Either it is sourced, backed by a named public reference a reader can check, or it is curated interpretation, a Robotnik estimate marked plainly as such with its basis named. There is no third category and there are no unattributed numbers. Which of the two a figure is matters as much as the figure, so it is stated rather than left for the reader to guess.
Sources and Provenance: How Robotnik Grounds Its Claims
Every quantitative claim Robotnik makes about the world is either sourced to a named public reference or marked as its own curated interpretation. This is where those sources are named, and where that rule is set out.
The two kinds of claim
A sourced claim rests on a named public source, cited so it can be followed back. A concentration figure by mining stage, a trade flow, a company's segment revenue: where a credible public reference carries it, the claim is sourced to that reference.
Curated interpretation is Robotnik's own reading, and it is never dressed as a sourced figure. A demand-share band, a criticality tier, a weighting judgment: these are analytical choices, anchored against public research (the USGS summaries, the IEA data, the US Department of Energy's Critical Materials Assessment, and material-specific literature) but not lifted from a single citable number. They are marked as curated, with the research they rest on named, so that a reader knows exactly what kind of claim is being made.
Keeping the two apart is the discipline. A curated estimate presented as though it were sourced would borrow a certainty it has not earned; a sourced figure buried among estimates would lose the weight it has. The line between them is drawn on the page, so that a reader sees exactly what is grounded and what is Robotnik's own judgment.
The sources
Where a claim is sourced, it draws on one of a small set of public references, each used for what it is authoritative on.
| Source | What it grounds | Terms |
|---|---|---|
| USGS Mineral Commodity Summaries | Mineral production and processing concentration, by stage | Public domain |
| UN Comtrade, with the WTO–ADB Trade in Critical Minerals dataset | Bilateral trade-flow and dependency evidence | Per their terms |
| IEA critical-minerals data and topic reports | Clean-energy-technology mineral demand and supply outlooks | CC BY 4.0, attributed |
| GLEIF | Entity identity and corporate-ownership normalisation | CC0 |
| OFAC SDN list and the US BIS Entity List, via the trade.gov Consolidated Screening List | Export-control and policy-exposure context | Public domain |
| SEC filings, via EDGAR | Company financials: revenue composition and segment shares | Public domain |
| US International Trade Commission | Trade-policy price and tariff-impact evidence | Public domain |
| White House 100-Day Supply-Chain Review (EO 14017, 2021) | Government supply-chain assessment figures, e.g. advanced-photoresist concentration | Public domain |
Each is used only within its scope. The IEA data covers the energy-transition minerals and is not applied to the semiconductor stack; SEC filings ground company financials and not minerals, trade, or policy; the screening lists ground policy exposure and nothing else. A source is not stretched beyond what it actually measures.
Drawing the line on licences
A public source is only usable if its licence permits the use. Robotnik draws only on references whose terms allow it, and attributes each as those terms require: the IEA data under CC BY 4.0, the rest public domain or CC0, and the two trade datasets per their own terms. A source whose data licence forbids commercial use is not cited, however useful its numbers might be. Getting this right is part of the same discipline as getting the numbers right.
Where open data runs out
Some figures are simply not available in open, structured data, and the honest response is to say so rather than to invent a number that looks sourced.
The concentration of advanced-packaging capacity, and of the specialist substrates that feed it, is the clearest case: the figures are either held behind analyst paywalls or asserted in prose without a citation. High-purity quartz is the documented example. The USGS publishes it and names its use in the fused-quartz crucibles that grow the silicon wafers semiconductor manufacturing depends on, but reports world mine production as not available; its US output figures are rounded to the nearest hundred thousand tonnes to avoid disclosing company data, and the trade is mixed with other sand and quartz in the customs codes, so no reliable figure can be isolated. The concentration metric is, in other words, structurally unobtainable from open data.
Where that is the case, Robotnik does not manufacture a sourced-looking figure. It marks the point as curated interpretation, names what the reading rests on, and, where the absence is itself on the public record, cites that absence. A documented gap is evidence too.
The verification-pending mark
Some curated estimates carry a further mark: verification pending. These are figures Robotnik tracks and uses but has not yet confirmed to its own standard, and they are never presented as sourced or as settled. The estimates at the chokepoints, where open data runs out, are the main case, and the reference pages that touch them name them as tracked rather than asserted.
What needs a source, and what does not
Not every number is a claim about the world. The base values, caps, weights, and constituent counts inside an index are parameters of the method: set by Robotnik, and documented in that index's own methodology. They describe how the index is built, not a fact about the outside world, so they carry no external source. The methodology page is their provenance, and that is the right place for them.
This is why the distinction is worth drawing carefully. An index level of 1,000 at a base date is not a measurement that could be right or wrong against the world; a statement that one country refines most of the world's supply of a given metal is. The first is documented in the method; the second is sourced or marked curated. Treating them the same would blur exactly the thing this rule exists to keep clear.
How it fits the library
This is the discipline behind the "Sources and provenance" note that the sourced pieces carry, and behind every figure marked as curated interpretation across the library. The thesis, the control-points methodology, and the commodities index carry the most external figures and apply the rule most visibly; the other pieces apply it too, in proportion to the claims they make.
It also explains a pattern a reader will notice. Some methodology pages carry a sources section and some do not. A page whose numbers are all construction constants, or a framework that is Robotnik's own analytical scheme, makes no external claim to source, so it carries no sources section. That is not an omission. It is the same rule working: provenance is attached where there is a claim about the world to attach it to, and nowhere else.
Sources and provenance
This piece follows the rule it sets out. Its one sourced claim, that the USGS reports high-purity quartz world mine production as not available, is drawn from the US Geological Survey, Mineral Commodity Summaries, February 2026, "Quartz (High-Purity and Industrial Cultured Crystal)" (public domain). The IEA data named in the registry is used under CC BY 4.0, attributed to the IEA. The other references are named with their terms in the table above; describing a source is not itself a claim that needs its own citation. Everything else here is a description of Robotnik's method, which is its own provenance.
Frequently asked questions
What is Robotnik's provenance rule?
Every quantitative claim about the world is either sourced to a named public reference that a reader can check, or marked as Robotnik's own curated interpretation with its basis named. There is no third category and there are no unattributed numbers.
What is the difference between a sourced claim and curated interpretation?
A sourced claim rests on a named public source and is cited to it. Curated interpretation is Robotnik's own analytical reading, anchored against public research but not lifted from a single citable figure, and it is marked as such rather than presented as though it were sourced.
What public sources does Robotnik use?
A small set, each within its scope: the USGS Mineral Commodity Summaries for mineral concentration, UN Comtrade with the WTO–ADB critical-minerals dataset for trade flows, the IEA Critical Minerals Dataset for energy-transition demand and supply, GLEIF for entity identity, the OFAC and BIS lists via the trade.gov screening list for policy exposure, SEC filings via EDGAR for company financials, and the US International Trade Commission for trade-policy price evidence. Each is used only for what it authoritatively measures.
Why are some figures curated rather than sourced?
Because some numbers are not available in open, structured data at all. The concentration of advanced-packaging capacity is one example; high-purity quartz production, which the USGS itself reports as not available, is another. Where a figure cannot be sourced, Robotnik marks it as curated interpretation rather than inventing one that looks sourced.
Do the numbers inside an index need a source?
No. An index's base value, caps, weights, and constituent counts are parameters of the method, not claims about the world. They are set by Robotnik and documented in that index's own methodology, which is their provenance; they carry no external source.
Why do some methodology pages have no sources section?
Because they make no external claim to source. A page whose numbers are all construction constants, or a framework that is Robotnik's own scheme, has nothing about the outside world to cite. The absence of a sources section is the rule working, not a gap in it.