Field notes
2026 · Field notesAbout 2 min read
Reading stats and predictions: methodology beats vibes
How to ask what a metric means, what window it covers, and what it cannot say about the future.
Statistics are not destiny. They are summaries of what happened under some measurement window. When vendors promise “AI picks” or “predicted outcomes,” ask what labels trained the model and how drift is monitored. When you read public dashboards, ask whether they are sample-biased or seasonally biased. Correlation is not causation, and a hot streak is not a guarantee.
Privacy matters too. Aggregated reporting is often safer for public dashboards; identifiable data should stay behind authentication and role-based access. When you integrate analytics across tools, separate consent: interest in a topic is not the same as marketing email permission unless the user opts in with clear copy.
Practical checks
Write down the question you actually need answered. “Who won last night?” is different from “who will win next season?” The first is historical; the second is predictive. Use the right tool for each. If you cannot explain the model in a paragraph, you are not ready to bet a process on it.
Communicating uncertainty
When you publish analysis, include confidence ranges or caveats where appropriate. Audiences forgive mistakes less when you sounded absolute. Transparency about uncertainty is not weakness; it is respect for the audience’s intelligence.
Sampling and survivorship
Survivorship bias hides failures. If you only study winners, you learn the wrong lessons. Historical performance of a strategy or model may include periods that no longer apply. Regime changes—rule changes, new competitors, or shifting consumer behavior—can invalidate older patterns.
Sample size matters. A streak of five events is not the same as five hundred. Confidence intervals widen with sparse data. When someone shows a chart without defining the population, ask what is missing.
Replication is the heart of science and engineering. If a claim cannot be reproduced with the same inputs and methodology, treat it as hypothesis. If methodology is proprietary, treat outputs as marketing until proven otherwise.
Ethical use of data includes consent and minimization. Collect what you need, retain what you must, and delete what you can. Your audience may not read privacy policies, but regulators and partners will.