There is no private or proprietary data behind the Atlas. Every person on the map comes from a free, public U.S. government health survey. The whole pipeline — download, combine, score, check — is scripted, so anyone can run it and get the same map.
The map is built on NHANES, the CDC’s National Health and Nutrition Examination Survey — one of the most trusted pictures of American health. It is already de-identified and released to the public. We use the 2013–2014 cycle, the most recent one that pairs lab work, a week of wearable-measured activity, and symptom questionnaires on the same people.
Our working sample is the 3,919 adults who have complete data across all three kinds of measurement — body, behaviour, and how they feel.
Everything is joined per person on the NHANES respondent ID (SEQN):
| Kind | What it is |
|---|---|
| Who | Demographics — age, sex, race/ethnicity, and the survey weights (file DEMO_H). |
| Body | Standard blood, metabolic, and inflammation labs, plus body measurements (BMX_H). |
| Behaviour | About a week of physical-activity-monitor data from a wrist device. |
| Feelings | Validated questionnaires on mood and self-rated health. |
| Conditions | Self-reported doctor-diagnosed medical conditions (MCQ_H) — used only to check the map, never to build it. |
The raw files are downloaded straight from the CDC: the public NHANES 2013–2014 data page.
The download-to-map pipeline is committed as a small, self-contained project with its scripts, a data manifest, and the validation output. Because the inputs are public and the steps are code, the map is not a claim you have to take on faith — it is something anyone with the scripts can rebuild from scratch.