We are pleased to announce the release of an updated version of the Simulacrum containing two additional years of diagnostic data (2013 – 2017 diagnoses) and treatment follow up to March 2018.

This new version of the Simulacrum contains almost twice as many synthetic patients as the previous version and over 900,000 new tumours. The vital status of each synthetic patient is now simulated up to February 2019.

The data dictionary for this release (version 1.2017) is the same as the previous release. To download the new version, go to the download page.

  Simulacrum v1.2.0   Simulacrum v2.1.0  
Diagnosis years  2013-2017  2016-2019 
Synthetic patients  2,200,626  1,871,605 
Synthetic tumours  2,371,281  1,995,570 
Synthetic patients with SACT  366,266  352,372 
Synthetic patients with RTDS  --   413,169 
Synthetic patients with genomic testing data  --  94,908 
Years of SACT data  2012 onwards  2012-2022 
Years of RTDS data  --  2012-2022 
Years of genomics data  --  2016-2019 
SACT regimens   730,472  781,389 
SACT cycles  2,442,037  2,741,674 
SACT drug administrations  6,385,828  7,662,030 
RTDS episodes   --  656,560 
RTDS prescriptions  --  657,648 
RTDS exposures  --  13,201,531 
Genomic tests  --  255,728 
Non-melanoma skin cancer diagnoses (C44)  607,619  514,517 
Breast cancer diagnoses (C50)  226,406  187,204 
Prostate cancer diagnoses (C61)  201,785  179,478 
Lung cancer diagnoses (C34)  169,118  156,927