Repository to hold the data and materials for the Wonderful Wednesday webinar series https://www.psiweb.org/sigs-special-interest-groups/visualisation/welcome-to-wonderful-wednesdays
Clinical trial fraud involving fabricated patients is a well-documented problem in research integrity. Statistical methods, including data visualisation, can be a first line of detection, flagging sites for deeper investigation. A simulated dataset is provided for a randomised, controlled multi-centre study studying an active treatment versus placebo in blood pressure reduction. One site contains data that is fraudulent.
Fabricated data often has the following properties:
A standard ADVS data format is provided, including the following key variables:
| Variable Name | Variable Label |
|---|---|
| USUBJID | Subject ID |
| SITEID | Site ID (important for identification) |
| TRT01P(N) | Treatment group |
| ADT | Visit date |
| AVISIT(N) | Visit ID |
| PARAMCD | DIABP or SYSBP (Blood pressure) |
| AVAL | Result (measured value) |
| BASE | Baseline value |
| CHG | Change from Baseline |
| PCHG | % Change from Baseline |
``` {r, eval=FALSE}
ADVS <- read_csv(“https://raw.githubusercontent.com/VIS-SIG/Wonderful-Wednesdays/master/data/2026/2026-05-13/ADVS.csv”)
```