Identifying and visualising inequities of access to clinical trials: a proof-of-concept using public transportation data
Clinical trials (CTs) are essential to the NHS in Scotland. They permit the patient population early access to potentially improved methods of prevention, diagnosis and treatment; inform strategic/logistic decision-making within the NHS with regards to healthcare delivery and may even lead to improved standards of care in participating hospitals more generally. The Scottish Government’s Health and Social Care Delivery Plan recognises the importance of research in high-performance health systems, with increased access to CTs an explicit aim.
It is widely recognised that participation in CTs is not uniform across gender, ethnicity or socioeconomic status. As such, health inequalities that may already exist between patient groups may be further exacerbated by disparities in CT access. Public Health Scotland (PHS) was launched in April 2020 specifically in order to tackle the significant and persistent health inequalities in Scotland. Therefore, any action that PHS takes in order to help meet the objective of increasing access to CTs should actively consider equity of access.
One barrier to healthcare access, and by extension CT participation, is transportation. Transport is critical to the timely delivery of appropriate diagnosis or treatment of disease and barriers to transportation will inhibit recruitment, retention and/or attendance of CT participants. This project proposes the collection, analysis and visualisation of local transport data in order to understand equity of transportation access to Scottish hospitals. There are three phases to this project:
- Generate synthetic trial datasets, capturing transport and attendance statistics
- Interactive visualisation of these data using R Shiny
- Building statistical models of retention and/or attendance
The overarching aim is to define a workflow for incorporating geospatial information into our CT pipeline to characterise sites of interest and to present this data in an interactive way, using transport data as a proof-of-concept. The longer term intention is that this workflow would be used as standard in our pipeline to evaluate whether geospatial data (e.g., deprivation indices) impacts on patient recruitment, retention and/or attendance.
The ultimate aim is to use these these data and the resulting modelling techniques to ask the question which patients are at high risk of dropping out? If trials units can predict this with confidence then we may be able to intervene early and maintain participant retention.
Completed as part of the Data Science Accelerator Programme 2020.
If you want to access Scottish public transport information via the Bulk Journey Planner (methodology in 02a_obtain_public_data.R), then you will need to create an account for yourself at Traveline Scotland's Bulk Journey Planner website. Once registered, you will receive an API key that will allow you to query the server.
In this pipeline, my API key is saved as a variable API_KEY
in the file dat/API_KEY.Rdat
,
for example:
API_KEY="randomlistofnumbersandletters"
You can save your API_KEY in the same way (this allows you to keep it private when uploading to Github), or you can include it as a variable in the script where you access the API.