eribul / nh_luxation_infektion Goto Github PK

Prediction models for infection and dislocation

R 3.54% HTML 76.83% Rich Text Format 19.63%

nh_luxation_infektion's Introduction

Prediction of Early Periprosthetic Joint Infection after Total Hip Arthroplasty

Prediction models for prosthesis joint infection (PJI) within 90 days of total hip arthsoplasty (THA).

The file structure follows the "ProjectTemplate" structure described here: http://projecttemplate.net/index.html

data/ defines what data to use for the project (no actual patient data included due to GDPR, as well as Swedish and Danish laws and regulations).
../linkage.R defines the relevant variables from the Swedish registers (requires an active connection to an internal SQL-database which is not shared). Note that the differenc co-morbidity measures (Charlson, Elixhauser and Rx Risk V have been pre-calculated and are already available from the SQL database.)
../categorization.xlsx defines our grouping of co-morbidities based on individual conditiond from the Charlson, Elixhauser and Rx Risk V clasifications.
munge/ Data munging steps performed on the "raw data sets"
../00-filter.R Applies inclusion criteria (and produce a flowchart "on the fly")
../01-outcome.R Identify patients with PJI within 90 days. This is based on relevant ICD-10 and NOMESCO codes recorded at hospital visits during the the year before surgery, or if reoperation was recorded to SHAR due to PJI.
../02-survdata.R The filename is missleading but here is the place for additional variable transformation. BMI is categorized according to the WHO classification, individual diagnoses are grouped and factor levels are translated from Swedish to English etc.
../03-compositvariabler.R We here use the data/categorization.xlsx file mentioned above to construct the variables for co-morbidity (as well as a table to present those groups).
../04-remove_empty_variables.R Here we identify variables (including dummy) with less than 10 observations for each positive or negative outcome (PJI or not within 90 days). Too rare conditions are droped and not included as potential predictors.
../05-BRLasso.R is for variable selection, which requires some additional help functions etc from the lib-folder.
../06-variables_differnet_BRLasso_models.R extract the selected variables to be used in the "main" and "reduced" models.
../07-compare_models.R estimate AUC-values and data for ROC-curves etc, botth for the derived models, as well as for simpler comparisons.
src/ contains scripts to make either figures (exported to the graphs-folder) or tables (not included) for later use in the manuscript.
/reports contains files for the submitted manuscript.

The config/-folder, .gitignore-file and the .Rproj-file contains process configurations for ProjectTemplate, Git and RStudio.

nh_luxation_infektion's People

Contributors

Watchers

nh_luxation_infektion's Issues

Kör om beräkningar

Fanns tidigare fel i NOMESCO-kodningen, vilket ledde till att för få fall av PJI identifierades.
Skripten har nu förberetts men behöver köras om.
Jag har avvaktat ifall vi ska göra flera ändringar samtidigt.

Normal weigth as baseline

I currently use overweight as baseline, since this seemed relevant from mortality studies etc. It seems less relevant in this case however so I think we should change this back.

female som baseline

Gender: Eftersom alla andra variabler är förenade med en risk-ökning undrar jag om vi inte borde sätta female som referenskategori, då ger male en riskökning, istället för att det som nu står female associerat med riskminskning. Absolut! Bra idé!

sammanställ med Danska data

Template-kod från Projekt 29 som förhoppningsvis kan återanvändas.

title: "External validation"
author: "Erik Bulow"
date: '2019-11-13'
output:
html_document:
toc: true
toc_float: true
df_print: paged
code_folding: show

knitr::opts_chunk$set(echo = TRUE, message = FALSE, warning = FALSE, cache = TRUE)

Start

Install and attatch some useful packages

pkgs <- c("tidyverse", "doParallel", "pROC", "rms", "givitiR")

# Install
pkgsinst <- setdiff(pkgs, rownames(installed.packages()))
if (length(pkgsinst)) install.packages(pkgsinst)

# Attatch
purrr::walk(pkgs, ~suppressPackageStartupMessages(library(., character.only = TRUE)))

Load the exported model to validate (the R object previsously sent, probably with another path).

# Load the exported model object
load("../cache/fit_export.RData")

Set random seed for reproducability:

set.seed(123)

Prepare data

Inclusion/exlusion

Those were the inclusions/exklusions from Sweden. It might not be necessary to filter out on BMI, hospital and education however. Those variables are not used in the model.

Additional filter to ages 35-99 years to match the Swedish cohort.

knitr::include_graphics("../graphs/flowchart.png")

Variables

Outcome

The outcome variable is boolean (or numeric) indicating wether the patient died of any cause within 90 days after THA (TRUE/1) or not (FALSE/0). We did not have any censoring in Sweden. I guess we can simply drop cases where status is unknown?

Predictors

The data to evaluate should look like this:

head(fit_export$data)

Baseline variables

P_Gender: Kvinna/Man = Female/Male
P_ASA: level 1-3
P_Age: 35 - 99

Comorbidities

Comorbidities are identified by Elixhauser and Charlson comorbidity during one year before surgery. This is done by ICD-10 codes in Sweden but other versions exist. If ICD-10-codes are used those might be identified by regular expressions in the table below. Those are based on codes like "C123". Hence no puntctuation (not "C12.3"). The regular expressions might be modified to allow possible other patterns. Regular expressoins below are just combined from differnet groups (separated by "|"). They could be rewritten for clarity. If you don't see any regular expressions, click the small arrow in the upper right corner of the table. The grepl function (base R) can be used to identify ICD-10-codes based on the regular expressions.

load("../cache/categorization.RData")
comorbidities <- 
  categorization %>% 
  map_df(as.character) %>% 
  filter(new %in% names(fit_export$data)[grepl("c_", names(fit_export$data))]) %>% 
  separate(old, letters, sep = "\\|", fill = "right") %>% 
  pivot_longer(letters, values_drop_na = TRUE) %>% 
  mutate(value = trimws(value)) %>% 
  separate(value, c("index", "name"), extra = "merge") %>% 
  mutate(name = gsub("_", " ", name))
  
CCI <- 
  comorbidities %>% 
  filter(index == "CCI") %>% 
  left_join(coder::charlson_icd10, c(name = "group"))

ECI <- 
  comorbidities %>% 
  filter(index == "ECI") %>% 
  left_join(coder::elix_icd10, c(name = "group"))

comorb_defs <- 
  bind_rows(CCI, ECI) %>% 
  select(new, regex) %>% 
  mutate(regex = sprintf("(%s)", gsub("^", "", regex, fixed = TRUE))) %>% 
  group_by(new) %>% 
  summarise(regex = paste(regex, collapse = "|"))

comorb_defs

Example data

Let's assume we now have some data (I will use the Swedish data just as an example).

y <- fit_export$y
X <- fit_export$data

Validation of model as is

# Tibble with observed and predicted outcome
obspred <- 
  tibble(
    obs  = y, 
    pred = predict(fit_export, X, type = "response")
  )

# ROC curve
ROC <- pROC::roc(obspred, "obs", "pred", direction = "<")

# Estimate CI for AUC based on bootstrapping
# Use parallel processing to speed up the process
doParallel::registerDoParallel()
AUCci <- 
  pROC::ci.auc(
    ROC, 
    method          = "bootstrap", 
    boot.stratified = FALSE, 
    parallel        = TRUE
  )

# Check calibration. Note that devel should actually be "internal" for this example but I use
# "external", since that's what you will use for the UK data. 
calibration <- 
  givitiR::givitiCalibrationBelt(
    obspred$obs, 
    obspred$pred, 
    devel = "external"
  )

Results

For this example we had AUC:

AUCci
plot(ROC)

A calibration belt plot might be illustrated as:

plot(calibration, xlim = c(0, 0.03), ylim = c(0, 0.06))

Re-calibrated intercept

Method 2 from table 1 in Steyerberg 2004.

Z <- predict(fit_export, X, type = "response")
  
# Refit the intercept using Z = a + Xb from above as offset
fit2 <- glm(y ~ 1, offset = Z)

# Same calibration and validation as above
obspred2     <- tibble(obs  = y, pred = predict(fit2, type = "response"))
ROC2         <- pROC::roc(obspred2, "obs", "pred", direction = "<")
AUCci2       <- pROC::ci.auc(ROC2, method = "bootstrap", boot.stratified = FALSE, parallel = TRUE)
calibration2 <- givitiR::givitiCalibrationBelt(obspred2$obs, obspred2$pred, devel = "external")

Results

AUCci2
plot(ROC2)
plot(calibration2, xlim = c(0, 0.03), ylim = c(0, 0.06))

Re-calibration of intercenpt and calibration slope

Method 3 from table 1 in Steyerberg 2004.

fit3         <- glm(y ~ 1 + Z)
obspred3     <- tibble(obs  = y, pred = predict(fit3, type = "response"))
ROC3         <- pROC::roc(obspred3, "obs", "pred", direction = "<")
AUCci3       <- pROC::ci.auc(ROC3, method = "bootstrap", boot.stratified = FALSE, parallel = TRUE)
calibration3 <- givitiR::givitiCalibrationBelt(obspred3$obs, obspred3$pred, devel = "external")

Results

AUCci3
plot(ROC3)
plot(calibration3, xlim = c(0, 0.03), ylim = c(0, 0.06))

Export data to Sweden

ROC

If it would be OK to export coordinates for ROC-plots I would recommend this code to extract only the minimal data needed:


roc_plot_coords <- 
  data.frame(
    specificities = ROC$specificities, 
    sensitivities = ROC$sensitivities
  )

roc3_plot_coords <- 
  data.frame(
    specificities = ROC3$specificities, 
    sensitivities = ROC3$sensitivities
  )

AUC with CI

The text output from AUCci and AUCci3 should be enough. Hence, the same character string that gets printed above (but now stored in an object).

AUCci_print  <- capture.output(AUCci)
AUCci3_print <- capture.output(AUCci3)

Export objects

Save objects above to file export.RData (in the current working directory).

save(
  roc_plot_coords,
  roc3_plot_coords,
  AUCci_print,
  AUCci3_print,
  file = "export.RData"
)

Calibration plots

Help function

This is a simple help function to make a clean calibration belt plot and save it as TIFF (in the curent working directory):

makeplot <- function(x, file_name = deparse(substitute(x))) {
  tiff(
    paste0(file_name, ".tiff"), 
    1024, 1024, pointsize = 36, 
    compression = "lzw"
  )
  
  tcks <- seq(.0, .1, .01)
  
  plot(
    x,
    xlim             = c(0, .06),
    ylim             = c(0, .08),
    xlab             = "Predicted probabilities [%]",
    ylab             = "Observed  probabilities [%]",
    main             = NULL,
    table            = FALSE,
    polynomialString = FALSE,
    pvalueString     = FALSE,
    nString          = FALSE,
    mar              = c(5, 4, 0, 0) + 0.1,
    xaxt             = "n",
    yaxt             = "n"
  )
  abline(v = .03, lty = "dashed", col = "darkgreen", lwd = 3)
  axis(1, at = tcks, lab = sprintf("%.0f", tcks * 100), las = TRUE)
  axis(2, at = tcks, lab = sprintf("%.0f", tcks * 100), las = TRUE)
  
  dev.off()
}

Make and save figures

makeplot(calibration)
makeplot(calibration3)

ROC plots

If it not possible to export data for ROC-plot, here is some code to make a figure and save it as TIFF (in the current working directory) instead:


roc_plot_coords %>% 
  ggplot(aes(1 - specificities, sensitivities)) +
  geom_path(size = 2) +
  geom_abline(intercept = 0, slope = 1, color = "grey", linetype = 2) +
  theme_minimal() +
  theme(
    legend.position = c(1, 0),
    legend.justification = c(1, 0),
    legend.title = element_blank()
  )

ggsave(
  "roc.tiff", 
  height = 10, 
  width = 10, 
  unit = "cm", 
  dpi = 900, 
  compression = "lzw"
)

skriv cover letter

nämn mort-artikeln?
Länka till webbkalkylatorn.
Nämn NARA

Kommentarer från NH

derivation with socioeconomival factors: "did we have income also?"
defs of comorb and outcomes: "refer to ICD and NOMESCO codes, additional table suggested"
Clinical usage: "here it would be nice to illustrate two extreme examples with low and high risk, respectively."

Inkludera NHs textutkast till artikel

Predict_PJI.docx

Generera Rmarkdown-fil och synka med mina steg!

Submittera till BMJ

https://mc.manuscriptcentral.com/bmj
logga in med vgregion.se

Osäker på kombination av ICD/KVÅ-koder.

Mail till NH/OR:
Se https://github.com/eribul/NH_luxation_infektion/blob/master/diagnostics/fels%C3%B6k%20infektionskoder.R

Hej!

Jag ser att Viktors urval tycks lite striktare (förskrivning av antibiotika under minst fyra veckor samt därefter identifierad via uppställda kriterier baserat på journalgranskning).
Dock började jag dubbelkolla de koder vi baserar infektionerna på. 
Nedan är en ungefärlig tabell som identifierar de ICD-10-koder som identifierar fall som inte dessutom är reopererade och/eller där det inte också finns en KVÅ-kod.
Samma patient kan ha fler än en kod så tabellen är inte perfekt. Men den visar ungefär hur det ser ut för de fall som endast identifieras via ICD-koder i NPR.
(p = andel i procent av de listade koderna, inte avseende samtliga fall)

   code      n     p desc                                                                                     
 1 T814    116    42 Infektion efter kirurgiska och medicinska ingrepp som ej klassificeras på annan plats    
 2 T845F    89    32 Infektion och inflammatorisk reaktion orsakad av inre ledprotes i höftled/lår            
 3 T845     35    13 Infektion och inflammatorisk reaktion orsakad av inre ledprotes                          
 4 M009      7     3 Purulent artrit, ospecificerad                                                           
 5 M009F     6     2 Purulent artrit UNS i höftled                                                            
 6 M000F     5     2 Septisk artrit (stafylokocker) i höftled                                                 
 7 M000      3     1 Stafylokockartrit och stafylokockpolyartrit                                              
 8 M002F     3     1 Septisk artrit (streptokocker) i höftled                                                 
 9 T847F     3     1 Infektion och inflammatorisk reaktion orsakad av andra inre ortopediska proteser, implan~
10 M008      2     1 Artrit och polyartrit orsakad av annan specificerad bakterie                             
11 T847      2     1 Infektion och inflammatorisk reaktion orsakad av andra inre ortopediska proteser, implan~
12 M861F     1     0 Annan akut osteomyelit i höftled/lårben                                                  
13 M866      1     0 Annan specificerad kronisk osteomyelit                                                   
14 M866F     1     0 Kronisk osteomyelit i höftled/lårben

Jag inser i samband med detta att jag kan ha missuppfattat hanteringen av KVÅ-koderna NFA12, TNF05 och TNF10. 
I den ursprungliga Excel-filen stod att dessa koder gäller ”bara om kombinerat med ICD-kod ovan”.
Jag tolkade detta som att dessa koder inte gäller på egen hand utan endast om de förekommer vid samma besökstillfälle från vilket det också finns en ICD-10 enligt den listan. 
Dock, då inget liknande villkor fanns för ICD-koderna, tolkade jag det som att ICD-koderna ensamma uppfyllde kravet för att identifiera infektion, dvs som att ICD-koden med eller utan KVÅ-kod anger infektion. 
Eller annorlunda uttryckt, att dessa KVÅ-koder därmed blev implicit redundanta.
Kikar jag nu på betydelsen av vissa av de inkluderade ICD-10-koderna ovan ser jag ju att de i sig själva inte utesluter annan lokalisation. 

Är det t ex så att T814 måste kombineras med NFA12/TNF05/TNF10 för att indikera en infektion av intresse?
Gäller det i så fall även T845F (där det iofs redan finns en implicit koppling till höften via koden själv)?

Om samtliga fall ovan exkluderas (eftersom de inte förekommer i kombination med NFA12/TNF05/TNF10 sjunker alltså vår skattning från ca 2,35 % till 2 %.
Exkluderas endast de som inte har ett ”F” på slutet får vi ca 2,17 %.
Dock vet jag inte hur sannolikt det är att en rel. frisk patient får en T814 helt orelaterat till protesoperationen nästan samtidigt?
Kanske är det lika sannolikt att man missat rapportera en KVÅ-kod som borde ha angetts? 
(Dvs att dessa fall bör inkluderas även om en mindre andel av dessa infektioner sannolikt är orelaterade till höftprotesoperationen.)
Jag tror väl att det bör vara ok att fortsätta som det är men att nämna som limitation. Eller behöver vi räkna om?

Med vänlig hälsning
Erik

Minnesanteckningar efter Zoom-möte 2021-01-29

Kolla hur ofta NOMESCO-koderna NFW59 och NFW69 förekommer hos oss
Vänta på uppdaterad kodlista från Alma för koder som definierar utfallet. Implementera sedan den och kör om.
titta endast på modell för 90 dagar (ta bort 2 år)
Ny modell som skickas till DK
De tittar om de kan validera modellen. Kanske behöver de exkludera varaibler för vilka de har färre än 10 oberservationer. Detta bör dock endast gälla om man gör rekalibrering av modellen kanske? Annars borde det inte spela ngn roll
De tar också fram en ännu mer förenklad modell som vi sedan utvärderar med våra data.
jag kollar också modellen med stickprov från vårt material med samma storlek som de kommer att ha i sin datamängd.

Added value from predicted model Harrell

# https://www.fharrell.com/post/addvalue/

require(rms)
getHdata(acath)
acath <- subset(acath, !is.na(choleste))
acath$sex <- factor(acath$sex, 0:1, c('male', 'female'))

# Enkel modell (PRE)
f <- lrm(sigdz ~ rcs(age,4) * sex, data=acath)

# MEr avancerad modell (POST)
g <- lrm(sigdz ~ rcs(age,4) * sex + rcs(choleste,4) + rcs(age,4) %ia%
           rcs(choleste,4), data=acath)

lra <- f$stats['Model L.R.']
lrb <- g$stats['Model L.R.']
ra  <- f$stats['R2']
rb  <- g$stats['R2']

# Det vi vill ha:
1 - ra / rb

Uppdaterad version från NH

#24 fick egen issue men övroga ändringar här
Predict_PJI_hai1.docx

Uppdatera formeln

Korrekt som det står men lättare med:
$\hat p = 1/(1 + e^{-\hat \alpha - \mathbf{\hat \beta X}})$

Redo everything without RxRiskV to check if the result is similar

It might be difficult to include this data and there might be differences between countries.
At least we start without it but might include it later if possible.

kategorisera BMI

BMI: Man har ju tidigare sett ett icke-linjärt samband mellan BMI och PJI, där både lågt och högt BMI ökar risken. Vi har i vår modell behandlat BMI som kontinuerlig parameter och finner en riskökning på 9% per BMI-enhet. Jag undrar om vi kanske missar riskökningen vid mycket lågt BMI, och om vi skulle kunna explorera BMI definierat enl WHO, alltså BMI uppdelat i fyra kategorier.

Skippa c_obesity

skippa c_obesity.
Dåligt rapporterad och svårtolkad, inkorrent om korstabulerad mot P_BMI.
Är dock ändå så pass korrelerad med BMI att variabelselektionen blir knepig ifall båda inkluderas.

kolla def av diagnosgrupper

Ev kan Marie hjälpa till med detta.
Alternativt korstabulera bef. data.
(behövs i så fall nytt uttag från länkningsdatabasen.)

Baseline-levels i coef-tabell

Lägg in även baselinelevels med OR = 1 etc.
För BMI och diagnos.

Driv fram första utkast

Lägg till modell med bara ålder och kön som jämförelse
Kolla om jag har punktskattning för AUC från Ute. Tror att det bör gå att frå fram från .RData-filen?
Inkorporera detta i manuset + figurer och all diskussion
figuren med koefficienterna i appendeix med relevant beskrivning i texten
dela via Box så att alla kan editera

Anpassa till BMJ

https://drive.google.com/file/d/1lh9zzpFUUoOuLVszt4kB8jMv097nfJTn/view

brancha ut den sammanslagna versionen och jobba med endast infektioner

får sedan börja om på nytt för dislokationer

Gruppering BMI

NH:

Vikt kategoriseras enl. WHO i 6 kategorier:
< 18.5 as underweight, 18.5–24.9 as normal weight, 25–29.9 as overweight, 30–34.9 as class I obesity, 35.0–39.9 as class II obesity, and ≥ 40 as class III obesity.

Tror dock vi har för få underviktiga och fetma klass III varför jag ändå föreslår 4 grupper.

Kolla antal och skicka fk!

exkludera avlidna inom två år

Kolla up diarienummer

Eamma efterfrågar.

Kör om efter korrigerad RxRiskV

Har upptäckt fel i denna klassning, vilket behöver åtgärdas, varpå samkörningsdatabasen uppdateras, varpå modellen räknas om.
Då mycket av detta sker utanför själva projektet (påverkar endast indata) kan detta ske ganska oberoende av övrigt projektarbete.

webkalkylator

Inkoorporera med bef mortalitetskalkylator och bryt ut båda till separat repo.

Do not exclude patients who died within two years of THA

Those patients were in fact excluded in a previos version, as correctly pointed out by @AlmaBP and @inatrol.
I have now re-run the analysis without this exclusion. The result is essentially similar but:

the number of included patients increased from 86,415 to 88,830
Some AUC-values changed in the second decimal
The selected model changed slightly.

I will push those changes in a minute!

Kommentarer till NH

I abstractet nämns "The unadjusted cumulative incidence of PJI was XXX within 90 days and XXX within two years". Jag gissar här att du avser CIF från en överlevnadsmodell med död som competing risk. Ett problem är dock att vi inte har exakta datum för PJI pss som för död. Dessutom har vi ett urvalsvilkor så att endast patienter som lever minst två år efter operation inkluderats. Jag börjar därmed att endast redisa ngt slags "crude rate" ist.

Check Danish data

Erik's fit comparison code ran smoothly with the new data and model, and the results look good on first glance

Find attached the compiled Rmarkdown document from Eriks crossvalidation of the model with Danish data (DK_external_validation.pdf), and a zip file DK_exports.zip containing figures as tiff, and R objects as text files.
You can load the data as usual with R function load.

I still have some difficulties with "trusting" the model. Fitting the same model to the danish data set, which is considerably smaller, we get quite different coefficients for some of the variables, see the attached DK_refit_model.pdf.

My first guess is, that this could have something to do with the smaller sample size of the Danish data. It could also have something to do with different habits of diagnosing - is it right that the Swedish data are also older than the Danish ones? - or other practical reasons.

To investigate the first hypothesis (1), it would be helpful if you, Erik, could fit the final model to random subsamples of the Swedish data. I've attached some short R code, subsample_SE_data.R,
that I tried out on data simulated from your fitted model - if you agree, you could just run that one with the real data, and I would be happy to look at the results together :-)
You could also crossvalidate the results from your model fitted to Danish data, on the Swedish data. I think it would be interesting to see if this makes much difference - maybe it won't. The model fitted to Danish data is in file model_dk_superlean.txt. I removed even more data to keep file size at an OK level,
but it should still be usable for predicting values.

DK_exports.zip.zip
DK_external_validationDK.pdf
DK_refit_model.pdf

COI

Alla fyller i http://icmje.org/disclosure-of-interest/ och skickar till EB som behåller och väljer en av:

No competing interests: “All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.”

Grant funding for research but no other competing interest: “All authors have completed the ICMJE uniform disclosure form at www.icmje.org/ coi_disclosure.pdf and declare: all authors had financial support from ABC Company for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.”

Mixed competing interests: “All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: no support from any organisation for the submitted work; AB has received research grants and honorariums from XYZ company, BF has been paid for developing and delivering educational presentations for BBB foundation, DF does consultancy for HHH and VVV companies; no other relationships or activities that could appear to have influenced the submitted work.”

Differences between countries

we should mention somewhere that the Swedish and Danish population are different, and also the preference of surgical and other medical procedures is different in the two countries, as well as probably the coding of diagnoses.

Updated outcome codes

ICD-10

M000=purulent ledbetændelse,
M000F=hip,
M001=Artritis ved infektiøse og parasitære sygdomme klassificeret andetsteds,
M002=Reaktive artritter,
M002F=hip,
M008=Leddegigt hos børn,
M008F=hip,
M009= Leddegigt hos børn ved sygdomme klassificeret andetsteds,
M009F=hip,
M860, M861, M862, M863, M854, M865, M866, M868, M869 Osteomyelitis (inkl. different subgroups)
T813=postoperative sårruptur IKA
T814=Infektion efter indgreb IKA,
T845=Infektion eller inflammation omkring ledprotese,
T845F, T845X,
T846F=Infektion eller inflammation omkring internt fiksationsmateriale hip,
T847=Infektion eller inflammation omkring anden ortopædisk protese, implantat eller transplantat, T847F
NFW59 = Reoperation ved overfladisk infektion efter operation på hofte eller lår
NFW69 = Reoperation ved dyb infektion efter operation på hofte eller lår

NOMESCO

NFS09 (not existing in DK), NFS19, NFS29, NFS39 (Not existing in DK), NFS49, NFS59, NFS99

Share model_reduced_lean.RData

Omgruppera diagnosgrupper utifrån NHs nya lista

Avvaktar besked från NH

Transparency declaration

NPH, as the lead author, affirms that this manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned have been explained.

Additional models

Attachment
DK_auc_simpler_models_etc.pdf
includes the values of the AUC that were missing. I also fitted the Danish data to the simpler models you have in Figure 3, the same models with Age and Sex added, and a model without diagnoses.

Qualitatively, the results are the same as in Sweden, although the comorbidity indices partly switch order. Adding Age and Sex improves the AUC in all three cases by a few percent (Rx, Charlson, Elixhauser), but the AUC is still far from Erik’s model.

A mere “man on the street” model with BMI, Age and Sex is better than the models based on diagnosis indices, and reaches AUC = 0.639 (in the same data as used for fitting).

The improvement that we get if we refit Erik’s model to the Danish data is not overly impressive; AUC raises from 0.664 to 0.672 (measured on the same data that were used for fitting). So, overall the Swedes seem to be able to describe Danes quite well, albeit not 100% as well as themselves (AUC 0.68).

Benämn "caputnekros" som ”AVN"

Det bästa engelska ordet för caputnekros är ”AVN" (avascular necrosis of the femoral head). Detta bör vi använda i text, tabeller och grafer.

SUMMARY BOXES

Please produce a box offering a thumbnail sketch of what your article adds to the literature. The box should be divided into two short sections, each with 1—3 short sentences.

Section 1: What is already known on this topic
In two or three single sentence bullet points, please summarise the state of scientific knowledge on this topic before you did your study, and why this study needed to be done. Be clear and specific, not vague.

Section 2: What this study adds
In one or two single sentence bullet points, give a simple answer to the question “What do we now know as a result of this study that we did not know before?” Be brief, succinct, specific, and accurate. For example: “Our study suggests that tea drinking has no overall benefit in depression.” You might use the last sentence to summarise any implications for practice, research, policy, or public health.

Double check which model was exported

It has a lot of covariates so might be the main model? It shoud be the reduced one, however.

Skippa åldersvariabeln ijämförande modeller för infektion

Under 9.2 AUC finns fortfarande ”Age” med, antingen som RCS eller som main effect eller som enkel modell i kombination med sex: kan det vara en kvarleva från mortalitets-arbetet? Age är ju inte med som parameter i våra modeller för PJI. Helt rätt! Får åtgärdas!

Patientinvolvering

request that authors provide a Patient and Public Involvement statement in the methods section of their papers.

brief response to the following questions, tailored as appropriate for the study design reported:
• At what stage in the research process were patients/public first involved in the research and how?
• How were the research question(s) and outcome measures developed and informed by their priorities, experience, and preferences?
• How were patients/public involved in the design of this study?
• How were they involved in the recruitment to and conduct of the study?
• Were they asked to assess the burden of the intervention and time required to participate in the research?

Merge NH:s and EBs versions of the manuscript

Merge NH:s and EBs versions of the manuscript
Predict_PJI_hai2.docx

PJI inom två år i table 1

Lägg till kolumner för PJI inom 2 år i tble 1.
KAnske får tas bort sedan om det blir för rörigt ihop med extern validering men vi börjar så så länge.

Include referenses and intructions to the coder-package

It is probably a good idea to use this package to include some instructions of how to use it.
It might be, however, that they have a national verison of ICD-10-codes that works better wit hthe Danish data.
Perhaps this version could then be used for the Swedish data as well as a sensitivity analysis.

predictive power eller discriminative ability

Erik: Vi (jag …) skriver ibland “predictive power”, ibland ”discriminative ability”. Inte helt konsistent, vad bör väljas?

Hur hanteras underviktiga?

Nils:
Helt överens om att slå ihop fetma II och III. Undervikt och normalvikt är däremot lite knepigare, det är ju just underviktiga patienter som föreslås ha en ökad risk, och detta fenomen kanske drunknar om man slår ihop grupperna? Se en färsk referens som Ola varit med att skriva nedan.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6534237/

Relaterad till #8

Tasks from Zoom meeting

will add the ICD-10/NOMESCO codes identifying PJI to the description.
Double check if any of those need to appear in tandem.
See how many patients had only those codes (no corresponding re-operation recorded in SHAR). (Nice to know.)

To be further discussed in a meeting with Nils:

Exclusion of everyone who died within two years?
Contributions to the manuscript?
There might be additional variables available in Denmark which might increase model performance. Might be interesting to look at as well.

combine calibration plots

Waiting for the ggplot2 object from Ute. Can then combine those and adopt the design

skriv om metoddelen

Påbörjat utkast baserat på mortalitetsartikeln

We used data from two national quality registers 2008-2015: The Swedish hip arthroplasty register (SHAR), with a completeness of 96-98%,[@Karrholm2018] for model development and internal validation, and XXX for external validation.

SHAR was linked using Swedish personal identity numbers[@Ludvigsson2009] to enhance the variable set with education and civil status from Statistics Sweden,[@Ludvigsson2019] comorbidity and adverse events from the national patient register,[@Ludvigsson2011] and prescribed medications from the medical prescription register.[@Cnudde2016]
We excluded patients with hip arthroplasty due to fractures, tumours, unspecified, or unknown reasons. We included patients with either unilateral THA, or with their second staged bilateral hip.[@Bulow2020] Patients younger than 18 or older than 100 years were excluded, as were patients with a body mass index (BMI) above 50, or with missing data on BMI, ASA class, education, type of hospital, and cementation (Figure @ref(fig:flowchart)).

Comorbidity, during one year before surgery, was based on the Swedish version of the 10th revsin of the international classification of diseases (ICD-10-SE), as recorded at any hospital physician appointment prior to surgery. Individual codes were then combined using Charlson and Elixhauser comorbidity groups.[@quan2005] A similar procedure was performed for codes of the anatomical therapeutic chemical (ATC) classification, grouped by Rx Risk V.[@Pratt2018] Similar conditions were then combined to a broader classification scheme based on clinical relevance, and those categories were used as possible predictors (Table @ref(tab:tabcategorization)).

Infections within 90 days and 2 years respectively, were identified by relevant ICD-10-SE codes, or by codes from the Classification of Surgical Procedures (NCSP) from the Nordic Medico-Statistical Committee (NOMESCO) recorded in the national patient register. An infection was also identified if causing a reoperation recorded to SHAR.

eribul / nh_luxation_infektion Goto Github PK

nh_luxation_infektion's Introduction

Prediction of Early Periprosthetic Joint Infection after Total Hip Arthroplasty

nh_luxation_infektion's People

Contributors

Watchers

nh_luxation_infektion's Issues

title: "External validation" author: "Erik Bulow" date: '2019-11-13' output: html_document: toc: true toc_float: true df_print: paged code_folding: show

Start

Prepare data

Inclusion/exlusion

Variables

Outcome

Predictors

Baseline variables

Comorbidities

Example data

Validation of model as is

Results

Re-calibrated intercept

Results

Re-calibration of intercenpt and calibration slope

Results

Export data to Sweden

ROC

AUC with CI

Export objects

Calibration plots

Help function

Make and save figures

ROC plots

ICD-10

NOMESCO

To be further discussed in a meeting with Nils:

Påbörjat utkast baserat på mortalitetsartikeln

Recommend Projects

Recommend Topics

Recommend Org

title: "External validation"
author: "Erik Bulow"
date: '2019-11-13'
output:
html_document:
toc: true
toc_float: true
df_print: paged
code_folding: show