‘Caveat emptor’: the cautionary tale of endocarditis and the potential pitfalls of clinical coding data—an electronic health records study

Abstract Background Diagnostic codes from electronic health records are widely used to assess patterns of disease. Infective endocarditis is an uncommon but serious infection, with objective diagnostic criteria. Electronic health records have been used to explore the impact of changing guidance on antibiotic prophylaxis for dental procedures on incidence, but limited data on the accuracy of the diagnostic codes exists. Endocarditis was used as a clinically relevant case study to investigate the relationship between clinical cases and diagnostic codes, to understand discrepancies and to improve design of future studies. Methods Electronic health record data from two UK tertiary care centres were linked with data from a prospectively collected clinical endocarditis service database (Leeds Teaching Hospital) or retrospective clinical audit and microbiology laboratory blood culture results (Oxford University Hospitals Trust). The relationship between diagnostic codes for endocarditis and confirmed clinical cases according to the objective Duke criteria was assessed, and impact on estimations of disease incidence and trends. Results In Leeds 2006–2016, 738/1681(44%) admissions containing any endocarditis code represented a definite/possible case, whilst 263/1001(24%) definite/possible endocarditis cases had no endocarditis code assigned. In Oxford 2010–2016, 307/552(56%) reviewed endocarditis-coded admissions represented a clinical case. Diagnostic codes used by most endocarditis studies had good positive predictive value (PPV) but low sensitivity (e.g. I33-primary 82% and 43% respectively); one (I38-secondary) had PPV under 6%. Estimating endocarditis incidence using raw admission data overestimated incidence trends twofold. Removing records with non-specific codes, very short stays and readmissions improved predictive ability. Estimating incidence of streptococcal endocarditis using secondary codes also overestimated increases in incidence over time. Reasons for discrepancies included changes in coding behaviour over time, and coding guidance allowing assignment of a code mentioning ‘endocarditis’ where endocarditis was never mentioned in the clinical notes. Conclusions Commonly used diagnostic codes in studies of endocarditis had good predictive ability. Other apparently plausible codes were poorly predictive. Use of diagnostic codes without examining sensitivity and predictive ability can give inaccurate estimations of incidence and trends. Similar considerations may apply to other diseases. Health record studies require validation of diagnostic codes and careful data curation to minimise risk of serious errors.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.6084/m9.figshare.c.4653179
PID https://www.doi.org/10.6084/m9.figshare.c.4653179.v1
URL http://dx.doi.org/10.6084/m9.figshare.c.4653179.v1
URL http://dx.doi.org/10.6084/m9.figshare.c.4653179
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right not available
Attribution

Description: Authorships and contributors

Field Value
Author Fawcett, Nicola
Author Young, Bernadette
Author Peto, Leon
Author T. Quan
Author Gillott, Richard
Author Jianhua Wu
Author Middlemass, Chris
Author Weston, Sheila
Author Crook, Derrick
Author Peto, Tim
Author Muller-Pebody, Berit
Author Johnson, Alan
Author A. Walker
Author Sandoe, Jonathan
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Datacite
Hosted By figshare
Publication Date 2019-01-01
Publisher figshare
Additional Info
Field Value
Language UNKNOWN
Resource Type Collection
keyword FOS: Chemical sciences
keyword FOS: Biological sciences
keyword FOS: Computer and information sciences
keyword FOS: Health sciences
system:type other
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/other?orpId=dedup_wf_001::b2e6e4577e6f5771851939918a6c62ca
Author jsonws_user
Last Updated 18 December 2020, 19:05 (CET)
Created 18 December 2020, 19:05 (CET)