Link trial conditions to Burden of Disease data
#36
Closed
opened 3 years ago by youainti
·
16 comments
Loading…
Reference in New Issue
There is no content yet.
Delete Branch '%!s(<nil>)'
Deleting a branch is permanent. It CANNOT be undone. Continue?
Use the WHO burden of disease data to approximate population sizes of different diseases.
Steps include:
https://meshb.nlm.nih.gov/record/ui?name=Global%20Burden%20of%20Disease
It looks like there is already some sort of linkage between GBD and MeSH
downloaded data and wrote scripts to put in in the db.
To quote snowmed about smdct <-> idc10 mappings
So it looks like I should map GBD to IDC-10 and also map the conditions from
browse_conditionsto idc-10icd-10 background info
https://www.cdc.gov/nchs/icd/index.htm
https://www.cms.gov/Medicare/Coding/ICD10/2019-ICD-10-CM
the 2019 icd-10-cm data which should match that used to link the global burdens of disease data as it was last released for 2019.
The icd-10 version from WHO: https://icd.who.int/browse10/2019/en
This does not seem to match the downloadable version of ICD-10-CM from CMS
https://www.cms.gov/Medicare/Coding/ICD10/2019-ICD-10-CM. In particular, there are missing codes in the CMS version that I find in the WHO version: e.g.
F02.4 can be found in the WHO version but not the CMS version search here
I checked the 2020 CMS version of ICD-10-CM for F02.4 as well and it is missing.
WHO does provide API access to get the ICD-10:2019 access. https://icd.who.int/icdapi
Details on python requests here:
https://github.com/ICD-API/Python-samples/blob/master/sample.py
Downloaded the WHO ICD-10 (2019) categories by copying and pasting from the nav bar of https://icd.who.int/browse10/2019/en
So it turns out that the ICD-10 codes used in the GBD data source is not consistent between WHO ICD-109 and CMS ICD-10-CM.
Examples:
As of
470dfc2611I got a working merge of the WHO and CMS versions as well as code to generate it.Current data state:
What I need to do:
I currently have a flask app that will simplify manual matching of the data I need.
I still have a couple of steps left:
match-status?I adjusted the schema for a simpler workflow. Everything is working fine. I will get the data together and then export a backup.
277b5b9Need to download the full data set so I can start setting things up for matching trials to causes.
Have the linking script in the repo as of
9a718f7.Used the estimated values and matched each trial to the most specific population measure they had.