Match drugs to alternatives to get count of competitors.
#40
Open
opened 3 years ago by youainti
·
22 comments
Loading…
Reference in New Issue
There is no content yet.
Delete Branch '%!s(<nil>)'
Deleting a branch is permanent. It CANNOT be undone. Continue?
Increase the number of drugs by matching using WHO ATC codes.
This data exists in the RxNav set.
Notes on using ATC:
I need to choose a level to describe as "alternatives", but that isn't necessarily consistent. For example, in (J01), level 3 splits into various classes of antibacterials, by what appears to be common lineage (pharmacological grouping?). In contrast, for (S01), splits are between general areas of interests (Theraputic grouping?) and level 4 splits between antibiotics and antivirals.
This makes groupings for my purposes (finding competing drugs) difficult.
USP classifications seem to make more sense
Human readable discussion of drug classes: https://www.verywellhealth.com/drug-classes-1123991
Therapeutic categories.
https://www.fda.gov/regulatory-information/fdaaa-implementation-chart/usp-therapeutic-categories-model-guidelines
I think this might be a decent alternative. In particular, the Pharmacological Classes (2nd level) would probably be useful.
I just confirmed that USP is one of the properties held in rxnorm and the list of 3600+ properties can be found using
in my migrated data.
Match drugs to others by WHO ATC codesto Match drugs to alternatives to get count of competitors. 3 years agoUpdated name to represent the goal better.
https://www.nlm.nih.gov/research/umls/rxnorm/sourcereleasedocs/usp.html
https://www.nlm.nih.gov/pubs/techbull/so18/brief/so18_rxnorm_usp.html
Based on this, it appears that only the nomenclature is included in RxNorm, while the classifications (including the pharmaceutical/therapeutic classifications) are found in the USP Drug Classification system (USP-DC)
The main issue here is that RxNorm doesn't have the actual hierarchy/classification in it. Instead it has just the mappings from RXCUIs to USP compendial nomenclature codes. I need a mapping of USP CN codes to pharmacological class. Here are the steps I need to take:
Another option is to use the Vetrans Affairs formulary and classes.
https://www.pbm.va.gov/nationalformulary.asp
As I've been thinking about how to set this up, I've been struck by the thought that using formularies to measure market competition is probably appropriate because those are the "approved competitors", as formularies determine what insurance will cover.
Some formularies or pharmaceutical classes I can probably get a hold of.
I am not sure how to include those in my model, but they are important.
Probably as a set of correlated coefficients.
I should probably start with the VA formulary.
I have requested the USP DC info.
Got the USP DC data from 2022 and 2023.
Also have the VA national formulary for 2023.
There was no discussion of licensing for USP or VA. I assume they are publicly available at this point.
So in the RxNorm aligned file from USP DC 2023, all drugs have USP categories, and most have USP Classes (but not all). I think I will match on both Class and Category.
To put this into the database, I will create three tables.
Two will represent the categories and classes.
The third will link RxCUIs to those categories and classes.
Then I will create views that:
For the VA data, the national drug file extract with ndc has both VA class code and the va class name. This appears to correspond with the class code list I have.
I have the following options in front of me:
After looking at these two, I think I want to do the former.
USP and VA data uploaded in
1c3d749.A final dataset for formularies is the USP MMG, which is a base from which Medicare part D providers can start.
I will need to figure out a way to combine these formularies into a single measure.
That max operator will have a big impact if one of them is matched poorly though.
If one of the formulary classes is usually poorly matched, then that should generally come to be disregarded as non-informative.
That would be an issue in and of itself because it affects inference.
I think the best option is the second one (include each formulary separately).
Potential issues
A potential issue with the matching is if a drug belongs to multiple classes, then it is possible we would pull in non-competitors.
A potential issue is that a trial compound doesn't show up in any of the formularies (because formularies are not exhaustive of drugs used).
Added the USP MMG data in
6a931b3. Main file of interest is the alignment file.Note that this is version 8, which is current up to 2023-09-15, 8 days from now. That is when version 9 should be published.
Just imported USP DC, USP MMG (V8), and VA data into the database. This was done using the import function from DBeaver community edition.
Now I need to focus on linking drugs from trials to the different classes.
One identified issue. Based on:
some categories hold drugs that don't fall into other classes.
I am not sure how to handle those drugs.
My first inclination is to decide that if a drug in a category is in one of the "No USP Class, $CATEGORY" classes, then it should not be treated as having no competitors.
I don't know what needs to be true in order to justify this.