You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ClinicalTrialsDataProcessing/ raw_data/ICD- 10_(2019 version)/WHO
youainti 2aba3469d3 adding WHO ICD-10 Data 3 years ago
..
README.txt adding WHO ICD-10 Data 3 years ago
icd10-2019.txt adding WHO ICD-10 Data 3 years ago
icd10-2019_categories_only.psv adding WHO ICD-10 Data 3 years ago
icd10-2019_categories_only.txt adding WHO ICD-10 Data 3 years ago

README.txt

This data was obtained by opening each nested portion on the
left navigation bar at 
    https://icd.who.int/browse10/2019/en
and then copying and pasting the data into a text file (icd10-2019.txt). 


This text file was then adjusted to get the pipe-separated values version 
which 
has the following columns
- code: the icd-10 code in a normalized format
- description: The basic description given
- source: This just says WHO so that it is possible to merge it with other 
sources.


The adjustments were as follows (parenthases include vim search and replace 
commands used):
- delete tabls (:%s/\t//g)
- delete leading spaces (:%s/^\s//)
- remove excess newlines (:%s/^\n//)
- remove periods in codes (:%s/\.//)
- Convert to Pipe-separated values file (:%s/\s/ | /)
- add column of sources (:%s/\s*$/ | WHO)
- Type in column headers