icd9WideToLong: convert ICD data from wide to long format
Description
This is different enough to dcast in reshape2 that
it needs writing again specifically for ICD codes. This function packages
the core reshape function. Empty strings and NA values will be
dropped, and everything else kept. No validation of the ICD codes is done.
data.frame in wide format, i.e. one row per patient, and
multiple columns containing ICD codes, empty strings or NA.
visitId
The name of the column in the data frame which contains the
patient or visit identifier. Typically this is the visit identifier, since
patients come leave and enter hospital with different ICD-9 codes. It is a
character vector of length one. Defaults to "visitId"
icdLabels
vector of column names in which codes are found. If NULL,
all columns matching icd or ICD will be included.
icdName
character vector length one containing the new column name
for the ICD codes, defaults to "icdCode"
Value
data frame with visitId column named the same as input, and a column
named by icd.name containing all the non-NA and non-empty codes
found in the wide input data.