Learn R Programming


output: github_document

incadata

The goal of incadata is to provide basic functionality to handle data from INCA and the Regional cancer centers in Sweden.

Installation

You can install the released version of incadata from CRAN with:

install.packages("incadata")

And the development version from BitBucket with:

# install.packages("remotes")
remotes::install_bitbucket("cancercentrum/incadata")

Standardised data sets

The function as.incadata standardize data from INCA and Rockan:

  • All date formats used by Rockan are recognized as dates and coerced to such (for example: 1985-05-04, "", 19850504, 19850500

, 19850000 and 8513).

  • Boolean values are numeric vectors in INCA: c(0, 1, 0, 1, 0, 0), but coerced to character when exported: c(NA, "True", NA, "True", NA, NA). The package recognize this

peculiarity and coerce to Boolean.

  • Personal identity numbers are recognized even if they end with X et cetera (used in Rockan).
  • Standard numerical codes from Rockan are decoded (using the decoder package).
  • Column names are always coerced to lower case, since these are generally easier to work with.
  • Data frames are coerced to tibbles .
  • An id column is always added to data frames in order to always have an identification variable at hand (regardless if the data has none or one of PERSNR, PNR or PAT_ID)

Register documentation

The package also provides functionality for easier access and archiving of register documentation (se vignette 'incadoc') and function documents.

Additional functionality

The package also lets you:

  • cache data between work sessions to speed up the data loading and munging process
  • use a single data reading/munging function regardless if you work on INCA or locally
  • interactively engage in the coercing process of variable formats. This is handy for example if a variable is almost a date but has some additional entries that are not recognized as such.

Copy Link

Version

Install

install.packages('incadata')

Monthly Downloads

8

Version

0.9.1

License

GPL-2

Maintainer

Erik Bulow

Last Published

April 9th, 2020

Functions in incadata (0.9.1)

as_numeric

Test object for, or coerce to, numeric
best_match

Tries to correct misspelling of character string
cut.integer

Convert integer vector to Factor
documents

Download and possibly open INCA documentation
find_links

Find links from web page
find_register

Find register by name
id

Add id variables to data frame
ex_data

Synthetic example data from INCA
exceed_threshold

Check if transformation/coercing of a vector is good enough
is.inca

Check if R is running from INCA
width

Calculate the width of the range of x
specify_missing

Specify missing values for a vector
use_incadata

Use incadata from file or dataframe df
create_s3_method

Template functions to generate basic S3 methods for new classes
is.wholenumber

Test if a numeric vector consists of whole numbers
psum

Parallel sum
lownames

Make all names in data.frame lower case
lt

Lead time from one date to another
clean_text

Clean/standardize text
exportr

Dump script together with functions from required packages
find_documents

List URLS to documents for a register
as.incadata

Identify data formats used by INCA and Rockan
is.scalar_in

Test if scalar is in interval
as.Dates

Converting potential date to Date vector
is.incalogical

Coerce to logical if value is logical according to INCA