- year
Integer. Year(s) of the data. Required.
- month
Integer. Month(s) of the data (1-12). If NULL (default),
downloads all 12 months. Example: 1 (January), 1:6 (first semester).
- vars
Character vector. Variables to keep. If NULL (default),
returns all available variables. Use sih_variables() to see
available variables.
- uf
Character. Two-letter state abbreviation(s) to download.
If NULL (default), downloads all 27 states.
Example: "SP", c("SP", "RJ").
- diagnosis
Character. CID-10 code pattern(s) to filter by principal
diagnosis (DIAG_PRINC). Supports partial matching (prefix).
If NULL (default), returns all diagnoses.
Example: "I21" (acute myocardial infarction), "J" (respiratory).
- parse
Logical. If TRUE (default), converts columns to
appropriate types (integer, double, Date) based on the variable
metadata. Use sih_variables() to see the target type for each
variable. Set to FALSE for backward-compatible all-character output.
- col_types
Named list. Override the default type for specific
columns. Names are column names, values are type strings:
"character", "integer", "double",
"date_dmy", "date_ymd", "date_ym", "date".
Example: list(VAL_TOT = "character") to keep VAL_TOT as character.
- cache
Logical. If TRUE (default), caches downloaded data for
faster future access.
- cache_dir
Character. Directory for caching. Default:
tools::R_user_dir("healthbR", "cache").
- lazy
Logical. If TRUE, returns a lazy query object instead of a
tibble. Requires the arrow package. The lazy object supports
dplyr verbs (filter, select, mutate, etc.) which are pushed down
to the query engine before collecting into memory. Call
dplyr::collect() to materialize the result. Default: FALSE.
- backend
Character. Backend for lazy evaluation: "arrow"
(default) or "duckdb". Only used when lazy = TRUE.
DuckDB backend requires the duckdb package.