Learn R Programming

RAZIAD (version 0.0.1)

DebTrivedi: DebTrivedi Dataset

Description

The DebTrivedi dataset contains data on health care utilization and insurance coverage.

Usage

data("DebTrivedi")

Arguments

Format

A data frame with 4406 observations on the following 19 variables.

ofp

a numeric vector indicating physicians office visits

ofnp

a numeric vector

opp

a numeric vector

opnp

a numeric vector

emer

a numeric vector

hosp

a numeric vector indicating number of hospital stays

health

a factor with levels poor average excellent indicating self-perceived health status

numchron

a numeric vector indicating number of chronic conditions

adldiff

a factor with levels no yes

region

a factor with levels midwest noreast other west

age

a numeric vector

black

a factor with levels no yes

gender

a factor with levels female male

married

a factor with levels no yes

school

a numeric vector indicating number of years of education

faminc

a numeric vector

employed

a factor with levels no yes

privins

a factor with levels no yes indicating private insurance indicator

medicaid

a factor with levels no yes

Details

Deb and Trivedi (1997) conducted an analysis on data comprising 4406 individuals aged 66 and above, enrolled in Medicare, a public insurance program. These data were initially sourced from the US National Medical Expenditure Survey (NMES) for the years 1987/88. They are accessible through the data archive of the Journal of Applied Econometrics at http://qed.econ.queensu.ca/jae/1997-v12.3/deb-trivedi/. Additionally, the dataset was compiled for an R package associated with Kleiber and Zeileis (2008) and is also retrievable as DebTrivedi.rda from the Journal of Statistical Software, alongside Zeileis (2006). The primary aim is to develop models for medical care demand, encompassing physician/non-physician office and hospital outpatient visits, based on available patient covariates.

References

Zeileis, A. and Kleiber, C. and Jackma, S. (2008). "Regression Models for Count Data in R". JSS 27, 8, 1–25.

Examples

Run this code
data(DebTrivedi)
plot(table(DebTrivedi$ofp), main = "physicians office visit (N=4406)", ylab="frequency")
#the response variable “physicians office visits” is highly right skewed

Run the code above in your browser using DataLab