rrcovNA (version 0.4-15)

ces: Consumer Expenditure Survey Data

Description

This data set has been derived from the Quarterly Interview Survey of the Consumer Expenditure Survey (CES) undertaken by the U.S. Department of Labor, Bureau of Labor Statistics and is available at https://www.bls.gov/cex/ where also more details about this survey can be found. The original data set comprises 869 households in 34 variables of which one is unique ID, five characterize the size of the household, further 6 variables contain other characteristics of the household like age, education ethnicity, etc. and 22 variables represent the household expenditures. We will consider a reduced set of only 8 expendature variables. This reduced data set was analyzed by Hubert at al. (2009)in the context of PCA and the first step of the analysis showed that all variables are highly skewed. They applied the robust PCA method of Serneels and Verdonck based on the EM algorithm, since some of the data are incomplete.

Usage

data(ces)

Arguments

Format

A data frame with 869 observations on the following 8 variables:

EXP

Total household expenditure

FDHO

Food and nonalcoholic beverages consumed at home

FDAW

Food and nonalcoholic beverages consumed away from home

SHEL

Housing expenditure

TELE

Telephone services

CLOT

Clothing

HEAL

Health care

ENT

Entertainment

References

Hubert, M, Rousseeuw, P.J. and Verdonck, T., (2009). Robust PCA for skewed data and its outlier map, Computational Statistics & Data Analysis, 53, 6, pp. 2264-2274

Examples

Run this code
data(ces)
summary(ces)
plot(ces)

Run the code above in your browser using DataCamp Workspace