hdm (version 0.3.1)

rlassologitEffects: rigorous Lasso for Logistic Models: Inference

Description

The function estimates (low-dimensional) target coefficients in a high-dimensional logistic model.

Usage

rlassologitEffects(x, ...)

# S3 method for default rlassologitEffects(x, y, index = c(1:ncol(x)), I3 = NULL, post = TRUE, ...)

# S3 method for formula rlassologitEffects(formula, data, I, included = NULL, post = TRUE, ...)

rlassologitEffect(x, y, d, I3 = NULL, post = TRUE)

Value

The function returns an object of class rlassologitEffects with the following entries:

coefficients

estimated value of the coefficients

se

standard errors

t

t-statistics

pval

p-values

samplesize

sample size of the data set

I

index of variables of the union of the lasso regressions

Arguments

x

matrix of regressor variables serving as controls and potential treatments. For rlassologitEffect it contains only controls, for rlassologitEffects both controls and potential treatments. For rlassologitEffects it must have at least two columns.

...

additional parameters

y

outcome variable

index

vector of integers, logical or names indicating the position (column) or name of variables of x which should be used as treatment variables.

I3

logical vector with same length as the number of controls; indicates if variables (TRUE) should be included in any case.

post

logical. If TRUE, post-Lasso estimation is conducted.

formula

An element of class formula specifying the linear model.

data

an optional data frame, list or environment (or object coercible by as.data.frame to a data frame) containing the variables in the model. If not found in data, the variables are taken from environment(formula), typically the environment from which the function is called.

I

An one-sided formula specifying the variables for which inference is conducted.

included

One-sided formula of variables which should be included in any case.

d

variable for which inference is conducted (treatment variable)

Details

The functions estimates (low-dimensional) target coefficients in a high-dimensional logistic model. An application is e.g. estimation of a treatment effect \(\alpha_0\) in a setting of high-dimensional controls. The function is a wrap function for rlassologitEffect which does inference for only one variable (d).

References

A. Belloni, V. Chernozhukov, Y. Wei (2013). Honest confidence regions for a regression parameter in logistic regression with a loarge number of controls. cemmap working paper CWP67/13.

Examples

Run this code
if (FALSE) {
library(hdm)
## DGP
set.seed(2)
n <- 250
p <- 100
px <- 10
X <- matrix(rnorm(n*p), ncol=p)
colnames(X) = paste("V", 1:p, sep="")
beta <- c(rep(2,px), rep(0,p-px))
intercept <- 1
P <- exp(intercept + X %*% beta)/(1+exp(intercept + X %*% beta))
y <- rbinom(n, size=1, prob=P)
xd <- X[,2:50]
d <- X[,1]
logit.effect <- rlassologitEffect(x=xd, d=d, y=y)
logit.effects <- rlassologitEffects(X,y, index=c(1,2,40))
logit.effects.f <- rlassologitEffects(y ~ X, I = ~ V1 + V2)
}

Run the code above in your browser using DataCamp Workspace