This data set contains observations from a randomized response survey conducted in a university to investigate drugs use. The sample is drawn by cluster sampling with the probabilities proportional to the size. The randomized response technique used is the Horvitz-UB model (Chaudhuri, 2011) with parameters \(p_1=0.6\) and \(p_2=0.7\).
data(HorvitzUBData)
A data frame containing a sample of 188 observations from a population of \(N=802\) students divided into four cluster. The variables are:
ID: Survey ID of student respondent
CL: Cluster ID
I: The first randomized response to the question: Have you ever used drugs?
J: The second randomized response to the question: Have you ever used drugs?
Pi: first-order inclusion probabilities
Chaudhuri, A. (2011). Randomized response and indirect questioning techniques in surveys. Boca Raton: Chapman and Hall, CRC Press.
Greenberg, B.G., Abul-Ela, A.L., Simmons, W.R., Horvitz, D.G. (1969). The unrelated question RR model: Theoretical framework. Journal of the American Statistical Association, 64, 520-539.
Horvitz, D.G., Shah, B.V., Simmons, W.R. (1967). The unrelated question RR model. Proceedings of the Social Statistics Section of the American Statistical Association. 65-72. Alexandria, VA: ASA.
# NOT RUN {
data(HorvitzUBData)
# }
Run the code above in your browser using DataLab