Skip to content

Publication

Variance estimation under multiple imputation

Abstract

"When investigating unemployment data, one may be interested in estimating the totals of unemployed in subpopulations, e.g. by regional or by contentional differentiation. For the estimation of a population total typically Horvitz-Thompson type estimates are used. However, often the data are prone to item-nonresponse. To achieve valid results for these estimates from a randomization-based perspective, in general, variance correction methods are needed. In this paper we discuss imputation issues in large-scale data sets with different scaled variables laying special emphasis on binary variables. Since fitting a multivariate imputation model can be cumbersome, univariate specifications are proposed which are much easier to perform. The regression-switching or chained equations Gibbs sampler is proposed and possible theoretical shortcomings of this approach are addressed as well as data problems. A simulation study is done based on the data of the German Microcensus, which is often used to analyse unemployment. Multiple imputation, raking, and calibration techniques are compared for estimating the number of unemployed in different settings. We find that the logistic multiple imputation routine for binary variable, in some settings, may lead to poor point as well as variance estimates. To overcome possible shortcomings of the logistic regression imputation, we derive a multiple imputation matching algorithm which turns out to work well." (Author's abstract, IAB-Doku) ((en))

Cite article

Münnich, R. & Rässler, S. (2004): Variance estimation under multiple imputation. In: Statistisches Bundesamt (Hrsg.) (2004): European conference on quality and methodology in official statistics (Q2004), Mainz, Germany, Castle of the Prince Elector, 24-26 May 2004 : proceedings, p. 1-19.

Download

Free Access