Document Type : Research Paper

Authors

1 Allameh Tabatabaei University

2 professor

Abstract

Nonresponse is an inevitable challenge to large-scale studies and can result in wasting money, time and human resource involved in data collection and can also prevent the studies from obtaining their objects especially scores distribution. Imputation methods have thus been invented to estimate item nonresponses in order to make inference from a completed data set. Using a simulation study on a real data set in the form of a multivariate experimental design, this paper evaluates the accuracy of three models including cumulative logit model, graded response model and explanatory item response model. The results show that the imputed values of all three models are acceptable under random nonresponse mechanism although the imputed values of the explanatory item response model are always more accurate than those of the other models. If nonrandom nonresponses are occurred, explanatory item response model has acceptable imputed values only at 5% nonresponse rate and the other models are not accurate at all. The results also show that it is more accurate to impute individual item nonresponses and then compute the total score instead of directly imputing the total score.

Keywords

Azar, B. (2002). Finding a solution for missing data. Monitor on Psychology, 33(7), 70.
Bodner, T. E. (2006). Missing data: Prevalence and reporting practices. Psychological Reports, 99, 675–680.
De Boeck, P., & Wilson, M. (2004). Explanatory item response models. Springer New York.
Enders, C. K. (2010). Applied missing data analysis. Guilford Press.
Fowler Jr, F. J. (2013). Survey research methods. Sage publications.
Groves, R. M., Fowler Jr, F. J., Couper, M. P., Lepkowski, J. M., Singer, E., & Tourangeau, R. (2011). Survey methodology (Vol. 561). John Wiley & Sons.
Hox, J., de Leeuw, E., Couper, M. P., Groves, R. M., De Heer, W., Kuusela, V., & Martin, J. (2002). The influence of interviewers’ attitude and behavior on household survey.
Krishnamoorthy, K., Mallick, A., & Mathew, T. (2009). Model-based imputation approach for data analysis in the presence of non-detects. Annals of Occupational Hygiene, 53(3), 249-263.
Molenberghs, G., Fitzmaurice, G., Kenward, M. G., Tsiatis, A., & Verbeke, G. (Eds.). (2014). Handbook of missing data methodology. CRC Press.
O'Connell, A. A. (2006). Logistic regression models for ordinal response variables (No. 146). Sage.
Peugh, J. L., & Enders, C. K. (2004). Missing data in educational research: A review of reporting practices and suggestions for improvement. Review of educational research, 74(4), 525-556.
Raghunathan, T. (2016). Missing Data Analysis in Practice. CHAPMAN & HALL/CRC.
Roth, P. L. (1994). Missing data: A conceptual review for applied psychologists. Personnel psychology, 47(3), 537-560.
Samejima, F. (2008). Graded response model based on the logistic positive exponent family of models for dichotomous responses. Psychometrika, 73(4), 561-578.
Seaman, S., Galati, J., Jackson, D., & Carlin, J. (2013). What Is Meant by" Missing at Random"?. Statistical Science, 257-268.
Schlomer, G. L., Bauman, S., & Card, N. A. (2010). Best practices for missing data management in counseling psychology. Journal of Counseling Psychology, 57(1), 1-10.
Tang, G., Little, R. J., & Raghunathan, T. E. (2003). Analysis of multivariate missing data with nonignorable nonresponse. Biometrika, 747-764.
Tourangeau, R., Rips, L. J., & Rasinski, K. (2000). The psychology of survey response. Cambridge University Press.
van der Heijden, G. J., Donders, A. R. T., Stijnen, T., & Moons, K. G. (2006). Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: a clinical example. Journal of clinical epidemiology, 59(10), 1102-1109.
Wilkinson, L. (1999). Task Force on Statistical Inference, American Psychological Association, Science Directorate (1999). Statistical methods in psychology journals: Guidelines and explanations. American Psychologist, 54(8), 594-604.
Zhang, Z. (2016). Missing data imputation: focusing on single imputation. Annals of translational medicine, 4(1), 1-8.