Combining multiple imputation and control function methods to deal with missing data and endogeneity in discrete-choice models

Gopalakrishnan, Raja; Guevara, C. Angelo; Ben-Akiva, Moshe

Abstract

While collecting data for estimating discrete-choice models, researchers often encounter missing information in observations. In addition, endogeneity can occur whenever the error term is not independent of the observed variables. Both problems result in inconsistent estimators of the model parameters. The problems of missing information and endogeneity may occur in the same variable in the data, if, e.g., partially missing price information is correlated with another omitted variable. Extant approaches to correct for endogeneity in discrete choice models, such as the control function method, do not address the problem when the error term is correlated with a variable having missing information. Likewise, approaches to address missing information, such as the multiple imputation method, cannot handle endogeneity problems. To address this challenge, we propose a novel hybrid algorithm by combining the methods of multiple imputation and the control function. We validate the algorithm in a Monte-Carlo experiment and apply it to real data of heavy commercial vehicle parking from Singapore. In this case study, we were able to substantially correct for price endogeneity in the presence of missing price information. (c) 2020 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )

Más información

Título según WOS: Combining multiple imputation and control function methods to deal with missing data and endogeneity in discrete-choice models
Título de la Revista: TRANSPORTATION RESEARCH PART B-METHODOLOGICAL
Volumen: 142
Editorial: PERGAMON-ELSEVIER SCIENCE LTD
Fecha de publicación: 2020
Página de inicio: 45
Página final: 57
DOI:

10.1016/j.trb.2020.10.002

Notas: ISI