NSF-Census Research Network - imputation https://www.ncrn.info/tags/imputation en MixedDataImpute: Missing Data Imputation for Continuous and Categorical Data using Nonparametric Bayesian Joint Models https://www.ncrn.info/software/mixeddataimpute-missing-data-imputation-continuous-and-categorical-data-using-nonparametric <div class="field field-name-field-author field-type-taxonomy-term-reference field-label-above"><div class="field-label">Author(s):&nbsp;</div><div class="field-items"><div class="field-item even"><a href="/authors/jared-s-murray" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Jared S. Murray</a></div></div></div><div class="field field-name-field-summary field-type-text-long field-label-hidden"><div class="field-items"><div class="field-item even"><p>Many datasets include a mix of continuous and categorical variables with missing values. In a paper published in the Journal of the American Statistical Association, we developed a joint model for such mixed data that can be used for multiple imputation. The approach uses a nonparametric Bayesian mixture model as the imputation engine. The mixture model comprises one set of mixture components with multivariate normal kernels for the continuous variables, and a separate set of mixture components with products of independent multinomial kernels for the categorical variables. The model induces dependence between the continuous and categorical variables in two ways, namely (i) by allowing the means of the multivariate normal distributions to depend on the categorical variables, and (ii) by using a tensor factorization prior that links the two sets of membership components.</p> </div></div></div><div class="field field-name-field-external-link field-type-link-field field-label-above"><div class="field-label">External Link:&nbsp;</div><div class="field-items"><div class="field-item even"><a href="http://arxiv.org/abs/1410.0438" target="_blank">Paper</a></div><div class="field-item odd"><a href="https://CRAN.R-project.org/package=MixedDataImpute" target="_blank">CRAN</a></div></div></div><div class="field field-name-field-nodes field-type-entityreference field-label-above"><div class="field-label">Nodes:&nbsp;</div><div class="field-items"><div class="field-item even"><a href="/node/duke-university-national-institute-statistical-sciences-niss">Duke University / National Institute of Statistical Sciences (NISS)</a></div></div></div><div class="field field-name-field-tags field-type-taxonomy-term-reference field-label-above"><div class="field-label">Tags:&nbsp;</div><div class="field-items"><div class="field-item even"><a href="/tags/r" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">R</a></div><div class="field-item odd"><a href="/tags/missing-data" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">missing data</a></div><div class="field-item even"><a href="/tags/imputation" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">imputation</a></div></div></div> Mon, 10 Apr 2017 15:55:20 +0000 Vilhuber 2499 at https://www.ncrn.info EditImputeCont: Simultaneous Edit-Imputation for Continuous Microdata https://www.ncrn.info/software/editimputecont-simultaneous-edit-imputation-continuous-microdata <div class="field field-name-field-author field-type-taxonomy-term-reference field-label-above"><div class="field-label">Author(s):&nbsp;</div><div class="field-items"><div class="field-item even"><a href="/authors/quanli-wang" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Quanli Wang</a></div><div class="field-item odd"><a href="/authors/hang-j-kim" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Hang J. Kim</a></div><div class="field-item even"><a href="/authors/jerome-p-reiter" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Jerome P. Reiter</a></div><div class="field-item odd"><a href="/authors/lawrence-h-cox" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Lawrence H. Cox</a></div><div class="field-item even"><a href="/authors/alan-f-karr" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Alan F. Karr</a></div></div></div><div class="field field-name-field-summary field-type-text-long field-label-hidden"><div class="field-items"><div class="field-item even"><p>Many statistical organizations collect data that are expected to satisfy linear constraints; as examples, component variables should sum to total variables, and ratios of pairs of variables should be bounded by expert-specified constants. When reported data violate constraints, organizations identify and replace values potentially in error in a process known as edit-imputation.  In a paper published in the <em>Journal of the American Statistical Association,</em> we developed an approach that fully integrates editing and imputation for continuous microdata under linear constraints. The approach relies on a Bayesian hierarchical model that includes (i) a flexible joint probability model for the underlying true values of the data with support only on the set of values that satisfy all editing constraints, (ii) a model for latent indicators of the variables that are in error, and (iii) a model for the reported responses for variables in error.  This R package implements a version of the model that uses mixtures of multivariate normal distributions for the underlying true values and uniform distributions for measurement errors.  The <a href="http://cran.r-project.org/web/packages/EditImputeCont/index.html">package is available on CRAN</a>.</p> </div></div></div><div class="field field-name-field-external-link field-type-link-field field-label-above"><div class="field-label">External Link:&nbsp;</div><div class="field-items"><div class="field-item even"><a href="https://CRAN.R-project.org/package=EditImputeCont" target="_blank">Package on CRAN</a></div><div class="field-item odd"><a href="http://sites.duke.edu/tcrn/research-projects/downloadable-software/" target="_blank">Description at TCRN</a></div><div class="field-item even"><a href="https://doi.org/10.1080/01621459.2015.1040881" target="_blank">Paper</a></div><div class="field-item odd"><a href="https://github.com/QuanliWang/EditImputeCont" target="_blank">Github</a></div></div></div><div class="field field-name-field-nodes field-type-entityreference field-label-above"><div class="field-label">Nodes:&nbsp;</div><div class="field-items"><div class="field-item even"><a href="/node/duke-university-national-institute-statistical-sciences-niss">Duke University / National Institute of Statistical Sciences (NISS)</a></div></div></div><div class="field field-name-field-tags field-type-taxonomy-term-reference field-label-above"><div class="field-label">Tags:&nbsp;</div><div class="field-items"><div class="field-item even"><a href="/tags/bayesian-hierarchical-model" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Bayesian hierarchical model</a></div><div class="field-item odd"><a href="/tags/edit-procedures" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">edit procedures</a></div><div class="field-item even"><a href="/tags/imputation" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">imputation</a></div><div class="field-item odd"><a href="/tags/constraints" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">constraints</a></div></div></div> Fri, 12 Jun 2015 14:09:24 +0000 Vilhuber 2022 at https://www.ncrn.info