Imputation

355 papers with code • 4 benchmarks • 12 datasets

Substituting missing data with values according to some criteria.

Libraries

Use these libraries to find Imputation models and implementations
15 papers
763
5 papers
1,162
4 papers
278

Most implemented papers

Input Convex Neural Networks

locuslab/icnn ICML 2017

We show that many existing neural network architectures can be made input-convex with a minor modification, and develop specialized optimization algorithms tailored to this setting.

MIDA: Multiple Imputation using Denoising Autoencoders

harry24k/mida-pytorch 8 May 2017

Missing data is a significant problem impacting all domains.

Variational Autoencoder with Arbitrary Conditioning

tigvarts/ucm ICLR 2019

We propose a single neural probabilistic model based on variational autoencoder that can be conditioned on an arbitrary subset of observed features and then sample the remaining features in "one shot".

On the consistency of supervised learning with missing values

nprost/supervised_missing 19 Feb 2019

A striking result is that the widely-used method of imputing with a constant, such as the mean prior to learning is consistent when missing values are not informative.

Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces

boschresearch/continuous-recurrent-units 17 May 2019

In order to integrate uncertainty estimates into deep time-series modelling, Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep learning models, however, such approaches typically rely on approximate inference techniques such as variational inference which makes learning more complex and often less scalable due to approximation errors.

GP-VAE: Deep Probabilistic Time Series Imputation

ratschlab/GP-VAE 9 Jul 2019

Multivariate time series with missing values are common in areas such as healthcare and finance, and have grown in number and complexity over the years.

Bayesian Temporal Factorization for Multidimensional Time Series Prediction

xinychen/transdim 14 Oct 2019

In this paper, we propose a Bayesian temporal factorization (BTF) framework for modeling multidimensional time series -- in particular spatiotemporal data -- in the presence of missing values.

Lung Segmentation from Chest X-rays using Variational Data Imputation

raghavian/lungVAE 20 May 2020

Pulmonary opacification is the inflammation in the lungs caused by many respiratory ailments, including the novel corona virus disease 2019 (COVID-19).

A Review of Deep Learning Methods for Irregularly Sampled Medical Time Series Data

SunChenxiSCX/ISMTS-Review 23 Oct 2020

Developing deep learning methods on EHRs data is critical for personalized treatment, precise diagnosis and medical management.

Geometry- and Accuracy-Preserving Random Forest Proximities

kevinmoonlab/rf-gap 29 Jan 2022

Random forests are considered one of the best out-of-the-box classification and regression algorithms due to their high level of predictive performance with relatively little tuning.