Missing value estimation methods for DNA microarrays
Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., and Altman, R. B. (2001). Missing value estimation methods for dna microarrays. Bioinformatics, 17(6):520–525. Permanent link to paper
SVD, KNN and row average imputation are evaluated with different parameter settings on real data sets with regard to robustness, sensitivity and accuracy.
3 Study outcomes
3.1 Outcome O1
Rank of performance: KNN, SVD, row average, zero filling.
3.2 Outcome O2
"KNN is relatively insensitive to .. K within the range of k=10-20" (Figure 1)
3.3 Outcome O3
SVD "is sensitive to the type of data" and "is ideally suited .. in terms of .. constituent patterns"
4 Study design and evidence level
Just 4 imputation algorithms (SVD,KNN) are evaluated from which 2 are singular value substitutions (average,zero).
Analysis is performed over a broad range of hyperparameters (KNN: k=[1,1000], SVD: Eigengenes=[5,30]).
The imputation methods are only analyzed on data with less than 20%.