Missing value estimation methods for DNA microarrays


1 Citation

Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., and Altman, R. B. (2001). Missing value estimation methods for dna microarrays. Bioinformatics, 17(6):520–525. Permanent link to paper

2 Summary

SVD, KNN and row average imputation are evaluated with different parameter settings on real data sets with regard to robustness, sensitivity and accuracy.

3 Study outcomes

3.1 Outcome O1

Rank of performance: KNN, SVD, row average, zero filling.

3.2 Outcome O2

"KNN is relatively insensitive to .. K within the range of k=10-20" (Figure 1)

3.3 Outcome O3

SVD "is sensitive to the type of data" and "is ideally suited .. in terms of .. constituent patterns"

4 Study design and evidence level

Just 4 imputation algorithms (SVD,KNN) are evaluated from which 2 are singular value substitutions (average,zero).

Analysis is performed over a broad range of hyperparameters (KNN: k=[1,1000], SVD: Eigengenes=[5,30]).

The imputation methods are only analyzed on data with less than 20%.

5 References