DMRcaller: a versatile R/Bioconductor package for detection and visualization of differentially methylated regions in CpG and non-CpG contexts
1 DMRcaller: a versatile R/Bioconductor package for detection and visualization of differentially methylated regions in CpG and non-CpG contexts
Catoni M, Tsang JM, Greco AP, Zabet NR DMRcaller: a versatile R/Bioconductor package for detection and visualization of differentially methylated regions in CpG and non-CpG contexts. Nucleic Acids Res. 2018 Nov 2;46(19):e114
The paper considers identification of differentially methylated regions (DMRs) from bisulfite sequencing data (BSSEQ). A new package (DMRcaller) is introduced. The DMRcaller package allows choosing between three conceptional approaches for merging information from individuals cytosines (analysis of individual cytosines, of pooled tiling-intervals, or of smoothed data). Three test statistics can be chosen (Fisher's Exact test, score-test, beta regression). Differential positions/bins/regions are selected based on several requirements e.g. significance<threshold, methylation difference>threshold, coverage>threshold.
Publicly available data from Arabidopsis thaliana (GSM2384978, GSM2384980, [20,26]), rice (GSM560563, GSM560562, ), and human IMR90 and H1 cell lines  were analyzed.
The performance of the DMRcaller was assessed by comparing with 2-4 other methods (methylKit, methylSig and partly with methylPipe and DSS).
1.2 Study outcomes
1.2.1 Outcome O1
For Arabidopsis and CpGs, the performances of DMR-caller-B (B denotes "binning") and DMR-caller-NF (NF denotes "noise-filter"=smoothing) are found as superior to methlySig and methylKit in terms of (maximal) genome coverage of the predicted DMRs. DMR-caller-NF exhibits the best overall performance.
Outcome O1 is presented as Figure 2 in the original publication.
1.2.2 Outcome O2
For the comparison of CpG methylation of two human cell lines, the performances of DMRcaller-NF and DMR-caller-B were found as superior to methylSig and methylKit.
Outcome O2 is presented as Figure 3 in the original publication.
1.2.3 Outcome O3
For non-CpG contexts, the following results were obtained:
- For CpHpG, DMRcaller-NF and DMR-caller-B are superior to methylSig and methylKit
- For CpHpH, DMRcaller-NF has a very weak performance. In this context, methylSig or DMRcaller-B have best performances (depending on the window size).
Outcomes O3 is presented as Figure 5 in the original publication.
1.2.4 Outcome O4
As validation, BSSEQ data from rice comparing endosperm to embryo was used to evaluate the DMRs predicted by DMCcaller-B with gene expression differences. DMCcaller-B outperformed methylKit and methylSig in terms of total number of predicted DMRs and overlap of DMRs with regulated genes.
Outcomes O4 are presented as Figure 6 and Supplementary Figure S8 in the original publication.
1.2.5 Outcome O5
Outcomes O1-O4 where obtained without replicate measurements.
- For Arabidopsis and CpGs DMR-caller-NF and DMR-caller-BR show superior performances in presence of a duplicate measurements in both groups.
Outcomes O5 are presented as Figure 7 in the original publication.
1.2.6 Further outcomes
- Boxplots for the run-times of DMRcaller-B, DMRcaller-NF, MethylKit, MethylSig, MethylPipe are provided. MithylKit and MethylSig are fastest.
- It was observed that the difference between the best method (DMR-caller-NF) and other methods is mainly the length of the predicted DMRs, i.e. in most cases the same regions were found but DMR-caller-NF predicts larger intervals as DMRs.
DMRs which are only predicted by DMRcaller-NF are short indicating enhanced abilities of DMRcaller-NF to identify short DMRs.
- For the comparison H1 vs. IMR90, the overlap between the DMR predictions of the different methods was between 62% and 78% (Supplementary Figure S7A).
- methylPipe could not be applied partly because methyPipe "cannot call DMRs that contain less than five differentially methylated CpGs" and could therefore not be applied for scrambled data. The overlap of methylPipe with other methods is rather small and shown as Supplementary Figure S7.
1.3 Study design and evidence level
1.3.1 General aspects
The paper presents a new approach (DMRcaller) and at the same times provides several analyses for comparing the performance of the new approach with existing algorithms. Such a study setting is very frequent in pratice but has a high risk for biased outcomes. One reason for such a bias might be that typically application examples are selected which nicely demonstrates performance benefits. Moreover, new approaches are often established if existing methods have minor performance in a new application setup. In such settings, it remains rather unclear how performance assessment translates to new application settings.
The predicted DMRs and the performance of the individual approaches depend on the choice of thresholds and configuration parameters. These dependencies are not evaluated. The following configuration parameters for DMCcaller are mentioned in the paper:
- the minimum difference between the methylation proportions
- the significance level for the statistical test
- the minimal average coverage level
- the minimum lenght of DMRs (smaller DMRs are removed)
- the minimum number of cytosines (DMRS with less are removed)
It seems that the choices for these thresholds are not stated in the paper.
The study designs for describing specific outcome (O1-O3) are listed in the following subsections:
1.3.2 Design for Outcome O1
- Wildtype was compared with methyltransferase knockouts and it was assumed that this leads to methylation differences.
The authors conclude that all predicted DMRs for CpG context are therefore "true positives" which seems a very questionable assumption because unmethylated regions remain unmethylated.
- The authors assess performance in terms of genome coverage of the predicted DMRs, i.e. in terms of number and size of the predicted DMRs.
- Because the authors assume that there are no regions with the same methylation level, the performance does not assess false-positives. The more DMRs, the better the performance.
Therefore an increasing number of predicted DMRs always increases the performance.
- The dependency of the bin size or of the width of smoothing windows is evaluated by plotting coverage over window-/bin size.
1.3.3 Design for Outcome O2
- A distince data set as for outcome O1 and O3 has been analyzed, namely ...
- To evaluate occurance of false-positives, random permutations of the genome positions of the individual CpGs was used which prevents occurance of DMRs (if the overall occurance of differential methylation in the original data set is small enough).
A "scrambled data set" was generated by random permutation of the genomic positions of a The difference between of the DMR genome coverage calculated for real and scrambled data has been used for assessing the accuracy.
1.3.4 Design for Outcome O3
- The results were obtained for the comparison of Arabidopsis thaliana wildtype and a quadruple mutant
termed ddcc(drm1 drm2 cmt3 cmt2) which leads to complete loss of methylation in CpG, CpHpG and CpHpH contexts.
- The analysis was done for CpHpG (sometimes also termed CHG) and CpHpH (sometimes also called CHH) contexts.
1.3.5 Design for Outcome O4
- The level of overlap of CpHpG DMRs with 165 genes that were upregulated in endosperm and 153 genes are upregulated in embryo according to the publication where the data comes from was used to assess performance.
1.3.6 Design for Outcome O5
- Duplicates of Arabidopsis thaliana wildtype and met1-3 mutants are analyzed
- One replicate was taken from  and the second biological replicate from 
- Only chromosome 1 was analyzed because of huge computation times
- For this analysis, no "scrambled" data was used to account for the false-positives (like for O1)
- Only CpG methylation was considered.
- An additional method (DSS) was used for comparison in this analysis. This method, however, exhibited minor performance.
1.4 Further comments and aspects
The choice for using methylSig and methylKit as reference is that only these tools (and methylPipe) can handle non-CpG sequence contexts.
 Feng,H., Conneely,K.N. and Wu,H. (2014) A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data. Nucleic Acids Res., 42, e69
 Lister,R., Pelizzola,M., Dowen,R.H., Hawkins,R.D., Hon,G., Tonti-Filippini,J., Nery,J.R., Lee,L., Ye,Z., Ngo,Q.-M. et al. (2009) Human DNA methylomes at base resolution show widespread epigenomic differences. Nature, 462, 315–322.
 Stroud,H., Greenberg,M., Feng,S., Bernatavichute,Y. and Jacobsen,S. (2013) Comprehensive analysis of silencing mutants reveals complex regulation of the Arabidopsis methylome. Cell, 152, 352–364.
 Catoni,M., Griffths,J., Becker,C., Zabet,N.R., Bayon,C., Dapp,M., Lieberman-Lazarovich,M., Weigel,D. and Paszkowski,J. (2017) DNA sequence properties that determine susceptibility to epiallelic switching. EMBO J., 36, 617–628
 Zemach,A., Kim,M.Y., Silva,P., Rodrigues,J.A., Dotson,B., Brooks,M.D. and Zilberman,D. (2010) Local DNA hypomethylation activates genes in rice endosperm. Proc. Natl. Acad. Sci. U.S.A., 107, 18729–18734.