Skip to main content

SMITE: an R/Bioconductor package that identifies network modules by integrating genomic and epigenomic information.

Citation
Wijetunga, A., et al. “Smite: An R/Bioconductor Package That Identifies Network Modules By Integrating Genomic And Epigenomic Information.”. Bmc Bioinformatics, p. 41.
Center Albert Einstein College of Medicine
Author Ari Wijetunga, Andrew D Johnston, Ryo Maekawa, Fabien Delahaye, Netha Ulahannan, Kami Kim, John M Greally
Keywords bioinformatics, epigenetic, gene expression, Genomic, Interaction network, Modules
Abstract

BACKGROUND: The molecular assays that test gene expression, transcriptional, and epigenetic regulation are increasingly diverse and numerous. The information generated by each type of assay individually gives an insight into the state of the cells tested. What should be possible is to add the information derived from separate, complementary assays to gain higher-confidence insights into cellular states. At present, the analysis of multi-dimensional, massive genome-wide data requires an initial pruning step to create manageable subsets of observations that are then used for integration, which decreases the sizes of the intersecting data sets and the potential for biological insights. Our Significance-based Modules Integrating the Transcriptome and Epigenome (SMITE) approach was developed to integrate transcriptional and epigenetic regulatory data without a loss of resolution.

RESULTS: SMITE combines p-values by accounting for the correlation between non-independent values within data sets, allowing genes and gene modules in an interaction network to be assigned significance values. The contribution of each type of genomic data can be weighted, permitting integration of individually under-powered data sets, increasing the overall ability to detect effects within modules of genes. We apply SMITE to a complex genomic data set including the epigenomic and transcriptomic effects of Toxoplasma gondii infection on human host cells and demonstrate that SMITE is able to identify novel subnetworks of dysregulated genes. Additionally, we show that SMITE outperforms Functional Epigenetic Modules (FEM), the current paradigm of using the spin-glass algorithm to integrate gene expression and epigenetic data.

CONCLUSIONS: SMITE represents a flexible, scalable tool that allows integration of transcriptional and epigenetic regulatory data from genome-wide assays to boost confidence in finding gene modules reflecting altered cellular states.

Year of Publication
2017
Journal
BMC bioinformatics
Volume
18
Issue
1
Number of Pages
41
Date Published
01/2017
ISSN Number
1471-2105
DOI
10.1186/s12859-017-1477-3
Alternate Journal
BMC Bioinformatics
PMID
28100166
PMCID
PMC5242055
Download citation