Predicting stimulation-dependent 1 enhancer-promoter interactions from 2 ChIP-Seq time course data

Research output: Contribution to journalArticle

  • External authors:
  • Tomasz Dzida
  • Mudassar Iqbal
  • Iryna Charapitsa
  • Henk Stunnenberg
  • Filomena Matarese
  • Antti Honkela

Abstract

We have developed a machine learning approach to predict stimulation-dependent enhancer-promoter interactions using evidence from changes in genomic protein occupancy over time. The occupancy of estrogen receptor alpha (ERa), RNA polymerase (Pol II) and histone marks H2AZ and H3K4me3 were measured over time using ChIP-Seq experiments in MCF7 cells stimulated with estrogen. A Bayesian classifier was developed which uses the correlation of temporal binding patterns at enhancers and promoters and genomic proximity as features to predict interactions. This method was trained using experimentally determined interactions from the same system and was shown to achieve much higher precision than predictions based on the genomic proximity of nearest ERa binding. We use the method to identify a genome-wide confident set of ERa target genes and their regulatory enhancers genome-wide. Validation with publicly available GRO-Seq data demonstrates that our predicted targets are much more likely to show early nascent transcription than predictions based on genomic ERa binding proximity alone.

Bibliographical metadata

Original languageEnglish
JournalPeerJ
StateAccepted/In press - 7 Aug 2017