Table of Contents
Fetching ...

A Multi-Label Temporal Convolutional Framework for Transcription Factor Binding Characterization

Pietro Demurtas, Ferdinando Zanchetta, Giovanni Perini, Rita Fioresi

Abstract

Transcription factors (TFs) regulate gene expression through complex and co-operative mechanisms. While many TFs act together, the logic underlying TFs binding and their interactions is not fully understood yet. Most current approaches for TF binding site prediction focus on individual TFs and binary classification tasks, without a full analysis of the possible interactions among various TFs. In this paper we investigate DNA TF binding site recognition as a multi-label classification problem, achieving reliable predictions for multiple TFs on DNA sequences retrieved in public repositories. Our deep learning models are based on Temporal Convolutional Networks (TCNs), which are able to predict multiple TF binding profiles, capturing correlations among TFs andtheir cooperative regulatory mechanisms. Our results suggest that multi-label learning leading to reliable predictive performances can reveal biologically meaningful motifs and co-binding patterns consistent with known TF interactions, while also suggesting novel relationships and cooperation among TFs.

A Multi-Label Temporal Convolutional Framework for Transcription Factor Binding Characterization

Abstract

Transcription factors (TFs) regulate gene expression through complex and co-operative mechanisms. While many TFs act together, the logic underlying TFs binding and their interactions is not fully understood yet. Most current approaches for TF binding site prediction focus on individual TFs and binary classification tasks, without a full analysis of the possible interactions among various TFs. In this paper we investigate DNA TF binding site recognition as a multi-label classification problem, achieving reliable predictions for multiple TFs on DNA sequences retrieved in public repositories. Our deep learning models are based on Temporal Convolutional Networks (TCNs), which are able to predict multiple TF binding profiles, capturing correlations among TFs andtheir cooperative regulatory mechanisms. Our results suggest that multi-label learning leading to reliable predictive performances can reveal biologically meaningful motifs and co-binding patterns consistent with known TF interactions, while also suggesting novel relationships and cooperation among TFs.
Paper Structure (15 sections, 5 equations, 10 figures, 7 tables)

This paper contains 15 sections, 5 equations, 10 figures, 7 tables.

Figures (10)

  • Figure 1: E2F4-DP2-DNA complex zheng1999structural
  • Figure 2: Multiple TFs binding to DNA
  • Figure 3: Causal convolutions by the use of padding; on the left 1D convolution with "valid" padding, on the right 1D convolution with left padding enforcing causality kondratyuk2021movinets.
  • Figure 4: A residual block
  • Figure 5: Temporal Convolutional Networks as proposed by Bai et al.bai2018empirical.
  • ...and 5 more figures