Computer-assisted identification of cell cycle-related genes: new targets for E2F transcription factors

J Mol Biol. 2001 May 25;309(1):99-120. doi: 10.1006/jmbi.2001.4650.

Abstract

The processes that take place during development and differentiation are directed through coordinated regulation of expression of a large number of genes. One such gene regulatory network provides cell cycle control in eukaryotic organisms. In this work, we have studied the structural features of the 5' regulatory regions of cell cycle-related genes. We developed a new method for identifying composite substructures (modules) in regulatory regions of genes consisting of a binding site for a key transcription factor and additional contextual motifs: potential targets for other transcription factors that may synergistically regulate gene transcription. Applying this method to cell cycle-related promoters, we created a program for context-specific identification of binding sites for transcription factors of the E2F family which are key regulators of the cell cycle. We found that E2F composite modules are found at a high frequency and in close proximity to the start of transcription in cell cycle-related promoters in comparison with other promoters. Using this information, we then searched for E2F sites in genomic sequences with the goal of identifying new genes which play important roles in controlling cell proliferation, differentiation and apoptosis. Using a chromatin immunoprecipitation assay, we then experimentally verified the binding of E2F in vivo to the promoters predicted by the computer-assisted methods. Our identification of new E2F target genes provides new insight into gene regulatory networks and provides a framework for continued analysis of the role of contextual promoter features in transcriptional regulation. The tools described are available at http://compel.bionet.nsc.ru/FunSite/SiteScan.html.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Binding Sites
  • Cell Cycle / genetics*
  • Cell Cycle Proteins*
  • Chromatin / genetics
  • Chromatin / metabolism
  • Computational Biology / methods*
  • Cross-Linking Reagents
  • DNA-Binding Proteins*
  • Databases as Topic
  • E2F Transcription Factors
  • Formaldehyde
  • Gene Expression Regulation*
  • Gene Frequency
  • Genes, cdc*
  • Humans
  • Internet
  • Nucleolin
  • Phosphoproteins / genetics
  • Phylogeny
  • Precipitin Tests
  • Promoter Regions, Genetic / genetics
  • RNA-Binding Proteins / genetics
  • Reproducibility of Results
  • Response Elements / genetics*
  • Sensitivity and Specificity
  • Software
  • Transcription Factors / metabolism*
  • Transcription, Genetic / genetics

Substances

  • Cell Cycle Proteins
  • Chromatin
  • Cross-Linking Reagents
  • DNA-Binding Proteins
  • E2F Transcription Factors
  • Phosphoproteins
  • RNA-Binding Proteins
  • Transcription Factors
  • Formaldehyde