The distribution of SNPs in human gene regulatory regions

BMC Genomics. 2005 Oct 6:6:140. doi: 10.1186/1471-2164-6-140.

Abstract

Background: As a result of high-throughput genotyping methods, millions of human genetic variants have been reported in recent years. To efficiently identify those with significant biological functions, a practical strategy is to concentrate on variants located in important sequence regions such as gene regulatory regions.

Results: Analysis of the most common type of variant, single nucleotide polymorphisms (SNPs), shows that in gene promoter regions more SNPs occur in close proximity to transcriptional start sites than in regions further upstream, and a disproportionate number of those SNPs represent nucleotide transversions. Additionally, the number of SNPs found in the predicted transcription factor binding sites is higher than in non-binding site sequences.

Conclusion: Current information about transcription factor binding site sequence patterns may not be exhaustive, and SNPs may be actively involved in influencing gene expression by affecting the transcription factor binding sites.

MeSH terms

  • Binding Sites
  • Contig Mapping
  • Databases, Genetic
  • Gene Expression Regulation
  • Genomics / methods*
  • Humans
  • Models, Genetic*
  • Polymorphism, Genetic
  • Polymorphism, Single Nucleotide*
  • Promoter Regions, Genetic
  • Regulatory Sequences, Nucleic Acid
  • Sequence Analysis, DNA
  • Transcription Factors / genetics
  • Transcription Factors / metabolism
  • Transcription, Genetic

Substances

  • Transcription Factors