PUBLICATION

Metrics of sequence constraint overlook regulatory sequences in an exhaustive analysis at phox2b

Authors
McGaughey, D.M., Vinton, R.M., Huynh, J., Al-Saif, A., Beer, M.A., and McCallion, A.S.
ID
ZDB-PUB-071219-7
Date
2008
Source
Genome research   18(2): 252-260 (Journal)
Registered Authors
McCallion, Andy
Keywords
none
MeSH Terms
  • Animals
  • Base Composition
  • Base Sequence
  • Computational Biology/methods*
  • Evolution, Molecular*
  • Gene Components
  • Homeodomain Proteins/genetics*
  • Homeodomain Proteins/metabolism
  • In Situ Hybridization
  • Molecular Sequence Data
  • Neurons/metabolism
  • Regulatory Sequences, Nucleic Acid/genetics*
  • Sequence Analysis, DNA
  • Transcription Factors/genetics*
  • Transcription Factors/metabolism
  • Zebrafish/genetics*
PubMed
18071029 Full text @ Genome Res.
Abstract
Despite its recognized utility, the extent to which evolutionary sequence conservation-based approaches may systematically overlook functional noncoding sequences remains unclear. We have tiled across sequence encompassing the zebrafish phox2b gene, ultimately evaluating 48 amplicons corresponding to all noncoding sequences therein for enhancer activity in zebrafish. Post hoc analyses of this interval utilizing five commonly used measures of evolutionary constraint (AVID, MLAGAN, SLAGAN, phastCons, WebMCS) demonstrate that each systematically overlooks regulatory sequences. These established algorithms detected only 29%-61% of our identified regulatory elements, consistent with the suggestion that many regulatory sequences may not be readily detected by metrics of sequence constraint. However, we were able to discriminate functional from nonfunctional sequences based upon GC composition and identified position weight matrices (PWM), demonstrating that, in at least one case, deleting sequences containing a subset of these PWMs from one identified regulatory element abrogated its regulatory function. Collectively, these data demonstrate that the noncoding functional component of vertebrate genomes may far exceed estimates predicated on evolutionary constraint.
Genes / Markers
Figures
Show all Figures
Expression
Phenotype
Mutations / Transgenics
Human Disease / Model
Sequence Targeting Reagents
Fish
Antibodies
Orthology
Engineered Foreign Genes
Mapping