Читайте также: |
|
The most surprising discovery about the human genome was that the majority of the functional sequence does not encode proteins. These features had been missed by decades of molecular biology, because scientists had no clue where to look.
Comparison of the human and mouse genomes showed a substantial excess of conserved sequence, relative to the neutral rate in ancestral repeat elements4. The excess implied that at least 6% of the human genome was under purifying selection over the past 100 million years and thus biologically functional. Protein-coding sequences, which comprise only ~1.5% of the genome, are thus dwarfed by functional conserved non-coding elements (CNEs). Subsequent comparison with the rat and dog genomes confirmed these findings5, 6.
Although the initial analysis provided a bulk estimate of the amount of conserved sequence, it could only pinpoint the most highly conserved elements. Among them are nearly 500 ultraconserved elements (200 bases or more perfectly conserved across human, mouse and rat), most of which neither overlap protein-coding exons nor show evidence of being transcribed22. On the basis of statistical measures of constraint, tens of thousands of additional highly conserved non-coding elements (HCNEs) were identified5, 23. In many cases the evolutionary origins of these HCNEs could be traced back to the common ancestor of human and fish. HCNEs preferentially reside in the gene deserts that often flank genes with key functions in embryonic development5, 22, 23. Large-scale screens of these sequences in transgenic mice revealed that they are highly enriched in tissue-specific transcriptional enhancers active during embryonic development24, revealing a stunning complexity of the gene regulatory architecture active in early development24 (Fig. 1).
Figure 1: Evolutionary conservation maps.
Comparison among the human, mouse, rat and dog genomes helps identify functional elements in the genome. The figure shows the density of protein-coding sequences (red) and the most highly conserved non-coding sequences (blue) along chromosome 3. Highly conserved non-coding sequences are enriched in gene-poor regions, each of which contained a gene involved in early development (such as SATB1, shown). Images courtesy of iStock Photo.
Sequencing additional genomes has gradually increased our power to pinpoint the less stringently conserved CNEs. Recent comparison with 29 mammalian genomes has identified millions of additional conserved elements, comprising about two-thirds of the total conserved sequence.
Evolutionary analyses of CNEs have also enabled the discovery of distinct types of functional elements, including regulatory motifs present in the promoters and untranslated regions of co-regulated genes, insulators that constrain domains of gene expression, and families of conserved secondary structures in RNAs. Nonetheless, the function of most CNEs remains to be discovered.
Дата добавления: 2015-10-24; просмотров: 146 | Нарушение авторских прав
<== предыдущая страница | | | следующая страница ==> |
Genome sequencing | | | Організація обліку виконання бюджету у фінансових органах |