Biochemical constraints on genes involved in early embryonic development.

Sandeep Venkataram is a graduate student in the Petrov lab.

Post author Sandeep Venkataram is a graduate student in the Petrov lab.

Chemical reactions form the foundation of life, yet such elementary activities are rarely considered when trying to understand higher-level processes, such as embryonic development. Nevertheless, as recently shown by Artieri and Fraser (MBE 2014), limitations on the kinetics of gene expression strongly constrain the length of highly expressed transcripts during early embryonic development of fruit flies. Furthermore, this phenomenon appears to be a general feature of fruit fly development as it is evolutionarily conserved across a number of species.

The long and short of mRNA transcription

It has long been known that only a portion of the mRNA molecules are used to produce functional proteins – multicellular species contain many long ‘introns’, which must first be transcribed, then spliced out before translation can occur. Introns can be very long, causing transcription of some mRNA molecules to take significant amounts of time: for example, one 2.3 million bp transcript in humans takes over half a day to be produced. This creates a problem as incompletely transcribed mRNA molecules are degraded when DNA is replicated at the beginning of cell division, and the process must begin anew once division is completed. Together, this implies that cell divisions need to be spaced out long enough apart from each other to produce all of the transcripts necessary for the growth of the cell before the next division occurs.

Studies of fruit fly development have shown that zygotes undergo “syncytial division” at the beginning of development, where the DNA within the zygotic nuclei divide every ~10 minutes for 9 cycles, followed by 4 additional progressively lengthening divisions. While most mRNA in the cell at this time are supplied by the mother (maternal mRNA), this also represents the phase during which the zygote begins producing its own mRNA. The extremely rapid cell divisions led Artieri and Fraser to hypothesize that long mRNA molecules transcribed from the zygotic genome may be underrepresented during these early stages of development. Maternal mRNAs, on the other hand, would be unaffected as they are already present in the cell and do not have to be transcribed.

Transcript length vs. developmental timing

The authors classified embryonically expressed genes as “maternal” or “zygotic” depending on whether or not the gene was present as maternal mRNA in unfertilized embryos using published data. They then obtained multiple developmental mRNA expression timecourses and found that long zygotically expressed genes took longer to reach maximal expression levels than short genes – consistent with their inability to be fully transcribed during early development (Figure 1). Furthermore, they were able to use total RNA expression data to detect the presence of incomplete transcripts, indicating that delay was not due to later transcriptional activation, but rather the incomplete production of transcripts.

Modified from Artieri and Fraser 2014 Figure 2B . Long zygotic genes are underexpressed early in the syncytial division phase relative to short genes, but catch up in expression by the end of the syncytial phase while maternally derived transcripts show no such changes.

Figure 1. Long zygotic genes are underexpressed early in the syncytial division phase relative to short genes, but catch up in expression by the end of the syncytial phase while maternally derived transcripts show no such changes. [Modified from Artieri and Fraser 2014 Figure 2B . ]

Using a published set of developmental mRNA expression timecourses from additional Drosophila species, Artieri and Fraser show that these patterns are consistent across all species examined. Finally, they also observed that the introns present in highly expressed zygotic genes appear to be highly evolutionarily constrained in terms of their lengths when compared to either genes maternally deposited or zygotically expressed during later timepoints. This suggests that natural selection has played a role in limiting the expansion of introns in early expressed zygotic genes, allowing them to escape ‘intron delay’.


In summary, Artieri and Fraser have found evidence that a significant fraction of zygotically expressed transcripts in fruit flies are delayed from reaching their maximal levels of expression due to the rapid cell cycles taking place at the beginning of development. This suggests a simple mechanism for developmental timing of zygotic gene expression: genes that are required early must be short, while genes whose expression is needed at a later time can delay their expression via the presence of long introns. While some evidence for the use of intron length as a regulatory mechanism has recently emerged (Takashima et al. 2011), future experiments will be required to determine how widespread is the effect of selection to maintain long lengths and delayed expression.


Carlo G. Artieri and Hunter B. Fraser Transcript Length Mediates Developmental Timing of Gene Expression Across Drosophila. (2014) Molecular Biology and Evolution doi:10.1093/molbev/msu226

Takashima Y, Ohtsuka T, González A, Miyachi H, Kageyama R. Intronic delay is essential for oscillatory expression in the segmentation clock. Proc Natl Acad Sci U S A. 2011;108:3300-3305.

Paper author Carlo Artieri is a postdoctoral fellow in the Fraser lab.

Paper author Carlo Artieri is a postdoctoral fellow in the Fraser lab.


Taking studies of regulatory evolution to the next level: translation

Carlo Artieri, a postdoc in the group of Hunter Fraser, wrote this blog post. The paper is written by Carlo and Hunter.

Carlo Artieri, a postdoc in the group of Hunter Fraser, wrote this blog post. The paper is written by Carlo and Hunter.

Carlo Artieri writes about his new paper: Evolution at two levels of gene expression in yeast which is in press in Genome Research.

Understanding the molecular basis of regulatory variation within and between species has become a major focus of modern genetics. For instance, the majority of identified human disease-risk alleles lie in non-coding regions of the genome, suggesting that they affect gene regulation (Epstein 2009). Furthermore, it has been argued that regulatory changes have played a dominant role in explaining uniquely human attributes (King and Wilson 1975). However, our knowledge of gene regulatory evolution is based almost entirely on studies of mRNA levels, despite both the greater functional importance of protein abundance, and evidence that post-transcriptional regulation is pervasive. The availability of high-throughput methods for measuring mRNA abundance, coupled to the lack of comparable methods at the protein level have contributed to this focus; however, a new method known as ribosome profiling, or ‘riboprofiling’ (Ingolia et al. 2009), has enabled us to study the evolution of translation in much greater detail than was possible before. This method involves the construction of two RNA-seq libraries: one measuring mRNA abundance (the ‘mRNA’ fraction), and the second capturing the portion of the transcriptome that is actively being translated by ribosomes (the ‘Ribo’ fraction). On average, the abundance of genes within the Ribo fraction should be proportional to that of the mRNA fraction. Genes with increased translational efficiency are identified when Ribo fraction abundance is higher than that of the mRNA fraction, whereas reduced translational efficiency is inferred when the opposite is observed.

Riboprofiling of yeast hybrids

We performed riboprofiling on hybrids of two closely related species of budding yeast, Saccharomyces cerevisiae and S. paradoxus, (~5 million years diverged). In hybrids, the parental alleles at a locus share the same trans cellular environment; therefore in the absence of cis-regulatory divergence in transcription, both alleles should be expressed at equal levels. Conversely, cis-regulatory divergence will produce unequal expression of alleles (termed allele-specific expression, or ‘ASE’). Cis-regulatory divergence at the translational level is detected when ASE in the mRNA fraction does not equal that measured in the Ribo fraction, indicating independent divergence across levels. We also performed riboprofiling on the two parental strains, as differences in the expression of orthologs between parental species that cannot be explained by the allelic differences in the hybrids can be attributed to trans divergence. Therefore, by measuring differences in the magnitudes of ASE between the two riboprofiling fractions in the hybrids and the parents, we identified independent cis and trans regulatory changes in both mRNA abundance and translational efficiency.


We found that both cis and trans regulatory divergence in translational efficiency is widespread, and of comparable magnitude to divergence at the mRNA level – indicating that we miss much regulatory evolution by focusing on mRNA in isolation. Moreover, we observed an overwhelming bias towards divergence in opposing parental directions, indicating that while many orthologs had higher mRNA abundance in one parent, they often showed increased translational efficiency in the other parent. This suggests that stabilizing selection acts to maintain more similar protein levels between species than would be expected by comparing mRNA abundances alone.

Translational divergence not associated with TATA boxes

Interestingly, while we confirmed the results of previous studies indicating that both cis and trans regulatory divergence at the mRNA level are associated with the presence of TATA boxes and nucleosome free regions in promoters, no such relationship was found for translational divergence, indicating that these regulatory systems have different underlying architectures.

Evidence for polygenic selection at two levels

We also searched for evidence of polygenic selection in and between both regulatory levels by applying a recently developed modification of Orr’s sign test (Orr 1998; Fraser et al. 2010; Bullard et al. 2010). Under neutral divergence, no pattern is expected with regards to the parental direction of up or down-regulating alleles among orthologs within a functional group (e.g., a pathway or multi-gene complex). However, a significant bias towards one parental lineage is evidence of lineage-specific selection. This analysis uncovered evidence of polygenic selection at both regulatory levels in a number of functional groups. In particular, genes involved in tolerance to heavy metals were enriched for reinforcing divergence in mRNA abundance and translation favoring S. cerevisiae. Increased tolerance to these metals has been observed in S. cerevisiae (Warringer et al. 2011), suggesting that domesticated yeasts have experienced a history of polygenic adaptation across regulatory levels allowing them to grow on metals such as copper.

Finally, using data from the Ribo fraction, we also uncovered multiple instances of conserved stop-codon readthrough, a mechanism via which the ribosome ‘ignores’ the canonical stop codon and produces a C-terminally extended peptide. Only two cases of C-terminal extensions have previously been observed in yeast, though in one such case, PDE2, extension of the canonical protein plays a functional role in regulating cAMP levels (Namy et al. 2002). Our data suggests that this mechanism may occur in dozens of genes, highlighting yet another post-transcriptional mechanism leading to increased proteomic diversity.


By applying a novel approach to a long-standing question, our analysis has revealed that post-transcriptional regulation is abundant, and likely as important as transcriptional regulation. We argue that partitioning the search for the locus of selection into the binary categories of ‘coding’ vs. ‘regulatory’ overlooks the many opportunities for selection to act at multiple regulatory levels along the path from genotype to phenotype.


Artieri CG, Fraser HB. 2013. Evolution at two levels of gene expression in yeast. Genome Research (in press).
Preprint on the arXiv. 

Bullard JH, Mostovoy Y, Dudoit S, Brem RB. 2010. Polygenic and directional regulatory evolution across pathways in Saccharomyces. Proc Natl Acad Sci USA 107: 5058-5063.

Epstein DJ. 2009. Cis-regulatory mutations in human disease. Brief Funct Genomic Proteomic 8: 310–316.

Fraser HB, Moses AM, Schadt EE. 2010. Evidence for widespread adaptive evolution of gene expression in budding yeast. Proc Natl Acad Sci USA 107: 2977-2982.

Ingolia NT, Ghaemmaghami S, Newman JR, Weissman JS. 2009. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324:218-223.

King MC, Wilson AC. 1975. Evolution at two levels in humans and chimpanzees. Science 188: 107-116.

Namy O, Duchateau-Nguyen G, Rousset JP. 2002. Translational readthrough of the PDE2 stop codon modulates cAMP levels in Saccharomyces cerevisiae. Mol Microbiol 43: 641-652.

Orr HA. 1998. Testing natural selection vs. genetic drift in phenotypic evolution using quantitative trait locus data. Genetics 149: 2099-2104.

Warringer J, Zörgö E, Cubillos FA, Zia A, Gjuvsland A, Simpson JT, Forsmark A, Durbin R, Omholt SW, Louis EJ, Liti G, Moses A, Blomberg A. 2011. Trait variation in yeast is defined by population history. PLoS Genet 7 :e1002111.