Journal names have been intentionally excluded. Lab member names are in bold. Links are provided to the published article and when possible an open-access version (pre- or post-print) of the paper. Click on a triangle for the article abstract.
What is the genetic architecture of local adaptation and what is the geographic scale that it operates over? We investigated patterns of local and convergent adaptation in five sympatric population pairs of traditionally cultivated maize and its wild relative teosinte (Zea mays subsp. parviglumis). We found that signatures of local adaptation based on the inference of adaptive fixations and selective sweeps are frequently exclusive to individual populations, more so in teosinte compared to maize. However, for both maize and teosinte, selective sweeps are frequently shared by several populations, and often between the subspecies. We were further able to infer that selective sweeps were shared among populations most often via migration, though sharing via standing variation was also common. Our analyses suggest that teosinte has been a continued source of beneficial alleles for maize, post domestication, and that maize populations have facilitated adaptation in teosinte by moving beneficial alleles across the landscape. Taken together, out results suggest local adaptation in maize and teosinte has an intermediate geographic scale, one that is larger than individual populations, but smaller than the species range.
Inbreeding depression is the reduction in fitness and vigor resulting from mating of close relatives observed in many plant and animal species. The extent to which the genetic load of mutations contributing to inbreeding depression is due to rare large-effect variation versus potentially more common variants with very small individual effects is unknown and may be affected by population history. We compared the effects of outcrossing and self-fertilization on 18 traits in a landrace population of maize, which underwent a population bottleneck during domestication, and a neighboring population of its wild relative teosinte. Inbreeding depression was greater in maize than teosinte for 15 of 18 traits, congruent with the greater segregating genetic load predicted from sequence data in the maize population. For many traits - and more commonly in maize - genetic variation among self-fertilized families was less than expected based on additive and dominance variance estimated in outcrossed families, suggesting that a negative covariance between additive and homozygous dominance effects limits the variation available to selection under partial inbreeding. We identified quantitative trait loci (QTL) representing large-effect rare variants carried by only a single parent, which were more important in teosinte than maize. Teosinte also carried more putative juvenile-acting lethal variants identified by segregation distortion. These results suggest a mixture of mostly polygenic, small-effect recessive variation underlying inbreeding depression, with an additional contribution from rare larger-effect variants that was more important in teosinte but depleted in maize following to the domestication bottleneck. Purging associated with the maize domestication bottleneck may have selected against large effect variants, but polygenic load is harder to purge and segregating mutational burden increased in maize compared to teosinte.
Recognition of the important role of transposable elements (TEs) in eukaryotic genomes quickly led to a burgeoning literature modeling and estimating the effects of selection on TEs. Much of the empirical work on selection has focused on analyzing the site frequency spectrum (SFS) of TEs. But TEs differ from standard evolutionary models in a number of ways that can impact the power and interpretation of the SFS. For example, rather than mutating under a clock-like model, transposition often occurs in bursts which can inflate particular frequency categories compared to expectations under a standard neutral model. If a TE burst has been recent, the excess of low frequency polymorphisms can mimic the effect of purifying selection. Here, we investigate how transposition bursts affect the frequency distribution of TEs and the correlation between age and allele frequency. Using information on the TE age distribution, we propose an age-adjusted site frequency spectrum to compare TEs and neutral polymorphisms to more effectively evaluate whether TEs are under selective constraints. We show that our approach can minimize instances of false inference of selective constraint, but also allows for a correct identification of even weak selection affecting TEs which experienced a transposition burst and is robust to at least simple demographic changes. The results presented here will help researchers working on TEs to more reliably identify the effects of selection on TEs without having to rely on the assumption of a constant transposition rate.
Genotype by environment interactions are a significant challenge for crop breeding as well as being important for understanding the genetic basis of environmental adaptation. In this study, we analyzed genotype by environment interaction in a maize multi-parent advanced generation intercross population grown across five environments. We found that genotype by environment interactions contributed as much as genotypic effects to the variation in some agronomically important traits. In order to understand how genetic correlations between traits change across environments, we estimated the genetic variance-covariance matrix in each environment. Changes in genetic covariances between traits across environments were common, even among traits that show low genotype by environment variance. We also performed a genome-wide association study to identify markers associated with genotype by environment interactions but found only a small number of significantly associated markers, possibly due to the highly polygenic nature of genotype by environment interactions in this population.
The search for quantitative trait loci (QTL) that explain complex traits such as yield and flowering time has been ongoing in all crops. Methods such as bi-parental QTL mapping and genome-wide association studies (GWAS) each have their own advantages and limitations. Multi-parent advanced generation intercross (MAGIC) populations contain more recombination events and genetic diversity than bi-parental mapping populations and reduce the confounding effect of population structure that is an issue in association mapping populations. Here we discuss the results of using a MAGIC population of doubled haploid (DH) maize lines created from 16 diverse founders to perform QTL mapping. We compare three models that assume bi-allelic, founder, and ancestral haplotype allelic states for QTL. The three methods have different power to detect QTL for a variety of agronomic traits. Although the founder approach finds the most QTL, there are also QTL unique to each method, suggesting that each model has advantages for traits with different genetic architectures. A closer look at a well-characterized flowering time QTL, qDTA8, which contains vgt1, suggests a potential epistatic interaction and highlights the strengths and weaknesses of each method. Overall, our results reinforce the importance of considering different approaches to analyzing genotypic datasets, and show the limitations of binary SNP data for identifying multi-allelic QTL.
While abiotic environments consistently shape local adaptation, the strength of local adaptation to biotic interactions may vary more. One theory, COCO (CO-evolutionary Outcomes across Conditionality), predicts it may be strongest where species experience greater stress, because stress increases fitness impacts of species interactions. For example, in plant interactions with rhizosphere biota, positive outcomes increase with stress from low soil fertility, drought and cold.
To investigate the influence of abiotic stress gradients on adaptation between plants and rhizosphere biota, we used a greenhouse common garden experiment recombining teosinte, Zea mays ssp. mexicana (wild relative of maize), and rhizosphere biota, collected across a stress gradient (elevational variation in temperature, precipitation, and nutrients).
We found stronger local adaptation between teosinte and rhizosphere biota from colder, more stressful sites, as expected by COCO. However, biota from less stressful, warmer sites provided greater average benefits across teosinte populations. Links between plant traits and 20-element profiles of plant leaves explained fitness variation, persisted in the field, were influenced by both plants and biota, and largely reflected patterns of local adaptation.In sum, we uncovered greater local adaptation to biotic interactions in colder sites, and that both plants and rhizosphere biota affect the expression of plant phenotypes.
Maize is a globally valuable commodity and one of the most extensively studied genetic model organisms. However, we know surprisingly little about the extent and potential utility of the genetic variation found in the wild relatives of maize. Here, we characterize a high-density genomic variation map from ~700 genomes encompassing maize and all wild taxa of the genus Zea, identifying over 65 million single nucleotide polymorphisms (SNPs), 8 million Insertion/Deletion (InDel) polymorphisms, and over one thousand novel inversions. The variation map reveals evidence of selection within taxa displaying novel adaptations such as perenniality and regrowth. We focus in detail on evidence of convergent adaptation in highland teosinte and temperate maize. This study not only indicates the key role of hormone related pathways in highland adaptation and flowering time related pathways in high latitude adaptation, but also identifies significant overlap in the genes underlying adaptations to both environments. To show how this data can identify useful genetic variants, we generated and characterized novel mutant alleles for two flowering time candidate genes. This work provides the most extensive sampling to date of the genetic diversity inherent in the genus Zea, resolving questions on evolution and identifying adaptive variants for direct use in modern breeding.
The genomic signature of wild-to-crop introgression during the domestiation of scarlet runner bean (Phaseolus coccineus L.) [preprint]
Guerra-Garcia A, Rojas-Barrera IC, Ross-Ibarra J, Papa R, Piñero D.
The scarlet runner bean is an open-pollinated legume from the highlands of Mesoamerica that is cultivated in small-scale agriculture for its dry seeds and immature pods. Demographic bottlenecks associated with domestication might reduce genetic diversity and facilitate the accumulation of deleterious mutations. Conversely, introgression from wild relatives could be a source of variation. Using Genotyping by Sequencing data (79,286 SNVs) from 237 cultivated and wild samples, we evaluated the demographic history of traditional varieties from different regions of Mexico and looked for evidence of introgression between sympatric wild and cultivated populations. Traditional varieties have high levels of diversity, even though there is evidence of a severe initial genetic bottleneck, followed by a population expansion. Introgression from wild to domesticated populations was detected, but not in the opposite direction. This asymmetric introgression might contribute to the recovery of genetic variation and it has occurred at different times: constantly in the center of Mexico; recently in the North West; and anciently in the South. Several factors are acting together to increase and maintain genetic diversity in P. coccineus cultivars, such as demographic expansion and introgression. Wild relatives represent a valuable genetic resource and have played a key role in scarlet runner bean evolution via introgression into traditional varieties.
Teosinte introgression modulates phosphatidylcholine levels and induces early maize flowering time [preprint]
Rodríguez-Zapata F, Barnes AC, Blöcher-Juárez KA, Gates DJ …[16 authors]… Ross-Ibarra J, Hufford M, Sawers RJH, Rellán-Álvarez R.
After domestication from lowland teosinte parviglumis (Zea mays ssp. parviglumis) in the warm Mexican southwest, maize (Zea mays ssp. mays) colonized the highlands of Mexico and South America. In the highlands, maize was exposed to lower temperatures that imposed strong selection on flowering time. Previous work in maize and other has linked variation in phospholipid metabolism to low temperature stress as well as changes in flowering time. Here, we combined linkage mapping analysis with genome scans to identify High PhosphatidylCholine 1 (HPC1), a gene which encodes a phospholipase A1 enzyme, as a major driver of phospholipid variation in highland maize. Common garden experiments demonstrated strong genotype-by-environment interactions associated with variation at HPC1, with the highland HPC1 allele leading to higher fitness in highlands, possibly by hastening flowering. The HPC1 variant we identified in highland maize results in impaired function of the encoded protein due to a polymorphism in a highly conserved sequence. A meta-analysis indicated a strong association between the identity of the amino acid at this position in a prokaryotic protein harboring this conserved sequence and optimal growth temperature of the organism. Mutagenesis of HPC1 via genome editing validated its role in regulating phospholipid metabolism. Finally, we show that the highland HPC1 allele entered cultivated maize by introgression from the wild highland teosinte Zea mays ssp. mexicana and has been maintained in maize breeding lines from Northern US, Canada and Europe. Thus, HPC1 introgressed from teosinte mexicana underlies a large metabolic QTL that modulates phosphatidylcholine levels and has an adaptive effect at least in part via induction of early flowering time.
When two populations or species hybridize, their offspring often experience reductions in fitness relative to either parental population. The production of low fitness hybrids may be prevented by the evolution of increased prezygotic isolation; a process known as reinforcement. Theoretical challenges to the evolution of reinforcement are generally cast as a coordination problem — e.g., linkage disequilibrium between trait and preference loci is difficult to maintain in the face of recombination. However, the evolution of reinforcement also poses a potential conflict between mates. For example, the opportunity costs to hybridization may differ between the sexes or species. This is particularly likely for postmating prezygotic isolation, as the ability to fertilize both conspecific and heterospecific eggs is beneficial to male gametes, but heterospecific mating may incur a cost for female gametes. Motivated by this problem, we develop a population genetic model of interspecific conflict over reinforcement, inspired by `gametophytic factors’, which act as postmating prezygotic barriers among Zea mays subspecies. We demonstrate that this conflict results in the transient evolution of reinforcement –– after female preference for a conspecific gamete trait rises to high frequency, male traits adaptively introgress into the other population. Ultimately the male gamete trait fixes in both species, and prezygotic isolation returns to pre-reinforcement levels. We interpret geographic patterns of isolation among Z. mays subspecies in light of these findings, and suggest when and how this conflict can be mediated. Our results suggest that sexual conflict may pose an understudied obstacle to the evolution of reinforcement via postmating prezygotic isolation.
We present a high-resolution genome-wide association analysis to identify loci exhibiting adaptive patterns in a large panel of more than 4500 traditional maize landraces representing the breadth of genetic diversity of maize in Mexico. We evaluate associations between genotype and plant performance in 13 common gardens across a range of environments, identifying hundreds of candidate genes underlying genotype by environment interaction. We further identify genetic associations with environment across Mexico and show that such loci are associated with variation in yield and flowering time in our field trials and predict performance in independent drought trials. Our results indicate that the variation necessary to adapt crops to changing climate exists in traditional landraces that have been subject to ongoing environmental adaptation and can be identified by both phenotypic and environmental association.
While often deleterious, hybridization can also be a key source of genetic variation and pre-adapted haplotypes, enabling rapid evolution and niche expansion. Here we evaluate these opposing selection forces on introgressed ancestry between maize (Zea mays ssp. mays), and its wild teosinte relative. Introgression from ecologically diverse teosinte may have facilitated maize’s global range expansion, in particular to challenging high elevation regions (> 1500 m). We generated low-coverage genome sequencing data for 348 maize and mexicana individuals to evaluate patterns of introgression in 14 sympatric population pairs, spanning the elevational range of Zea mays ssp. mexicana, a teosinte endemic to the mountains of Mexico. While recent hybrids are commonly observed in sympatric populations and mexicana demonstrates fine-scale local adaptation, we find that the majority of mexicana ancestry tracts introgressed >1000 generations ago. This mexicana ancestry seems to have maintained much of its diversity and likely came from a common ancestral source, rather than contemporary sympatric populations, resulting in relatively low Fst between mexicana ancestry tracts sampled from geographically distant maize populations. Introgressed mexicana ancestry is reduced in lower-recombination rate quintiles of the genome and around domestication genes, consistent with pervasive selection against introgression. However, we also find mexicana ancestry increases across the sampled elevational gradient and that high introgression peaks are most commonly shared among high-elevation maize populations, consistent with introgression from mexicana facilitating adaptation to the highland environment. In the other direction, we find patterns consistent with adaptive and clinal introgression of maize ancestry into sympatric mexicana at many loci across the genome, suggesting that maize also contributes to adaptation in mexicana, especially at the lower end of its elevational range. In sympatric maize, in addition to high introgression regions we find many genomic regions where selection for local adaptation maintains steep gradients in introgressed mexicana ancestry across elevation, including at least two inversions: the well-characterized Inv4m and a new 3 Mb inversion Inv9f surrounding the macrohairless1 locus on chromosome 9. The bulk of our ancestry selection outliers show no signals of sweeps or local sourcing from sympatric populations and so likely represent ancestral introgression sorted by selection, resulting in correlated but distinct outcomes of introgression in different contemporary maize landrace populations.
Transposable elements (TEs) constitute the majority of flowering plant DNA, reflecting their tremendous success in subverting, avoiding, and surviving the defenses of their host genomes to ensure their selfish replication. More than 85% of the sequence of the maize genome can be ascribed to past transposition, providing a major contribution to the structure of the genome. Evidence from individual loci has informed our understanding of how transposition has shaped the genome, and a number of individual TE insertions have been causally linked to dramatic phenotypic changes. But genome-wide analyses in maize and other taxa have frequently represented TEs as a relatively homogeneous class of fragmentary relics of past transposition, obscuring their evolutionary history and interaction with their host genome. Using an updated annotation of structurally intact TEs in the maize reference genome, we investigate the family-level ecological and evolutionary dynamics of TEs in maize. Integrating a variety of data, from descriptors of individual TEs like coding capacity, expression, and methylation, as well as similar features of the sequence they inserted into, we model the relationship between these attributes of the genomic environment and the survival of TE copies and families. Our analyses reveal a diversity of ecological strategies of TE families, each representing the evolution of a distinct ecological niche allowing survival of the TE family. In contrast to the wholesale relegation of all TEs to a single category of junk DNA, these differences generate a rich ecology of the genome, suggesting families of TEs that coexist in time and space compete and cooperate with each other. We conclude that while the impact of transposition is highly family- and context-dependent, a family-level understanding of the ecology of TEs in the genome can refine our ability to predict the role of TEs in generating genetic and phenotypic diversity.
We report de novo genome assemblies, transcriptomes, annotations, and methylomes for the 26 inbreds that serve as the founders for the maize nested association mapping population. The data indicate that the number of pan-genes exceeds 103,000 and that the ancient tetraploid character of maize continues to degrade by fractionation to the present day. Excellent contiguity over repeat arrays and complete annotation of centromeres further reveal the locations and internal structures of major cytological landmarks. We show that combining structural variation with SNPs can improve the power of quantitative mapping studies. Finally, we document variation at the level of DNA methylation, and demonstrate that unmethylated regions are enriched for cis-regulatory elements that overlap QTL and contribute to changes in gene expression.
DNA sequencing technology has advanced so quickly, identifying key functional regions using evolutionary approaches is required to understand how those genomes work. This research develops a sensitive sequence alignment approach to identify functional constrained non-coding sequences in the Andropogoneae tribe. The grass tribe Andropogoneae contains several crop species descended from a common ancestor ~18 million years ago. Despite broadly similar phenotypes, they have tremendous genomic diversity with a broad range of ploidy levels and transposons. These features make Andropogoneae a powerful system for studying conserved non-coding sequence (CNS), here we used it to understand the function of CNS in maize. We find that 86% of CNS comprise known genomic elements e.g., cis-regulatory elements, chromosome interactions, introns, several transposable element superfamilies, and are linked to genomic regions related to DNA replication initiation, DNA methylation and histone modification. In maize, we show that CNSs regulate gene expression and variants in CNS are associated with phenotypic variance, and rare CNS absence contributes to loss of gene expression. Furthermore, we find the evolution of CNS is associated with the functional diversification of duplicated genes in the context of the maize subgenomes. Our results provide a quantitative understanding of constrained non-coding elements and identify functional non-coding variation in maize.
Convergent phenotypic evolution provides some of the strongest evidence for adaptation. However, the extent to which recurrent phenotypic adaptation has arisen via parallelism at the molecular level remains unresolved, as does the evolutionary origin of alleles underlying such adaptation. Here, we investigate genetic mechanisms of convergent highland adaptation in maize landrace populations and evaluate the genetic sources of recurrently selected alleles. Population branch excess statistics reveal strong evidence of parallel adaptation at the level of individual SNPs, genes and pathways in four independent highland maize populations, even though most SNPs show unique patterns of local adaptation. The majority of selected SNPs originated via migration from a single population, most likely in the Mesoamerican highlands. Polygenic adaptation analyses of quantitative traits reveal that alleles affecting flowering time are significantly associated with elevation, indicating the flowering time pathway was targeted by highland adaptation. In addition, repeatedly selected genes were significantly enriched in the flowering time pathway, indicating their significance in adapting to highland conditions. Overall, our study system represents a promising model to study convergent evolution in plants with potential applications to crop adaptation across environmental gradients.
In plants, mammals and insects, some genes are DNA methylated in the CG dinucleotide context, a phenomenon called gene body methylation. It has long been controversial whether this phenomenon has any functional role. Here, we took advantage of the availability of 868 leaf methylomes in Arabidopsis thaliana to characterize the population frequency of gene body methylation at the gene level. We used two outgroups to infer the ancestral methylation state of orthologs. Using the A. thaliana gene body methylation site frequency spectrum and a population genetics model specifically designed for epigenetic data, we find that genes with ancestral gene body methylation are under significant selection to remain methylated. Conversely, ancestrally unmethylated genes are under selection to remain unmethylated. The estimated selection coefficients are small, on the order of magnitude of selection acting on codon usage. We also find that A. thaliana is losing gene body methylation more often than gaining it, which could be due to a recent reduction in the efficacy of selection after a switch to selfing. We investigated the potential function of gene body methylation through its link with gene expression level. We show that, within genes with polymorphic methylation states in A. thaliana wild populations, the gene body methylated state is consistently associated with the highest gene expression level. Our work suggests that gene body methylation has a small effect on fitness, but substantial enough for natural selection to act on it.
Sorghum and maize share a close evolutionary history that can be explored through comparative genomics. To perform a large-scale comparison of the genomic variation between these two species, we analyzed 13M variants identified from whole genome resequencing (WGS) of 468 sorghum lines together with 25M variants previously identified in 1,218 maize lines. Deleterious mutations in both species were prevalent in pericentromeric regions, enriched in non-syntenic genes, and present at low allele frequencies. A comparison of deleterious burden between sorghum and maize revealed that sorghum, in contrast to maize, departed from the “domestication cost” hypothesis that predicts a higher deleterious burden among domesticates compared to wild lines. Additionally, sorghum and maize population genetic summary statistics were used to predict a gene deleterious index with an accuracy higher than 0.5. This research represents a key step towards understanding the evolutionary dynamics of deleterious variants in sorghum and provides a comparative genomics framework to start prioritizing them for removal through genome editing and breeding.
DNA methylation is a ubiquitous chromatin feature – in maize, more than 25% of cytosines in the genome are methylated. Recently, major progress has been made in describing the molecular mechanisms driving methylation, yet variation and evolution of the methylation landscape during maize domestication remain largely unknown. Here we leveraged whole-genome sequencing (WGS) and whole-genome bisulfite sequencing (WGBS) on populations of modern maize, landrace, and teosinte (Zea mays ssp. parviglumis) to investigate the adaptive and phenotypic consequences of methylation variations in maize. By using a novel estimation approach, we inferred the methylome site frequency spectrum (mSFS) to estimate forward and backward methylation mutation rates and selection coefficients. We only found weak evidence for direct selections on methylations in any context, but thousands of differentially methylated regions (DMRs) were identified in population-wide that are correlated with recent selections. Further investigation revealed that DMRs are enriched in 5’ untranslated regions, and that maize hypomethylated DMRs likely helped rewire distal gene regulation. For two trait-associated DMRs, vgt1-DMR and tb1-DMR, our HiChIP data indicated that the interactive loops between DMRs and respective downstream genes were present in B73, a modern maize line, but absent in teosinte. And functional analyses suggested that these DMRs likely served as cis-acting elements that modulated gene regulation after domestication. Our results enable a better understanding of the evolutionary forces acting on patterns of DNA methylation and suggest a role of methylation variation in adaptive evolution.
The genetics of domestication has been extensively studied ever since the rediscovery of Mendel’s law of inheritance and much has been learned about the genetic control of trait differences between crops and their ancestors. Here, we ask how domestication has altered genetic architecture by comparing the genetic architecture of 18 domestication traits in maize and its ancestor teosinte using matched populations. We observed a strongly reduced number of QTL for domestication traits in maize relative to teosinte, which is consistent with the previously reported depletion of additive variance by selection during domestication. We also observed more dominance in maize than teosinte, likely a consequence of selective removal of additive variants. We observed that large effect QTL have low minor allele frequency (MAF) in both maize and teosinte. Regions of the genome that are strongly differentiated between teosinte and maize (high F ST ) explain less quantitative variation in maize than teosinte, suggesting that, in these regions, allelic variants were brought to (or near) fixation during domestication. We also observed that genomic regions of high recombination explain a disproportionately large proportion of heritable variance both before and after domestication. Finally, we observed that about 75% of the additive variance in both teosinte and maize is “missing” in the sense that it cannot be ascribed to detectable QTL and only 25% of variance maps to specific QTL. This latter result suggests that morphological evolution during domestication is largely attributable to very large numbers of QTL of very small effect.
Maize landraces are well adapted to their local environments and present valuable sources of genetic diversity for breeding and conservation. But the maintenance of open-pollinated landraces in programs is challenging, as regeneration of seed can often lead to inbreeding depression and the loss of diversity due to genetic drift. Recent reports suggest that the production of doubled-haploid (DH) lines from landraces may serve as a convenient means to preserve useful genetic diversity in a homozygous form that is immediately useful for modern breeding. The production of doubled-haploid (DH) lines presents an extreme case of inbreeding which results in instantaneous homozygosity genome-wide. Here we analyzed the effect of DH production on genetic diversity, using genome-wide SNP data from hundreds of individuals of five European landraces and their related DH lines. In contrast to previous findings, we observe a dramatic loss of diversity at both the haplotype level and that of individual SNPs. We identify thousands of SNPs that exhibit allele frequency differences larger than expected under models of neutral genetic drift, and document patterns of heterozygosity and polymorphism at conserved sites that suggest an important role for deleterious recessive load in determining diversity differences between landrace and DH populations. Although we were unable to uncover more details about the mode of selection, we conclude that while landrace DH lines may be a valuable tool for the introduction of variation into maize breeding programs they come at the cost of decreased genetic diversity and increased genetic load.
Modern crop breeding has made a profound impact on food production to sustain population growth during the past century; however, systematic analyses of the genomic basis underlying the dramatic increase in crop yields during modern breeding remain limited in scope. Here we report an analysis of the genomic and phenotypic changes associated with modern maize breeding across a chronological sampling of 350 elite inbred lines representing multiple eras of both Chinese and U.S. germplasm. We document a number of convergent phenotypic changes in both countries. Using genome-wide association and selection scan methods, we identify 160 loci underlying adaptive agronomic phenotypes and more than 1,800 genomic regions representing the targets of selection during modern breeding. We functionally validate two candidate genes underpinning variation in ear height and tassel branch number using CRISPR knockout mutants. In sum, this work demonstrates the utility of breeding-era approach for identifying breeding signatures and lays the foundation for future genomics-enabled maize breeding."
Crop domestication is a fascinating area of study, as evidencedby a multitude of recent reviews. Coupled with the increasing availability of genomic and phenomic resources in numerous crop species, insights from evolutionary biology will enable a deeper understanding of the genetic architecture and short-term evolution of complex traits, which can be used to inform selection strategies. Future advances in crop improvement will rely on the integration of population genetics with plant breeding methodology, and the development of community resources to support research in a variety of crop life histories and reproductive strategies. We highlight recent advances in the role of selective sweeps and demographic history in shaping genetic architecture, how these breakthroughs can inform selection strategies, and the application of precision gene editing to leverage these connections.
We review studies on short and long term adaptation, both natural and artificial, in maize and teosinte.
We use simulations to show that the effects of nonequilibrium population dynamics on linked selection differ qualitatively from predictions under equilibrium models, calling into question inferences about the efficacy of selection made from summary statistics.
Rushworth, C.A., R.S. Baucom, B.K. Blackman, M. Neiman, M.E. Orive, A. Sethuraman, J. Ware, and D.R. Matute. 2021. Who are we now? A demographic assessment of the evolution societies. In press, Evolution. Early View: https://onlinelibrary.wiley.com/doi/10.1111/evo.14168. Rushworth, C.A., and T. Mitchell-Olds. 2021. The evolution of sex is tempered by costly hybridization in Boechera (rock cress). Journal of Heredity Early View: https://doi.org/10.1093/jhered/esaa041. Rushworth, C.A., Y. J. Brandvain, and T. Mitchell-Olds. 2020. Identifying the fitness consequences of sex in complex natural environments. Evolution Letters 4-6: 516-529. https://dx.doi.org/10.1002/evl3.194 Jordon-Thaden, I., J. Beck, C.A. Rushworth, M.D. Windham, N. Diaz, J. Cantley, C. Martine, and C.J. Rothfels. 2020. RADseq for any lab: a basic two-enzyme protocol performs well in herbarium and silica-dried tissues across four genera. Applications in Plant Sciences 8(4): e11344.
Tittes, Silas. "rdmc: an open source R package implementing convergent adaptation models of Lee and Coop (2017)." G3: Genes, Genomes, Genetics 10.9 (2020): 3041-3046. O'Hara, N. B., Franks, S. J., Kane, N. C., Tittes, S., & Rest, J. S. (2020). Evolution of pathogen response genes associated with increased disease susceptibility during adaptation to an extreme drought in a Brassica rapa plant population. Wooliver, R., Tittes, S. B., & Sheth, S. N. (2020). A resurrection study reveals limited evolution of thermal performance in response to recent climate change across the geographic range of the scarlet monkeyflower. Evolution, 74(8), 1699-1710. Comeault, A. A., Wang, J., Tittes, S., Isbell, K., Ingley, S., Hurlbert, A. H., & Matute, D. R. (2020). Genetic diversity and thermal performance in invasive and native populations of African fig flies. Molecular biology and evolution, 37(7), 1893-1906.
We develop an approach to estiamte family-level expression in transposable elements. While a relatively small proportion of TE families are transcribed, expression is highly dynamic with most families exhibiting tissue-specific expression. We also assay expression among different maize genotypes and use a mapping population to identify what proportion of copies in a family contribute to expression."
We identify hundreds of thousands of variable transposable elements among a small set of maize genome assemblies. We find evidence of a large number of genes impacted by TEs, including TEs in genes and genes in TEs, and evidence of recent transposition as well.""
We describe a phenomenon we name ‘hybrid decay’ triggered by backcrossing between domesticated maize and a specific teosinte population, resulting in genome instability, activation of transposable elements, and altered epigenetics ."
We show that rhizosphere biota impact adaptive divergence of phenotypic traits in teosinte along an environmental cline. This work suggests an important role for biotic interactions in determining the outcome of local adaptation.
We look for introgression from teosinte in the genomes of two highland landraces and use QTL mapping to test for functional relevance of introgressed regions.
We develop methods to detect polygenic adaptation on phenotypes from common gardens and GWAS panels.
Perkins, M. Taylor, Tetyana Zhebentyayeva, Paul H. Sisco, and J. Hill Craddock. “Genome-wide sequence-based genotyping supports a nonhybrid origin of Castanea alabamensis.” bioRxiv (2019): 680371. Craddock, J. Hill, and M. Taylor Perkins. “Chestnut (Castanea spp. Miller) Breeding.” In Advances in Plant Breeding Strategies, vol. 4. (J.M. Al-Khayri et al., eds.)(in press), 55 pages. Springer Nature Switzerland AG, Basel.
Roessler, Kyria, Aline Muyle, Concepcion M. Diez, Garren RJ Gaut, Alexandros Bousios, Michelle C. Stitzer, Danelle K. Seymour, John F. Doebley, Qingpo Liu, and Brandon S. Gaut. "The genome-wide dynamics of purging during selfing in maize." Nature plants (2019): 1-11. Noshay, Jaclyn M., Sarah N. Anderson, Peng Zhou, Lexiang Ji, William Ricci, Zefu Lu, Michelle C. Stitzer et al. "Monitoring the interplay between transposable element families and DNA methylation in maize." PLoS genetics 15, no. 9 (2019): e1008291. Marcela K. Tello-Ruiz, Cristina F. Marco, Fei-Man Hsu, Rajdeep S. Khangura, Pengfei Qiao, Sirjan Sapkota, Michelle C. Stitzer, Rachael Wasikowski, Hao Wu, Junpeng Zhan, Kapeel Chougule, Lindsay C. Barone, Cornel Ghiban, Demitri Muna, Andrew C. Olson, Liya C. Wang, Doreen C. Ware, David A. Micklos bioRxiv 654848; doi: https://doi.org/10.1101/654848 Stitzer, Michelle C., and Philipp Brand. "Digest: Hybrid incompatibilities and introgression in wild monkeyflowers." Evolution 72, no. 11 (2018): 2565-2566.
We use simulations to study the population genetics of selection on a quantitative trait and learn which parameters drive quantitative trait evollution.
We present the CoCoA hypothesis which describes how patterns of biotic coadpation change across varying abiotic environments
We review progress thus far in genomic research of maize domestication and adaptation. We discuss the insights genomics has shed on our understanding of these processes and conclude with a future outlook for how genomics might be further applied.
We review genetic work looking at the role of dominance, epistasis, and pleiotropy during maize domestication.
We identify a small gene family Kindr as the causal locus for meiotic drive in maize.
We show that genome size can be analyzed as a quantitative trait and that selection on flowering time in maize has likely driven adaptive changes in genome size along multiple altitudinal clines.
We develop the functional space hypothesis in which we posit that adaptation in large genomes uses more noncoding variation and is more likely to proceed via soft sweeps.
Discusses the challenges of changing climates pose for society and the environment along the Mexican-US border.
Wenbin Mei and Michelle Stitzer
Springer, Nathan M., et al. "The maize W22 genome provides a foundation for functional genomics and transposon biology." Nature genetics (2018): 1.
Joshi, Dinesh C., et al. "From zero to hero: the past, present and future of grain amaranth breeding." Theoretical and Applied Genetics (2018): 1-17.
Josephs, Emily B. "Determining the evolutionary forces shaping G× E." New Phytologist (2018). Uzunović, Jasmina, et al. "Transposable elements are important contributors to standing variation in gene expression in Capsella grandiflora." bioRxiv (2018): 289173.
Kremling, Karl AG, et al. "Dysregulation of expression correlates with rare-allele burden and fitness loss in maize." Nature 555.7697 (2018): 520.
Gates, Daniel J., Diana Pilson, and Stacey D. Smith. "Filtering of target sequence capture individuals facilitates species tree construction in the plant subtribe Iochrominae (Solanaceae)." Molecular phylogenetics and evolution 123 (2018): 26-34. Gates, Daniel J., et al. "A novel R3 MYB transcriptional repressor associated with the loss of floral pigmentation in Iochroma." New Phytologist 217.3 (2018): 1346-1356.
Riaz, Summaira, et al. "Genetic diversity analysis of cultivated and wild grapevine (Vitis vinifera L.) accessions around the Mediterranean basin and Central Asia." BMC plant biology 18.1 (2018): 137.
We present a SNP dataset of 80M variants from a sample of 1200 maize and teosinte genomes.
We show how population bottlenecks and gene flow have shaped patterns of genetic load across the maize genome and among populatons.
We combine phenotypic data from crosses between multiple maize inbred lines and genome-wide characterization of deleterious alleles to show that a simple model of incomplete dominance may help explain hybrid vigor.
Gene expression data from maize and teosinte grown in mid-Holocene environments suggests that for some genes plastic changes in expression may have preceded genetic changes during domestication.
A perspective that seeks to predict what factors influence the success of domestication, how many genes contributed to the process, where these genes originated and the implications for de novo domestication.
We show that the two subgenomes of maize differentially contribute to functional variation in phenotypes.
We show that short read sequence data allows discovery of novel tandem repeats in grass species but, unlike many other species, the most abundant repeat is frequently not centromeric.
Aradhya, Mallikarjuna, et al. "Genetic and ecological insights into glacial refugia of walnut (Juglans regia L.)." PloS one 12.10 (2017): e0185974.
Josephs, Emily B., et al. "The relationship between selection, network connectivity, and regulatory variation within a population of Capsella grandiflora." Genome biology and evolution 9.4 (2017): 1099-1109. Josephs, Emily B., John R. Stinchcombe, and Stephen I. Wright. "What can genome‐wide association studies tell us about the evolutionary forces maintaining genetic variation for quantitative traits?." New Phytologist 214.1 (2017): 21-33.
Mei, Wenbin, et al. "A comprehensive analysis of alternative splicing in paleopolyploid maize." Frontiers in plant science 8 (2017): 694. Liu, Xiaoxian, et al. "Detecting alternatively spliced transcript isoforms from single‐molecule long‐read sequences without a reference genome." Molecular ecology resources 17.6 (2017): 1243-1256. Gault, Christine M., et al. "Aberrant splicing in maize rough endosperm3 reveals a conserved role for U12 splicing in eukaryotic multicellular development." Proceedings of the National Academy of Sciences (2017): 201616173. Feng, Guanqiao, et al. "Evolution of the 3R-MYB gene family in plants." Genome biology and evolution 9.4 (2017): 1013-1029. Mei, Wenbin, et al. "Evolutionarily conserved alternative splicing across monocots." Genetics (2017): genetics-300189.
Genome sequence of a 5,000 year old maize cob reveals it was partially domesticated and helps identify the timing of adaptation during domestication.
Resequencing data from a number of peach and almond varieties show differences in population size and mating system, but little evidence for convergent selection during domestication.
We provide a set of scripts and configuration files to facilitate easy popgen analysis of short-read sequencing data.
We find little evidence for widespread hard sweeps in maize or teosinte and show that differences in demography during maize domestication and subsequent expansion have have changed the impacts of linked selection on surrounding diversity in the maize genome.
Migicovsky, Zoë, et al. "Genomic ancestry estimation quantifies use of wild species in grape breeding." BMC genomics 17.1 (2016): 478.
Josephs, Emily B., and Stephen I. Wright. "On the Trail of Linked Selection." PLoS genetics 12.8 (2016): e1006240.
Morota, Gota, Timothy M. Beissinger, and Francisco Peñagaricano. "MeSH-informed enrichment analysis and MeSH-guided semantic similarity among functional terms and gene products in chicken." G3: Genes, Genomes, Genetics (2016): g3-116.
We show how modern hybrid breeding approahces impact diversity across the genome. Using the Iowa Stiff Stalk synthetic population as an example, we track diversity across 16 generations of selection and compare with simulations. We show the vast majority of the changes in diversity observed can be explained by drift, but that fixation of large haplotypes, likley including considerable linked load, leads to selection for complementation in the recirpical populations. This work provided some of the first population genetic evidence supporting the complenmentaton model of hybrid vigor.
We find that maize adaptation to the highlands in Mexico and South America is largely independent and has made considerable use of standing genetic variation.
We find that variation in the domestication locus tb1 does not explain a substantial proportion of phenotypic variation for tillering in natural populations of teosinte.
We use sequence data from ancient maize cobs to reveal the timing and geography of maize introduction into the Southwest US. One of the most read papers from the last four years of the journal! UC Davis Press Release Science World Report Archaeology SiNC, Science magazine Nature Plants commentary The Scientist
We show that transposable elements can act as regulatory sequences that up- or down-regulate genes in maize in response to abiotic stress.
Aradhya, Mallikarjuna, J. E. Preece, and Dianne Velasco. "Multivariate analysis of molecular and morphological diversity in fig (Ficus carica L.)." V International Symposium on Fig 1173. 2015.
Leung, Wilson, et al. "Drosophila Muller F elements maintain a distinct set of genomic properties over 40 million years of evolution." G3: Genes, Genomes, Genetics 5.5 (2015): 719-740.
Persson, Tomas, et al. "Candidatus Frankia datiscae Dg1, the actinobacterial microsymbiont of Datisca glomerata, expresses the canonical nod genes nodABC in symbiosis with its host plant." PLoS One 10.5 (2015): e0127630.
Le, Tu N., et al. "Epigenetic regulation of intragenic transposable elements impacts gene transcription in Arabidopsis thaliana." Nucleic acids research 43.8 (2015): 3911-3921.
We discuss some of the outstanding questions and key challenges facing the use of population genetics to study local adaptation.
We show that deleterious mutations are abundant in modern maize breeding material and that genes with deleterious mutations are likely important contributors to phenotypic variation.
Aradhya, Mallikarjuna, et al. "Genetic diversity, structure, and patterns of differentiation in the genus Vitis." Plant systematics and evolution 299.2 (2013): 317-330. Zdunić, Goran, et al. "Genetic diversity and differentiation within and between cultivated (Vitis vinifera L. ssp. sativa) and wild (Vitis vinifera L. ssp. sylvestris) grapes." Vitis 52.1 (2013): 29-32.
We identify and study the evolutionary genetics of a 50Mb inversion in natural populations of teosinte
Krutovsky, Konstantin V., et al. "Gene flow, spatial structure, local adaptation, and assisted migration in trees." Genomics of tree crops. Springer, New York, NY, 2012. 71-116. Pyhäjärvi, Tanja, Esa Aalto, and Outi Savolainen. "Time scales of divergence and speciation among natural populations and subspecies of Arabidopsis lyrata (Brassicaceae)." American journal of botany 99.8 (2012): 1314-1322.
Hufford, Matthew B., et al. "Inferences from the historical distribution of wild and domesticated maize provide ecological and evolutionary insight." PLoS One 7.11 (2012): e47659.
We resolve the conflict between genetic, ecological, and achaeological data regarding the geographic location of maize domestication.
Savolainen, Outi, et al. "Adaptive potential of northernmost tree populations to climate change, with emphasis on Scots pine (Pinus sylvestris L.)." Journal of Heredity 102.5 (2011): 526-536. Pyhäjärvi, Tanja, Sonja T. Kujala, and Outi Savolainen. "Revisiting protein heterozygosity in plants—nucleotide diversity in allozyme coding genes of conifer Pinus sylvestris." Tree genetics & genomes 7.2 (2011): 385-397.
We show that mating system differences in heterozygosity mitigate the effects of insertion and deletion polymorphisms on nucleotide diversity.
The need to protect crop genetic resources has sparked a growing interest in the genetic diversity maintained in traditional farming systems worldwide. Although traditional seed management has been proposed as an important determinant of genetic diversity and structure in crops, no models exist that can adequately describe the genetic effects of seed management. We present a metapopulation model that accounts for several features unique to managed crop populations. Using traditional maize agriculture as an example, we develop a coalescence-based model of a crop metapopulation undergoing pollen and seed flow as well as seed replacement. In contrast to metapopulation work on natural systems, we model seed migration as episodic and originating from a single source per population rather than as a constant immigration from the entire metapopulation. We find that the correlated origin of migrants leads to surprising results, including a loss of invariance of within-deme diversity and a parabolic relationship between F(ST) and migration quantity. In contrast, the effects of migration frequency on diversity and structure are more similar to classical predictions, suggesting that seed migration in managed crop populations cannot be described by a single parameter. In addition to migration, we investigate the effects of deme size and extinction rates on genetic structure, and show that high levels of pollen migration may mask the effects of seed management on structure. Our results highlight the importance of analytically evaluating the effects of deviations from classical metapopulation models, especially in systems for which data are available to estimate specific model parameters.
Epanchin-Niell, Rebecca S., et al. "Controlling invasive species in complex social landscapes." Frontiers in Ecology and the Environment 8.4 (2010): 210-216.
We discover a 13,000 year old oak clone in the hills of Southern California
We show that differences in demographic history among populations of have changed the abundance and freqeuncy of transposable elements in the A lyrata genome.
A rebuttal showing that genetic data can indeed successfully recover multiple independent domestications.
Discusses some of the evolutionary forces responsible for shaping genome size across plants.
Presents plant domestication as an ideal system in which to study the genetics of adaptation.
Here I test two different hypotheses about the how recombination should relate to domestication: Does domestication itself select for increased recombination (yes), or respecies with higher recombination rates more likely to be domesticated (no).
I use genetic data to study the origin of a leafy shrub domesticated by the Maya