The ancestral recombination graph (ARG) is a structure that represents the history of coalescent and recombination events connecting a set of sequences (Hudson RR. In: Futuyma D, Antonovics J, editors. Gene genealogies and the coalescent process. In: Oxford Surveys in Evolutionary Biology; 1991. p. 1 to 44.). The full ARG can be represented as a set of genealogical trees at every...
Zona pellucida (ZP) modules mediate extracellular protein–protein interactions and contribute to important biological processes including syngamy and cellular morphogenesis. Although some biomedically relevant ZP modules are well studied, little is known about the protein family’s broad-scale diversity and evolution. The increasing availability of sequenced genomes from “nonmodel...
Many regulatory networks appear to involve partially redundant enhancers. Traditionally, such enhancers have been hypothesized to originate mainly by sequence duplication. An alternative model postulates that they arise independently, through convergent evolution. This mechanism appears to be counterintuitive to natural selection: Redundant sequences are expected to either...
Xanthomonas oryzae pv. oryzae (Xoo) is a serious pathogen of rice which displays tremendous interstrain variation. The emergence of highly-virulent strains of Xoo is a major threat to rice cultivation. Evolutionary insights into genome dynamics of highly virulent strains as compared with the less-virulent ones are crucial for understanding the molecular basis of exceptional...
Though large multilocus genomic data sets have led to overall improvements in phylogenetic inference, they have posed the new challenge of addressing conflicting signals across the genome. In particular, ancestral population structure, which has been uncovered in a number of diverse species, can skew gene tree frequencies, thereby hindering the performance of species tree...
Our ability to correctly reconstruct a phylogenetic tree is strongly affected by both systematic errors and the amount of phylogenetic signal in the data. Current approaches to tackle tree reconstruction artifacts, such as the use of parameter-rich models, do not translate readily to single-gene alignments. This, coupled with the limited amount of phylogenetic information...
The conservation of orthologs of most subunits of the origin recognition complex (ORC) has served to propose that the whole complex is common to all eukaryotes. However, various uncertainties have arisen concerning ORC subunit composition in a variety of lineages. Also, it is unclear whether the ancestral diversification of ORC in eukaryotes was accompanied by the...
We previously introduced a numerical quantity called the stability (Ps) of an inferred tree and showed that for the tree to be reliable this stability as well as the reliability of the tree, which is usually computed as the bootstrap probability (Pb), must be high. However, if genome duplication occurs in a species, a gene family of the genome also duplicates, and for this reason...
The olive fruit fly Bactrocera oleae is a major pest of olives worldwide and houses a specialized gut microbiota dominated by the obligate symbiont “Candidatus Erwinia dacicola.” Candidatus Erwinia dacicola is thought to supplement dietary nitrogen to the host, with only indirect evidence for this hypothesis so far. Here, we sought to investigate the contribution of the symbiosis...
Extrachromosomal circular DNA (eccDNA) elements of chromosomal origin are known to be common in a number of eukaryotic species. However, it remains to be addressed whether genomic features such as genome size, the load of repetitive elements within a genome, and/or animal physiology affect the number of eccDNAs. Here, we investigate the distribution and numbers of eccDNAs in a...
Toll-like receptors (TLRs) are a complex family of innate immune genes that are well characterized in mammals and birds but less well understood in nonavian sauropsids (reptiles). The advent of highly contiguous draft genomes of nonmodel organisms enables study of such gene families through analysis of synteny and sequence identity. Here, we analyze TLR genes from the genomes of...
Long-term balancing selection results in a build-up of alleles at similar frequencies and a deficit of substitutions when compared with an outgroup at a locus. The previously published β(1) statistics detect balancing selection using only polymorphism data. We now propose the β(2) statistic which detects balancing selection using both polymorphism and substitution data. In...
Repeated and independent emergence of trait divergence that matches habitat differences is a sign of parallel evolution by natural selection. Yet, the molecular underpinnings that are targeted by adaptive evolution often remain elusive. We investigate this question by combining genome-wide analyses of copy number variants (CNVs), single nucleotide polymorphisms (SNPs), and gene...
Typical avian eyes are phenotypically engineered for photopic vision (daylight). In contrast, the highly derived eyes of the barn owl (Tyto alba) are adapted for scotopic vision (dim light). The dramatic modifications distinguishing barn owl eyes from other birds include: 1) shifts in frontal orientation to improve binocularity, 2) rod-dominated retina, and 3) enlarged corneas...
Dicyemids, previously called “mesozoans” (intermediates between unicellular protozoans and multicellular metazoans), are an enigmatic animal group. They have a highly simplified adult body, comprising only ∼30 cells, and they have a unique parasitic lifestyle. Recently, dicyemids were shown to be spiralians, with affinities to the Platyhelminthes. In order to understand molecular...
Transposable elements (TEs) play major roles in the evolution of genome structure and function. However, because of their repetitive nature, they are difficult to annotate and discovering the specific roles they may play in a lineage can be a daunting task. Heliconiine butterflies are models for the study of multiple evolutionary processes including phenotype evolution and...
The interface between populations and evolving young species continues to generate much contemporary debate in systematics depending on the species concept(s) applied but which ultimately reduces to the fundamental question of “when do nondiscrete entities become distinct, mutually exclusive evolutionary units”? Species are perceived as critical biological entities, and the...
The relationships of crustaceans and hexapods (Pancrustacea) have been much discussed and partially elucidated following the emergence of phylogenomic data sets. However, major uncertainties still remain regarding the position of iconic taxa such as Branchiopoda, Copepoda, Remipedia, and Cephalocarida, and the sister group relationship of hexapods. We assembled the most taxon...
The piranha enjoys notoriety due to its infamous predatory behavior but much is still not understood about its evolutionary origins and the underlying molecular mechanisms for its unusual feeding biology. We sequenced and assembled the red-bellied piranha (Pygocentrus nattereri) genome to aid future phenotypic and genetic investigations. The assembled draft genome is similar to...
High-elevation organisms experience shared environmental challenges that include low oxygen availability, cold temperatures, and intense ultraviolet radiation. Consequently, repeated evolution of the same genetic mechanisms may occur across high-elevation taxa. To test this prediction, we investigated the extent to which the same biochemical pathways, genes, or sites were subject...
Meiotic recombination is a highly conserved process that has profound effects on genome evolution. At a fine-scale, recombination rates can vary drastically across genomes, often localized into small recombination “hotspots” with highly elevated rates, surrounded by regions with little recombination. In most species studied, the location of hotspots within genomes is highly...
Insertions and deletions (INDELs) remain understudied, despite being the most common form of genetic variation after single nucleotide polymorphisms. This stems partly from the challenge of correctly identifying the ancestral state of an INDEL and thus identifying it as an insertion or a deletion. Erroneously assigned ancestral states can skew the site frequency spectrum, leading...
The metabolic conversion of dietary omega-3 and omega-6 18 carbon (18C) to long chain (>20 carbon) polyunsaturated fatty acids (LC-PUFAs) is vital for human life. The rate-limiting steps of this process are catalyzed by fatty acid desaturase (FADS) 1 and 2. Therefore, understanding the evolutionary history of the FADS genes is essential to our understanding of hominin evolution...