Gene expression phylogenies and ancestral transcriptome reconstruction resolves major transitions in the origins of pregnancy
Abstract
Structural and physiological changes in the female reproductive system underlie the origins of pregnancy in multiple vertebrate lineages. In mammals, the glandular portion of the lower reproductive tract has transformed into a structure specialized for supporting fetal development. These specializations range from relatively simple maternal nutrient provisioning in egg-laying monotremes to an elaborate suite of traits that support intimate maternal-fetal interactions in Eutherians. Among these traits are the maternal decidua and fetal component of the placenta, but there is considerable uncertainty about how these structures evolved. Previously, we showed that changes in uterine gene expression contributes to several evolutionary innovations during the origins of pregnancy (Mika et al., 2021b). Here, we reconstruct the evolution of entire transcriptomes (‘ancestral transcriptome reconstruction’) and show that maternal gene expression profiles are correlated with degree of placental invasion. These results indicate that an epitheliochorial-like placenta evolved early in the mammalian stem-lineage and that the ancestor of Eutherians had a hemochorial placenta, and suggest maternal control of placental invasiveness. These data resolve major transitions in the evolution of pregnancy and indicate that ancestral transcriptome reconstruction can be used to study the function of ancestral cell, tissue, and organ systems.
Editor's evaluation
Mika and colleagues reconstruct the evolution of uterine endometrial transcriptomes during pregnancy from 23 diverse species of mammals that differ with respect to their degree of placental invasiveness. Through this analysis, the authors infer that the eutherian mammal ancestor had an invasive mode of placentation and that the degree of invasiveness of placentation is reflected on uterine endometrial gene expression during pregnancy. Thus, phylogenetic analysis of gene expression profiles of different mammals groups them on the basis of the degree of placental invasiveness, a quite striking finding.
https://doi.org/10.7554/eLife.74297.sa0Introduction
Studies of the fossil record have revealed in fine detail major stages in the origin and diversification of evolutionary novelties (structures) and innovations (functions) such as the vertebrate limb and skull (Abzhanov, 2015; Hirasawa and Kuratani, 2015), the turtle shell (Lyson and Bever, 2020), feathers (Chen et al., 2015), and flowers (Chanderbali et al., 2016). Many features of soft tissues and macromolecular structures, however, are lost during the fossilization process and thus leave little to no trace in the fossil record. To reconstruct the history of these characters, including DNA and amino acid sequences, morphology, physiology, and behavior, among others, evolutionary studies have traditionally relied on comparative (phylogenetic) methods such as parsimony or model-based maximum likelihood or Bayesian inference (Lewis and Olmstead, 2001). These methods infer ancestral characters from the distribution of character states among extant species, the latter of which also allows increased model complexity including unequal character state transitions (Lewis and Olmstead, 2001; Pagel and Cunningham, 1999), variable rates among sites and branches (Galtier, 2001; Yang, 1994; Yang, 1993), and character-dependent diversification (Maddison et al., 2007).
Extant mammals span crucial transitions in the origins of pregnancy (Figure 1A) and are an excellent system in which to explore the origins of evolutionary novelties. The platypus and echidna (monotremes), for example, are oviparous (egg-laying) but retain the egg in the glandular portion of uterus for about two weeks. During this period, the developing embryo is nourished by uterine secretions (matrotrophy) delivered through a simple yolk sac placenta (Hughes and Hall, 1998; Renfree and Shaw, 2013). Viviparity (live-birth) evolved in the stem-lineage of therian mammals, but marsupials and eutherians have very different reproductive strategies, particularly in the ontogenetic origins of the definitive placenta and the arrangement of the maternal-fetal interface (Freyer and Renfree, 2009; Renfree and Shaw, 2013; Renfree, 2010). In most marsupials, the embryonic portion of the placenta is derived from the yolk sac, which may come into direct contact with but does not invade the maternal endometrium. While the yolk sac has essential functions during early pregnancy in eutherians, the definitive placenta is derived from chorion and allantois (chorioallantois) and varies in its degree of endometrial invasion (Swanson and Skinner, 2018; Figure 1B). Thus, matrotrophy and a yolk-sac derived placenta were present in the mammalian stem-lineage and transitioned to a chorioallantoic placenta in eutherians.
Unfortunately, most characters related to pregnancy and the nature of the maternal-fetal interface leave little to no trace in the fossil record. Thus, studies exploring the evolution of pregnancy have relied on comparing morphological differences between extant mammals to reconstruct the steps in the origins of pregnancy. These comparative analyses have used multiple methods to reconstruct the arrangement of the mammalian maternal-fetal interface and reached contradictory conclusions about the degree of placental invasiveness in the eutherian ancestor, a debate which has persisted since at least 1876 (Table 1). Here, we use comparative transcriptomics and maximum likelihood to infer ancestral gene expression states (ancestral transcriptome reconstruction) from 23 amniotes with different parity modes and degrees of placental invasion to identify the evolutionary history of the maternal-fetal interface. We found strong evidence that the last common ancestor of eutherian mammals had an invasive hemochorial placenta, as well as convergence in gene expression profiles during the independent evolution of non-invasive epitheliochorial placentas in marsupials and some eutherian lineages. These data indicate that the degree of placental invasion can be inferred from endometrial gene expression profiles and suggest that placental invasiveness is regulated by gene expression profiles in the maternal endometrium rather than the fetal portion of the placenta.
Results
Endometrial gene expression profiling
We previously assembled a collection of transcriptomes from the pregnant or gravid uterine endometrium of therian mammals with varying degrees of placental invasiveness, as well as a monotreme (platypus), a bird, as well as viviparous, oviparous, and reproductively bi-modal lizards (Marinić et al., 2021). The complete dataset includes expression information for 21,750 genes from 23 species (Figure 2—source data 1). Principal Component Analysis (PCA) based on gene expression levels in transcripts per million (TPMs), using a variant of PCA that extends PCA to binary data, generally grouped species randomly (Figure 2A), consistent with noise in gene expression data overwhelming phylogenetic signal. Therefore, we transformed quantitative gene expression values in transcripts per million (TPM) into discrete character states – Genes with TPM ≥2.0 were coded as expressed (state = 1), genes with TPM <2.0 were coded as not expressed (state = 0), and genes without data in specific species coded as missing (?) (Marinić et al., 2021; Mika et al., 2021a). In contrast to PCA based on TPM values, PCA of the binary encoded endometrial transcriptome dataset grouped species by phylogenetic relatedness (Figure 2B), indicating significant noise reduction in the binary encoded dataset.
Phylogenetic analyses of endometrial transcriptomes
We used IQ-TREE to infer the best fitting model of character evolution and the maximum-likelihood (ML) phylogeny. The best-fit model of character evolution was a general time reversible model for binary data (GTR2) with character state frequencies optimized by maximum-likelihood (FO) and rate heterogeneity across sites accommodated with the FreeRate model that had three rate categories (R3). Next, we assessed the tree topology and branch support metrics for the ML phylogeny inferred using the binary encoded endometrial gene expression dataset and the GTR2 +FO + R3 model. The ML phylogeny (Figure 3A and ) generally followed taxonomic relationships (Figure 3B); however, four particularly discordant relationships within therian mammals were inferred with high support: (1) Rather than grouping marsupials into a monophyletic sister-clade to the eutherians, opossum and wallaby were placed as sister-species within the Boreoeutheria; (2) Armadillo groups within the Euarchontoglires rather than as sister to all other eutherians; (3) Bat groups within the Euarchontoglires rather than within Laurasiatheria; and (4) Dog groups within the Euarchontoglires rather than with Laurasiatheria (Figure 3A/B). We also used multiple non-parametric topology tests to directly compare the ML tree to alternative trees with the correct phylogenetic placement of these four discordant lineages, all of which rejected alternative ‘corrected’ trees in favor of the (Figure 3C). Remarkably while these discordant relationships are incorrect with respect to the species phylogeny, they are correct with respect to placenta-type, that is, species form well-supported clades based on their degree of placental invasiveness: Wallaby and opossum, which have epitheliochorial placentas similar to ungulates forms a clade with ungulates, Armadillo, which has an invasive placenta (Carter and Enders, 2004; Chavan and Wagner, 2016), forms a clade with the other species with hemochorial placentas, and dog, which has an invasive endotheliochorial placenta, forms a clade with Euarchontoglires that have invasive hemochorial placentas.
Ancestral transcriptome reconstruction and fuzzy C-Means clustering
We also used IQ-TREE and the species phylogeny (Figure 3B) to reconstruct ancestral gene expression states for each gene (ancestral transcriptome reconstruction). To explore the similarity of extant and reconstructed transcriptomes we used Fuzzy C-Means (FCM) clustering, a ‘soft’ clustering method that allows each sample to have membership in multiple clusters and assigns samples to clusters based on their degree of cluster membership. FCM with two to four clusters (K=2–4) had a clear biological interpretation (Figure 4): (1) K=2 clustered eutherians and non-eutherians; (2) K=3 clustered most therians with non-invasive (epitheliochorial) placentas, eutherians with invasive (endotheliochorial or hemochorial) placentas, platypus and sauropsids (i.e. viviparous and oviparous lizards, and birds); (3) K=4 clustered eutherians with invasive placentas, eutherians with epitheliochorial placentas, opossum/wallaby, and sauropsids. A notable exception with K=3–4 is the cluster membership of dunnart, which is discussed in greater detail below. FCM clusters with K=5 and K=6 were similar to K=4, but divided Eutherians with hemochorial placentas into two clusters, and clustered dog, dunnart, and the viviparous skink Chalcides ocellatus (Figure 4); Beyond K=6 clusters had no clear biological interpretation.
Ancestral transcriptome reconstructions generally clustered with extant species having similar parity modes and degrees of placenta invasiveness (Figure 4). For example, FCM with K=2 grouped extant eutherians and their ancestral lineages as well as extant non-eutherians and their ancestral lineages. Similarly, FCM with K=4 grouped extant eutherians with invasive placentas and their ancestral lineages with extant therians with non-invasive placentas and their ancestral lineages. FCM with K=3–5 clustered the ancestral eutherian (AncEutheria) transcriptome with extant species that have invasive hemochorial placentas and clustered the ancestral therian (AncTheria) and mammalian (AncMammalia) transcriptomes with extant mammals that have non-invasive epitheliochorial placentas. These data suggest that FCM clustering of extant and ancestral reconstructions of endometrial transcriptomes can predict ancestral placenta invasiveness, implying that an epitheliochorial placenta evolved early in the development of mammalian pregnancy and that a hemochorial placenta is ancestral for eutherians.
While FCM clustering generally groups extant and ancestral transcriptomes by phylogenetic relatedness and degree of placental invasiveness, a notable exception is the marsupial fat-tailed dunnart (Sminthopsis crassicaudata). FCM clusters with K=2–5 grouped dunnart with non-therians while FCM K=6 clustered dog, dunnart, and the skink, C. ocellatus (Figure 4). FCM cluster membership coefficients of all three species with K=2–5 were mixed, with significant membership across clusters. In contrast, dog, dunnart, and C. ocellatus formed a distinct cluster from all other species at K=6 with nearly 100% FCM cluster membership in group 6 (Figure 4). Remarkably, both dog and dunnart have endotheliochorial placentas whereas the Chalcides maternal-fetal interface has been described as either endotheliochorial or epitheliochorial with extensive vascularization and interdigitating folds of hypertrophied uterine and chorioallantoic tissue (Blackburn, 1993; Blackburn and Callard, 1997; Corso et al., 2000). These data suggest that species with endotheliochorial placentas have a gene expression profile that is intermediate between epitheliochorial or hemochorial placentas, yet is also distinct, and that gene expression at the Chalcides maternal-fetal interface are converging on a therian-like endotheliochorial pattern.
Convergent loss of RORA in species with epitheliochorial placentas
Among the genes with discordant species- and gene-expression phylogenies is RAR-related orphan receptor alpha (RORA), an orphan nuclear hormone receptor that regulates the development and function of type 2 and type 3 innate lymphoid cells (ILC2 and ILC3), which negatively regulates the immune system in local microenvironments especially during inflammation (Haim-Vilmovsky et al., 2019), the establishment of tolerance to intestinal microbiota (Lyu et al., 2022), and the regulation of inflammatory responses and vascular remodeling during placentation and pregnancy (Balmas et al., 2018; Mendes et al., 2020; Miller et al., 2018). Remarkably, RORA independently lost endometrial expression at least four times, each loss coincident with the independent evolution of non-invasive epitheliochorial placentas (Figure 5A). The restricted expression of RORA to immune cells in the human first trimester decidua (Vento-Tormo et al., 2018; Figure 4B) suggests that the repeated loss of RORA expression reflects changes in the composition of immune cells at the maternal-fetal interface in species with epitheliochorial placentas. To explore this possibility, we used CIBERSORT to deconvolve endometrial bulk RNA-Seq datasets from each species into cell-type abundance estimates with a signature gene expression matrix composed of cell-types from the human first trimester maternal-fetal interface. We found that CIBERSORT inferred that nearly all species that lacked RORA expression also lacked ILC2/3 cells at the maternal-fetal interface with the exception of armadillo and chicken which had RORA expression but were inferred to lack endometrial ILC2/3 cells, and the viviparous Saiphos population which has an epitheliochorial placenta and RORA expression and was inferred to have a population of ILC2/3 cells. Thus, we conclude that there is convergent loss of RORA in species with epitheliochorial placentas which in many species correlates with the convergent loss of ILC2/3 cells at the maternal-fetal interface.
Discussion
One of the central of goals of DevoEvo is a mechanistic explanation for the origin and evolution of evolutionary novelties (Amundson, 2005; Brigandt and Love, 2010; Lynch, 2022; Wagner, 2001; Wagner, 2000; Wagner and Larsson, 2003). A major challenge for reconstructing the origins of evolutionary novelties, however, is a lack of transitional forms among living species. For example, while feathers and the turtle’s shell are excellent examples of morphological novelties, there are no living species with transitional forms between scale and feather (Chen et al., 2015) or with protoshells (Lyson and Bever, 2020). Despite the lack of transitional forms among extant taxa, an abundance of fossil data has reconstructed the major steps in the evolution of these structures that when combined with molecular studies of their development provides a rich explanation for their developmental evolution. Unfortunately, many features of soft tissues leave little to no trace in the fossil record, thus the steps in their evolution have remained elusive. Here, we used ancestral transcriptome reconstruction to trace gene expression changes during the origins of mammalian pregnancy, in which extant species preserve intermediate stages in the evolution of both pregnancy and a diversity of placenta types and reconstruct the evolution of placental invasiveness of ancestral species.
Binary encoding uncovers hidden biological signal
While transforming gene expression count data into binary categories likely loses information about evolutionarily relevant variation in expression levels, it may not be possible to infer meaningful gene expression levels from bulk RNA-Seq. For example rather than evolutionary differences between individuals or species, variation in transcript abundance between samples can result from various sources of experimental noise such as technical variation in library preparation, sequencing, or batch effects (Gilad and Mizrahi-Man, 2015; Tung et al., 2017), variation in cell-type composition of a tissue (Price et al., 2022), sampling different timepoints in development or only a few individuals that do not capture the variance properties of gene expression levels within a population or species (Pal et al., 2020; Thompson et al., 2020). Thus, by transforming gene expression data into not/expressed states we may reduce the potential for these and other biases to influence our ancestral transcriptome reconstructions, revealing biological signal. Our binary encoded endometrial gene expression dataset, for example, cluster species with similar placenta types and parity mode (Figure 2B) indicating binary encoding preserves functional signal in gene expression data that is otherwise masked by variation in gene expression levels (Figure 2A).
Phylogenetic analyses of endometrial transcriptomes
Previous studies of molecular phylogenies have shown that gene-species tree discordance can result from convergent or parallel amino acid changes in lineages that independently evolved morphological traits. For example, parallel amino acid changes in prestin (SLC26A5), a motor protein expressed in outer hair cells in the cochlea that essential for hearing, have occurred in different lineages of echolocating mammals, including multiple lineages bats and whales, leading a strongly monophyletic clades in gene trees that do not reflect taxonomic relationships (Li et al., 2008; Li et al., 2010; Liu et al., 2010; Teeling, 2009). Gene-species tree discordance is also associated coadaptation of amino acid substitutions in Na+K+–ATPase of frogs that prey on toxic toads (Mohammadi et al., 2021), the evolution of C4 phosphoenolpyruvate carboxykinases (PCK) in grasses (Christin et al., 2009), and between snake and agamid lizard mitochondrial genomes (Castoe et al., 2009). These examples highlight how phylogenetic discordance can be a signal of convergent evolution, rather than just a sign of biased phylogenetic inference that can arise from processes like long branch attraction (Bergsten, 2005; Felsenstein, 1978), incomplete lineage sorting and introgression (Guerrero and Hahn, 2018; Hibbins et al., 2020; Maddison and Knowles, 2006), biased character state frequencies and gene conversion (Figuet et al., 2014; Kostka et al., 2012; Lartillot, 2013; Romiguier et al., 2013), and heterotachy (Kolaczkowski and Thornton, 2008; Philippe et al., 2005), among others.
Similar to the signals of convergence from gene-species tree discordance, our transcriptomic data showed significant phylogenetic support for uterine transcriptomes grouping by parity mode and degree of placental invasiveness rather than phylogenetic relationships. This transcriptome-species tree discordance is particularly striking for opossum and wallaby, which are deeply nested within eutherians with epitheliochorial placentas based on uterine transcriptome data, because the eutherian and marsupial placenta is derived from different extra-embryonic tissues – the chorion and allantois in eutherians and the yolk-sac in marsupials. Thus, there must be significant convergence in endometrial gene expression profiles between eutherians with chorioallantois-derived epitheliochorial placentas and marsupials with yolk-sac epitheliochorial placentas because our transcriptome phylogeny is based on gene expression in the endometrium during pregnancy rather than gene expression in the placenta. This convergence in gene expression between species with non-invasive epitheliochorial placentas may be related to the expression of genes that limit the ability of the trophoblast to invade into maternal tissues (see below).
Reconstruction of ancestral (endometrial) transcriptomes
Previous studies have explored transcriptome evolution with the goal of developing methods of ancestral transcriptome inference (Price et al., 2022), characterizing the general tempo and mode of gene expression evolution (Bauernfeind et al., 2021), implicating transcriptome evolution in the origin and evolution of morphological traits (Church et al., 2021; Mika et al., 2021b; Lynch et al., 2015; Marinić et al., 2021; Munro et al., 2021), and identifying specific genes with derived expression levels (Brawand et al., 2011; Necsulea et al., 2014), usually in the context of characterizing genes whose expression levels have changed because of the action of positive selection, that is, directional selection on transcript abundance (Gu, 2004; Price et al., 2022; Yang et al., 2020). In contrast, only a few studies have treated the expression of individual genes as a discrete character, transforming quantitative gene expression levels into binary not/expressed states, and used ancestral state reconstruction to determine the expression state of all genes in an ancestral transcriptome rather than changes in the expression levels of specific genes (Mika et al., 2021b; Kin et al., 2015; Lynch et al., 2015; Marinić et al., 2021). Our ancestral transcriptome reconstructions of gene expression state indicate that an epitheliochorial-like placenta evolved early in the mammalian stem-lineage, before the loss of the egg-shell in Therians, and that the ancestor of Eutherians had a hemochorial placenta, which resolves a longstanding debate about the nature of ancestral mammalian placentas. These data suggest that ancestral transcriptome reconstruction can be used to infer the function of ancestral cell, tissue, and organ systems which leave little to no trace in the fossil record even if soft tissues might by chance fossilize.
Convergent loss of RORA in species with epitheliochorial placentas
Our observation that RORA expression has been lost multiple times in species with non-invasive epitheliochorial placentas, which is coincident with the inferred absence of ILC2/ILC3 cells from the endometrium during at least one stage in pregnancy. This result suggests a mechanistic connection between absence of decidual RORA expression, the absence of decidual ILC2/ILC3 cells, and the loss of placental invasion. Indeed, the major decidual ILC produce signaling factors such as GM-CSF, XCL1, MIP1α, and MIP1β, whose receptors are expressed by EVT likely regulate placental invasion (Huhn et al., 2020). ILC2 also contributes to the maintenance of a type-2 anti-inflammatory immune environment in the uterus during pregnancy (Balmas et al., 2018), which may be particularly important in species in which the endotheliochorial or haemochorial placenta invades maternal tissues. ILC2s, for example, are increased in the decidua basalis of women with spontaneous preterm labor compared to those who delivered preterm without labor (Mendes et al., 2021; Xu et al., 2018), suggesting that ILCs may participate in the chronic inflammatory process that occurs during pregnancy. Thus, loss of placental invasion may be associated with a reduced need to limit local inflammation by ILCs, leading to an evolutionary loss of ILCs in the endometrium during pregnancy. However, while our data are consistent this role for decidual ILC2 and its association with placental invasion, more detailed studies specifically aimed at characterizing cell-type composition differences across species are necessary to determine if there are correlations between cell-types in the endometrium during pregnancy and placenta type.
Ideas and speculation: Maternal control of placental invasion
Our observation that endometrial gene expression patterns are correlated with degree of placental invasiveness might seem surprising, however, rather than acting as a passive substrate into which the trophoblast invades, the endometrium directly controls trophoblast invasion (Cui et al., 2012; Graham and Lala, 1991). For example, the trophoblast of mammals with hemochorial placentas, such as humans and rodents, is only permissive to invasion when the ‘window of implantation’ is opened by the endometrium. Similarly, while the trophoblast of mammals with non-invasive endotheliochorial and epitheliochorial placentas, such as cats, dogs, horses, cows, pigs, and sheep cannot invade into the endometrium, they can invade into ectopic sites (reviewed in Corpa, 2006). The trophoblasts of guinea pig (Loeb, 1914), mouse (Billington, 1965), rat (Jollie, 1961), and pig (Samuel and Perry, 1972) also invade ectopic sites in experimentally induced ectopic pregnancy. While there is no similar ectopic pregnancy data for marsupials, some marsupial lineages, including dunnart, have evolved invasive placentation. In contrast, there is no invasion of maternal tissues during ectopic pregnancy in viviparous reptiles with epitheliochorial placentation such as Pseudemoia entrecasteauxii (Griffith et al., 2013). Thus, the invasive ability of trophoblasts most likely either evolved in the stem-lineage of therian mammals or multiple times, including in the stem-lineage of eutherians and some lineages of marsupials. Regardless, ancestral maternal control of placental invasion likely allowed us to infer ancestral placental invasiveness from ancestral endometrial transcripomes.
Ideas and speculation: Evolution of placental invasion and cancer metastasis
Numerous authors have noted the similarity between placental invasion and cancer metastasis (Costanzo et al., 2018; Ferretti et al., 2007; Kozlov, 2022; Lala et al., 2021; Manzo, 2019; Murray and Lessey, 2008; Perry et al., 2009; Piechowski, 2019), which was first proposed in 1902 by Scottish embryologist John Beard (1858–1924) who hypothesized that ectopic trophoblasts gave rise cancer (Gurchot, 1975; Ross, 2015). While Beard’s hypothesis is incorrect, at least for cancers not derived from the placenta such as choriocarcinoma, there are numerous mechanistic similarities between implantation, placental invasion, and tumor progression to malignancy (Nordor et al., 2017; Wagner et al., 2022). These data suggest that the evolution of maternal mechanisms that prevent endometrial invasion through expression gain and loss of genes that restrain and promote, respectively, trophoblast invasion, may be related to resistance to metastasis in eutherian lineages with non-invasive placentas (Boddy et al., 2020; Afzal et al., 2019; Wagner et al., 2020). For example, pleiotropy can lead to correlated patterns of gene expression between the transcriptomes of different tissue and organ systems (Liang et al., 2018). Thus, the evolution of a gene regulatory module that restricts placental invasion into the endometrium can be coopted to restrict implantation and spread of cancer cells into metastatic locations.
Caveats and limitations
A limitation of this study not directly addressed thus far is that we have only sampled a small number of species. For example, we lack pregnant endometrial samples from most mammals, particularly those with endotheliochorial placentas, as well as a diversity of oviparous and viviparous squamates there are at least 115 origins of viviparity in squamates (Blackburn, 2015; Blackburn and Brandley, 2015; Blackburn and Starck, 2015). Thus, our inferences from phylogenetic, ancestral reconstruction, and clustering analyses may be biased by small sample sizes and non-random sampling. We also assume that models of evolution designed for phylogenetic inference and ancestral reconstruction of morphological and molecular data are appropriate for gene expression data or binary encoded gene expression data, which may affect our results; for example, we have not directly accommodated incomplete lineage sorting which can mislead phylogenetic inference (Guerrero and Hahn, 2018; Hibbins et al., 2020). Similarly, while Fuzzy C-Means clustering is conceptually similar to topic (‘grade of membership’) models used in population genetics, its underlying assumptions may be violated for gene expression and binary encoded gene expression data. More detailed studies are necessary to determine if our results are robust to potential sources of error such as model mis-specification, small sample sizes, and non-random taxon sampling, as well as incomplete lineage sorting, the latter of which we were unable to directly test.
Conclusions
Previous critiques of statistical methods to infer ancestral states, particularly in the context of parity mode evolution in squamates, have suggested that ancestral state reconstructions of morphological characters must be supported by additional kinds of biological support such as anatomical, physiological, and ecological evidence, to be persuasive (Griffith et al., 2015). Here we explored the evolution of parity mode and placental invasiveness in amniotes utilizing comparative gene expression data. While our study also relies on statistical methods to infer ancestral (gene expression) states, this approach is orthogonal to traditional methods that infer ancestral states from morphological characters among extant species. Indeed, gene expression ultimately underlies the development, evolution, and function of anatomical systems. Thus, by reconstructing the evolution of entire transcriptomes we may be able to infer function of ancestral cell, tissue, and organ systems. Our results resolve several evolutionary transformations during the origins of pregnancy, including the early evolution of an epitheliochorial-like placenta in the mammalian stem-lineage, a hemochorial placenta in the ancestor of eutherians, multiple reversions to non-invasive epitheliochorial placentas within some eutherian lineages, convergent evolution of gene expression profiles among species with different ontogenetic origins of epitheliochorial placentas, and maternal control of placental invasiveness.
Materials and methods
Endometrial gene expression profiling
Request a detailed protocolWe previously published a dataset of uterine endometrial transcriptomes, which is also used in this study. Interested readers are referred to Mika et al., 2021b for specific details. Briefly, we searched the NCBI BioSample, Short Read Archive (SRA), and Gene Expression Omnibus (GEO) databases using the search terms ‘uterus’, ‘endometrium’, ‘decidua’, ‘oviduct’, and ‘shell gland’. These anatomical terms refer to the glandular portion of the female reproductive tract, which is specialized for maternal-fetal interactions or shell formation. We then manually curated transcriptomes and excluded those that did not indicate whether tissue samples were from pregnant or gravid tissues and datasets composed of pooled tissues. Gene expression data were analyzed with Kallisto (Bray et al., 2016) version 0.42.4 to pseudo-align the raw RNA-Seq reads to reference transcriptomes and to generate transcript abundance estimates (see Figure 2—source data 1 for accession numbers and reference genome assemblies); Kallisto was run using default parameters, bias correction, and 100 bootstrap replicates.
Gene expression phylogeny and ancestral transcriptome reconstruction
Request a detailed protocolWe used the binary encoded endometrial transcriptome dataset for phylogenetic analyses and to reconstruct ancestral gene expression states. Gene expression phylogenies were inferred with IQ-TREE2 (Nguyen et al., 2015) using the best-fitting model of character evolution determined by ModelFinder (Kalyaanamoorthy et al., 2017). The best fitting model was inferred to be the General Time Reversible model for binary data (GTR2), with character state frequencies optimized by maximum-likelihood (FO), and a FreeRate model of among site rate heterogeneity with three categories (R3) (Soubrier et al., 2012). The rate at which characters evolve may vary over time, with the same character evolving rapidly or slowly in different lineages. This phenomenon, known as heterotachy, can bias phylogenetic trees using models of evolution that assume the rates of character evolution are constant such as the GTR +FO + R3 model. Therefore we compared the GTR2 +FO + R3 model to the General Heterogeneous evolution On a Single Topology (GHOST) model; The GHOST model accommodates heterotachy by combining features of mixed substitution rate models (Foster, 2004; Lartillot, 2013; Pagel and Meade, 2004), whereby each class of characters has its own substitution rate matrix, and mixed branch length models (Kolaczkowski and Thornton, 2008; Pagel and Meade, 2004), whereby each class of characters has its own set of branch lengths.
We found the best-fit GHOST model was GTR2 +FO*H4 (AICc = 215681.71), which accommodated rate heterotachy with 4 categories, described the binary encoded gene expression dataset better than the standard GTR2 +FO + R3 model (AICc = 220930.65) indicating that there is extensive variation in the rate of character evolution over time. However, while there was a significant likelihood difference between GTR2 +FO + R3 and GTR2 +FO*H4 (AICc difference = 5248.94), as well as other heterotachy with models with less than or more than 4 categories, the topology of the trees was the same. Thus, while accommodating heterotachy improves estimation of parameters of the substitution model it had no affect on the tree topology and we therefore use the computationally simpler GTR2 +FO + R3 model for downstream analyses; This is particularly important for non-parametric topology tests and ancestral state reconstructions (discussed below), which as currently implemented in IQ-TREE2 cannot accommodate heterotachy models with unlinked model parameters.
Ancestral gene expression states for each gene were inferred using the empirical Bayesian method implemented in IQ-TREE2, the GTR2 +FO + R3 model of character evolution, and the species phylogeny as a constraint tree (Figure 3B). Branch support was assessed using the standard (StdBoot) and ultrafast (UFBoot) bootstraps, which assess the effects of sampling bias on branch support (Hoang et al., 2018; Minh et al., 2020). We also used several single branch tests, including the SH-like aLRT and the parametric aLRT (Anisimova et al., 2006; Guindon et al., 2010) aBayes (Anisimova et al., 2011) and the local (LBoot) bootstrap tests (Minh et al., 2020) single branch tests assess whether a branch provides a significant likelihood improvement compared to a null hypothesis that collapses the branch to a polytomy but leaves the rest of the tree topology unaltered. We considered a clade to be highly-supported if its StdBoot support ≥80%, UFboot ≥95%, SH-aLRT ≥80%, aBayes ≥0.90, parametric aLRT ≥0.95, and LBoot ≥90% (Anisimova et al., 2011).
The bootstrap and single branch tests assess the robustness of individual branch bipartitions and cannot directly compare complex alternate tree topologies. Therefore we used non-parametric topology tests to directly compare the inferred ML tree to alternative trees with the correct phylogenetic placement of platypus, armadillo, dog, marsupials (opossum and wallaby), and bat, as well as the correct species phylogeny (Figure 2B); tests included the BP-RELL, KH-test (Kishino et al., 1990; Kishino and Hasegawa, 1989), SH-test (Anisimova et al., 2011; Guindon et al., 2010), c-ELW (Strimmer and Rambaut, 2002), weighted KH- and SH-tests, and the AU-test (Shimodaira and Goldman, 2002). We note that the KH-test compares two a priori defined trees rather than the ML and alternative trees (Goldman et al., 2000) and does not correct for multiple hypothesis tests, it is included solely for comparison to other methods and previous studies. The SH-test can be used to compare the ML tree to multiple alternative trees selected a priori (i.e. is dataset independent) and corrects for multiple hypothesis tests, but is too conservative when many trees are tested. The AU-test, in contrast, resolves the conservative nature of the SH-test and thus is the preferred test. All tests performed 100,000 resamplings using the RELL method.
Clustering methods
Request a detailed protocolWe evaluated multiple methods to summarize and visualize the binary encoded extant and ancestral reconstructed transcriptomes, including: (1) Logistic Principal Component Analysis (LPCA), a version of principal component analysis for dimensionality reduction of binary data (https://cran.r-project.org/web/packages/logisticPCA/vignettes/logisticPCA.html); (2) classical Multi-Dimensional Scaling (MDS); (3) Uniform Manifold Approximation and Projection (UMAP); (4) tSNE; and (5) Fuzzy C-Means (FCM) clustering. All clustering analyses were conducted in R after removing columns (genes) with missing data (coded as?/NA) or that were invariant (all 0 or all 1). LPCA was performed using the LogisticPCA R package (Landgraf and Lee, 2015), which implements three methods: exponential family PCA applied to Bernoulli data, logisitic PCA, and the convex relaxation of logistic PCA. For each of the methods, we fit the parameters assuming two-dimensional representation, returning four principal components (ks = 4), and selecting the best m value to approximate the saturated model for cross validation. MDS was performed using the vegan R package (Oksanen et al., 2008) with four reduced dimensions. UMAP was performed using the umap R package. tSNE was performed using the Rtsne R package.
To explore the data in greater detail we focused on FCM clustering because the results were qualitatively similar to the other methods, and it has several desirable properties including providing a statistically sound way to identify clusters rather than an ad hoc approach that might be applied to the other methods. FCM also allows each sample to have membership in multiple clusters and is conceptually similar to topic (‘grade of membership’) models used in population genetics to visualize private and shared genetic structure across populations. FCM membership coefficients can thereby account for multiple sources of similarity including noise, phylogenetic signal, and convergence of gene expression. FCM was performed in R using the R package, using Manhattan distances (cluster membership was not altered by using other distance metrics), and an estimated fuzzifier (m=1.034978). FCM clustering requires a priori knowledge of the number of clusters (K) to include, therefore we evaluated FCM with K=2–9 following the suggestions given in https://www.r-bloggers.com/2019/01/10-tips-for-choosing-the-optimal-number-of-clusters/. First, we used the “elbow” method, in which the sum of squares of each cluster number is calculated and graphed and the optimal number of clusters estimated by a change of slope from steep to shallow (the elbow). We also assessed the optimal number of clusters using the clustree R package, which assess the optimal number of clusters by considering how samples change groupings as the number of clusters increases ; clustree is useful for estimating which clusters are distinct and which are unstable but cannot determine the optimal number of clusters (K).
CIBERSORT analyses
Request a detailed protocolWe inferred the proportion of different cell-types in the endometrium during pregnancy across our comparative gene expression datasets using CIBERSORT (Newman et al., 2019), which takes as input a file of gene expression levels from a mixed cell population and a gene expression signature file with expression levels of marker genes in specific cell-types. CIBERSORT was run on the bulk RNA-Seq data from each species using a signature gene expression file based on the Vento-Tormo et al. scRNA-Seq dataset. The signature gene expression file included gene expression data (TPM-like) for genes with an expression level greater than or equal to the expression threshold that also have at least fivefold higher expression levels in a particular cell compared to all other cells (Jain and Tuteja, 2021).
Data availability
All data generated or analysed during this study are included in the manuscript and supporting file; Source Data files have been provided for Figures 2, 3 and 5.
References
-
Evolution of placental invasion and cancer metastasis are causally linkedNature Ecology & Evolution 3:1743–1753.https://doi.org/10.1038/s41559-019-1046-4
-
BookThe Changing Role of the Embryo in Evolutionary Thought: Roots of Evo-DevoCambridge University Press.https://doi.org/10.1017/CBO9781139164856
-
A review of long-branch attractionCladistics 21:163–193.https://doi.org/10.1111/j.1096-0031.2005.00059.x
-
THE INVASIVENESS OF TRANSPLANTED MOUSE TROPHOBLAST AND THE INFLUENCE OF IMMUNOLOGICAL FACTORSJournal of Reproduction and Fertility 10:343–352.https://doi.org/10.1530/jrf.0.0100343
-
Viviparous placentotrophy in reptiles and the parent-offspring conflictJournal of Experimental Zoology. Part B, Molecular and Developmental Evolution 324:532–548.https://doi.org/10.1002/jez.b.22624
-
Lifetime cancer prevalence and life history traits in mammalsEvolution, Medicine, and Public Health 2020:187–195.https://doi.org/10.1093/emph/eoaa015
-
Near-optimal probabilistic RNA-seq quantificationNature Biotechnology 34:525–527.https://doi.org/10.1038/nbt.3519
-
Evolutionary Novelty and the Evo-Devo Synthesis: Field NotesEvolutionary Biology 37:93–99.https://doi.org/10.1007/s11692-010-9083-6
-
Comparative aspects of trophoblast development and placentationReproductive Biology and Endocrinology 2:46.https://doi.org/10.1186/1477-7827-2-46
-
Development, regeneration, and evolution of feathersAnnual Review of Animal Biosciences 3:169–195.https://doi.org/10.1146/annurev-animal-022513-114127
-
Evolution of C(4) phosphoenolpyruvate carboxykinase in grasses, from genotype to phenotypeMolecular Biology and Evolution 26:357–365.https://doi.org/10.1093/molbev/msn255
-
Ectopic pregnancy in animals and humansReproduction (Cambridge, England) 131:631–640.https://doi.org/10.1530/rep.1.00606
-
Placental invasiveness mediates the evolution of hybrid inviability in mammalsThe American Naturalist 168:114–120.https://doi.org/10.1086/505162
-
Cases in which Parsimony or Compatibility Methods will be Positively MisleadingSystematic Biology 27:401–410.https://doi.org/10.1093/sysbio/27.4.401
-
Biased gene conversion and GC-content evolution in the coding sequences of reptiles and vertebratesGenome Biology and Evolution 7:240–250.https://doi.org/10.1093/gbe/evu277
-
Modeling compositional heterogeneitySystematic Biology 53:485–495.https://doi.org/10.1080/10635150490445779
-
The mammalian yolk sac placentaJournal of Experimental Zoology. Part B, Molecular and Developmental Evolution 312:545–554.https://doi.org/10.1002/jez.b.21239
-
Maximum-likelihood phylogenetic analysis under a covarion-like modelMolecular Biology and Evolution 18:866–873.https://doi.org/10.1093/oxfordjournals.molbev.a003868
-
Likelihood-based tests of topologies in phylogeneticsSystematic Biology 49:652–670.https://doi.org/10.1080/106351500750049752
-
Mechanism of control of trophoblast invasion in situJournal of Cellular Physiology 148:228–234.https://doi.org/10.1002/jcp.1041480207
-
Ancestral state reconstructions require biological evidence to test evolutionary hypotheses: A case study examining the evolution of reproductive mode in squamate reptilesJournal of Experimental Zoology. Part B, Molecular and Developmental Evolution 324:493–503.https://doi.org/10.1002/jez.b.22614
-
II:Croonian lecture - the developmental history of the primatesPhilosophical Transactions of the Royal Society of London. Series B, Containing Papers of a Biological Character 221:45–178.https://doi.org/10.1098/rstb.1932.0002
-
UFBoot2: Improving the Ultrafast Bootstrap ApproximationMolecular Biology and Evolution 35:518–522.https://doi.org/10.1093/molbev/msx281
-
Early development and embryology of the platypusPhilosophical Transactions of the Royal Society of London. Series B, Biological Sciences 353:1101–1114.https://doi.org/10.1098/rstb.1998.0269
-
The incidence of experimentally produced abdominal implantations in the ratThe Anatomical Record 141:159–167.https://doi.org/10.1002/ar.1091410209
-
Period of gestation and body weight in some placental mammalsComparative Biochemistry and Physiology Part A 43:673–679.https://doi.org/10.1016/0300-9629(72)90254-X
-
Maximum likelihood inference of protein phylogeny and the origin of chloroplastsJournal of Molecular Evolution 31:151–160.https://doi.org/10.1007/BF02109483
-
A mixed branch length model of heterotachy improves phylogenetic accuracyMolecular Biology and Evolution 25:1054–1066.https://doi.org/10.1093/molbev/msn042
-
The role of GC-biased gene conversion in shaping the fastest evolving regions of the human genomeMolecular Biology and Evolution 29:1047–1057.https://doi.org/10.1093/molbev/msr279
-
Phylogenetic patterns of GC-biased gene conversion in placental mammals and the evolutionary dynamics of recombination landscapesMolecular Biology and Evolution 30:489–502.https://doi.org/10.1093/molbev/mss239
-
The hearing gene Prestin unites echolocating bats and whalesCurrent Biology 20:R55–R56.https://doi.org/10.1016/j.cub.2009.11.042
-
Pervasive Correlated Evolution in Gene Expression Shapes Cell and Tissue Type TranscriptomesGenome Biology and Evolution 10:538–552.https://doi.org/10.1093/gbe/evy016
-
The experimental production of an early stage of extrauterine pregnancyExperimental Biology and Medicine 11:103–106.https://doi.org/10.3181/00379727-11-64
-
Comparative development and evolution of the placenta in primatesContributions to Primatology 3:142–234.
-
BookA Multidisciplinary ApproachIn: Patrick Luckett W, Frederick SS, editors. Phylogeny of the Primates. New York: Springer. pp. 157–182.https://doi.org/10.1007/978-1-4684-2166-8
-
Cladistic relationships among primate higher categories: evidence of the fetal membranes and placentaFolia Primatologica; International Journal of Primatology 25:245–276.https://doi.org/10.1159/000155719
-
Evolution: Stress fans the flames of innovationCurrent Biology 32:R158–R160.https://doi.org/10.1016/j.cub.2022.01.030
-
Origin and Evolution of the Turtle Body PlanAnnual Review of Ecology, Evolution, and Systematics 51:143–166.https://doi.org/10.1146/annurev-ecolsys-110218-024746
-
Inferring phylogeny despite incomplete lineage sortingSystematic Biology 55:21–30.https://doi.org/10.1080/10635150500354928
-
Estimating a binary character’s effect on speciation and extinctionSystematic Biology 56:701–710.https://doi.org/10.1080/10635150701607033
-
Similarities Between Embryo Development and Cancer Process Suggest New Strategies for Research and Therapy of Tumors: A New Point of ViewFrontiers in Cell and Developmental Biology 7:20.https://doi.org/10.3389/fcell.2019.00020
-
The evolution of reproductive mechanisms in primatesJournal of Reproduction and Fertility pp. 49–66.
-
Evolution of Placentation in Primates: Implications of Mammalian PhylogenyEvolutionary Biology 35:125–145.https://doi.org/10.1007/s11692-008-9016-9
-
The evolution of embryo implantationThe International Journal of Developmental Biology 58:155–161.https://doi.org/10.1387/ijdb.140020dw
-
Innate Lymphoid Cells in Human PregnancyFrontiers in Immunology 11:551707.https://doi.org/10.3389/fimmu.2020.551707
-
Evolutionary transformations of fetal membrane characters in Eutheria with special reference to AfrotheriaJournal of Experimental Zoology. Part B, Molecular and Developmental Evolution 306:140–163.https://doi.org/10.1002/jez.b.21079
-
Innate Lymphoid Cells in the Maternal and Fetal CompartmentsFrontiers in Immunology 9:2396.https://doi.org/10.3389/fimmu.2018.02396
-
IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic EraMolecular Biology and Evolution 37:1530–1534.https://doi.org/10.1093/molbev/msaa015
-
Embryo Implantation and Tumor Metastasis: Common Pathways of Invasion and AngiogenesisSeminars in Reproductive Medicine 17:275–290.https://doi.org/10.1055/s-2007-1016235
-
IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogeniesMolecular Biology and Evolution 32:268–274.https://doi.org/10.1093/molbev/msu300
-
Regulation of invasive growth: similar epigenetic mechanisms underpin tumour progression and implantation in human pregnancyClinical Science (London, England 118:451–457.https://doi.org/10.1042/CS20090503
-
Heterotachy and long-branch attraction in phylogeneticsBMC Evolutionary Biology 5:50.https://doi.org/10.1186/1471-2148-5-50
-
Plausibility of trophoblastic-like regulation of cancer tissueCancer Management and Research 11:5033–5046.https://doi.org/10.2147/CMAR.S190932
-
Detecting signatures of selection on gene expressionNature Ecology & Evolution 1:1–11.https://doi.org/10.1038/s41559-022-01761-8
-
Less is more in mammalian phylogenomics: AT-rich genes minimize tree conflicts and unravel the root of placental mammalsMolecular Biology and Evolution 30:2134–2144.https://doi.org/10.1093/molbev/mst116
-
The trophoblast model of cancerNutrition and Cancer 67:61–67.https://doi.org/10.1080/01635581.2014.956257
-
The ultrastructure of pig trophoblast rransplanted to an ectopic site in the uterine wallJournal of Anatomy 113:139–149.
-
An Approximately Unbiased Test of Phylogenetic Tree SelectionSystematic Biology 51:492–508.https://doi.org/10.1080/10635150290069913
-
The influence of rate heterogeneity among sites on the time dependence of molecular ratesMolecular Biology and Evolution 29:3345–3358.https://doi.org/10.1093/molbev/mss140
-
Inferring confidence sets of possibly misspecified gene treesProceedings. Biological Sciences 269:137–142.https://doi.org/10.1098/rspb.2001.1862
-
Hear, hear: the convergent evolution of echolocation in bats?Trends in Ecology & Evolution 24:351–354.https://doi.org/10.1016/j.tree.2009.02.012
-
Some General Observations on the Placenta, with especial reference to the Theory of EvolutionJournal of Anatomy and Physiology 11:33–53.
-
What is the promise of developmental evolution? Part II: A causal explanation of evolutionary innovations may be impossibleThe Journal of Experimental Zoology 291:305–309.https://doi.org/10.1002/jez.1130
-
What is the promise of developmental evolution? III. The crucible of developmental evolutionJournal of Experimental Zoology 300B:1–4.https://doi.org/10.1002/jez.b.41
-
Comments on Boddy et al. 2020: Available data suggest positive relationship between placental invasion and malignancyEvolution, Medicine, and Public Health 2020:211–214.https://doi.org/10.1093/emph/eoaa024
-
The Coevolution of Placentation and CancerAnnual Review of Animal Biosciences 10:259–279.https://doi.org/10.1146/annurev-animal-020420-031544
-
BookOn the placentation of primates, with a consideration of the phylogeny of the placentaCarnegie Institution.
-
Innate lymphoid cells at the human maternal-fetal interface in spontaneous preterm laborAmerican Journal of Reproductive Immunology (New York, N.Y 79:e12820.https://doi.org/10.1111/aji.12820
-
Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sitesMolecular Biology and Evolution 10:1396–1401.https://doi.org/10.1093/oxfordjournals.molbev.a040082
-
Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methodsJournal of Molecular Evolution 39:306–314.https://doi.org/10.1007/BF00160154
-
Ancestral transcriptome inference based on RNA-Seq and ChIP-seq dataMethods (San Diego, Calif.) 176:99–105.https://doi.org/10.1016/j.ymeth.2018.11.010
Article and author information
Author details
Funding
March of Dimes Foundation
- Vincent J Lynch
Burroughs Wellcome Fund (1013760)
- Vincent J Lynch
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
The authors thank MB Thompson (University of Sydney) for constructive comments on this manuscript. This study was supported by a grant from the March of Dimes (March of Dimes Chicago-Northwestern-Duke Prematurity Research Center) and a Burroughs Welcome Fund Preterm Birth Initiative grant (1013760) to principal investigator VJL. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Copyright
© 2022, Mika et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 5,142
- views
-
- 1,014
- downloads
-
- 16
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Cell Biology
- Developmental Biology
In most murine species, spermatozoa exhibit a falciform apical hook at the head end. The function of the sperm hook is not yet clearly understood. In this study, we investigate the role of the sperm hook in the migration of spermatozoa through the female reproductive tract in Mus musculus (C57BL/6), using a deep tissue imaging custom-built two-photon microscope. Through live reproductive tract imaging, we found evidence indicating that the sperm hook aids in the attachment of spermatozoa to the epithelium and facilitates interactions between spermatozoa and the epithelium during migration in the uterus and oviduct. We also observed synchronized sperm beating, which resulted from the spontaneous unidirectional rearrangement of spermatozoa in the uterus. Based on live imaging of spermatozoa-epithelium interaction dynamics, we propose that the sperm hook plays a crucial role in successful migration through the female reproductive tract by providing anchor-like mechanical support and facilitating interactions between spermatozoa and the female reproductive tract in the house mouse.
-
- Developmental Biology
The morphogen FGF8 establishes graded positional cues imparting regional cellular responses via modulation of early target genes. The roles of FGF signaling and its effector genes remain poorly characterized in human experimental models mimicking early fetal telencephalic development. We used hiPSC-derived cerebral organoids as an in vitro platform to investigate the effect of FGF8 signaling on neural identity and differentiation. We found that FGF8 treatment increases cellular heterogeneity, leading to distinct telencephalic and mesencephalic-like domains that co-develop in multi-regional organoids. Within telencephalic regions, FGF8 affects the anteroposterior and dorsoventral identity of neural progenitors and the balance between GABAergic and glutamatergic neurons, thus impacting spontaneous neuronal network activity. Moreover, FGF8 efficiently modulates key regulators responsible for several human neurodevelopmental disorders. Overall, our results show that FGF8 signaling is directly involved in both regional patterning and cellular diversity in human cerebral organoids and in modulating genes associated with normal and pathological neural development.