The Generalized Haldane (GH) model tracking population size changes and resolving paradoxes of genetic drift

Yongsen Ruan; Xiaopei Wang; Mei Hou; Liying Huang; Wenjie Diao; Miles Tracy; Shuhua Xu; Weiwei Zhai; Zhongqi Liufu; Haijun Wen; Chung-I Wu

doi:10.7554/eLife.99990.3

eLife Assessment

This study presents a useful model of genetic drift by incorporating variance in reproductive success, aiming to address several apparent paradoxes in molecular evolution. However, some of the apparent paradoxes only arise in the most basic version of standard models and have been reconciled in more advanced models. Nonetheless, this paper offers intuitive explanations for these apparent paradoxes, by adopting a new perspective and solid modeling and analysis. More broadly, the proposed model provides an alternative framework to address puzzling observations in molecular evolution, which will be of interest to evolutionary and population geneticists.

https://doi.org/10.7554/eLife.99990.3.sa4

Significance of findings

useful: Findings that have focused importance and scope

landmark
fundamental
important
valuable
useful

Strength of evidence

solid: Methods, data and analyses broadly support the claims with only minor weaknesses

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Population genetic models, such as the Wright-Fisher (WF) model, track relative gene frequencies. The absolute gene copy number, or population size (N), is supplied externally for tracking genetic drift. JBS Haldane (1927) proposed an alternative model based on the branching process, whereby each gene copy is transmitted to K descendants with the mean and variance of E(K) and V(K). In this model, E(K) governs N, while V(K)/N governs genetic drift. Nevertheless, as the branching process allows N to drift unboundedly, a Generalized Haldane (GH) model that regulates N more tightly is proposed. The GH model can account for several paradoxes of molecular evolution. Notably, genetic drift may often become stronger as N becomes larger in the ecological setting, thus contradicting the general view. In particular, a very small population growing exponentially experiences little drift. Interestingly, when the population grows and N oscillates near the carrying capacity, the paradoxical trend is also observed in both field works and laboratory experiments. This paradox whereby population size in genetics (N_e) and ecology (N) could be negatively correlated is resolved by the GH model. Additional paradoxes include ii) The two sexes experiencing drift differently; iii) Genetic drift of advantageous mutations being independent of N; iv) Multi-copy gene systems (viruses, mitochondria, etc.) having no definable N_e (for effective N). In brief, the GH model defines genetic drift simply as V(K), or V(K)/N averaged over the population. It represents an attempt at integrating genetical and ecological analyses into one framework.

Introduction

Genetic drift is defined as the random changes in frequency of genetic variants (Crow and Kimura 1970; Hartl and Clark 1997; Lynch, et al. 2016). Much, or even most, of DNA sequence evolution is governed by the random drift of neutral mutations. Nevertheless, even non-neutral variants are subjected to genetic drift as advantageous mutations often have a small probability of being fixed. If the strength of genetic drift is under-estimated, random changes that are missed in the analysis would result in the over-estimation of other evolutionary forces of greater biological interest, including selection, mutation, migration, meiotic drive and so on. In particular, the conclusion of pervasive selection at the molecular level has been suspected to be due to the failure to fully account for genetic drift (Crow and Kimura 1970; Fu 1997; Li 1997; Charlesworth 2009; Lynch, et al. 2016; Chen, Yang, et al. 2022).

Hagedoorn and Hagedoorn (1921) may be the first to suggest that random forces (accidents, for example) could impact long-term evolution. In the same decade, models of genetic drift were developed along two lines. The one approach that became widely adopted is the Wright-Fisher (WF) model (Crow and Kimura 1970; Hartl and Clark 1997), formalized by Fisher (1930) and Wright (1931). They define genetic drift to be due to the random sampling of genes. Thus, in generation transition, the variance in the frequency of a variant allele, x, would be

where N is the population size. (Here, we present the haploid model while a factor of 2 should be added to the diploid models.) The WF model has been modified in many directions (Wright 1969; Gillespie 1975; Eldon and Wakeley 2006; Chen, et al. 2017; Sackman, et al. 2019) whereby the effective population size, N_e, is used to accommodate various deviations from the ideal WF population. Note that N, and hence N_e, are imposed externally on the model as in the later Moran model (2008).

At about the same time, JBS Haldane introduced the branching process to model genetic drift (Haldane 1927; Haldane 1932). In the Haldane model, each gene copy is independently transmitted to K descendants with the mean and variance of E(K) and V(K). Again, we present the haploid model, noting some added complexities with diploidy.

The branching process of the Haldane model can generate N internally by formulating N of the next generation as E(N′) = N×E(K). Since Haldane’s model (1927) resets N′ to a constant N, a Generalized Haldane (GH) model is proposed here that permits varying N under ecological regulation. Indeed, E(K) is often negatively correlated with N in actual populations (Smith and Slatkin 1973; Sibly and Hone 2002). The GH model thus tracks gene frequency changes as well as regulates N. In the Haldane model, E(K) would determine the trajectory of N changes and V(K) would determine the strength of genetic drift. Thus, there is no drift if V(K) = 0 irrespective of N.

The GH model is then applied to several paradoxes of genetic drift (see the overview below). It is possible that mathematical modifications of the conventional models (i.e., WF and Moran models) may resolve these paradoxes as well. Nevertheless, the key question is whether these modifications, often highly sophisticated, are biologically feasible (see Discussion). In comparison, the Haldane model is intuitively appealing and deserves to be considered as an alternative approach to genetic drift. As we study the ever more complex biological systems in the genomic era, such an alternative would seem desirable.

Results

Overview of the paradoxes of genetic drift

Among the paradoxes, a most curious one is genetic drift in relation to changing population size. The WF model dictates stronger drift when N is smaller. However, when N is very small (say, N = 10 in a bacteria or yeast population) and increases to 20, 40, 80 and so on, there is in fact little drift in this exponential phase. Drift will intensify when N grows to near the carrying capacity. This trend is the exact opposite of the standard view. The second paradox concerns the genetic drift of sex-linked genes in relation to sex-dependent breeding successes (Charlesworth and Charlesworth 2000; Bachtrog 2013; Cortez, et al. 2014; Wilson Sayres, et al. 2014; Makova, et al. 2024). A third paradox is about drift strength when selection is also at work. A new mutation with a selective advantage would always be fixed if there is no random drift. With drift, fixation is probabilistic, but the probability does not depend on N or N_e. This curious property echoes the view that N is a scaling factor of drift that is determined by V(K) (see Results).

The power of the Haldane model in resolving these paradoxes is the focus of this study. Furthermore, the companion study (Wang et al. 2024) addresses a fourth paradox – the evolution of multi-copy gene systems, whereby evolution proceeds in two stages - between as well as within individuals. Multi-copy gene systems including viruses, mitochondria, transposons and ribosomal genes do not have easily definable Ne. Broadly speaking, even diploidy is also multi-copy systems (Silver 1985; Wu, et al. 1988; Wu, et al. 1989; Lindholm, et al. 2016; Courret, et al. 2019).

On the Haldane model of genetic drift

In the original Haldane model, genetic drift is defined by the branching process (see Eq. (4) of Chen, et al. (2017)). Let K be the number of progenies receiving the gene copy of interest from a parent; K would follow the same distribution for all neutral variants. In haploids, K is equivalent to the progeny number of each parent. In diploids, K is the transmission success of each gene copy, which is separately tracked. By the Taylor approximations of the ratio of two variables (Kendall, et al. 2006), we obtain the approximation as

where x is the variant frequency (see Supplementary Information). This is the same approximation, when V(K) is not very large, as obtained by the WF model (Kimura and Crow 1964; see Chen et al. 2017 for details). For Eq. (2), we provide further extensive simulations on the accuracy in both the fixation probability and fixation time of neutral variants (see Supplementary Information). Note that N, constant or variable, is supplied externally to the WF model but it is tracked internally in the Haldane model (see the next section). The apparent equivalence of Eq. (2) makes a simple point that, between the WF and Haldane models, the mathematical results are often shared but the biological meanings may not be identical. (Similarly, the diffusion and coalescence theories are also distinct approaches to the same population genetic phenomena.)

In the Haldane model, there would be no genetic drift if V(K) = 0, regardless of the value of N. When E(K) = 1 and N stays constant, we obtain

Eq. (2) and Eq. (3) have been obtained for the WF model (Kimura and Crow 1963; Crow and Denniston 1988; Chen, et al. 2017). The issue will be highly significant when sex-dependent drift is analyzed.

In Table 1, the data of progeny number from the literature indeed show over-dispersion in all taxa surveyed, i.e., V(K) > E(K). In fact, V(K) > 10 E(K) can often be found, e.g., the ratio of V(K)/E(K) among males of the mandrill (Mandrillus sphinx), the great reed warbler (Acrocephalus arundinaceus), and the rhesus macaque (Macaca mulatta) at 19.5, 12.6 and 11.3, respectively (Hasselquist 1995; Setchell, et al. 2005; Dubuc, et al. 2014). We have also found that V(K) may be far larger than E(K) in other biological systems such as viruses (Ruan, Luo, et al. 2021; Ruan, Wen, et al. 2021; Hou, et al. 2023).

Field record of E(K) and V(K) across diverse taxas.

I. The first paradox – The paradox of changing N

1. Empirical demonstration and simulation

This “paradox of changing N” is that “genetic drift increases when N increases” in direct contradiction with Eq. (2). Fig. 1a shows the results in a cell culture in the exponential growth phase where each cell doubles every 13 hours. Hence, V(K) ∼ 0 with almost no drift when N is as small as < 50. Genetic drift is expected to increase as N approaches the carrying capacity.

The paradox of genetic drift when the population size (N) changes.
(a) In the laboratory cell culture, nearly all cells proliferate when N is very small (interpreted in the next panel). (b) The simulation of Haldane model shows little genetic drift in the exponential phase. As in (a), drift may increase due to the heightened competition as the population grows. (c) Simulation by the WF model shows a pattern of drift opposite of the Haldane model. (d) The patterns of drift at the low and high N are analyzed in the framework of the logistic growth model. (e-f) Measurements of genetic drift in laboratory yeast populations at low and high density as defined in (d). The progeny number of each cell, K, is counted over 4 or 5 intervals as shown by the dots, the sizes of which reflect the cell number. E(K) and V(K) are presented as well. In panel (f), the change of offspring number overtime, denoted as V(ΔK), is shown above the braces. (g) The variance of offspring number V(K) increases, observed in (e) and (f), as population size increases.

The paradox is exemplified through computer simulations in Fig. 1b-c. These panels show the drift pattern in the Haldane model and the standard WF model, respectively, when the populations are growing as the logistic growth model (see Methods). In the WF model, the drift is strong initially but weakened as N increases. The pattern is reversed in the Haldane model. The contrast between the two panels is the paradox.

To verify the simulations of Fig. 1b-c, we study the growth of yeast populations under laboratory conditions where cells appear to increase following the logistic growth curve. As shown in Fig. 1d, the lower portion of the S-curve portrays the exponential growth in a low-density population, while the upper curve indicates a slowdown in growth near the population’s carrying capacity. We directly recorded the individual output (K) of each yeast cell in the early (n = 25) and late (n = 65) stage of population growth under real-time high-content fluorescence imaging (Methods).

It’s evident that V(K) is decoupled from E(K). In the early stage, E(K) is 10.04 with V(K) = 0.60 over five one-hour intervals. In the high-density phase, E(K) decreases to 2.83 but V(K) increases to 1.09 in four intervals (Fig. 1e-f). Most interestingly, during the late stage of population growth, V(ΔK) (adjusted to the four-hour time span for comparison; see Methods) indeed increases as N increases (Supplementary Table 3). At the last time point, the population is closest to the carrying capacity and V(ΔK) experiences a substantial leap from around 1.12 to 1.82. Overall, the variance in progeny number V(K) in a high-density population is always greater that in a low-density population (Fig. 1g). Indeed, V(K) may exhibit an increase with the increase in N, sometimes outpaces the latter.

Measurements of these two growth phases demonstrate the “paradox of changing N” by showing i) there is almost no drift when N is very small; ii) as N increases, V(K), hence, genetic drift would also increase. Furthermore, V(K) may increase sharply when N is close to the carrying capacity. This trend appears to echo the field observations by Coltman, et al. (1999) who show that V(K) increases more than 5-fold when N increases only 2.5-fold in the United Kingdom (UK) population of Soay rams (Ovis aries).

2. The density-dependent Haldane (DDH) model – A first attempt at the Generalized Haldane model

To explain the paradox of changing N, the model has to track and regulate N. Ironically, although the branching process can track the changes, the process has been deemed unsuitable for population genetics. This is because N_t = N₀ E(K)^t would drive N to approach either zero or infinity as t becomes very large. Indeed, Haldane’s original model reset N′ to N in each generation. Any generalized Haldane (GH) model must take advantage of this feature by regulating N via E(K), as attempted in this section.

The “paradox of changing N” can be explained by N_e = N/V(K) if the numerator and denominator move in the same direction. Most interestingly, V(K) may increase more than N itself. This has been shown when V(K) is close to zero while N is growing exponentially. In addition, when N is very near the carrying capacity, even a small increase in N would intensify the competition and inflate V(K). This is shown in the yeast experiment of Fig. 1d and supported by Coltman, et al. (1999) field study of ram populations in UK. Thus, the relationship between N and V(K) may realistically lead to this paradox.

We now extend the Haldane model by incorporating density-dependent regulation of N in order to define the conditions of this paradox. The model developed here is on an ecological, rather than an evolutionary time scale. The simplest model of N regulation is the logistic equation:

with C_k being the carrying capacity. In the ecological time scale, a changing N would mean that the population is departing from C_k or moving toward it (Fig. 2a). (As will be addressed in Discussion, changing N in the WF model is depicted in Fig. 2b whereby N is at C_k. C_k may evolve too, albeit much more slowly in an evolutionary time scale.) Here, we consider changes in N in the ecological time scale. Examples include the exponential growth when N ≪ C_k, seasonal fluctuation in N (Frankham 1995; Charlesworth 2009) and competition modeled by the Lotka-Volterra equations (Bomze 1983).

The meaning of population size (N) changes in ecology vs. in population genetics.
(a) In ecology, changing N would generally mean a population approaching or departing the carrying capacity. (b) In population genetics, a population of size N is assumed to be at the carrying capacity, C_k. Thus, changes in N would mean an evolving C_k, likely the consequence of environmental changes. The arrows indicate the disparity in time scale between the two scenarios.

Details of the DDH model are given in the Supplementary Information. A synopsis is given here: We consider a non-overlapping haploid population with two neutral alleles. The population size at time t is N_t. We assume that expected growth rate E(K) is greater than 1 when N < C_k and less than 1 when N > C_k, as defined by Eq. (5) below:

The slope of E(K) vs. N (i.e., the sensitive of growth rate to changes in population size), as shown in Fig 3a, depends on z. To determine the variance V(K), we assume that K follows the negative binomial distribution whereby parents would suffer reproduction-arresting injury with a probability of p_t at each birthing (Supplementary Information). Accordingly, V(K) can then be expressed as

Genetic drift as a function of population size in the DDH model.
For all panels, the carrying capacity is C_k = 10,000 and the intrinsic growth rate is r = 2. (a) When N increases, E(K) decreases as modeled in Eq. (5). The z value of Eq. (5) (0.1, 1.5 and 3) determines the strength of N regulation, indicated by the slope of E(K) near C_k = 10,000. (b) Depending on the strength of N regulation near C_k, genetic drift can indeed decrease, increase or stay nearly constant as the population size increases. Thus, the conventional view of N_e being positively dependent on N is true only when the regulation of N is weak (the green line). At an intermediate strength (the red line), N_e is nearly independent of N. When the regulation becomes even stronger at z = 3, N_e becomes negatively dependent on N. (**c-e**) V(K)/E(K) of Eq. (6)) is shown as a function of N. The results of panel (b) are based on a constant V(K)/E(K) shown in panel (c). Interestingly, the results of panel (b) would not be perceptibly changed when V(K)/E(K) varies, as shown in panel (d) and (e).

By Eq. (6), the ratio of V(K)/E(K) could be constant, decrease or increase with the increase of population size. With E(K) and V(K) defined, we could obtain the effective population size by substituting Eq. (5) and Eq. (6) into Eq. (3).

Eq. (7) presents the relationship between effective population size (N_e) and the population size (N) as shown in Fig. 3. The density-dependent E(K) could regulate N with different strength (Fig. 3a). The steeper the slope in Fig. 3a, the stronger the regulation.

The main results of this study are depicted in Fig. 3b. First, with no or little regulation of N, N_e and N are strongly correlated. The green dashed lines portray the conventional view of decreasing drift with increasing N. Second, under a particular strength of N regulation, the red line shows no dependence of N_e on N, meaning that genetic drift is independent of N. Finally, as N becomes strongly regulated, N_e and N would be negatively correlated as the blue dashed line shows. This trend is the paradox of changing N.

In summary, genetic drift effect can indeed decrease, increase or stay nearly constant as the population size increases, depending on the strength of N regulation near C_k. We further note that the V(K)/E(K) ratio of Eq. (7) can be independent (Fig. 3c), positively dependent (Fig. 3d), or negatively dependent (Fig. 3e) on N. Interestingly, the results of Fig. 3b are nearly identical when the V(K)/E(K) ratio changes (Supplementary Figs 1-3). These results show that the strength of genetic drift depends on the ecology that governs E(K), V(K) and N.

The paradox of changing N may not be an exception but a common rule in natural populations. Note that the WF approximation yielding Eq. (2) assumes nearly constant N. The assumption would imply very strong N regulation near C_k, which is precisely the condition leading to the paradox of changing N (Fig. 3). Coltman et al. (1999)’s observations of the reproduction in rams is such a case.

II. The paradox of genetic drift in sex chromosomes

Since the relative numbers of Y, X and each autosome (A) in the human population are 1:3:4, the WF model would predict the genetic diversity (θ_Y, θ_X and θ_A, respectively) to be proportional to 1:3:4. In a survey of human and primate genetic diversity, θ_Y is almost always less than expected as has been commonly reported (Hammer, et al. 2001; Wang, et al. 2014; Wilson Sayres, et al. 2014; Makova, et al. 2024). The low θ_Y value has been used to suggest that human Y chromosomes are under either positive or purifying selection (Wang, et al. 2014; Wilson Sayres, et al. 2014; Makova, et al. 2024). Similarly, the reduced diversity X-linked genes was interpreted as a signature of positive selection (Pan, Liu, et al. 2022).

As pointed out above, under-estimation of genetic drift is a major cause of over-estimation of selective strength. Our goal is hence to see if genetic drift of different strength between sexes can account for the observed genetic diversities. Details will be presented in Wang et al. (2024). Below is a synopsis.

Let V_m and V_f be the V(K) for males and females respectively and α′ = V_m/V_f is our focus. (We use α′ for the male-to-female ratio in V(K) since α is commonly used for the ratio of mutation rate (Miyata, et al. 1987; Makova and Li 2002).) The three ratios, θ_Y /θ_X (denoted as R_YX), θ_Y /θ_A (R_YA) and θ_X /θ_A (R_XA), could be expressed as the functions of α′, which incorporate the relative mutation rates of autosomes, X and Y (Supplementary Information). These relative mutation rates can be obtained by interspecific comparisons as done by Makova and Li (2002).

We note that there are many measures for the within-species genetic diversity, θ = 4N_eμ. Under strict neutrality and demographic equilibrium, these measures should all converge. In the neutral equilibrium, the infinite site model dictates the frequency spectrum to be ξ_i = θ/i, where ξ_i is the number of sites with the variant occurring i times in n samples. Since every frequency bin is a measure of θ, different measures put different weights on the i-th bin (Fay and Wu 2000; Fu 2022). While π, the mean pairwise differences between sequences, is most commonly used in the literature, we use several statistics to minimize the possible influences of selection and demography (Wang et al. 2024). In this synopsis, we used the Watterson estimator (Watterson 1975) as the measure for θ_Y, θ_X and θ_A by counting the number of segregating sites.

Using any of the three ratios (θ_Y /θ_X, θ_Y /θ_A and θ_X /θ_A), we can obtain α′; for example, R_YA = θ_Y/θ_A and, hence,

where y is the mutation rate of Y-linked sequences relative to autosomal sequences. With rearrangement,

Similar formulae of α′ can be obtained for R_YX and R_XA but the accuracy for estimating α′ is highest by R_YA whereas R_XA is the least accurate for this purpose (Supplementary Information).

Table 2 presents the α′ estimates in chimpanzees and bonobos. This is part of a general survey in mammals with a strong emphasis on primates and humans (Wang et al. 2024). It is almost always true that α′ > 5 in primates. Sources of data used are also given in Table 2. As shown for chimpanzees, α′ is often far larger than 5, above which the resolution is very low as can be seen in Eq. (8). Note that, when R_YA is under-estimated and approaching y/8, α′ would increase rapidly when the denominator is close to 0. That is why α′ often becomes infinity (Supplementary Fig. 5). The estimated α′ (MSE) in Table 2 alleviates the problem somewhat by using all three ratios to calculate the mean square error (MSE) (Supplementary Information).

Estimation of V_m/V_f in chimpanzee and bonobo.

Among primates surveyed, bonobo is the only exception with α′ < 1. While chimpanzees and bonobos are each’s closest relatives, their sexual behaviors are very divergent (de Waal 1995; De Waal and Lanting 2023). With unusually strong matriarchs, bonobo society seems to stand out among primates in sexual dominance. The α′ value is important in behavioral studies and, particularly, in primatology and hominoid research. If 0.1% of males sire 100 children and the rest have the same K distribution as females, V_m/V_f would be ∼ 10. Such outlier contribution is the equivalent of super-spreaders in viral evolution which may easily be missed in field studies. In this brief exposition, we highlight again that V(K) or, more specifically, V_m and V_f are key to genetic drift. The differences in drift strength among X, Y and autosomes constitute a curious (but not immediately recognizable) paradox for the WF models as will be addressed in Discussion.

III. The paradox of genetic drift under selection

Genetic drift operates on neutral as well as non-neutral mutations. Let us assume a new mutation, M, with a frequency of 1/N is fixed in the population with the probability of P_f. Fisher (1930) first suggested that the fixation probability of an advantageous mutation should increase in growing populations, while decrease in shrinking populations. If there is no genetic drift, a beneficial mutation will always be fixed and P_f = 1. In the WF model, it is well known that P_f ∼ 2s for a new advantageous mutation, with fitness gain of s, while population size is large (Haldane 1932; Kimura 1962). This seems paradoxical that the determinant of genetic drift, 1/N_e, does not influence P_f (Otto and Whitlock 1997; Lanfear, et al. 2014).

In the Method section, we show that P_f under the Haldane model is given by

where M₀ and W₀ are the initial number of mutant allele (M allele) and wildtype allele (W allele), with N = M₀ + W₀. We note that u_M(t) and u_W(t), respectively, are the extinction probabilities of M allele and W allele by generation t with the initial number of alleles of 1. While obtaining the direct analytical solution of Eq. (9) may not be feasible, we could obtain its numerical solution due to its convergence as t increases (see Methods). The accuracy of the numerical solution from Eq. (9) is confirmed through simulation (Supplementary Fig. 4). Moreover, with the aid of the WF model (see Supplementary Information), we could obtain the approximation of Eq. (9) as follows.

We verify Eq. (10) by both the numerical solution from Eq. (9) and simulations based on the branching process (Supplementary Fig. 4). The fixation probabilities obtained by numerical solution vs. those inferred from Eq. (10) are shown in Fig. 4. The salient feature is that the fixation probability of 2s, as in the classical formula, would be a substantial over-estimate when V(K) is larger than E(K). Eq. (10) is sufficiently accurate as long as N≥50. When N is as small as 10, the theoretical result is biased. Indeed, at such a low N value, the population is prone to extinction. The DDH model presented above should rectify this deficiency.

Fixation probability of a new advantageous mutation in the Haldane model.
The fixation probabilities of a new advantageous mutation with the selective advantage of s = 0.1 are calculated based on approximate solution from Eq. (9) (i.e., 2s/V(K)) as well as numerical solution from Eq. (10). The numerical solution from Eq. (10) has been confirmed accurate by simulations (Supplementary Fig. 4). **(a-b)** When N < 50, the approximate fixation probability (the gray line) is lower than the simulated values (the color lines) due to population extinction. **(c-d)** By the Haldane model, the expected fixation probability of Eq. (9) is accurate when N reaches 100, as in most natural populations.

The main message of Eq. (10) is that genetic drift under positive selection is influenced by s and V(K) but not by N. This independence from N is explicable: when an advantageous mutation increases to a certain level still far below N (depending mainly on s), its fixation would be almost certain. This may be a most direct argument against equating genetic drift with N, or N_e (which is a function of changing N’s).

Discussion

Genetic drift should broadly include all random forces affecting evolutionary trajectories. Hence, when N is not regulated by the models themselves but supplied externally, some random forces are excluded from the modeling. The DDH model may be a first-generation GH model that incorporates N regulation. We note that the original Haldane model suppresses N fluctuation and is hence a special case of DDH with extremely strong N regulation near C_k.

We shall first clarify the first paradox, the “paradox of changing N’s”. This paradox is in the ecological time scale whereby N is either growing or oscillating around the carrying capacity (Fig. 2a). In this time scale, drift may often (but certainly not always) increase in strength as N increases. While N and N_e are often poorly correlated in WF models(Crow and Kimura 1970; Charlesworth 2009; Lynch, et al. 2016), there is no demonstration how N and N_e can be negatively correlated in the absence of N regulation. Nevertheless, the WF models do work well in the evolutionary time scale (Fig. 2b). For example, by the PSMC model (Li and Durbin 2011), both the European and Chinese populations experience a severe bottleneck 10-60 kyr ago. Presumably, various environmental forces may have reduced C_k drastically. Averaged over the long-time span, N should be at or near C_k. In short, the N value in the WF models is both N and C_k at the evolutionary time scale. It should also be mentioned that the DDH model is distinct from the concept of “genetic draft” that involves selection and hitchhiking (Gillespie 2000, 2001).

The second paradox of sex-dependent drift is about different V(K)’s between sexes (generally V_m > V_f) but the same E(K) between them. In the conventional models of sampling, it is not clear what sort of biological sampling scheme could yield V(K) ≠ E(K), let alone two separate V(K)’s with one single E(K). Mathematically, given separate K distributions for males and females, it is unlikely that E(K) for the whole population could be 1, hence, the population would either explode in size or decline to zero. In short, N regulation has to be built into the genetic drift model as the GH model does to avoid this paradox.

The third paradox of genetic drift is manifested in the fixation probability of an advantageous mutation, 2s/V(K). As explained above, the fixation probability is determined by the probability of reaching a low threshold that is independent of N itself. Hence, the key parameter of drift in the WF model, N (or N_e), is missing. This paradox supports the assertion that genetic drift is fundamentally about V(K) with N being a scaling factor. Note the absence of genetic drift at all N’s when V(K) = 0.

The fourth paradox is about the multi-copy gene systems such as viruses and rRNA genes covered in the companion study (Wang, et al. 2024). These systems evolve both within and between hosts. Given the small number of virions transmitted between hosts, drift is strong in both stages as shown by the Haldane model (Ruan, Luo, et al. 2021; Ruan, Wen, et al. 2021; Hou, et al. 2023). Therefore, it does not seem possible to have a single effective population size to account for the genetic drift in two stages with very different biological processes. The inability to deal with multi-copy gene systems may explain the difficulties in accounting for the SARS-CoV-2 evolution (Deng, et al. 2022; Pan, Liu, et al. 2022; Ruan, et al. 2022; Hou, et al. 2023; Ruan, et al. 2023).

As the domain of evolutionary biology expands, many new systems do not have definable populations that fit the criteria of WF populations (such as panmixia in mating or dispersal). Multi-copy gene systems are obvious examples. Others include domestications of animals and plants that are processes of rapid evolution (Diamond 2002; Larson and Fuller 2014; Purugganan 2019; Chen, Yang, et al. 2022; Pan, Zhang, et al. 2022; Wang, et al. 2022). Due to the very large V(K) in domestication, drift must have played a large role. Somatic cell evolution is another example with “undefinable” genetic drift (Wu, et al. 2016; Chen, et al. 2017; Chen, et al. 2019; Ruan, et al. 2020; Chen, Wu, et al. 2022). The Haldane model is an individual-output model (Chen, et al. 2017) whereby the collection of individuals constitute the “population”.

We understand that further modifications of the WF models may account for some or all of these paradoxes. However, such modifications have to be biologically feasible and, if possible, intuitively straightforward. Such possible elaborations of WF models are beyond the scope of this study. We are only suggesting that the Haldane model can be extensively generalized to be an alternative approach to genetic drift. The GH model attempts to integrate population genetics and ecology and, thus, can be applied to genetic systems far more complex than those studied before. The companion study is one such example.

Methods

Cell culture and image analysis

NIH3T3 cells, a fibroblast cell line that was isolated from a mouse NIH/Swiss embryo, were stably transfected with the fluorescent, ubiquitination-based cell cycle indicator (Fucci) (Sakaue-Sawano, et al. 2008) plasmid using Lipofectamine 3000 Transfection Reagent (Invitrogen) following the manufacturer’s specified instructions. The Fucci-labeled cells exhibited distinct fluorescent signals indicative of G1, S, and G2/M phases, represented by red, yellow, and green, respectively. NIH3T3-Fucci cell was derived from single-cell colony and cultured in DMEM supplemented with 10% Calf Bovine Serum and penicillin/streptomycin. All cells were maintained at 37°C with 5% CO₂. Subsequently, the cells underwent extended time-lapse imaging using high-content fluorescence microscopy (PerkinElmer Operetta CLS) equipped with a 10x objective lens. Images were captured hourly over a 100-hour period, and the analysis was conducted using ImageJ (Fiji) (Schneider, et al. 2012) to count the number of cells (Supplementary Table 1).

Yeast strain construction

Strains were constructed on the genetic background of Saccharomyces cerevisiae strain BY4741. A GFP or BFP fluorescent protein expression cassette, under the control of the TDH3 promoter, was inserted into the pseudogene locus YLL017W. Transformations followed a published protocol (Gietz and Schiestl 2007). Transformants were plated on synthetic complete medium without uracil (SC-Ura), and from these, single colonies were selected. Confirmation of replacements was achieved through PCR, and cassette verification was performed using fluorescence microscopy. Subsequently, the constructed strains were cultivated non-selectively in YPD medium (1% Yeast extract, 2% Peptone, 2% D-glucose) at 30°C on a rotary shaker.

Estimation of V(K) and E(K) in yeast cells

To discern division events, even under high concentration, we conducted co-cultures of the GFP-yeast and BFP-yeast at ratios of 1:1 and 1:25, with the initial cell concentration of 0.1% and 12.5%, respectively. Then the yeast cells were then continuously imaged under high-content fluorescence microscopy (PerkinElmer Operetta CLS) for 10 hours with 1-hour intervals to observe the individual offspring of GFP-yeast. Yeast cells with a distinct offspring number (K) within the initial 5 hours (with the high-density group limited to the first 4 hours) were documented (Supplementary Table 2). Subsequently, the mean and variance of K for each cell were calculated across various time intervals as follows.

According to the law of total variance,

where I_t is the offspring number at time t for an initial single cell (i.e., the total number of progeny cells for a single cell after t hours), as documented in Supplementary Table 2. And E_{t-1, t} represents the average of offspring number after single time interval (1 hour here) for each single cell from time t-1 to time t, while V_{t-1, t} is the corresponding variance. Utilizing the documented total cell count at time t (denoted by N_t), we could calculate the average of offspring number for a single yeast cell in a specific time interval, e.g., E_{t-1, t} = N_t / N_t-1 and E[I_t-1] = N_t /N₀. With some rearrangement,

Applying the aforementioned equations, we observed that both V_{t-1, t} (i.e., V(K) within a one-hour interval) and V(I_t) (i.e., the V(K) from 0 to time t) for yeast cells under high density (12.5%) are generally greater than the values under low density (Supplementary Table 3). This suggests that the variance of offspring number in high-density conditions could be larger than that in a low-density context. It’s noteworthy that the estimated value of V_{t-1, t} could be less than zero (Supplementary Table 3), implying a very small variance in the number of offspring. Furthermore, the occurrence of negative values for V_{t-1, t} is more frequent under low density than high density, indicating the higher variance of offspring number in high density as previously suggested.

To show the variance of offspring number overtime, we also calculated the change of offspring number within successive one-hour intervals, denoted as V(I_t - I_t-1). This value is intricately linked to the variance of offspring overtime. To facilitate the comparison with the total variance from 0h to 4h, we set V(ΔK) = 4V(I_t - I_t-1) to account for the four-hour time span.

Simulation of genetic drift in the Haldane model and the Wright-Fisher (WF) model

In both models, interactions between individuals are implicitly included through the dependency of the average number of offspring on population size, as defined by Eq. (5). This dependency leads to the logistic population growth, reflecting the density-dependent interactions. The initial population size is set to 4, with initial gene frequency (i.e., the frequency of mutant allele) of 0.5. And the carrying capacity is established at 100, 000, with r = 1 and z = 1. In WF model, the population size at next generation is N_t+1 = N_t ×E(K_t). Note if the calculated value of N_t+1 is not an integer, we round it to the nearest whole number. The number of mutant alleles follows a binomial distribution, allowing the simulation of gene frequency overtime in WF model. For Haldane model, the number of offspring number is assumed to follow a beta-binomial distribution. The mean of offspring number E(K_t) is obtained from Eq. (5). The variance V(K_t) is obtained from Eq. (6) with parameters a = 1/300 and b = 1.2. Given the mean and variance of the beta-binomial distribution, we simulated the number of mutant alleles and population size overtime and then traced the gene frequency overtime.

Fixation probability of a new mutation in Haldane model

Here we obtain the fixation probability of advantageous mutation in Haldane model governed by a branching process. The Haldane model considers a well-mixed population of N_t haploid parents with only two types of alleles (W for wildtype, M for mutant). There is no migration and new mutations in this model. Each individual is assumed to independently reproduce in discrete and non-overlapping generation, t = 0, 1, 2, …. Thus, the number of offspring per allele (also an individual in this haploid population) at any generation is represented by a set of identically and individually distributed random variables. In particular, the numbers of offspring of M and W alleles (denoted as K_M and K_W respectively) can be represented by following distributions.

For a particular distribution, we can obtain the average number of offspring as follows.

With the selection coefficient of s (0 for neutral mutation, positive for advantageous mutation, and negative for deleterious mutation) of M allele, average offspring number of M allele and W allele will follow the relationship.

Now, the evolution of the number of M allele (denoted as M_t) and the number of W allele (denoted as W_t) as time process is a branching process.

To compare Haldane model with WF model (the offspring number follows Poisson distribution with variance and mean equal to 1), we will set E(K_W) = 1, E(K_M) = 1 + s. And then let the offspring number follow a more general and realistic distribution (including negative binomial distribution (negBin), beta-binomial distribution (betaBin)), which can let the ratio of variance to mean range from 1 to a large number. (For simplicity, we let V(K_M)/E(K_M) = V(K_W)/E(K_W).) Based on the branching process, we obtained the fixation probability M alleles (see Supplementary Information for more details on the derivation).

where

Note u_M(t = 1) = P(K_M = 0) = i₀, u_W(t = 1) = P(K_W = 0) = j₀. And both {u_M(t), t = 0, 1, 2, …} and {u_W(t), t = 0, 1, 2, …} are a bounded monotonic sequence. Thus, although we cannot obtain the direct analytical solution for P_f, we could easily obtain their numerical solution by iteration to the case when both u_M(t) and u_W(t) converge.

Acknowledgements

We thank Xionglei He for helpful comments. The work was supported by the National Natural Science Foundation of China (32150006, 32200493, 32293193, 81972691), the Guangzhou Science and Technology Planning Project (2025A04J3499), the National Key Research and Development Projects of the Ministry of Science and Technology of China (2021YFC2301300, 2021YFC0863400).

Additional files

Supplementary Information

References

1. Bachtrog D.
2013Y-chromosome evolution: emerging insights into processes of Y-chromosome degenerationNat. Rev. Genet. 14:113–124
1. Bomze IM
1983Lotka-Volterra equation and replicator dynamics: A two-dimensional classificationBiol. Cybern. 48:201–211
1. Charlesworth B.
2009Fundamental concepts in genetics: effective population size and patterns of molecular evolution and variationNat. Rev. Genet. 10:195–205
1. Charlesworth B
2. Charlesworth D.
2000The degeneration of Y chromosomesPhilos Trans R Soc Lond B Biol Sci 355:1563–1572
1. Chen B
2. Shi Z
3. Chen Q
4. Shen X
5. Shibata D
6. Wen H
7. Wu CI
2019Tumorigenesis as the Paradigm of Quasi-neutral Molecular EvolutionMol. Biol. Evol. 36:1430–1441
1. Chen B
2. Wu X
3. Ruan Y
4. Zhang Y
5. Cai Q
6. Zapata L
7. Wu CI
8. Lan P
9. Wen H.
2022Very large hidden genetic diversity in one single tumor: evidence for tumors-in-tumorNatl. Sci. Rev. 9:nwac250
1. Chen Q
2. Yang H
3. Feng X
4. Chen Q
5. Shi S
6. Wu CI
7. He Z.
2022Two decades of suspect evidence for adaptive molecular evolution-negative selection confounding positive-selection signalsNatl. Sci. Rev. 9:nwab217
1. Chen Y
2. Tong D
3. Wu CI
2017A New Formulation of Random Genetic Drift and Its Application to the Evolution of Cell PopulationsMol. Biol. Evol. 34:2057–2064
1. Coltman DW
2. Smith JA
3. Bancroft DR
4. Pilkington J
5. MacColl AD
6. Clutton-Brock TH
7. Pemberton JM
1999Density-dependent variation in lifetime breeding success and natural and sexual selection in Soay ramsAm. Nat. 154:730–746
1. Cortez D
2. Marin R
3. Toledo-Flores D
4. Froidevaux L
5. Liechti A
6. Waters PD
7. Grutzner F
8. Kaessmann H.
2014Origins and functional evolution of Y chromosomes across mammalsNature 508:488–493
1. Courret C
2. Chang CH
3. Wei KH
4. Montchamp-Moreau C
5. Larracuente AM
2019Meiotic drive mechanisms: lessons from DrosophilaProc Biol Sci 286:20191430
1. Crow JF
2. Denniston C.
1988Inbreeding and Variance Effective Population NumbersEvolution 42:482–495
1. Crow JF
2. Kimura M.
1970An Introduction to Population Genetics Theory
1. de Waal FB
1995Bonobo sex and societySci. Am. 272:82–88
1. De Waal FB
2. Lanting F.
2023Bonobo: The forgotten apeUniv of California Press
1. Deng S
2. Xing K
3. He X.
2022Mutation signatures inform the natural host of SARS-CoV-2Natl. Sci. Rev. 9:nwab220
1. Diamond J.
2002Evolution, consequences and future of plant and animal domesticationNature 418:700–707
1. Dubuc C
2. Ruiz-Lambides A
3. Widdig A.
2014Variance in male lifetime reproductive success and estimation of the degree of polygyny in a primateBehav. Ecol. 25:878–889
1. Eldon B
2. Wakeley J.
2006Coalescent processes when the distribution of offspring number among individuals is highly skewedGenetics 172:2621–2633
1. Fay JC
2. Wu CI
2000Hitchhiking under positive Darwinian selectionGenetics 155:1405–1413
1. Fisher RA
1930The genetical theory of natural selectionOxford, England: Clarendon Press
1. Frankham R.
1995Effective population size/adult population size ratios in wildlife: a reviewGenetics Research 66:95–107
1. Fu YX
1997Statistical tests of neutrality of mutations against population growth, hitchhiking and background selectionGenetics 147:915–925
1. Fu YX
2022Variances and covariances of linear summary statistics of segregating sitesTheor. Popul. Biol. 145:95–108
1. Gietz RD
2. Schiestl RH
2007High-efficiency yeast transformation using the LiAc/SS carrier DNA/PEG methodNat Protoc 2:31–34
1. Gillespie JH
2000Genetic drift in an infinite population. The pseudohitchhiking modelGenetics 155:909–919
1. Gillespie JH
2001Is the population size of a species relevant to its evolution?Evolution 55:2161–2169
1. Gillespie JH
1975Natural selection for within-generation variance in offspring number II. Discrite haploid modelsGenetics 81:403–413
1. Hagedoorn AL
2. Hagedoorn AC
1921The relative value of the processes causing evolutionSpringer Dordrecht
1. Haldane JBS
1932The Causes of EvolutionLondon: Longmans, Green and Company
1. Haldane JBS
1927A mathematical theory of natural and artificial selection, part V: selection and mutationCambridge University Press
1. Hallast P
2. Maisano Delser P
3. Batini C
4. Zadik D
5. Rocchi M
6. Schempp W
7. Tyler-Smith C
8. Jobling MA
2016Great ape Y Chromosome and mitochondrial DNA phylogenies reflect subspecies structure and patterns of mating and dispersalGenome Res. 26:427–439
1. Hammer MF
2. Karafet TM
3. Redd AJ
4. Jarjanazi H
5. Santachiara-Benerecetti S
6. Soodyall H
7. Zegura SL
2001Hierarchical patterns of global human Y-chromosome diversityMol. Biol. Evol. 18:1189–1203
1. Hartl DL
2. Clark AG
1997Principles of Population GeneticsSunderland, Massachusetts: Sinauer Associates
1. Hasselquist D.
1995Demography and Lifetime Reproductive Success in the Polygynous Great Reed WarblerJapanese Journal of Ornithology 44:181–194
1. Herman DM
2. Colwell MA
2015Lifetime reproductive success of Snowy Plovers in coastal northern CaliforniaThe Condor 117:473–481
1. Hou M
2. Shi J
3. Gong Z
4. Wen H
5. Lan Y
6. Deng X
7. Fan Q
8. Li J
9. Jiang M
10. Tang X
11. et al.
2023Intra-vs. Interhost Evolution of SARS-CoV-2 Driven by Uncorrelated Selection-The Evolution ThwartedMol. Biol. Evol. 40
1. Kendall MG
2. Stuart A
3. Ord JK
4. Arnold SF
2006Kendall’s advanced theory of statistics.London: Edward Arnold London
1. Kimura M.
1962On the probability of fixation of mutant genes in a populationGenetics 47:713–719
1. Kimura M
2. Crow JF
1963The measurement of effective population numberEvolution :279–288
1. Krüger O
2. Lindström J.
2001Lifetime Reproductive Success in Common Buzzard, Buteo buteo: From Individual Variation to Population DemographyOikos 93:260–273
1. Kuester J
2. Paul A
3. Arnemann J.
1995Age-related and individual differences of reproductive success in male and female barbary macaques (Macaca sylvanus)Primates 36:461–476
1. Lanfear R
2. Kokko H
3. Eyre-Walker A.
2014Population size and the rate of evolutionTrends Ecol. Evol. 29:33–41
1. Larson G
2. Fuller DQ
2014The Evolution of Animal DomesticationAnnual Review of Ecology, Evolution, and Systematics 45:115–136
1. Li H
2. Durbin R.
2011Inference of human population history from individual whole-genome sequencesNature 475:493–496
1. Li W-H.
1997Molecular evolution.Sunderland: Sinauer Associates Incorporated
1. Lindholm AK
2. Dyer KA
3. Firman RC
4. Fishman L
5. Forstmeier W
6. Holman L
7. Johannesson H
8. Knief U
9. Kokko H
10. Larracuente AM
11. et al.
2016The Ecology and Evolutionary Dynamics of Meiotic DriveTrends Ecol. Evol. 31:315–326
1. Lynch M
2. Ackerman MS
3. Gout JF
4. Long H
5. Sung W
6. Thomas WK
7. Foster PL
2016Genetic drift, selection and the evolution of the mutation rateNat. Rev. Genet. 17:704–714
1. Makova KD
2. Li WH
2002Strong male-driven evolution of DNA sequences in humans and apesNature 416:624–626
1. Makova KD
2. Pickett BD
3. Harris RS
4. Hartley GA
5. Cechova M
6. Pal K
7. Nurk S
8. Yoo D
9. Li Q
10. Hebbar P
11. et al.
2024The complete sequence and comparative analysis of ape sex chromosomesNature 630:401–411
1. Miyata T
2. Hayashida H
3. Kuma K
4. Mitsuyasu K
5. Yasunaga T.
1987Male-driven molecular evolution: a model and nucleotide sequence analysisCold Spring Harb Symp Quant Biol 52:863–867
1. Nam K
2. Munch K
3. Hobolth A
4. Dutheil JY
5. Veeramah KR
6. Woerner AE
7. Hammer MF
8. Great Ape Genome Diversity P
9. Mailund T
10. Schierup MH
2015Extreme selective sweeps independently targeted the X chromosomes of the great apesProc Natl Acad Sci U S A 112:6413–6418
1. Oring LW
2. Colwell MA
3. Reed JM
1991Lifetime reproductive success in the spotted sandpiper (Actitis macularia) : sex differences and variance componentsBehav. Ecol. Sociobiol. 28:425–432
1. Otto SP
2. Whitlock MC
1997The probability of fixation in populations of changing sizeGenetics 146:723–733
1. Pan Y
2. Liu P
3. Wang F
4. Wu P
5. Cheng F
6. Jin X
7. Xu S.
2022Lineage-specific positive selection on ACE2 contributes to the genetic susceptibility of COVID-19Natl. Sci. Rev. 9:nwac118
1. Pan Y
2. Zhang C
3. Lu Y
4. Ning Z
5. Lu D
6. Gao Y
7. Zhao X
8. Yang Y
9. Guan Y
10. Mamatyusupu D
11. et al.
2022Genomic diversity and post-admixture adaptation in the UyghursNatl. Sci. Rev. 9:nwab124
1. Purugganan MD
2019Evolutionary Insights into the Nature of Plant DomesticationCurr. Biol. 29:R705–R714
1. Ribble DO
1992Lifetime Reproductive Success and its Correlates in the Monogamous Rodent, Peromyscus californicusThe Journal of Animal Ecology 61:457–468
1. Ruan Y
2. Luo Z
3. Tang X
4. Li G
5. Wen H
6. He X
7. Lu X
8. Lu J
9. Wu CI
2021On the founder effect in COVID-19 outbreaks: how many infected travelers may have started them all?Natl. Sci. Rev. 8:nwaa246
1. Ruan Y
2. Wang H
3. Chen B
4. Wen H
5. Wu CI
2020Mutations Beget More Mutations-Rapid Evolution of Mutation Rate in Response to the Risk of Runaway AccumulationMol. Biol. Evol. 37:1007–1019
1. Ruan Y
2. Wen H
3. He X
4. Wu CI
2021A theoretical exploration of the origin and early evolution of a pandemicSci Bull (Beijing) 66:1022–1029
1. Ruan Y
2. Wen H
3. Hou M
4. He Z
5. Lu X
6. Xue Y
7. He X
8. Zhang YP
9. Wu CI
2022The twin-beginnings of COVID-19 in Asia and Europe-one prevails quicklyNatl. Sci. Rev. 9:nwab223
1. Ruan Y
2. Wen H
3. Hou M
4. Zhai W
5. Xu S
6. Lu X.
2023On the epicenter of COVID-19 and the origin of the pandemic strainNatl. Sci. Rev. 10:nwac286
1. Sackman AM
2. Harris RB
3. Jensen JD
2019Inferring Demography and Selection in Organisms Characterized by Skewed Offspring DistributionsGenetics 211:1019–1028
1. Sakaue-Sawano A
2. Kurokawa H
3. Morimura T
4. Hanyu A
5. Hama H
6. Osawa H
7. Kashiwagi S
8. Fukami K
9. Miyata T
10. Miyoshi H
11. et al.
2008Visualizing spatiotemporal dynamics of multicellular cell-cycle progressionCell 132:487–498
1. Schneider CA
2. Rasband WS
3. Eliceiri KW
2012NIH Image to ImageJ: 25 years of image analysisNat Methods 9:671–675
1. Setchell JM
2. Charpentier M
3. Wickings EJ
2005Sexual selection and reproductive careers in mandrills (Mandrillus sphinx)Behav. Ecol. Sociobiol. 58:474–485
1. Sibly RM
2. Hone J.
2002Population growth rate and its determinants: an overviewPhilos Trans R Soc Lond B Biol Sci 357:1153–1170
1. Silver LM
1985Mouse t haplotypesAnnu. Rev. Genet. 19:179–208
1. Smith JM
2. Slatkin M.
1973The Stability of Predator_Prey SystemsEcology 54:384–391
1. Wang CC
2. Jin L
3. Li H.
2014Natural selection on human Y chromosomesJ Genet Genomics 41:47–52
1. Wang X
2. He Z
3. Guo Z
4. Yang M
5. Xu S
6. Chen Q
7. Shao S
8. Li S
9. Zhong C
10. Duke NC
11. et al.
2022Extensive gene flow in secondary sympatry after allopatric speciationNatl. Sci. Rev. 9:nwac280
1. Wang X
2. Ruan Y
3. Zhang L
4. Chen X
5. Shi Z
6. Wang H
7. Chen B
8. Tracy M
9. Wen H
10. Wu C-I.
2024The paradox of extremely fast evolution driven in multi-copy gene systems - A resolutioneLife 13:RP99992
1. Watterson GA
1975On the number of segregating sites in genetical models without recombinationTheor. Popul. Biol. 7:256–276
1. Wilson Sayres MA
2. Lohmueller KE
3. Nielsen R.
2014Natural selection reduced diversity on human y chromosomesPLoS Genet. 10:e1004064
1. Wright S.
1969Evolution and the Genetics of Populations, Volume 2: Theory of gene frequenciesUniversity of Chicago press
1. Wright S.
1931Evolution in Mendelian PopulationsGenetics 16:97–159
1. Wu CI
2. Lyttle TW
3. Wu ML
4. Lin GF
1988Association between a satellite DNA sequence and the Responder of Segregation Distorter in Dmelanogaster. Cell 54:179–189
1. Wu CI
2. True JR
3. Johnson N.
1989Fitness reduction associated with the deletion of a satellite DNA arrayNature 341:248–251
1. Wu CI
2. Wang HY
3. Ling S
4. Lu X.
2016The Ecology and Evolution of Cancer: The Ultra-Microevolutionary ProcessAnnu. Rev. Genet. 50:347–369
1. Wysocki D
2. Jankowiak L
3. Cholewa M
4. Zyskowski D
5. Griffin A.
2019Natal conditions, lifespan and lifetime reproductive success of European blackbirdsBehav. Ecol. 30:1707–1714

Article and author information

Author information

Yongsen Ruan
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
ORCID iD: 0000-0002-5573-4154
- For correspondence: ruanys3@mail.sysu.edu.cn
Xiaopei Wang
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Mei Hou
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Liying Huang
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Wenjie Diao
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Miles Tracy
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Shuhua Xu
Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai, China
Weiwei Zhai
Institute of Zoology, Chinese Academy of Sciences, Beijing, China
Zhongqi Liufu
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Haijun Wen
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
- For correspondence: wenhj5@mail.sysu.edu.cn
Chung-I Wu
State Key Laboratory of Biocontrol, Innovation Center for Evolutionary Synthetic Biology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
- For correspondence: ciwu@uchicago.edu

Author Notes

Competing interests: No competing interests declared

Version history

Sent for peer review: June 10, 2024
Preprint posted: June 12, 2024
Reviewed Preprint version 1: July 31, 2024
Reviewed Preprint version 2: December 23, 2024
Reviewed Preprint version 3: March 5, 2025

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 550
downloads: 29
citations: 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

Introduction

Results

Overview of the paradoxes of genetic drift

On the Haldane model of genetic drift

Field record of E(K) and V(K) across diverse taxas.

I. The first paradox – The paradox of changing N

1. Empirical demonstration and simulation

The paradox of genetic drift when the population size (N) changes.

2. The density-dependent Haldane (DDH) model – A first attempt at the Generalized Haldane model

The meaning of population size (N) changes in ecology vs. in population genetics.

Genetic drift as a function of population size in the DDH model.

II. The paradox of genetic drift in sex chromosomes

Estimation of Vm/Vf in chimpanzee and bonobo.

III. The paradox of genetic drift under selection

Fixation probability of a new advantageous mutation in the Haldane model.

Discussion

Methods

Cell culture and image analysis

Yeast strain construction

Estimation of V(K) and E(K) in yeast cells

Simulation of genetic drift in the Haldane model and the Wright-Fisher (WF) model

Fixation probability of a new mutation in Haldane model

Acknowledgements

Additional files

References

Article and author information

Author information

Yongsen Ruan

Xiaopei Wang

Mei Hou

Liying Huang

Wenjie Diao

Miles Tracy

Shuhua Xu

Weiwei Zhai

Zhongqi Liufu

Haijun Wen

Chung-I Wu

Author Notes

Version history

Copyright

Metrics

Estimation of V_m/V_f in chimpanzee and bonobo.