separation of timescales

Martin G & Lambert A 2015 A simple, semi-deterministic approximation to the distribution of selective sweeps in large populations. Theor Popul Biol 101:40-46.

  • the main tool to obtain the distribution of allelic frequencies at at given time is by numerical solutions of the diffusion equation by perturbation analysis for small Nes (Kimura, 1957), or more recently, via spectral analysis of the diffusion operator (Song and Steinrücken, 2012)
  • we start from the Wright-Fisher diffusion approximation of the exact stochastic process of allelic frequency under selection and genetic drift
  • we approximate this diffusion itself, via a well-known separation of timescales
  • this separation of timescales is explicitly connected to the simpler Feller diffusion process
  • the dynamics of pt follow a stochastic differential equation (SDE) corresponding to the Wright-Fisher diffusion approximation
  • dpt = pt qt (sa qt + sb pt) dt + √(pt qt / (2Ne)) dBt ... (1)
  • sa = s (h + (1 − h) F)
  • sb = s (1 − (1 − F) h)
  • p0 = k / 2N
  • the haploid model is characterized by sa = sb = s*
  • conditional on fixation, pt is in fact only 'stochastic' while pt → 0 (early phase A: t → 0) or qt → 0 (late phase C: tτ), the intermediate phase B being approximately deterministic
  • the resulting early phase A is characterized by a simpler (linear) diffusion (Feller, 1951), akin to a branching process with independently growing types, as first noted by Haldane (1927)
  • the same goes for the late phase C
  • phase A : pt = o(1) : dptsa pt dt + √(pt / (2Ne)) dBt + O(pt2dt) ... (2)
  • phase B : pt = O(1) : dptpt qt (sa qt + sb pt) dt + O(1 / √(Ne) dt)
  • phase C : qt = o(1) : dpt = −dqtsb qt dt + √(qt / (2Ne)) dBt + O(qt2dt)
  • phase A in Eq. (2) is the SDE of a supercritical Feller diffusion (Feller, 1951) with 'drift' term sa > 0 and 'diffusion' term 1/2Ne
  • pt ~ Feller(sa, 1/2Ne)
  • qt ~ Feller(−sb, 1/2Ne)
  • conditional on non-extinction, a Feller diffusion starting at some p0 > 0, when properly rescaled by its expectation E(pt) = p0 esat, converges to some fixed distribution as time grows to infinity
  • more precisely, define the positive random variable limt→∞ ωt = pt / E(pt) (called a 'terminal value')
  • for a trajectory destined to fixation, the distribution of this variable converges, at large times, to an exponentially distributed random variable ωt = ω~ ~ Exp(P1)
  • P1 is the non-extinction probability of the Feller diffusion
  • any density independent branching process ends up being roughly deterministic and exponentially growing at a fixed rate, if avoiding initial extinction
  • the long term behavior of non-extinct processes thus appears as a distribution of exponentially growing processes, all with same rates (sa here) and variable 'starting points' (ω~p0 here)
  • applying this property to the phase A of our selective sweep model, starting from a single copy (p0 = 1/2N), yields the following long term behavior of phase A
  • limt→∞ pt esat = p~0 ~ Exp(2NP1)
  • the allele will have reached its asymptotic behavior by the time it enters phase B
  • the trajectories entering this phase are approximately distributed as p~0 esat, namely as if they had been deterministic from the start, but with variable initial frequency p~0
  • if an arbitrary number 1 ≤ kn of the k initial copies reach establishment, they will do so independently, yielding several independent Feller diffusion, all conditioned by fixation, during phase A
  • the allele then rises into phase B approximately as the sum of n processes with i.i.d. exponentially distributed initial states, yielding a gamma distribution for p~0
  • limt→∞ pt esat = p~0 ~ Γ(n, 1/(2NP1))