Toward the cellular-scale simulation of motor-driven cytoskeletal assemblies
Abstract
The cytoskeleton – a collection of polymeric filaments, molecular motors, and crosslinkers – is a foundational example of active matter, and in the cell assembles into organelles that guide basic biological functions. Simulation of cytoskeletal assemblies is an important tool for modeling cellular processes and understanding their surprising material properties. Here, we present aLENS (a Living Ensemble Simulator), a novel computational framework designed to surmount the limits of conventional simulation methods. We model molecular motors with crosslinking kinetics that adhere to a thermodynamic energy landscape, and integrate the system dynamics while efficiently and stably enforcing hard-body repulsion between filaments. Molecular potentials are entirely avoided in imposing steric constraints. Utilizing parallel computing, we simulate tens to hundreds of thousands of cytoskeletal filaments and crosslinking motors, recapitulating emergent phenomena such as bundle formation and buckling. This simulation framework can help elucidate how motor type, thermal fluctuations, internal stresses, and confinement determine the evolution of cytoskeletal active matter.
Editor's evaluation
This article presents a new method for simulating cytoskeletal dynamics inside cells. This is an important problem in the life sciences, and the numerical methods and derived results described in the paper seem very promising to facilitate computational modelling of cell dynamics. Although the user- friendliness of the software can still be improved, the method will be of interest to a broad community of biologists and biophysicists.
https://doi.org/10.7554/eLife.74160.sa0Introduction
Living systems are built hierarchically, where smaller structures assemble themselves into larger functional ones. Such organization is fundamental to life, where it is seen across scales from molecules to organelles to cells to tissues to organisms. An example is the cellular cytoskeleton, made up of polymer filaments (and other accessory proteins) crosslinked by motor proteins that exert forces by walking processively along filaments (Howard, 2001). Cytoskeletal assemblies such as the cortex, mitotic spindle, and cilia and flagella, underlie cell polarity, division, and movement (Bornens, 2008; Barnhart et al., 2015; McIntosh, 2016; Pollard and O’Shaughnessy, 2019). Cytoskeletal components have been reconstituted outside of cells to study self-organization (Nédélec et al., 1997; Foster et al., 2015) and to create new active materials (DeCamp et al., 2015). Understanding how cytoskeletal structures assemble from their molecular components remains challenging, in part because of the variety of motors and crosslinkers with different behavior. Improved understanding of the cytoskeleton would allow us to predict how molecular perturbations change cell behavior and to design new complex and adaptive materials (Li and Gundersen, 2008; Fletcher and Mullins, 2010; Needleman and Dogic, 2017).
Computational modeling of the cytoskeleton has elucidated principles of self-organization, suggested hypotheses for experimental test, and helped interpret results of experiments (Gao et al., 2015b; Rincon et al., 2017; Bun et al., 2018; Saintillan et al., 2018; Varghese et al., 2020). Several software packages for cytoskeletal modeling are currently available, including Cytosim (Nedelec and Foethke, 2007), MEDYAN (Popov et al., 2016), AFINES (Freedman et al., 2017), and CyLaKS (Fiorenza et al., 2021). A challenge for molecular simulation is the large size of cytoskeletal systems, typically 104–107 or more filaments (Petry, 2016). While current simulations may reach filaments (Belmonte et al., 2017; Strübing et al., 2020), molecular modeling has required significant compromises in treating steric interactions and motor-proteins.
Here, we describe aLENS, a framework of computational methods and software designed to more efficiently and accurately simulate large cytoskeletal systems (Figure 1). Since motor proteins must bind, crosslink, and unbind from filaments to evolve such systems, aLENS simulates motors as traversing a (well-defined) free energy landscape Lamson et al., 2021. This prevents artificial energy flux during crosslinking and maintains detailed balance in the passive limit. As motors crosslink filaments, the spacing between filaments is on the order of the length of motor proteins (10–100 ) (Figure 1A), comparable to the filament diameter. Therefore, steric interactions between filaments occur frequently and must be treated carefully to avoid unphysical filament overlap, stress, and deformation (Figure 1B). Most other cytoskeletal simulation methods implement a repulsive pairwise potential between filaments, but this requires a small timestep for hard potentials because of the instability of timestepping methods (Heyes and Melrose, 1993). Therefore, potential-based models limit simulations to short timescales. To circumvent this limitation, here we utilize our recently developed constraint method to enforce hard-core repulsion between particles (Anitescu et al., 1996; Yan et al., 2019). We further develop constraint-based modeling by introducing a related method to treat stiff spring forces due to crosslinking motors. Both steric interactions and crosslinking forces are incorporated in a unified implicit solver. This approach ensures numerical stability of the method and allows for timesteps two or more orders of magnitude larger than currently available. Additionally, aLENS is parallelized with OpenMP and MPI to reach length and timescales comparable to those of experiments (Figure 5 and 7).
As an illustration of aLENS, Figure 1C (and movie Figure 1—video 1) shows a simulation of 3200 microtubules within a spherical volume driven by 9600 motors that, when bound, walk to the microtubule minus-end (modeling the activity of dynein). Although the microtubules are initially unorganized (C1), the combination of motor crosslinking and walking causes the microtubule minus-ends to contract into the center of a large aster (C2). The motor-driven steric interactions between filaments, however, eventually fragment this into smaller asters and bottle-brush-like structures (C3,C4). This simulation displays the complex interplay between steric and crosslinking forces in determining the dynamics and steady state configurations of cytoskeletal materials.
Methodology
In this work, we model filaments as rigid spherocylinders. (While not presented here, flexible filaments can be modeled within our framework as segmented, jointed filaments; See Appendix H.) Crosslinking motors are modeled as Hookean spring tethers connecting two binding domains referred to as heads, with steric interactions between motors neglected.
As outlined below, our algorithm performs three tasks sequentially at every timestep: motor diffusion and stepping, motor binding and unbinding, and filament movement. The major computational challenges arise in task 2, computing binding and unbinding while maintaining realistic macroscopic statistics, and in task 3, updating filament position while overcoming stiffness constraints and maintaining steric exclusion. The timestep is determined by the shortest characteristic timescale in the simulated system (filament collision, motor binding/unbinding kinetics, and filament motion). All other degrees of freedom (e.g. internal conformational changes of motor binding heads) are assumed to occur on shorter timescales.
1. Crosslinking motor diffusion and stepping
Each unbound motor executes Brownian motion independently. Each bound motor updates information on the filament to which it is attached, following filament movement in the previous timestep. During the motor movement step, singly bound motors move and doubly bound motors move along the filaments. Here, is the motor stepping velocity that depends on force on the motor head (Gao et al., 2015a):
where is the projection of tether force along filament in the stepping direction. As typically found experimentally, this stepping model means that if is assisting stepping, the velocity saturates at vm; while for hindering stepping, stepping is halted when .
2. Crosslinker binding and unbinding
In filament networks, the spatial variation of unbound and bound motors is integral to network self-organization. For example, crosslinking proteins concentrate in volumes with high filament densities, producing ripening effects as passive crosslinkers are depleted from the bulk (Weirich et al., 2017) (e.g. see Figure 1C). Furthermore, if motors or crosslinkers bind, unbind, or diffuse at rates not set by free energy barriers, the system’s energy and/or entropy can be artificially elevated or lowered, changing the system dynamics and steady-state configuration. Entropic forces bundle and increase overlaps among crosslinked filaments (Lansky et al., 2015; Gaska et al., 2020), and free-energy-dependent binding kinetics contribute to organization of cortical microtubules (Allard et al., 2010) and induce actin bundling (Yang et al., 2006).
Ad-hoc models, like those that attach crosslinking motors to filaments at a fixed length or randomly sample a uniform distribution to set the binding length, are unlikely to recover the force or final configuration of bundled filaments. For example, if passive crosslinkers only bind in a non-stretched configuration, they will not generate entropic forces that drive bundle overlap, as seen experimentally (Lansky et al., 2015). Further, if crosslinkers are modeled as binding with a uniform length distribution and zero tether rest length, the contractile stress of networks will be overestimated, condensing filament networks with greater rapidity.
The assemblies of filaments/motors are assumed to explore an underlying free energy landscape, where all ‘fast’ degrees of freedom can be subsumed into an effective free energy that depends only on filament and crosslinking motor degrees of freedom. We require that our model correctly recapitulates the distribution and chemical kinetics of crosslinking proteins in the passive limit, that is, when for the bound velocity of motor heads. We achieve this with a kinetic Monte Carlo procedure in which motor protein binding and unbinding events are modeled as stochastic processes. Transition rates recover the correct limiting (equilibrium) distribution by imposing detailed balance (Appendix C). That is, we model binding and unbinding as passive processes, but it is in principle possible that certain such processes consume chemical energy.
To enforce the macroscopic thermodynamic statistics, including correct equilibrium bound-unbound concentrations and distributions (Appendix C) (Gao et al., 2015a; Lamson et al., 2019; Allard et al., 2010), we explicitly model each crosslinker as a Hookean spring connecting two binding heads labeled as or . Each crosslinker has four possible states: both heads unbound (), either or singly bound ( or ), or both heads (doubly) bound (). For each timestep , we first calculate the rates at which each head ( and ) transitions from their current state to a new binding state (i.e. for the transitions ). The transition probabilities are modeled as inhomogeneous Poisson processes with the cumulative probability function
The transitions do not stretch or compress the tether and so do not depend on tether deformation energy. However, the transitions do account for tether deformation energy (Table 1).
3. Filament dynamics
We sought to develop a stable, large-timestep method for updating the position of filaments, subject to spring forces from crosslinking motors, steric interactions, and Brownian motion. This requires addressing two stability restrictions on the timestep . The first arises in models that use a stiff repulsive pairwise potential to prevent filament overlaps. For example, the Lennard-Jones potential , where is the separation between filaments, is so steeply varying that it requires small for stability. As a result, soft alternatives such as a harmonic potential are often used (Nedelec and Foethke, 2007). These soft potentials allow partial filament overlaps, and may therefore lead to unphysical system dynamics and stresses (Heyes and Melrose, 1993).
The second stability restriction arises from the fast relaxation times of crosslinking motors. When crosslinkers connect two parallel filaments, the spring tether length relaxes according to , where is the preferred length and (Howard, 2001). Explicit timestepping schemes require , for some constant . For motors, tether stiffness , and slender body drag coefficient for -long microtubules in aqueous solvent, we have .
We overcome these difficulties with a novel, linearized implicit Euler timestepping scheme, which extends on our previous work on enforcing non-overlap conditions (Yan et al., 2019). This technique is inspired by constraint-based methods for granular flow (Tasora et al., 2013). When collisions occur between filaments, the minimal distance between them attains with collision force . If not colliding, and . This mutually exclusive condition is called a complementarity constraint, written as . If one crosslinking motor connects these two filaments, its length and force magnitude satisfy the Hookean spring model , which is an equality constraint.
We integrate the equation of motion such that these two types of constraints for all possible collisions and all crosslinking motors are satisfied. We briefly derive the method here, and all details can be found in Appendix C. Because the method is specific to rigid particles with arbitrary shape, we shall use ‘particle’ and ‘filament’ interchangeably.
Each particle is tracked by its center location in the lab frame and its orientation as a quaternion (Delong et al., 2015). are the scalar and vector parts of the quaternion, respectively. Using a quaternion to track the rotational kinematics of a rigid body is a standard computational approach due to its compact memory footprint (4 floating point numbers) and its singularity-free nature. The geometric configuration at time for all filaments can be written as a column vector with entries:
Similarly, we use the vectors to represent the translational & angular velocities, and forces & torques of all particles, respectively. We relate to via a mobility matrix , dependent only upon the geometry , and relate to via a geometric matrix :
Because the biological filaments we consider mostly have lengths on the to scales and inertial effects can be ignored. In the following, the subscript refers to constraints, which includes both unilateral (with subscript ) and bilateral (with subscript ) constraints. For our problem, unilateral constraints refer to collision constraints while bilateral constraints refer to crosslinking motor constraints. The subscript refers to non-constraint.
For unilateral constraints, we define the grand distance vector where each is the minimum distance between a pair of filaments. Similarly, for bilateral constraints we define the grand distance vector , containing the length of the doubly bound motor . There are in total possibly colliding pairs of filaments and crosslinking motors. The force magnitude corresponding to these constraints are also written as vectors, and . The two types of constraints can be summarized as:
Here, and satisfy the complementarity (collision) constraints, while and satisfy the Hookean spring law. Here, is a diagonal matrix consisting of all the stiffness constants, while represents the rest length of every crosslinking motor.
Equations 4 and 5 define a differential-variational-inequality (DVI). This is solvable when closed by a geometric relation mapping the force magnitude and to the force vectors and :
where and are sparse matrices containing the orientation norm vectors of all constraint forces (Anitescu et al., 1996; Yan et al., 2020 and Appendix D). Next, we discretize this DVI using the linearized implicit Euler timestepping scheme with at timestep :
The unknowns to be solved for at every timestep are the constraint (collision and motor tether) force magnitude . This is a nonlinear DVI because , are nonlinear functions of geometry , although is linearly dependent on and . For a small timestep (), this nonlinearity can be linearized by Taylor expansion, for example, . Then, this nonlinear DVI can be converted to a convex quadratic programming problem (Nocedal and Wright, 2006) (details in Appendix D):
Here, is a column vector, and
One way to understand the constraint optimization method is that the implicit temporal integration ‘jumps’ on a timescale that bypasses the relaxation timescales of unilateral and bilateral constraints (collisions and crosslinking motor springs). In the limit of motor tethers being infinitely stiff (), the quadratic term coefficient matrix is still symmetric-positive-semi-definite (SPSD) and the Equation 8 is still convex and can be efficiently solved. Physically speaking, in this case the bilateral constraints degenerate from deformable springs to non-compliant joints.
Instantiation in a massively parallel computing environment
Our methods naturally lend themselves to high-performance parallel computing architectures. We utilize both MPI and OpenMP and use standard spatial domain decomposition to balance the number of motors and filaments across MPI processors. The motor update step samples the vicinity of every motor, where we use a parallel near-neighbor detection algorithm and update all motors in parallel. The most expensive part of the method is finding the solution to Equation 8, because of its very large dimension, equal to the total number of close pairs of filaments plus the number of crosslinking proteins. We use a fully parallel Barzilai-Borwein Projected Gradient Descent (BBPGD) solver (Yan et al., 2019) because the gradient is efficiently computed by one parallel sparse matrix-vector multiplication operation.
aLENS is written in a modular design using standard object-oriented C ++ and is available on GitHub as discussed at the end of the Discussion section.
Verification and benchmarks
To validate and benchmark aLENS, we first note that its collision handling approach has already been benchmarked for the pure-filament phase, and shown to accurately reproduce the equation of state and the isotropic-nematic liquid crystal phase transition of densely packed rigid Brownian rods (Yan et al., 2019). This capacity to accurately compute the dense packing phase of fibers makes aLENS valuable to simulate many dense biological filament assemblies. The accurate treatment of steric interactions extends beyond other simulation methods and software, where steric interactions are often approximated by soft repulsive potentials or neglected.
We now further benchmark of aLENS by simulating mixtures of filaments and motors and directly comparing simulation results with experimental data. Although there are many parameters in our motor model, these comparisons don’t involve fitting of model parameters to experimental data. Instead, we chose motor parameters as measured from experimental data (Scharrel et al., 2014; Fürthauer et al., 2019) or estimate them based on similar motor proteins (Cross and McAinsh, 2014).
Directed transport of microtubules by mixed active and inactive motors
We begin by verifying our motor model by reproducing results from experiments on directed microtubule transport (Scharrel et al., 2014). As in the experimental system, the simulation begins with a fixed number of motors with one head attached to a fixed surface while the other head interacts with one microtubule. Some motor heads are active and can drive gliding of the microtubule, while other heads are inactive and behave as passive crosslinkers that hinder microtubule motion. Here, is the number of active motors and is the total number of motors (active and inactive). The microtubule velocity increases as increases from 0 to 1 in experiments (Scharrel et al., 2014) and in our simulations. As shown in Figure 2, our simulations quantitatively reproduce the experimental data. To achieve this agreement, we set the active motor velocity to , so the sliding velocity at matches experiment. Apart from this one experimentally constrained velocity, there are no fitting parameters in our simulation (further motor parameters are in Appendix B). In initial trial simulations, we found that changing the total motor number didn’t noticeably affect the microtubule transport velocity. Therefore, for the results shown here we fixed , similar to the experimental system. Since the transport trajectory is stable without stochastic noise, as shown in Figure 2, there is no need to perform ensemble average to determine the transport velocity. Therefore, we ran 1 simulation for 10 s for each ratio .
Self-straining state of actively crosslinked microtubule networks
As an additional verification, we compare aLENS with results of recent experiments of Fürthauer et al., 2019 in which many-microtubule assemblies are densely packed into a nematic bundle and crosslinked by a large number of motors. In this heavily crosslinked nematic regime, microtubules are found to be transported by motors along the nematic director direction at a constant velocity in a direction determined by individual microtubule polarity. Experimentally, microtubule velocity was found to be independent of the local average polarity of the ensemble, as has been observed in extract spindles (Needleman et al., 2010), and (over the range of experimental conditions) independent of motor density. This phenomenon of oppositely oriented, constant velocity microtubule fluxes was referred to as ‘self-straining motion’, with the system interpreted as being composed to two polar microtubule gels whose inter-connecting motors pulled them past one another.
We simulate this experiment using 3,000 model microtubules with . Initially the filaments are confined in a tube of diameter , randomly initialized with their orientations along the (pink) and (white) directions, and packed at about 30% volume fraction. The simulated system is periodic along the direction, with periodic tube length . There are approximately 25 motors per microtubule according to the experimental estimates, and in our simulations we vary the motor-to-microtubule number from 10 to 30. There is no accurate measurement for the XCTK2 motor in these experimental conditions. Therefore, we used experimental estimates of for the walking speed of NCD motors (Furuta and Toyoshima, 2008). To approximate the experimental measurement of velocity that used line photobleaching (Fürthauer et al., 2019), we sample the local polarity and straining velocity using virtual sampling planes, as shown in the left panel of Figure 3. As in Fürthauer et al., 2019, Figure 3 shows that the straining velocity is largely independent of the number of motors and the local average polarity over the range simulated.
Intuitively, the straining velocity is predominantly determined by the free walking velocity of the motors in limit of many cross-linkers. From our simulations, we find a straining velocity of approximately , close to the experimental measurement of .
Large-scale parallelization efficiency
Simulation of cellular-scale cytoskeletal assemblies requires methods that can reach large system sizes and timescales. Therefore, we developed aLENS to efficiently utilize modern high-performance computing resources. Millions of objects and constraints can be simulated with aLENS. Figure 4 shows detailed parallel efficiency measurements for one large-scale test case, similar to that in Figure 6, but more than 10 times larger. Here we track 1 million microtubules and 3 million motors for 100 timesteps. The performance is benchmarked on a cluster interconnected with infiniband and each node has two AMD EPYC 7742 CPUs, each having 64 cores at 2.5 GHz. We launched hybrid MPI + OpenMP jobs such that each MPI rank has 16 OpenMP threads. On average at each timestep the constraint optimization solver handles approximately 8 million collision and doubly bound motor constraints. The number of constraints changes at every timestep due to a variable number of collision pairs and to stochastic binding and unbinding of motors.
We achieve nearly ideal linear speed up as the number of cores increases (Figure 4). At 1536 cores, the efficiency remains at 93% and each timestep takes less than 1 s, making it possible to track such large systems on experimental timescales (a few seconds) within days or weeks of computing time. More importantly, the constraint optimization allows a that is one or two orders of magnitude larger than conventional pairwise potential methods. For the system simulated in Figure 4, aLENS can reach physical time per day, using a timestep size of .
Results
Here, we illustrate the ability to use aLENS to study the interplay between microscopic dynamics and macroscopic order in active cytoskeletal assemblies. The specific examples shown here are the formation and extension of a band of microtubule bundles, polarity sorting of short microtubules on a spherical shell, the development of asters with and without thermal fluctuations, and the effect of confinement on assembling microtubule-motor mixtures. For the results presented here, all simulations were conducted in solvent with viscosity at room temperature, using a fixed timestep unless otherwise stated.
Bundle formation and buckling in a filament band
Microtubules driven by crosslinking motors can bundle; sliding of microtubules within the bundles causes them to fracture dynamically (Sanchez et al., 2012; Foster et al., 2015; Roostalu et al., 2018). We study such phenomena through a large-scale simulation of 100,000 filaments modeling microtubules and 500,000 minus-end-directed motor proteins modeled after dynein (Figure 5). Motor crosslinking drives contraction of initially disordered, bundled filaments (Figure 5A and B). Aligning steric and crosslinking forces drive the system into a series of well-aligned bundles spanning several filament lengths (Figure 5C, see Figure 5—video 1 and Figure 5—video 2). The motors slide filaments parallel to each other, generating macroscopic extensile motion. Later, the extended network buckles and fractures (Figure 5C).
The macroscopic stresses and dynamics depend on the spatial organization of filaments and motor-driven sliding. To characterize this, we measure the joint probability distribution of the local nematic order parameter and the number of neighboring filaments crosslinked to a filament (Figure 5D). While the network contracts, the distribution of does not change significantly because the number of motors per filament and the maximum number of neighboring filaments within a densely packed structure remain roughly constant. As filaments align, they become near-perfectly nematic (), although less-ordered regions occur between aligned bundles of different orientations (Figure 5C1 and D2).
Inside the bundles, filament sliding by motors leads to transport along the local nematic director. Projecting filament trajectories onto the lab-frame -axis, we observe left- and right-moving filaments that speed up early in the simulation, and then maintain constant average velocities at later time ( in Figure 5E), as filaments align due to steric and motor forces (Figure 5F). Note that velocity and stresses plateau only when the nematic order saturates.
The filament motions created by motors cause the densely-packed filaments to collide often, creating a net extensile stress along the bundles’ axes (Figure 5F). However, the fixed simulation box size hinders the networks’ elongation, causing the bundles aligned with -axis to buckle due to the net extensile stress (Figure 5F, see Figure 5—video 1 and Figure 5—video 2). In contrast, bundles not aligned with the -axis are not constrained and so evolve into straight spikes. This misalignment of bundles is seen as a small net stress in the -directions for (Figure 5F).
Polarity sorting in a spherical shell
Crosslinking motors on antiparallel filaments drive polarity sorting, which transports filaments to regions of like polarity. This has been well-studied on a planar periodic geometry, e.g. (Gao et al., 2015b). Here, we use aLENS to examine the effect of confinement geometry on polarity sorting (Figure 6). The geometry is designed to explore the polarity sorting phenomena where initial filament alignment occurs in a spherical geometry and significantly affects the dynamics and steady state of the system. In this simulation, 100,000 filaments with aspect ratio are confined between two closely spaced concentric spherical shells at 40% volume fraction. The shell gap is , shorter than the filament length, with so filaments can move over each other in a restricted way. The filaments are initialized such that the nematic directors are along the meridians everywhere. 200,000 motors, modeled after kinesin-5 tetramers, drive relative filament sliding (Figure 6A). Brownian motion is modeled at room temperature and timestep is set to . Motors move toward minus ends of bound filaments at . Once they reach the minus ends, they immediately detach.
Motors walk along the filaments, driving sliding of antiparallel filaments (Figure 6B). This leads to polarity-sorted regions at the north and south ‘poles’ of the sphere, meaning that the filament orientation on average points toward the poles. Filaments with reversed initial polarity are transported to the equatorial region (Figure 6C1). In contrast to the planar geometry (Gao et al., 2015b), we did not observe the formation of polar lanes with boundaries between polarity-sorted regions approximately parallel to the polarity direction. Instead, on the sphere the boundaries between polarity-sorted regions are approximately orthogonal to the polarity directions, as more clearly illustrated by plotting the polarity divergence (Figure 6C1).
Motors also accumulate in some regions according to the filament polarity (Figure 6C1). These motor accumulation regions are actually regions where the divergence of filament polarity field is positive, meaning areas of overlap of filament minus-ends (Figure 6C2, C5 and G). This accumulation is illustrated by the positive correlation between motor density and at in Figure 6D. Furthermore, motor accumulation regions appear to show slightly lower filament volume fraction (Figure 6C2 and C4), as shown in Figure 6F. These correlations can be understood through the behavior of crosslinking motors near filament ends (Figure 6G). Once polarity sorted regions of filaments form, as the blue arrows represent, in regions where minus-ends meet minus-ends and vice versa in regions where plus-ends meet plus-ends. Minus-end directed motors accumulate in regions with , while plus-end motors accumulate in regions with . Once motors accumulate, they may attach to both minus ends and push them away such that the distance between minus ends is the length of motors. As a result, the volume fraction of filaments in that region is below average.
In contrast, if the motors stop walking but do not detach when they reach the minus ends (end-pausing, EP), the filament network contracts (Fig. 6E1-5) with volume fraction increases from 40% to 60% and eventually freezes at . We observe neither substantial polarity sorting nor motor accumulation. This indicates that the ability of motors to continuously walk, without end-pausing, is crucial to effective polarity sorting.
Aster formation in bulk
Aster formation is driven by motor pausing at ends of rigid filaments (end-pausing). Previous work has focused on how motor biophysics affects aster formation (Belmonte et al., 2017; Roostalu et al., 2018). An additional contributor to aster formation may be thermal fluctuations, which are difficult to tune experimentally but can be easily modulated in simulations (Figure 7). To examine this, we simulated 40,000 filaments and 80,000 processive, minus-end-directed, end-pausing motors starting from the same spatially uniform and orientationally isotropic random configuration (Figure 7A). In one version of the model, we included thermal fluctuations that drive filament motion (Figure 7D and movie Figure 7—video 1), while in the other thermal fluctuations of filaments were neglected (Figure 7E and movie Figure 7—video 2). The resulting structure of the system is significantly different in the absence of filament thermal motion, showing that thermal fluctuations influence the asters’ shape, structure, and ultimate spatial organization. With filament thermal motion, a number of dispersed, spherically symmetric, dense asters form. By contrast, in the absence of thermal motion the number of asters is larger and more regularly spaced, but their shape is more irregular and they contain fewer filaments (Figure 7D vs E).
These differences are clear in the radial distribution function of filament minus ends, which are clustered by motors paused at filament ends (Figure 7B). On large length scales, the radial distribution reflects larger and denser asters for the simulation with thermal fluctuations that drive filament movement. In simulations of both cases, two prominent peaks appear in the radial distribution funcation at small length scales and which correspond to scale on which filaments bind to or are crosslinked by motors, respectively (Figure 7B, D2 and E2). The relatively small peak between these two maxima correspond to filaments that are geometrically confined between two crosslinked filaments.
These differences arise from the fact that athermal filaments do not move unless driven by motors, which requires that two filaments are close enough to become crosslinked. This suggests that, at steady state, athermal aster centers are separated by twice the filament length. In contrast, with thermal motion filaments may diffuse ∼ in . This allows filaments to diffuse until they are captured in regions of high motor density, such as aster centers. Furthermore, with thermal fluctuations the asters themselves diffuse, which leads to aster coalescence (Figure 7D1). These observations and estimated lengthscale are quantitatively confirmed by analyzing the static structure factor of aster centers (details in Appendix F), which shows that the athermal simulation has approximately three times more asters than the thermal case (Figure 7D vs E).
The differences in the dynamics of aster formation are also reflected in stress measurements (Figure 7C), where the more crowded filament configurations of the thermal case produces a larger stress throughout the simulation. In both cases, the motor-induced stress initially increases quickly, reaching a peak at roughly , similar to the behavior during bundle contraction shown above (Figure 5F), before declining. The average time required for motors to walk to filament ends, , determines the initial contraction timescale. After reaching minus ends, motors pause and relax toward their equilibrium lengths. As a result, both the motor and collision stress grow in magnitude as more motors accumulate at minus ends.
Confined filament-motor protein assemblies
Confinement of cytoskeletal structures plays an important role in cells, where the cytoskeleton is spatially constrained by membranes, organelles, and other cellular structures. Although in the previous examples we studied open periodic geometry, here we show results of cylindrical confinement. The microtubule motor system is constrained inside a cylinder with periodic boundary conditions at the cylinder ends. The impermeable boundary of the cylinder surface to motors and filaments was implemented by our complementarity constraints.
Similar to the previous bulk cases, motors move filaments to create high-density crosslinked filament aggregates that coexist with a relatively low density vapor of non-crosslinked filaments. In bulk systems as shown above and in previous work, end-pausing motors drive aster formation because crosslinking motors pull filament ends together. A confining cylindrical boundary strongly modifies the conformation of these aggregated structures (Figure 8). These simulations used long filaments at a fixed packing fraction (), confined in two cylinders with diameters and .
For a small-diameter cylinder where one filament length can fit across the cylinder (, ), the cylinder is too narrow for asters to form. Instead, motor sliding and end-pausing drive the filaments into polarity-sorted bilayers (PSBs, Figure 8A and movie Figure 8—video 1). A single polarity-sorted bilayer contains a central interface of highly crosslinked filament minus-ends between two antiparallel polar layers of filaments (Figure 8). At steady state, the system consists of individual PSBs separated by low-density vapor regions containing few motors. As expected, the local nematic order parameter nearly reaches 1 within PSBs. Even the the vapor phase is close to nematic (Figure 8A4), due to the strong confinement effect.
Next we increased the diameter of the cylinder to () to weaken the confinement (Figure 8B and movie Figure 8—video 2). Here, the polarity-sorted bilayers are not present, because the larger cylinder diameter allows filaments to reorient and organize into bottle-brush-like aggregates (BBs). In the bottle brushes, filament plus ends are oriented radially outward from the cylinder axis, forming a hedgehog line defect capped by half asters (Figure 8B2). Motors become highly concentrated along the line defects at the center of the cylinder (Figure 8B3). The radial hedgehog structure of BBs is evidenced by a negative local nematic order parameter (Figure 8B4, blue line). The splayed nature of the BBs produces a lower relative packing fraction of times the vapor when compared to the PSBs (Figure 8B4, red line).
Discussion
We designed aLENS to (i) model crosslinking motor kinetics conforming to an underlying free energy landscape, (ii) circumvent the timescale limitation imposed by conventional explicit timestepping methods, and (iii) efficiently utilize modern parallel computing resources to allow simulation of cellular-scale systems. This efficient framework allows both modeling the individual building cytoskeletal building blocks (filaments, motors) and gathering mesoscale statistical information such as stress and order parameters from a large system. This multiscale capability will make it possible to directly compare simulations with experimental observations on mesoscopic and macroscopic scales over timescales from seconds to minutes.
The aLENS framework is not limited to a specific motor model. Because of the modular design of the motor code, the motor model can be extended to include additional physics such as force-dependent binding and unbinding rates, or even entirely replaced, say, with a passive crosslinker or other model. Dynamic instability and branching of cytoskeletal filaments can also be integrated with the constraint minimization problem, as we showed previously in modeling the division-driven growth of bacterial colonies (Yan et al., 2019). Long and flexible polymers can be simulated by chaining short and rigid segments together with flexible connections (Appendix H), even with nonlocal interactions mediated by hydrodynamics, electrostatics, or other fields (Shelley, 2016; Nazockdast et al., 2017; Maxian et al., 2021). For example, in ongoing work we have used aLENS to simulate chromatin in the nucleus as a bead-spring chain moving through the nucleoplasmic fluid, and confined by the nuclear envelope.
Recent years have seen considerable innovation in computational approaches to cytoskeletal modeling, implemented in powerful simulation packages including Cytosim (Nedelec and Foethke, 2007), MEDYAN (Popov et al., 2016), and AFINES (Freedman et al., 2017). These packages utilize a variety of coarse-grained representations of cytoskeletal elements and numerical simulation schemes, with the diversity of approaches in part reflecting the diversity of cytoskeletal systems and phenomena of interest. aLENS brings a powerful set of new capabilities to the table, significantly expanding the range of accessible time and length scales in simulations of systems in which excluded volume and crosslink-mediated interactions play an important role.
aLENS has been open-sourced on GitHub: https://github.com/flatironinstitute/aLENS (copy archived at swh:1:rev:f2dd484f82443735562ad7b480fe7ed9fc020fb0; Adam, 2022) and precompiled binary executable is available on DockerHub: https://hub.docker.com/r/wenyan4work/alens. Our GitHub documentation provides a clear roadmap for developing additional user-specific modules.
Appendix 1
Crosslinker and motor properties
Appendix 2
Crosslinker binding and unbinding
Kinetic Monte-Carlo: crosslinking protein-filament interactions
Our molecular model simulates distinct filaments and crosslinking proteins (crosslinking motor proteins, passive crosslinkers, etc.). This model includes fluctuations in bound protein number and binding kinetics that recovers the equilibrium distribution of static crosslinking proteins Gao et al., 2015a; Blackwell et al., 2017; Rincon et al., 2017; Lamson et al., 2019; Edelmaier et al., 2020. Modeled crosslinking proteins in solution bind to one filament and then crosslink two filaments (Appendix 2—figure 1). In dense filament networks, the spatial variation of unbound proteins play an important part in the network’s reorganization. To account for inhomogeneous concentrations, we explicitly model unbound crosslinkers and develop a method that reproduces one head bound and doubly bound distributions consistent with a mean-field model (Appendix C.2). All binding and unbinding rate calculations are summarized in Appendix 2—table 1.
Unbound crosslinking proteins rapidly diffuse in the surrounding fluid until a head binds to a filament. Heads of modeled crosslinking proteins in solution bind to filaments described by the reversible chemical reaction
where is a head and is a binding site on a filament. The association constant of the heads to binding sites is described by the equilibrium equation
where defines the concentration of substance . The association constant has units of inverse molarity, and relates the the on- and off-rate constants and .
Unbound crosslinkers are modeled as diffusing points with center of mass positions and diffusion constants du. The heads of a crosslinker have spatial- and time-dependent concentrations . The head binding rate is the volume integral over the product of the on-rate constant, binding site density, and crosslinker concentration
where and is the linear binding site density along filaments. The lab position along the ith filament is parameterized by .
The binding probability in a timestep is an in-homogeneous Poisson process with the cumulative probability function
We assume the tight binding limit and do not consider multiple binding and unbinding events of one crosslinker during a timestep . The average number of multiple events may be calculated from binding parameters and the timestep allowing one to set a probability threshold (Lamson et al., 2021). Heads of the same crosslinking protein are forbidden to be bound to the same filament at the same time.
To describe during a timestep, we first consider a crosslinking protein with two heads connected by a flexible but relatively stiff polymer tether with length . Because of the tether’s stiffness, the radius of gyration of an unbound protein is assumed to be . The binding heads at the tether’s ends move by the tether’s rotation and translation. Depending on the timestep’s length, either rotation or translation will dominate the evolution of the head distributions.
For most biological crosslinking proteins, the rotational diffusion is fast compared to translational diffusion. When the crosslinker’s center does not diffuse far from its position at the beginning of a timestep, that is, , rotational diffusion dominates and we approximate heads to be within a sphere of radius centered at . Realistically, the head distributions can vary within this volume but because is small compared to filament lengths, we approximate the head distribution as being uniform, that is, , where is the Heaviside step function. For larger crosslinking proteins, more detailed spatial distributions may be calculated using freely-jointed or worm-like chain models.
For uniformly distributed heads, the head binding rate is
where is the filament ’s length segment within . To account for cylindrical filaments with diameter , we augment the binding radius such that . Since this scenario exists within a regime where the crosslinker or motor does not diffuse far from its initial position in a timestep, we approximate , for .
However, it is uncommon that an unbound crosslinker or motor will diffuse less than rg in a timestep, and so we must account for the protein’s translational diffusion. The diffusion equation models the mean spatial distribution of a unbound crosslinking protein’s center
which has the solution
If the characteristic diffusion length , then equation (14) underestimates binding (Appendix 2—figure 2). The large diffusion distance also allows us to approximate the head distribution as matching the protein’s spatial distribution, . Substituting the binding rate equation (12) and the solution to the diffusion equation (16) into the integral of equation (13) gives
For straight, rigid filaments, we take the volume and time integrals while reparameterizing by the crosslinker’s perpendicular and parallel distances from a filament segment’s center. This gives the linear binding probability density for a filament
Integrating over si gives the binding probability of one crosslinker head to a single filament. The total binding probability is then
where is the number of filaments surrounding the crosslinker head.
Calculating the binding probability from this function and ensuring that the protein unbinds so that detailed-balance is satisfied is computationally prohibitive. Instead, setting to the root mean square diffusion distant and using the rate equation 14, we obtain a useful approximation to equation 19. This is computationally efficient and mitigates the low binding rates when diffusion or times steps are large (Appendix 2—figure 2). We note that the accuracy of this approximation is dependent on the length of filaments in the simulation with longer filaments reducing the error from edge effects. Future work will focus on developing methods to more accurately reproduce the above binding distribution.
Once bound, a crosslinking protein’s head unbinds with a constant rate
If implementing equation 14, the protein unbinds into a uniform sphere of radius . This ensures crosslinkers bind to and from regions in a way that satisfies detailed-balance.
With one head bound, a crosslinker’s tether deforms to bind its other head to adjacent filaments. Deforming a tether requires energy, implying crosslinking kinetics depend on tether deformation. For passive crosslinkers and motors with rapid kinetic rates compared to stepping rate, the ratio of binding and unbinding rates to and from a position on a filament is proportional to the Boltzmann factor of the binding free energy.
With one head bound to filament at position si, the binding rate constant for binding to a location sj on filament is
where is the unbinding-rate of crosslinking proteins when no force is applied, is a binding association constant similar to , and is the free energy contribution from the tether
Before crosslinking, the unbound motor head explores an effective volume centered around the bound motor head. Not considering steric interactions with filaments, this volume is the free head’s position weighted by the Boltzmann factor integrated over all space.
We impose an integration cutoff radius where the integrand becomes sufficiently small, making the factor consistent with a finite lookup table Lamson et al., 2021. The binding head’s positional distribution must also satisfy the Boltzmann factor. We recover the proper binding distribution through inverse transformation sampling of equation (21) Lamson et al., 2021.
Theory and experimental evidence suggests that binding rates depend not only on energy but also force Evans and Ritchie, 1997; Dudko et al., 2006; Walcott, 2008; Guo et al., 2019. This allows for catch-bond-like behavior where proteins remain crosslinked for longer if under tension and release quicker if compressed. We replicate this behavior with the function
where λ and xc are the energy factor and characteristic length specifying the behavior of the energy- and force-dependent binding/unbinding, respectively. For values of , you see catch-bond like behavior whereas values of exhibit slip bond behavior Walcott, 2008; Edelmaier et al., 2020. This formalism can also be used to add in angle dependence.
When we include an effective energy and/or force dependence, the unbinding rate becomes
This does not change the final stored energy of either bound state but does effect the frequency at which the motors will switch between having one head bound and crosslinking.
Mean-field theory for crosslinking proteins
We expand on our previous mean-field motor density model to include motors that have dissimilar heads, diffusion and walking in singly and doubly bound states, and a time-dependent homogeneous concentration of unbound crosslinking proteins Lamson et al., 2021. This last addition imposes the condition that the total number of proteins when all bound and unbound states are accounted for remains constant.
This requires a system of equations with crosslinking densities , singly bound densities and , and an unbound density to model all crosslinking proteins between filaments. By convention,
For heads and , crosslinking diffusion constants and , the drag speeds and , and walking speeds and have been shown to depend on the force exerted on the binding heads and thus the stretch of the tether . No tether force acts on singly bound motor proteins but the singly bound diffusion constants and and walking speeds and may depend on si through some other physical mechanism such as crowding or state of the filament’s lattice. The total number of crosslinking proteins of the system is
and is constant in time.
Appendix 3
Filament dynamics
Constraint quadratic programming
In the main text we discussed specifically filament. In fact, our method is applicable to rigid bodies of arbitrary shapes. Here we derive the detailed equations.
The configuration of each particle is tracked by its center location in the lab frame and its orientation as a quaternion Delong et al., 2015. This is the vector component of the quaternion, not the unit orientation vector. There are other choices to specify the orientation, such as Euler-angles and rotation matrices, but we prefer quaternions for simplicity. The geometric configuration for all filaments can be written as a column vector:
which is a function of time: . The translational and angular velocity of all filaments can also be written as a column vector:
Similarly we can write the force and torque applied on all filaments as a column vector:
The kinematic equation of motion 35 maps to , via a geometric matrix .
is a block diagonal matrix, with one and one block for each particle:
is the identity matrix, same for every particle. Each block simply corresponds to the translational motion of each particle . Each refers to the rotational motion of each particle, where for each :
Here is the Levi-Civita symbol for cross-product in 3D space.
The biological filaments we consider mostly have lengths on the nm to scales. At these scales, solvent viscosity dominates and inertia effects can be ignored, which is the so-called Stokes regime where the mobility matrix maps the force linearly to the velocity :
includes collision force between particle-particle and particle-container pairs, linker force between particle pairs generated by doubly bound crosslinkers, Brownian force on each particle generated by thermal fluctuations, and other externally applied forces through gravity and electrostatic fields.
In principal, Equation 35 together with Equation 38 can be integrated directly because both and are functions of the geometry and time only. However, this approach is usually impractical, because or is usually very stiff functions of the geometry. For example, the collision force is usually computed by assuming a very stiff pairwise potential between filaments, such as the Lennard-Jones or WCA potential. This stiffness poses severe limits on the stability of all explicit temporal integrators. We discussed this problem in detail for collision forces in our previous work on Brownian spherocylinders Yan et al., 2019 and rigid spheres in Stokes flow Yan et al., 2020. Instead of computing using repulsive potentials, we imposed non-overlapping constraints on the geometry while integrating Equation 35.
Equation of motion with geometric constraints
In the following, the subscript refers to constraints, which includes both unilateral (with subscript ) and bilateral (with subscript ) constraints. Unilateral constraints refer to those inequality constraints, i.e., constraints imposed from one side, while bilateral constraints refer to equality constraints. In our system, unilateral constraints come from collisions and bilateral constraints come from doubly bound crosslinkers. The subscript refers to non-constraint, i.e., physical components that are independent of the constraints.
For unilateral constraints, we define the grand distance function between every pair of particles as a column vector:
where each is the minimal distance between particles with indices and . Similarly, we define the grand distance function for bilateral constraints:
where each is the distance between two fixed points on particles and , respectively. Physically, is simply the length of each doubly bound crosslinker. With this definition, there are in total unilateral and bilateral constraints in the system. In other words, there are in total possibly colliding pairs of filaments and doubly bound crosslinkers. Both kinds of constraints are functions of the system geometry, so we shall write them as and in the following when necessary.
The force magnitude between all pairs of particles for unilateral and bilateral constraints can be written similarly as column vectors:
For each or , there is a corresponding force magnitude or , the (normalized) direction vector of this force, and the location and where this force is applied on the filament and respectively, as shown in Appendix 3—figure 1. With norm vectors defined in this way, or is positive when the force is repulsive between two filaments.
For unilateral constraints and satisfy this complementarity condition:
This condition means and are orthogonal to each other, and all components of and are non-negative Yan et al., 2019.
For bilateral constraints and satisfy this linear equality condition because they are modeled as Hookean springs:
is a diagonal matrix, with the stiffness constant κ for each spring on its diagonal . Obviously every constant is positive. and represent the current and free length of every spring.
Both unilateral and bilateral constraints change over time, as particles move and springs attach to and detach from particles.
All combined together, we reach the equation of motion with geometric constraints:
These equations are solvable when closed by a geometric relation, which maps the force magnitude and to the force vectors and :
where and are sparse matrices containing all orientation vectors of unilateral and bilateral forces, i.e., all vectors as shown in Appendix 3—figure 1. More details about the definition of can be found in the following.
Both and depend only on the geometry norm vectors and location of constraints , together with the particle indices , i.e., which particles appear within the vicinity of each other and which are bound to each other by springs.
Further, this constraint formulation is also applicable to the case where one constraint is not between a pair of particles but between one particle and one externally imposed confinement or boundary, for example, a flat substrate or a spherical shell. The only necessary modification in this case is to ignore one side of the collision geometry when constructing the matrix and . For example, if a particle collides with a fixed substrate, we only include and in , because this substrate does not appear in the mobility matrix .
Temporal discretization and convex quadratic programming
Equations 45 and 46 generate a differential variational inequality (DVI), which can be solved when equipped with a timestepping scheme. In this work we use the linearized implicit Euler timestepping scheme, similar to our previous work Yan et al., 2020; Yan et al., 2019, for three reasons:
It is straightforward to integrate with both the Brownian motion and the stochastic binding and unbinding of crosslinkers into an Euler scheme.
The scheme cannot be explicit. Otherwise is limited to be tiny by the temporal stiffness of collision and doubly bound crosslinkers.
The implicit scheme is linearized to avoid expensive large-scale non-linear problems.
With timestep , Equations 45 and 46 are discretized at timestep as:
The unknowns to be solved at every timesteps are the constraint force magnitude . Equations 47d and e are nonlinear because and are nonlinear functions of . Therefore, we linearize these two terms:
Here we have also rewritten the Equation 47 e into a equivalent form, similar to Equation 47 d. The right side, and should be understood in the component-wise sense. Equations 48 a and b are now closed and can be solved. We shall drop the superscript in the following derivations because we shall repeat this solution process at every timestep.
Then equation (48) can be written in the block-matrix form:
where the blocks are clear from equation (48)
Here we used the fact that:
The first relation has been well known in the problem of collision constraints Anitescu et al., 1996. In this work we extend this result to bilateral constraints. A proof of this is detailed in Section Symmetry of the geometrically constrained optimization problem.
This formulation means that the coefficient matrix is Symmetric-Positive-Semi-Definite (SPSD), because the mobility matrix is SPD and is positive & diagonal:
Because of this SPSD property, solving Equations 48 is equivalent to solving a constrained quadratic programming (CQP) due to the Karush-Kuhn-Tucker condition Nocedal and Wright, 2006:
Here is a column vector, and
This can be conveniently understood as following. represent the current values of the constraint functions plus the (linearized) changes due to non-constraint forces , such as Brownian fluctuations. represent the linearized relation between the unknown constraint force and the changes of the constraint functions .
Solving one global optimization problem at every timestep is usually expensive, because the dimension of this problem (53) can be very large in a system with many particles and constraints. However, this CQP. (53) is a class of well understood optimization problem and fast algorithms exist. We previously developed a fully parallel Barzilai-Borwein projected gradient descent (BBPGD) method Yan et al., 2020; Yan et al., 2019 to efficiently solve this problem for unilateral constraints only. In this work we found that the same BBPGD method also works very well for the current problem.
One way to understand the constraint optimization method is that the temporal integration ‘jumps’ on a timescale that the relaxation timescales of unilateral and bilateral constraints (collisions and crosslinker springs) are bypassed. As a special case, in the limit of infinitely stiff springs where the quadratic term matrix is still SPSD and the Equation 53 is still easily solved. Physically speaking, in this case the bilateral constraints degenerate from deformable springs to non-compliant joints.
Last but not least, due to the linearization in Equations 48 our geometric constraint method has some inevitable numerical errors in imposing both types of constraints for any finite timestep size . In other words, there may be some slight residual overlaps between filaments even if Equation 48 are exactly solved. Such residual overlaps converge to zero as the timestep size decreases to zero, which follows the typical first order numerical convergence. In principal, such residual overlaps due to linearization errors can be eliminated if the full nonlinear constraint problem is solved. However, the cost for a full nonlinear solution is prohibitive. Therefore, in our implementation we do not pursue the elimination of such residual overlaps. Instead, we focus on the stability of temporal integration, i.e., the temporal integration of trajectory is stable even if very large forces suddenly appear on some particles due to, for example, Brownian noise or a large number of doubly bound crosslinkers. We have also benchmarked our algorithm such that the average physical properties of the entire suspension converge to the reference values. For example, our method accurately captures the system stress, the equation of state and isotropic-nematic phase transition of rigid Brownian spherocylinders Yan et al., 2019.
Symmetry of the geometrically constrained optimization problem
We briefly prove the symmetry of Equation 51. The derivation in this section is applicable to rigid particles with arbitrary shapes.
The configuration of each particle is tracked by its center location in the lab frame and its orientation as a unit quaternion . For an arbitrary 3D vector which is attached to a particle and follows the particle’s motion, its image in the lab frame following the particle’s rotation is:
where denotes the rotation matrix generated by the unit quaternion .
For both unilateral and bilateral constraints, has a sparse column structure:
where are particle indices for the -th column. For example, for a system with 4 particles and two possible collision pairs and , the matrix for collision (unilateral) constraints is:
Because of this structure, to prove Equation 51 we only need to prove the equality for a pair of particles , as shown in Appendix 3—figure 1.
We consider two rigid particles centered at , each has a point fixed on the body (not necessarily on the surface). and are vectors in the lab frame from the particle centers to the points. The distance between these two points follows the rigid body motion of both particles:
where and are the well-known rotation matrices. and are locations of those two points in their intrinsic coordinate systems. is simply the distance between the two points, dependent on the motion of the two rigid particles.
According to our definition, maps the force magnitude γ between the two particles to force and torque vectors on each particle:
can also be explicitly written as follows:
Further, we notice the symmetry of P and Q in the above equations of and , we only need to prove the following equality for P:
In Equation 62 the only difference between unilateral and bilateral constraints are how the two points on particles P and Q are picked. For unilateral (collision) constraints, the two points are where the distance reaches the minimal distance between the two particles. For bilateral constraints, there is no such restriction and the two points are arbitrary. Obviously, we only need to prove this latter case, i.e., to prove Equation 62 when is an arbitrary vector.
The first row of Equation 62 is straightforward because
The second row can be proved as follows. We first derive some general results about quaternions and rotation matrices, dropping the subscript to simplify equations. When the particle rotates with an angular velocity , the motion of satisfies
can also be directly computed by applying the chain rule on Equation 55, because is intrinsic to the particle invariant over time:
The matrix bridges angular velocity and quaternion by definition:
We have
This must be valid for arbitrary , which is only possible when
Now for another arbitrary vector :
Using Equation 69 we can prove the second row of Equation 62. We first calculate the derivatives of Equation 62 using dummy indices:
Multiply the matrix on both sides:
Substitute the right side by Equation 69, we get:
This is exactly the right side of Equation 62 because by definition and . Therefore Equation 62 holds and the equality Equation 51 holds.
Implementation
As mentioned above, at each timestep we first update the crosslinkers and then the filaments. We implement the two steps in a fully parallelized C ++ codebase, utilizing MPI and OpenMP and scalable to hundreds of CPU cores.
In the crosslinker-update step, we have assumed that every crosslinker has binding-unbinding probabilities independent of other crosslinkers. Therefore, it is straightforward to parallelize this step, we only need to search the vicinity of each crosslinker to find the candidate filaments that this crosslinker may bind to. This can be conveniently accomplished by a standard near neighbor detection operation based on bounding volume hierarchy Iwasawa et al., 2016, where the search radius is determined by the maximum stretch of each crosslinker. Once the candidate filaments for each crosslinker have been found, we compute the k-MC probabilities using a precomputed lookup table with interpolation to speed up the numerical integration while maintaining accuracy. This step is also parallel on all CPU cores.
After the positions of crosslinkers have been updated, we update the set of bilateral constraints in the constraint solver. If one crosslinker has changed its status from doubly bound to singly bound, the corresponding constraint is removed from , and vice versa. The geometric matrix is also updated according to the current geometry, that is, those locations and norm vectors . Then, a near neighbor detection operation is performed for all filaments to determine the unilateral constraints and its geometry . If two filaments are far away from each other, there is no need to include this pair in the constraint solver because it is impossible for them to collide within this time step . Therefore, we include only close pairs whose minimal distance is below some threshold value . is controlled by system dynamics, that is, how far each filament may move within each timestep. Empirically, we take to be the diameter of each filament.
Once the constraint problem Equation 48 has been constructed, we run a fully parallel iterative Barzilai-Borwein Projected Gradient Descent (BBPGD) solver Yan et al., 2019 to solve for constraint forces and , together with the velocities and due to constraint forces and by solving the equivalent CQP 53. The cost of every BBPGD iteration scales as , that is, the total dimension of the linear constraint problem. The number of iterations needed depends on the complexity of the structure. For example, if all filaments are far from each other such that almost no collisions or no doubly bound crosslinkers exist, the solution converges almost immediately. If all filaments are densely packed and many doubly bound crosslinkers form between the filaments, many iterations may be necessary. Empirically, the solution of Equation 53 converges in a few hundred BBPGD iterative steps for common biological structures such as microtubule asters or bundles. However, each iteration of BBPGD is cheap because we only need to compute . This sparse matrix-vector multiplication spmv is a well optimized standard mathematical operation. The BBPGD solver is implemented using the Trilinos package for distributed linear algebra operations. Once and have been solved, the filament configuration is updated and then next timestep starts.
Appendix 4
Performance measurements
The bundle contraction-buckling simulation runs on 2 nodes connected by Infiniband, and each node has two AMD EPYC 7742 64-Core CPUs 2.25 GHz. Appendix 4—figure 1 shows the performance of the solver. Different from the aster formation case shown in Appendix 4—figure 2, computational time spent on crosslinkers is negligible. This is because as the fixed head of each dynein is permanently attached to the microtubule, we only need to update the status of the free head. Also, the free heads only experience the transition, which further reduces the computational cost. On the other hand, the collisions in this case is more difficult to resolve compared to the aster cases, because in nematic bundles more collisions happen and collisions may happen anywhere along the microtubule instead of only at the center of each aster. Similar to the aster case, we see that computational time for constraint solution is proportional to the number of BBPGD steps.
For the aster formation in bulk problem, each case runs on 1 node of dual Intel Xeon 14-core CPUs E5-2680 v4 2.40 GHz. Appendix 4—figure 2 shows the performance of the solver for simulations with and without thermal fluctuations. Updating the binding states of kinesin-5 motors requires roughly the same wall clock time per timestep for the entire simulation. However, the time required to solve the constraint problem grow quickly in the initial stage. The solver cost increases mostly due to the increased number of BBPGD steps (as shown in the right panels of Appendix 4—figure 2) even though the dimension of the constraint problem Equation 53 grows as more kinesin-5 motors become doubly bound and more collisions occur as the asters form. The increase in BBPGD steps dominates because while the dimension of increases, the dimension of remains constant since the number of microtubules does not change and the cost of each BBPGD step mainly depends on the cost of applying when calculating in solving Equation 53.
Appendix 5
Aster center analysis of asters formation in bulk
This section provides more details about the simulation in Section Confined filament-motor protein assemblies of main text.
To quantify the spatial aster center distribution, we identify aster centers for each snapshot of data. For cross validation, we use two different methods to identify the aster centers: ‘DBSCAN’ and ‘Graph’. The implementation details are discussed in the following. Once aster centers are identified, we compute the radial distribution function . Then, we compute structure factor based on as
because the structure of aster centers is isotropic and the orientation of does not matter.
Appendix 5—figure 1 summarizes the results for BMT and NBMT systems. Both ‘DBSCAN’ and ‘Graph’ methods generate similar results. According to , there is a clearly length scale for the NBMT case at . This reflects the spacing between individual asters at approximately . This length scale is straightforward to understand. Since we used microtubules of length , if two aster centers are smaller than , then the edge of two asters may touch or overlap, and are likely to be crosslinked by kinesin-5 motors and gradually merge into one bigger aster. With this length scale argument, we can estimate the total number of asters in the simulation box to be . This simple estimation agrees with our aster center identification results, which on average 1200 aster centers are found for each snapshot of steady state configuration.
The BMT case does not show such a significant special length scale in , but they do show larger spacing between asters according to , compared to the NBMT case. This agrees with the snapshots shown in Figure 7 in the main text, where asters are larger but more distributed in space. Both methods identified on average 280 aster centers for each snapshot of steady state configuration.
Identify aster centers by DBSCAN method
DBSCAN stands for Density-Based Spatial Clustering of Applications with Noise and is a method to identify clusters from points in space. With a given distance and a threshold of minimal number of points, DBSCAN searches all clusters such that each cluster has no less than points and no point in one cluster is more than distance separated from other points in the same cluster.
To apply DBSCAN, we first create a point cloud using the location of all microtubule minus ends in the system, and then run the algorithm using the function cluster.dbscan from the python package scikit-learn. Once clusters have been identified, we compute the aster centers by averaging the location of all points in each cluster.
We set , because according to Figure 7 in main text, the minus ends of microtubules are separated roughly . We also set .
Identify aster centers by Graph method
The entire microtubule-kinesin system can be abstracted as an undirected graph, where each microtubule is a node marked by their index and each doubly bound kinesin form an edge. Then, one aster is simply abstracted as a connected component of the graph. We use the connected_components() function in the python package networkx to find all such connected components, with minimal number of microtubules . We identify aster centers by computing the average location of minus ends of these connected components.
Appendix 6
Confined filament-motor protein assemblies
This section provides more details about the simulation in Section Confined filament-motor protein assemblies of main text. We simulate 9,216 microtubules and 27,648 crosslinking motor proteins in a cylindrical volume. Microtubules are modeled as rigid spherocylinders with length and diameter (aspect ratio of . Crosslinking motor proteins are modeled as Hookean springs. The cylinderical axis is oriented along the direction, with a periodic boundary condition. The radial direction has a hard confinement boundary. System temperature is fixed at and the simulation timestep is , with the system configuration recorded every 500 steps. Solvent viscosity is set at . Values for the cylinder diameter, are chosen to disrupt the self-assembly of an ideal aster. Initially, microtubules were aligned along the direction (cylinder axis) such that the initial nematic order parameter, , was 1. Here, θ is the angle between the microtubule orientation vector and the axis, and denotes an average over all microtubules. Equal numbers of microtubules are oriented in the and the direction such that the polar order parameter, .
Structural quantification
To measure the structure of our steady-states, we compute the local packing fraction , local nematic order, local crosslinker density, and pair distribution functions. For the first three quantities, we start by dividing the volume into cylindrical bins with their axis in the direction. The diameter of the bins is equal to , and the height is chosen as . The local packing fraction is computed by calculating the cumulative volume of all microtubules that fall inside each bin, and then dividing by the bin volume. For simplicity, we treat the filaments as cylinders (such that there as no hemispherical caps at their ends). For the local nematic order parameter , we find the total number of microtubules, , that pass through each bin at some location . For each microtubule in the bin, we compute it’s individual contribution to the local nematic order parameter, . We weight each by a factor that depends on the length of microtubule that falls inside the bin, normalized by the cumulative length of all other microtubules that traverse the bin. We calculate the local nematic order parameter as
Local crosslinker density, , is found by counting the number of center points of crosslinking motor proteins that fall in each bin, and then dividing by the bin volume. For this calculation, we only consider doubly-bound crosslinking motor proteins. Finally, we compute the pair distribution functions by finding the distance (using the nearest image convention in ) of all microtubules from a single reference microtubule. Repeating this for all microtubules as a reference, and averaging yields a pair distribution function. The useful dimensions here are and . Due to the non-periodic nature in ρ, this pair distirbution function does not decay to 1.
The simulation volume is a cylinder with height . We measure structural properties of the system over the course of the simulation. A kymograph of the local packing fraction is shown in Appendix 6—figure 1. The local nematic order (Appendix 6—figure 1B) shows that the polarity-sorted bilayers (PSBs) have a maximum order parameter equal to 1 The condensation of microtubules is mediated by the crosslinking motor proteins. In Appendix 6—figure 1C, we show a kymograph of the local density of the crosslinking motor proteins.
The microtubule pair distribution function at steady-state (Appendix 6—figure 1D) shows that plus-ends (top plot) are distributed in a ring. The ring radius is set by the length of a single crosslinking motor protein. There is negligible density away from the ring. In contrast to asters (that contain microtubules isotropically distributed around a core), microtubule centers (bottom plot) are distributed in vertically extended regions. Separation between these regions is determined by the sum of the microtubule length and the length of the crosslinking motor protein. The presence of three regions in this pair distribution plot is evidence for a pair of layers.
In this case, the simulation volume is a cylinder with height . Over time, microtubules condense into a bottlebrush-like (BB) state with a hedgehog line defect. This consists of microtubules having a degree of alignment in the radial direction. The ends of the BB state contain a half-aster. Crosslinking motor proteins are highly concentrated along the central axis of the BB. Here is a kymograph for the local microtubule packing fraction, (Appendix 6—figure 2A). We show the evolution of the local nematic order parameter, , in Appendix 6—figure 2B. A negative indicates a significant degree of radial alignment. Maximum radial alignment (the ideal bottlebrush state) is evidenced by a nematic order parameter value of The condensation of microtubules is mediated by crosslinking motor proteins. In Appendix 6—figure 2C, we show a kymograph of the local density of the crosslinking motor proteins. Appendix 6—figure 2D depicts the microtubule pair distribution function. While there is a ring clearly visible for microtubule plus-ends (top plot), showing that this state is aster-like, there is significant density present along the X axis. This indicates that there is an accumulation of plus-ends throughout the line defect. Microtubule centers (bottom plot) are distributed uniformly along while there is a decay in density along ρ. The absence of a ring indicates that this state is not aster-like. High density at suggests that microtubule centers tend to be stacked in .
Ideal bottle-brush state
The ideal bottle-brush state (BB) consists of microtubules aligned in the radial direction directed away from a central line defect. A schematic and different views are shown in Appendix 6—figure 3. Microtubule orientation is indicated by the color wheel. For such a state, the local nematic order parameter along has a value of −0.5 along the length of the BB.
Appendix 7
Bending rigidity
A flexible long fiber can be implemented by connecting short rigid segments into chains. The key is how to properly implement the force and torque induced by deformation at the rigid segment joints. There are two ways to implement this, which we shall detail in the following. The first method implements the deformation of each joint with two linear Hookean springs and requires no modification to the current codebase. The second method directly incorporates the bending rigidity as a new set of constraints in the geometric constraint minimization solver, but requires some extensions to the current codebase.
Method 1: use two Hookean springs
We can use two permanently bound springs for each joint, as shown in Appendix 7—figure 1, to implement the bending rigidity. The separations in the figure is exaggerated to show the geometry clearly. The energy of the two springs depend on their lengths geometrically:
With the deformed geometry, the lengths of the two springs are:
When , the energy of the two springs can be expanded as:
Here in the first term is simply the linear extension of both springs when α is small. The two-spring system generate a equivalent extensional rigidity . The second term governs the bending energy. We can tune the five parameters such that the connected segments reproduce the desired mechanical behavior of a flexible filament. Although the expansion Equation 77 is general and can be fitted to many different models by tuning the five parameters, it is too complicated to be conveniently used in an actual simulation. In the following we discuss simpler special cases which are more relevant to biological filaments.
Special case 1 When model some bio-filaments such as microtubules, we sometimes assume filaments are inextensible, i.e., and . In this special case, the energy of the two springs depends only on . By imposing , we can solve for :
Then in this case depends on in the limit of . To simplify the notations of the expansion, we define:
The value defines three cases of the equilibrium configuration:
. The equilibrium configuration of the joint is a straight line, and the bending spring is compressed at equilibrium.
. The equilibrium configuration of the joint is a straight line, and the bending spring is not compressed nor stretched at equilibrium.
. The equilibrium configuration of the joint is bent.
For the first two cases, the equilibrium configuration is a straight line and we can expand in the limit of :
With this form, it is clear that the bending energy is tunable with the parameter , that is, how much the bending spring is strained in the equilibrium configuration. Note that here is a constant determined by the lengths only. Therefore, the first term only ‘shifts’ the zero-point of the energy. This term does not contribute to the stretching or bending energy of the joint. When , the leading order terms all vanish and . When , the leading order terms are non-zero and the energy is asymptotically a quadratic function of α: .
Special case 2 If we further assume that and , we have that . This means the extension spring degenerates into a point joint between the two segments. In this case the energy can be further simplified:
The expansion of as is also further simplifed:
Here we have the same conclusion as the previous special case, that the dependence of on α can be tuned between and by choosing a proper value of .
Method 2: use bilateral constraints
Here we briefly derive the constraint optimization formulation for handling the bending rigidity of flexible fibers with bilateral constraints. To fit in the geometric constraint formulation, we represent a long and flexible fiber as many short rigid straight fibers chained together by joints. The linear extension of each joint can be straightforwardly handled by the bilateral spring constraints as for those doubly bound motors. For the bending rigidity, we first realize that for each joint the two norm orientation vectors and form a plane. This plane is orthogonal to a unit norm vector
For most relevant biological filaments, the bending rigidity is isotropic along different directions on a cross-section of the filament. In other words, the recovering torque is always co-linear with the vector and the recovering deformation is always in plane spanned by and . This is important because we can simplify the deformation to in-plane rotations in the following derivations. Note that this plane can be different for each joint since each joint is handled as an independent constraint in our method.
There are different models of how the bending energy depends on the deformation, .
Case 1:
When the angle α between and is small, we have:
Case 2:
In this form when the energy depends on the second order instead of the fourth order of the angle:
The following derivation and method still applies.
The two cases can be handled in the same way. In the following we derive the equations for the first case, where the second case only requires a simpler small α expansion in the derivation.
There is one more relation we can utilize to simplify the derivation. Assume that and have arbitrary directions, and to the first order of the orientation vectors and rotates within :
Then, the bending energy after this rotation is:
where we have utilized the vector triple product identity:
This means, to the first order of only the component of rotation and that is inside this plane spanned by affect the bending energy. Therefore, to the first order of we can simplify the bending rigidity problem inside this spanned plane, although in reality the filament segments have true 3D rotations.
We denote the current and next timesteps by and . We have, to the first order:
The rotational mobility matrix for these two rods is:
where and are inverse of rotational drag coefficients for those two segments. The torque generated by the joint on each segment can be calculated by the derivative of bending energy . More specifically:
where the scalar torque is:
where the higher order terms in have been neglected. If the bending energy Case 2 is used, instead of we have . We can replace the expansion accordingly and the derivation remains valid.
Combining all of the above, we are effectively integrating the dynamics of all rods while ensuring Equation 90. Skipping the timestep index , we can write the result in the same way as the bilateral Hookean spring constraints as:
where and the geometric matrix defines the direction of torque on each rod:
The left side of Equation 94 means the motion of filament segments must satisfy the torque-deformation relation, while the right side means the torque can take any values.
Equation 94 is mathematically identical to the Hookean spring constraints and can be incorporated in the constraint minimization problem in the same way.
We can solve this two-segment problem analytically if the constraint optimization problem contains only Equation 94, in the absence of collisions and Hookean springs:
This simply means that if a straight fiber is bent to angle α, its recovering motion within each timestep is proportional to the current angle α. More importantly, as the bending rigidity modulus increases to infinity. In this case, and the above solution is still stable, and is simplified to . This means the solution to Equation 94 has very strong temporal integration stability even when the deformation force is infinitely stiff, the same as what we discussed for the infinitely stiff Hookean spring case.
Data availability
The current manuscript is a computational study, so no data have been generated for this manuscript. The open-source modeling software is hosted at GitHub.
References
-
A mechanochemical model explains interactions between cortical microtubules in plantsBiophysical Journal 99:1082–1090.https://doi.org/10.1016/j.bpj.2010.05.037
-
Formulating Three-Dimensional Contact Dynamics ProblemsMechanics of Structures and Machines 24:405–437.https://doi.org/10.1080/08905459608905271
-
A theory that predicts behaviors of disordered cytoskeletal networksMolecular Systems Biology 13:941.https://doi.org/10.15252/msb.20177796
-
Organelle positioning and cell polarityNature Reviews. Molecular Cell Biology 9:874–886.https://doi.org/10.1038/nrm2524
-
Prime movers: the mechanochemistry of mitotic kinesinsNature Reviews. Molecular Cell Biology 15:257–271.https://doi.org/10.1038/nrm3768
-
Orientational order of motile defects in active nematicsNature Materials 14:1110–1115.https://doi.org/10.1038/nmat4387
-
Brownian dynamics of confined rigid bodiesThe Journal of Chemical Physics 143:144107.https://doi.org/10.1063/1.4932062
-
Intrinsic rates and activation free energies from single-molecule pulling experimentsPhysical Review Letters 96:108101.https://doi.org/10.1103/PhysRevLett.96.108101
-
Dynamic strength of molecular adhesion bondsBiophysical Journal 72:1541–1555.https://doi.org/10.1016/S0006-3495(97)78802-7
-
Self-straining of actively crosslinked microtubule networksNature Physics 15:1295–1300.https://doi.org/10.1038/s41567-019-0642-1
-
Multiscale modeling and simulation of microtubule-motor-protein assembliesPhysical Review. E, Statistical, Nonlinear, and Soft Matter Physics 92:062709.https://doi.org/10.1103/PhysRevE.92.062709
-
Multiscale polar theory of microtubule and motor-protein assembliesPhysical Review Letters 114:048101.https://doi.org/10.1103/PhysRevLett.114.048101
-
Brownian dynamics simulations of model hard-sphere suspensionsJournal of Non-Newtonian Fluid Mechanics 46:1–28.https://doi.org/10.1016/0377-0257(93)80001-R
-
BookMechanics of Motor Proteins and the CytoskeletonSunderland: Sinauer associates.
-
Implementation and performance of FDPS: a framework for developing parallel particle simulation codesPublications of the Astronomical Society of Japan 68:54.https://doi.org/10.1093/pasj/psw053
-
Theory of Cytoskeletal Reorganization during Cross-Linker-Mediated Mitotic Spindle AssemblyBiophysical Journal 116:1719–1731.https://doi.org/10.1016/j.bpj.2019.03.013
-
Comparison of explicit and mean-field models of cytoskeletal filaments with crosslinking motorsThe European Physical Journal. E, Soft Matter 44:45.https://doi.org/10.1140/epje/s10189-021-00042-9
-
Beyond polymer polarity: how the cytoskeleton builds a polarized cellNature Reviews Molecular Cell Biology 9:860–873.https://doi.org/10.1038/nrm2522
-
Integral-based spectral method for inextensible slender fibers in Stokes flowPhysical Review Fluids 6:014102.https://doi.org/10.1103/PhysRevFluids.6.014102
-
MitosisCold Spring Harbor Perspectives in Biology 8:a023218.https://doi.org/10.1101/cshperspect.a023218
-
A fast platform for simulating semi-flexible fiber suspensions applied to cell mechanicsJournal of Computational Physics 329:173–209.https://doi.org/10.1016/j.jcp.2016.10.026
-
Collective Langevin dynamics of flexible cytoskeletal fibersNew Journal of Physics 9:427.https://doi.org/10.1088/1367-2630/9/11/427
-
Active matter at the interface between materials science and cell biologyNature Reviews Materials 2:1–14.https://doi.org/10.1038/natrevmats.2017.48
-
Mechanisms of Mitotic Spindle AssemblyAnnual Review of Biochemistry 85:659–683.https://doi.org/10.1146/annurev-biochem-060815-014528
-
Molecular Mechanism of CytokinesisAnnual Review of Biochemistry 88:661–689.https://doi.org/10.1146/annurev-biochem-062917-012530
-
MEDYAN: Mechanochemical Simulations of Contraction and Polarity Alignment in Actomyosin NetworksPLOS Computational Biology 12:e1004877.https://doi.org/10.1371/journal.pcbi.1004877
-
Multimotor transport in a system of active and inactive kinesin-1 motorsBiophysical Journal 107:365–372.https://doi.org/10.1016/j.bpj.2014.06.014
-
The Dynamics of Microtubule/Motor-Protein Assemblies in Biology and PhysicsAnnual Review of Fluid Mechanics 48:487–506.https://doi.org/10.1146/annurev-fluid-010814-013639
-
Wrinkling Instability in 3D Active NematicsNano Letters 20:6281–6288.https://doi.org/10.1021/acs.nanolett.0c01546
-
A compliant visco-plastic particle contact model based on differential variational inequalitiesInternational Journal of Non-Linear Mechanics 53:2–12.https://doi.org/10.1016/j.ijnonlinmec.2013.01.010
-
Confinement-Induced Self-Pumping in 3D Active FluidsPhysical Review Letters 125:268003.https://doi.org/10.1103/PhysRevLett.125.268003
-
The load dependence of rate constantsThe Journal of Chemical Physics 128:215101.https://doi.org/10.1063/1.2920475
-
Computing collision stress in assemblies of active spherocylinders: Applications of a fast and generic geometric methodThe Journal of Chemical Physics 150:064109.https://doi.org/10.1063/1.5080433
-
A scalable computational platform for particulate Stokes suspensionsJournal of Computational Physics 416:109524.https://doi.org/10.1016/j.jcp.2020.109524
-
Energetics and dynamics of constrained actin filament bundlingBiophysical Journal 90:4295–4304.https://doi.org/10.1529/biophysj.105.076968
Article and author information
Author details
Funding
National Science Foundation (DMR-2004469)
- Michael Shelley
National Science Foundation (CMMI-1762506)
- Michael Shelley
National Science Foundation (DMS-1821305)
- Saad Ansari
- Matthew A Glaser
- Meredith D Betterton
National Science Foundation (ACI-1532235)
- Saad Ansari
- Matthew A Glaser
- Meredith D Betterton
National Science Foundation (ACI-1532236)
- Saad Ansari
- Matthew A Glaser
- Meredith D Betterton
National Science Foundation (RGM124371A)
- Saad Ansari
- Matthew A Glaser
- Meredith D Betterton
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
MJS acknowledges support from NSF grants DMR-2004469 and CMMI-1762506. SA, ARL, MAG, and MB acknowledge support from NSF grants DMS-1821305, and NIH grant RGM124371A. This work utilized resources from the University of Colorado Boulder Research Computing Group, which is supported by the National Science Foundation (awards ACI-1532235 and ACI-1532236), the University of Colorado Boulder, and Colorado State University. We thank Prof. Dimitrios Vavylonis for discussions on implementing flexible filaments.
Copyright
© 2022, Yan et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,068
- views
-
- 406
- downloads
-
- 14
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Microbiology and Infectious Disease
Timely and effective use of antimicrobial drugs can improve patient outcomes, as well as help safeguard against resistance development. Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) is currently routinely used in clinical diagnostics for rapid species identification. Mining additional data from said spectra in the form of antimicrobial resistance (AMR) profiles is, therefore, highly promising. Such AMR profiles could serve as a drop-in solution for drastically improving treatment efficiency, effectiveness, and costs. This study endeavors to develop the first machine learning models capable of predicting AMR profiles for the whole repertoire of species and drugs encountered in clinical microbiology. The resulting models can be interpreted as drug recommender systems for infectious diseases. We find that our dual-branch method delivers considerably higher performance compared to previous approaches. In addition, experiments show that the models can be efficiently fine-tuned to data from other clinical laboratories. MALDI-TOF-based AMR recommender systems can, hence, greatly extend the value of MALDI-TOF MS for clinical diagnostics. All code supporting this study is distributed on PyPI and is packaged at https://github.com/gdewael/maldi-nn.
-
- Computational and Systems Biology
- Genetics and Genomics
Enhancers and promoters are classically considered to be bound by a small set of transcription factors (TFs) in a sequence-specific manner. This assumption has come under increasing skepticism as the datasets of ChIP-seq assays of TFs have expanded. In particular, high-occupancy target (HOT) loci attract hundreds of TFs with often no detectable correlation between ChIP-seq peaks and DNA-binding motif presence. Here, we used a set of 1003 TF ChIP-seq datasets (HepG2, K562, H1) to analyze the patterns of ChIP-seq peak co-occurrence in combination with functional genomics datasets. We identified 43,891 HOT loci forming at the promoter (53%) and enhancer (47%) regions. HOT promoters regulate housekeeping genes, whereas HOT enhancers are involved in tissue-specific process regulation. HOT loci form the foundation of human super-enhancers and evolve under strong negative selection, with some of these loci being located in ultraconserved regions. Sequence-based classification analysis of HOT loci suggested that their formation is driven by the sequence features, and the density of mapped ChIP-seq peaks across TF-bound loci correlates with sequence features and the expression level of flanking genes. Based on the affinities to bind to promoters and enhancers we detected five distinct clusters of TFs that form the core of the HOT loci. We report an abundance of HOT loci in the human genome and a commitment of 51% of all TF ChIP-seq binding events to HOT locus formation thus challenging the classical model of enhancer activity and propose a model of HOT locus formation based on the existence of large transcriptional condensates.