Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-Seq Data Analysis

Zhang, Zhenyi; Sun, Yuhao; Peng, Qiangwei; Li, Tiejun; Zhou, Peijie

doi:10.3390/e27050453

Open AccessReview

Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-Seq Data Analysis

by

Zhenyi Zhang

^1,†

,

Yuhao Sun

^2,†,

Qiangwei Peng

^1,†

,

Tiejun Li

^1,2,3,* and

Peijie Zhou

^2,4,5,6,*

¹

School of Mathematical Sciences, Peking University, Beijing 100871, China

²

Center for Machine Learning Research, Peking University, Beijing 100871, China

³

Laboratory of Mathematics and Its Applications (LMAM), Peking University, Beijing 100871, China

⁴

Center for Quantitative Biology, Peking University, Beijing 100871, China

⁵

National Engineering Laboratory for Big Data Analysis and Applications, Peking University, Beijing 100871, China

⁶

AI for Science Institute, Beijing 100080, China

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Entropy 2025, 27(5), 453; https://doi.org/10.3390/e27050453

Submission received: 14 March 2025 / Revised: 18 April 2025 / Accepted: 19 April 2025 / Published: 22 April 2025

(This article belongs to the Special Issue Complexity, Information and Quantitative Modelling in Single Cell Multiomics)

Download

Browse Figures

Versions Notes

Abstract

:

Understanding the dynamic nature of biological systems is fundamental to deciphering cellular behavior, developmental processes, and disease progression. Single-cell RNA sequencing (scRNA-seq) has provided static snapshots of gene expression, offering valuable insights into cellular states at a single time point. Recent advancements in temporally resolved scRNA-seq, spatial transcriptomics (ST), and time-series spatial transcriptomics (temporal-ST) have further revolutionized our ability to study the spatiotemporal dynamics of individual cells. These technologies, when combined with computational frameworks such as Markov chains, stochastic differential equations (SDEs), and generative models like optimal transport and Schrödinger bridges, enable the reconstruction of dynamic cellular trajectories and cell fate decisions. This review discusses how these dynamical system approaches offer new opportunities to model and infer cellular dynamics from a systematic perspective.

Keywords:

single-cell RNA sequencing; spatiotemporal dynamics; computational modeling; cellular trajectories

1. Introduction

Understanding the dynamic change of biological systems has played a central role in life sciences, with important applications in developmental biology, disease modeling, and medicine [1,2,3,4,5]. One key framework for understanding these dynamic processes is Waddington’s developmental landscape [6,7,8], which illustrates how cells navigate various potential fates as they differentiate during development. However, how to construct such developmental landscapes or understand the cellular dynamics within the biological systems, presents a significant challenge. To fully understand these complex cellular transitions, a deep understanding of gene expression at the single-cell level is essential. Advancements in high-throughput sequencing technologies have enabled unprecedented resolutions into the molecular signatures of individual cells, with single-cell RNA sequencing (scRNA-seq) emerging as a revolutionary tool [9,10,11]. scRNA-seq allows for the dissection of cellular heterogeneity and the identification of transcriptional programs underlying complex biological processes, offering a snapshot of gene expression in single cells at a given moment. Despite its powerful capabilities, traditional scRNA-seq provides only a static picture of gene expression across individual cells, missing the temporal information for understanding how cells transition through different states.

In recent years, the development of temporally resolved scRNA-seq technologies has begun to gain increasing attention, enabling the capture of gene expression profiles across multiple time points [4,12,13]. Another breakthrough in transcriptomics is spatial transcriptomics (ST), which integrates spatial context into gene expression data by mapping RNA profiles within tissue architectures [14,15,16,17,18,19,20,21]. When combined with temporal resolution, this approach leads to temporally resolved spatial transcriptomics (temporal-ST), which provides an enhanced tool for studying the spatiotemporal dynamics of single cells [22].

Extracting meaningful dynamical features from spatiotemporal single-cell transcriptomic data remains a significant challenge. Since the inherently destructive nature of single-cell sequencing, each cell can only be measured once during the dynamical process. As a result, continuous dynamics cannot be directly obtained from the data. Even with temporally resolved single-cell RNA sequencing, we can only obtain unpaired gene expression snapshots at discrete time points, capturing cell distribution changes over time rather than the continuous movement of individual cells. Consequently, inferring cell-state transitions and dynamic regulatory mechanisms from such snapshot-based data necessitates computational modeling approaches, which is an important problem in computational system biology and has gained increasing importance.

To address the challenges, numerous computational frameworks have been developed. For single-cell transcriptomics data, several methods have been proposed to approximate cellular trajectories and dissect dynamic cellular states. Pseudotime inference methods [23,24,25], for instance, arrange snapshot data along an inferred developmental axis, offering a continuous perspective of cell-state transitions over time. In addition, RNA velocity analysis [26,27,28,29,30] has emerged as a powerful tool for understanding cellular dynamics by leveraging splicing kinetics to infer the direction of future gene expression changes. Recently, with the development of temporally resolved sequencing technology, there has been a growing interest in dissecting single-cell dynamics from multiple snapshot data. Simultaneously, the development of generative modeling techniques, such as diffusion models [31,32,33,34], optimal transport theory [13,35,36], flow-based model [37,38], and the Schrödinger bridge problem [39,40] have emerged as key mathematical frameworks for modeling distribution transitions in dynamic biological systems. More recently, the rapid development of spatial transcriptomics has opened exciting new avenues for integrating spatial and temporal data. Extending these computational methods to the ST data and capturing spatiotemporal cellular transitions also has inspired many recent kinds of research [41].

Recently, many reviews have provided comprehensive summaries of the methods and advancements in the study of single-cell dynamics. For example, ref. [42] reviewed various pseudotime inference methods. Ref. [43] conducted a comprehensive benchmarking study on pseudotime inference methods. Refs. [27,42,44] reviewed RNA velocity methods in single-cell transcriptomics. [27] also discussed the limitations and potential extensions of RNA velocity. Refs. [4,5,13,45] provided an in-depth analysis of the application of optimal transport theory in single-cell or spatial omics data. Additionally, refs. [1,3] examined various perspectives on cellular dynamics, exploring how the reconstruction of cell states and energy landscapes can contribute to our understanding of cellular behavior and development. The current review takes a distinct perspective by systematically discussing modeling strategies for different types of data from a dynamical modeling perspective, aiming to unify and expand upon the current methodologies in the field.

This paper mainly focuses on how dynamic insights can be extracted from high-resolution biological data, including scRNA-seq, temporally resolved scRNA-seq, spatial transcriptomics (ST), and temporally resolved spatial transcriptomics. We examine how key concepts from dynamical systems modeling—such as Markov chains, stochastic differential equations (SDEs), ordinary differential equations (ODEs), and partial differential equations (PDEs)—can be effectively applied to the analysis of cellular processes reflected in these high-dimensional data. Furthermore, we explore the application of emerging generative modeling techniques, including optimal transport theory, flow matching, and the Schrödinger bridge problem, as approaches for inferring spatiotemporal cellular trajectories and transitions. By focusing on these modeling strategies, this review aims to provide a systematic framework for understanding cellular dynamics across different types of data, thus advancing the study of spatiotemporal biological processes.

This paper is organized as follows: In Section 2, we provide an overview of the data and models, laying the foundation for understanding the types of biological data and the mathematical frameworks. Section 3 delves into the dynamic modeling of single-cell transcriptomics, with a focus on both single-cell RNA sequencing (scRNA-seq) and temporal-scRNA-seq. In Section 4, we explore the dynamic modeling of spatial transcriptomics, examining both snapshot-based and temporally resolved approaches to analyze the spatial and temporal dynamics of gene expression. Section 5 discusses the extensions, challenges, and future directions in the field, highlighting the key limitations and opportunities for advancing the study of cellular dynamics. Finally, we summarize the insights and outline potential areas for future research in Section 6.

2. Overview of the Data and Models

In this section, we provide preliminary background on the structure of the scRNA-seq data as well as the mathematical models to describe dynamic cellular processes. An overview of the data and models is provided in Figure 1.

2.1. Spatiotemporal scRNA-Seq Data

Single-cell RNA sequencing (scRNA-seq) has emerged as a prevalent tool for dissecting cellular heterogeneity by providing high-resolution snapshots of gene expression profiles at the individual cell level. Traditionally, scRNA-seq experiments capture only single-time-point data, that is, a static “snapshot” of the cellular landscape. Recent advances in technologies have enhanced the spatiotemporal resolutions of the datasets, enabling finer resolutions to investigate the underlying dynamic biological processes such as development, differentiation, and disease progression. Below we describe the various types of scRNA-seq datasets as inputs to infer spatiotemporal dynamics through the dynamical systems models.

2.1.1. Snapshot scRNA-Seq Data

In snapshot RNA sequencing (RNA-seq) data, gene expression is measured across multiple cells at a single time point. The gene expression matrix is represented as

X \in R^{n \times d}

, where

X

denotes the count matrix of gene expression, n is the number of cells or spots, and d is the number of genes measured. Each entry

X_{i j}

in

X

represents the expression level of gene j in cell i, typically measured as the number of mRNA molecules (transcripts) for that gene in the corresponding cell. Additionally, the total RNA-seq data can further be separated into counts for spliced and unspliced transcripts, useful in certain analyses such as the RNA velocity model described below. The spliced and unspliced counts are represented as

U \in R^{n \times d}

and

S \in R^{n \times d}

, respectively, denoting the matrices of unspliced and spliced counts for the n cells or spots. Over time, unspliced RNA (

u

) can undergo splicing process to become spliced RNA (

s

).

2.1.2. Temporally and Spatially Resolved scRNA-Seq

Recently, a growing number of temporally resolved scRNA-seq datasets have been generated, where single-cell measurements are performed at multiple time points during a dynamic process. Such datasets could offer deeper insights into how cell populations evolve over time [4,12,13].

For temporally resolved scRNA-seq dataset, at each fixed time point

i \in {0, \dots, T - 1}

, the gene expression matrix is represented as

X_{i} \in R^{n_{i} \times d}

, where

X_{i}

denotes the matrix of gene expression data,

n_{i}

is the number of cells at time i, and d is the number of genes. Notably, the gene expression data across time points are unpaired and can be assumed to be sampled from a distribution from a certain time point.

The development of spatial transcriptomics (ST) technology allows gene expression to be captured alongside spatial coordinates [14,15,16,17,18,19,20,21]. ST methods are broadly divided into image-based and sequencing-based approaches. Image-based techniques [19,20,21] detect hundreds to thousands of genes with cellular or sub-cellular resolution, while sequencing-based methods [15,16,17,18] allow whole-transcriptome analysis but are usually limited to spot-level resolution. Advances like Stereo-seq [17] and 10x Visium HD [18] have significantly improved spatial resolution to single-cell or even subcellular precision.

Similarly to temporally resolved scRNA-seq data, ST time series data could be represented as

(Z_{(0 : K)}, X_{(0 : K)})

at

t_{0}, t_{1} \dots t_{K}

totaling K time points, and the number of cells in each observation is

n_{0}, n_{1} \dots n_{K}

. In addition to the gene expression matrices

X_{i} \in R^{n_{i} \times d}

, the associated spatial coordinate matrices

Z_{i} \in R^{n_{i} \times 2}

or

R^{n_{i} \times 3}

represent the spatial coordinates (2D or 3D) of each sequenced cell (or spot), respectively.

2.2. Models for Cell-State Transitions

In computational systems biology, several modeling strategies have been formulated to quantify the cell-state transition dynamics. In general, they could be categorized into two types: discrete models, which are usually defined on observed samples and evolve in discrete time steps, as well as continuous models, which are extrapolated into the continuous cell state space and described by differential equation models.

2.2.1. Discrete Dynamics: Markov Chain Model

Random walk or Markov chain models are simple yet powerful tools for studying stochastic dynamic processes, particularly in the context of complex systems such as gene expression dynamics and cell trajectories. In these models, a system evolves over time as a series of transitions between discrete states, where each state corresponds to a possible configuration or position in the system (such as a specific gene expression profile or a cell’s position in a developmental trajectory). The transitions between states are governed by transition probabilities, which can be represented in the form of a transition matrix

P

. The transition matrix is defined as

P_{i j} = \frac{W_{i j}}{W_{i}},

where

P_{i j}

represents the probability of transitioning from cell state i to state j, and

W_{i j}

is the weight (or similarity) between cells i and j in a weighted graph, and

W_{i} = \sum_{k} W_{i k}

is the degree of cell i. The weights

W_{i j}

typically reflect some measure of similarity or distance between the corresponding gene expression profiles of the cells, or induced from other quantities such as RNA velocity or optimal transport plan.

The stationary distribution

π = (π_{1}, π_{2}, \dots, π_{n})

of the Markov chain is a probability distribution over the states that remains unchanged under the dynamics of the chain. In other words, the distribution is invariant under the transition probabilities, and we have

π_{i} = \sum_{j} P_{i j} π_{j} .

If the cellular state graph is undirected (for example, induced by gene expression similarity), i.e.,

W

is a symmetric matrix and we have the expression

π_{i} = \frac{W_{i}}{\sum_{i} W_{i}}

, then the stationary Markov chain is in detailed balance such that

P_{i j} π_{i} = P_{j i} π_{j}

.

A more realistic assumption in biology is that the state-transition graph can be directional (for example, induced by RNA velocity or optimal transport discussed below), with the cells ultimately reaching terminal states such as fully differentiated or mature cell types. In this setup, recurrent states represent the final, stable cell types or fates that the system eventually reaches. Once a cell enters one of these recurrent states, it remains there, similar to how a fully differentiated cell does not revert back to an undifferentiated or less specialized state. On the other hand, transient states correspond to intermediate stages of cellular development, such as precursor or progenitor cells that are still undergoing differentiation or division. These cells are in transition, with the potential to eventually reach one of the recurrent, stable cell types. The transition matrix governing this system can be partitioned into blocks that reflect these different types of cell states. Specifically, the matrix

P

can be written as the canonical form

P = [\begin{matrix} \tilde{P} & 0 \\ S & Q \end{matrix}] .

(1)

Here,

\tilde{P}

corresponds to the transitions between recurrent (terminal) states, where once a cell reaches these states, it remains there in absorbing states.

Q

represents transitions between transient states (cells that are still in intermediate stages of differentiation or cell cycle).

S

denotes the transitions from transient to recurrent states, representing cells’ eventual differentiation or maturation into their terminal stable types. Since recurrent states are absorbing, the upper right block of the matrix is zero, indicating no transitions from recurrent to transient states.

2.2.2. Continuous Dynamics: From Trajectories to Population Dynamics

In modeling cellular dynamics, we are interested in both the trajectories of individual cells and the distribution of cell states across a population. To capture the behavior of cells in response to both deterministic and stochastic influences, we can approach the problem from two perspectives: (1) cellular trajectories, which describe the path of individual cells over time, and (2) population distribution, which describes how the overall distribution of cell states evolves. The first perspective, trajectory-based models, provides insight into the detailed behavior of a single cell, often described through ordinary or stochastic differential equations (ODEs or SDEs). The second perspective, population-level models, focuses on the evolution of the density of cells across different states, typically captured by partial differential equations (PDEs). Together, these models offer a comprehensive understanding of how individual cell behaviors aggregate to produce population-level dynamics.

Trajectory Dynamics: Stochastic Differential Equations (SDEs) To model the evolution of individual cell trajectories, we consider that cellular dynamics can be governed by a stochastic differential equation (SDE). This accounts for both deterministic factors, such as gene expression regulation, and stochastic factors, like noise from cellular environments or molecular fluctuations. The SDE for the state

x_{t}

of a single cell at time t is given by

d x_{t} = b (x_{t}, t) d t + σ (x_{t}, t) d w_{t},

(2)

where

x_{t} \in R^{d}

represents the state of the cell (e.g., gene expression profile) at time t, and

w_{t} \in R^{d}

is the standard d-dimensional Brownian motion. The term

b (x, t)

represents the drift vector, which defines the deterministic flow of the system, while

σ (x, t) \in R^{d \times d}

represents the diffusion coefficient matrix, which describes the random fluctuations in the system.

Specifically, when the diffusion coefficient

σ (x, t)

is zero, the system reduces to an ordinary differential equation (ODE), which describes the deterministic evolution of the cell state without random fluctuations. In this case, the evolution of the cell is entirely governed by the drift term

b (x, t)

, and the system follows a deterministic trajectory. A useful concept in understanding the long-term behavior of the system is that of an attractor. In the context of the ODE, an attractor corresponds to a stable fixed point of the system, where the rate of change

b (x, t)

of the cell state

x

becomes zero. In cellular dynamics, such attractors can represent stable gene expression profiles, such as differentiated or quiescent states, where the cell remains in a stable state over time.

Furthermore, since

b (x, t)

is time-dependent, the system may exhibit bifurcations, where the qualitative characteristics of attractors could change with respect to time t. In cellular contexts, bifurcations are important for understanding processes like cell fate decisions, where a small change in the environment or internal signaling can push the cell toward a new distinct state (e.g., differentiation into a different cell type).

Population Dynamics: Partial Differential Equations (PDEs) To capture the evolution of the entire population of cells, we consider the density of cells with respect to their state

x

, represented by the probability density function

p (x, t)

. The population distribution evolves according to a partial differential equation (PDE), which incorporates both deterministic and stochastic dynamics at the population level. As a simple model, the evolution of

p (x, t)

can be described by

\partial_{t} p (x, t) = - \nabla_{x} \cdot (p (x, t) b (x, t)) + \frac{1}{2} \nabla_{x}^{2} : (a (x, t) p (x, t)) + g (x, t) p (x, t),

(3)

where

\nabla_{x}^{2} : (a (x, t) p (x, t)) = \sum_{i j} \partial_{i j} (a_{i j} (x, t) p (x, t))

, and

a (x, t) = σ (x, t) σ^{T} (x, t)

represents the diffusion matrix at the population level.

The terms on the right-hand side of the equation represent the key dynamics driving the population evolution. The drift term

\nabla_{x} \cdot (p (x, t) b (x, t))

quantifies the deterministic flow of the population, describing how cells move through different states based on the drift vector

b (x, t)

. The diffusion term

\frac{1}{2} \nabla_{x}^{2} : (a (x, t) p (x, t))

models the spread of the population due to random fluctuations, where

a (x, t) = σ (x, t) σ^{T} (x, t)

represents the diffusion matrix, capturing the effects of stochasticity. Finally, the growth term

g (x, t) p (x, t)

governs the birth and death rates of cells, modeling cell proliferation and mortality, thus controlling the population’s size and dynamics over time. When

g (x, t) = 0

, the PDE reduces to the Fokker–Planck equation associated with the SDE in the Itô integral sense, describing the evolution of the probability density

p (x, t)

for the stochastic process defined by the SDE.

3. Dynamic Modeling of Single-Cell Transcriptomics

In this section, we describe the dynamical systems models for scRNA-seq datasets. We begin with methods for snapshot data, i.e., cells sequenced at a single time point, including pseudotime methods, discrete Markov chain methods, and continuous RNA velocity methods and their extensions. Next, we review methods targeted for temporally resolved scRNA-seq data, majorly based on various formulations and extensions of optimal transport (OT) based methods.

3.1. Snapshot Single-Cell RNA-Seq

A key challenge in using snapshot data to infer dynamic cellular trajectories lies in the inability to directly observe the temporal evolution of cells. The destructive nature of the measurement process, where cells are disassociated after sequencing, means that we lack direct access to the temporal trajectory of individual cells.

When analyzing such a snapshot of “cell state ensembles”, several approaches have been developed to uncover the underlying dynamical processes. One popular type of method, pseudotime, ranks individual cells temporally based on the structure of the data manifold or prior biological knowledge. Other techniques focus on modeling stochastic dynamics over the point clouds of observed cells, yielding discrete random walk analyses. Additionally, continuous differential equation models have been proposed to infer the data-generating process of snapshot scRNA-seq dataset, to use these models for future dynamical predictions. In the following sections, we will explore these methods in more detail. An overview of these approaches is provided in Figure 2.

3.1.1. Pseudotime Methods

Given snapshot data from single-cell sequencing, where the data matrix

X \in R^{n \times d}

, pseudotime assigns a positive real number for each cell to reflect its order during a dynamical process. Let

x_{i} \in R^{d}

represent the state vector (such as mRNA expression) for the i-th cell. The pseudotime

t_{i} \in R

is then a mapping from the state vector

x_{i}

to a real number, i.e.,

x_{i} \mapsto t_{i}

.

Pseudotime can be viewed from two perspectives. First, the geodesic perspective considers that the cell’s state is constrained by a limited number of biological pathways, limiting the evolution of the state to a low-dimensional manifold embedded in the high-dimensional gene expression space. Given an initial state

x_{0}

, the task of finding the mapping

x_{i} \mapsto t_{i}

becomes equivalent to determining the length of the evolutionary path between

x_{0}

and

x_{i}

along this manifold. Several tools, such as Monocle, Slingshot, DPT, and PAGA, have been developed based on this idea. Second, the entropy perspective recognizes that during natural biological development, as cells differentiate, they become more specialized and lose their potential for further differentiation.

Geodesic-Based Pseudotime Geodesic -based pseudotime aims to reconstruct these trajectories by leveraging graph-based methods and principal curve algorithms [46]. Two widely used approaches include Monocle [23,24] and Slingshot [25], along with numerous other methods. Monocle focuses on ordering cells using a minimum spanning tree and a refined PQ tree approach, and Slingshot can handle multiple lineages and smooth pseudotime across branching events. Both methods offer insights into cellular dynamics, helping to uncover the paths cells take through different states.

Monocle estimates pseudotime through two main steps: (1) ordering cells and (2) assigning pseudotime values. First, each cell’s state is reduced to a d-dimensional vector

x_{i} \in R^{d}

using Independent Component Analysis (ICA). A complete graph is created where vertices represent cells, and edges are weighted by Euclidean distance. Cell ordering relies on the minimum spanning tree (MST) of this graph, which is refined using a PQ tree to mitigate noise from sequencing. The tree is constructed by first identifying the longest path (diameter path), classifying vertices as decisive or indecisive, and recursively building the tree by ordering decisive vertices and handling indecisive vertices through new P nodes. Once ordered, pseudotime is calculated as

t (x_{i}) = t (x_{Parent (i)}) + ∥ x_{i} - x_{Parent (i)} ∥,

where

Parent (i)

is the parent node of cell i, and the root node (selected based on prior knowledge) is initialized with pseudotime 0.

Slingshot estimates pseudotime across multiple lineages. It begins by clustering cells into K clusters and identifying lineages using the MST. After constructing the MST, a new lineage is formed at each branching point. Pseudotime is assigned using the principal curve algorithm, which involves projecting cells onto the curve, computing arc lengths, and smoothing iteratively. To handle inconsistent pseudotime across multiple lineages, Slingshot modifies the standard principal curve approach. It initializes the curve for each lineage through the centroids of its clusters, assigns weights to cells in multiple lineages based on projection distances, and constructs an average curve for smooth transitions at shared cell regions. The average curve is defined as

c_{avg} (t) = \frac{1}{M} \sum_{m = 1}^{M} c_{m} (t),

where

c_{m}

is the principal curve of the m-th lineage at the branching point. The shrinkage process is defined as

c_{m}^{new} (t) = w_{m} (t) c_{avg} (t) + (1 - w_{m} (t)) c_{m} (t),

where

w_{m} (t)

is the weight of the m-th lineage. These modifications allow Slingshot to produce consistent pseudotime values across multiple lineages.

Entropy-Based Pseudotime One major challenge of the geodesic-based pseudotime is the appropriate determination of root cells, which often relies on prior biological knowledge. From a physical understanding, a cell’s pseudotime reflects the directionality of the underlying dynamical process, which the concept of entropy could quantify. Heuristically, higher entropy values typically indicate a more undifferentiated or pluripotent state where genes are more randomly expressed and the association between genes could be more prevalent. In comparison, lower entropy values suggest differentiated states, where the gene expression profile could be more concentrated on only a small number of pathways, and the gene interaction network could be more modular [47]. A proper entropy score based on such intuitions can thus be leveraged to estimate a cell’s relative position along developmental trajectories. Several methods have been developed from this perspective [48,49,50,51,52,53].

As a simple implementation, the entropy for one cell i is defined as [48]

H_{i} = - \sum_{j = 1}^{d} p_{i j} log p_{i j},

where

p_{i j} = \frac{X_{i j}}{N_{i}}

, and

X_{i j}

represents the transcript count of gene j in cell i, and

N_{j}

is the total transcript count for cell i.

One can extend this concept to a Markov chain model [50] to consider the interaction between the genes. Assume there is a predefined graph representing the gene–gene interaction (e.g., protein–protein interaction (PPI) network) from the existing database. The transition probability between genes i and j in cell c is

p_{i j}^{(c)} = \frac{x_{j}^{(c)}}{\sum_{k \in N (i)} x_{k}^{(c)}} = \frac{x_{j}^{(c)}}{{(A x^{(c)})}_{i}},

where

x_{i}^{(c)}

is the expression level of gene i in cell c,

N (i)

are the neighbors of gene i in the graph, and

A

is the adjacency matrix of the graph. The corresponding stationary distribution is

π_{i}^{(c)} = \frac{x_{i}^{(c)} {(A x^{(c)})}_{i}}{x^{{(c)}^{T}} A x^{(c)}},

and the Markov chain entropy (MCE) is defined as

MCE = - \sum_{(i, j) \in \tilde{E}} π_{i} p_{i j} log (π_{i} p_{i j}),

where

\tilde{E}

includes all edges on the graph. The entropy of cell c is given by

{MCE}^{(c)}

, computed using

π_{i}^{(c)}

and

p_{i j}^{(c)}

. To determine the weights, ref. [51] proposes to optimize interaction weights based on cell expression

π^{(c)} = \frac{x^{(c)}}{∥ x^{(c)} ∥_{L_{1}}}

. For each cell, its MCE is maximized by solving

max_{p_{i j}^{(c)} \geq 0} - \sum_{(i, j) \in \bar{E}} π_{i}^{(c)} p_{i j}^{(c)} log (π_{i}^{(c)} p_{i j}^{(c)}),

subject to

\sum_{j \in N (i)} p_{i j}^{(c)} = 1 and \sum_{i \in N (j)} π_{i}^{(c)} p_{i j}^{(c)} = π_{j}^{(c)} .

3.1.2. Discrete Dynamics Modeling

Diffusion Pseudotime Previous pseudotime methods, such as those based on geodesic paths or simple assumptions of pseudotime, typically lacked an underlying dynamical model to explain cell state transitions. These methods often relied on the assumption of continuous trajectories without explicitly modeling the stochastic processes driving those transitions. In contrast, methods like DPT [54] and PAGA [55] introduce stochastic dynamics through Markov chains. While still estimating pseudotime, these methods incorporate random walk-defined observed samples of single cells, allowing for a more quantitative treatment of how cell states evolve over time, with transitions captured probabilistically. As a result, they provide a more mechanistic approach to pseudotime estimation, making them a natural progression from traditional geodesic-based methods.

Motivated by the DiffusionMap [56] algorithm for dimensionality reduction, DPT constructs a Markov chain between cells, defines a distance metric, and uses this distance as pseudotime. The transition probability of cell i moving to cell j is computed using a simple Gaussian kernel, defined as

T_{i j} = \frac{1}{Z} W_{i j} = \frac{1}{Z} (\frac{2 σ_{i} σ_{j}}{σ_{i}^{2} + σ_{j}^{2}}) exp (- \frac{∥ x_{i} - x_{j} ∥^{2}}{2 (σ_{i}^{2} + σ_{j}^{2})}),

where

Z_{i} = \sum_{j \in N (i)} W_{i j}

is the normalization factor, and the hyperparameters

σ_{i}, σ_{j}

are the Gaussian kernel widths for cell i and cell j. DPT assumes that the distance in the eigenspace of the transition matrix T is related to the pseudotime ordering of cells. After removing steady-state eigenspace, the system’s dynamics are captured by the transition matrix

\bar{T} = T - ψ_{1} ψ_{1}^{T}

, where

ψ_{1}

represent the eigenvector corresponding to the largest eigenvalue of the transition matrix. The dynamics are analyzed by summing all t-step transition matrices to compute the cumulative probability of state transitions across multiple walk lengths

M = \sum_{t = 1}^{\infty} {\bar{T}}^{t} = {(1 - \bar{T})}^{- 1} - I .

Using this matrix

M

, a new distance metric is defined as

{dpt}^{2} (i, j) = ∥ M_{i, \cdot} - M_{j, \cdot} ∥ = \sum_{k = 2}^{n} {(\frac{λ_{k}}{1 - λ_{k}})}^{2} {(ψ_{k} (i) - ψ_{k} (j))}^{2},

where

ψ_{k} (i)

represents the i-th component of the eigenvector corresponding to the k-th largest eigenvalue of the transition matrix. This metric simultaneously captures both short-range and long-range cell state transitions, making it useful for understanding the trajectory of cell states over time.

PAGA generalizes the DPT distance metric to disconnected graphs to deal with the existence of multiple distinct lineages in the dataset. In PAGA, graph construction begins by reducing the dimensionality of the gene expression data using PCA, followed by the construction of a KNN graph where the nodes represent cells. The graph is then partitioned into cell clusters using the Louvain algorithm, reminiscent of the attractors concept in the random walk. Two groups are considered connected if the actual number of edges

ϵ_{i j}

between them significantly exceeds the expected number of edges. The DPT distance metric is then extended to the disconnected graph. In practice, one treats cells that belong to separate clusters as being at an infinite distance from each other. For cells within the same connected region, one calculates distances between them similar to the calculation in DPT. This modification allows PAGA to estimate pseudotime and infer trajectories even in the presence of disconnected or sparse data regions.

Random Walk with Directionality Building on the inferred DPT, Palantir [57] introduces directionality into the cellular random walk, which is further developed in [58,59]. One simple approach is to prune the weight matrix as follows:

{\bar{W}}_{i j} = \{\begin{matrix} W_{i j} & if t_{i} \leq t_{j} or 0 < t_{i} - t_{j} < σ_{i} \\ 0 & if t_{i} - t_{j} > σ_{i} \end{matrix}

For the directional Markov chain induced by the weight matrix

\bar{W}

, terminal states can be determined from its stationary distribution. Using absorption Markov chain theory, a cell fate matrix

F

can be derived from the canonical form of transition probability matrix in Equation (1), where

F = {(I - Q)}^{- 1} S

. Here, the element

F_{i j}

represents the probability that a random walk starting from transient cell i will eventually be absorbed by terminal cell j. Specifically, the fate vector

f_{i}

of a transient cell i corresponds to the i-th row of

F

, capturing the probability of differentiation into various states. To quantify the differentiation potential of a cell, one can then determine the entropy of the fate vector, or the Kullback–Leibler (KL) divergence between the fate vector

f_{i}

and the average fate vector

\bar{f}

. Author: Please check that the intended meaning has been retained.

Another method to define the directional random walk on cells is Population Balance Analysis (PBA) [60]. Let

G

be the k-nearest neighbor graph of

{X}

and

L

its graph Laplacian. A potential function is defined by

V = \frac{1}{2} L_{N}^{†} R

, where

L_{N}^{†}

denotes the pseudo-inverse of

L_{N}

and

R

is the estimated cell population production rate vector at each node, using the gene expression of predefined lists of proliferation relevant genes. The transition probabilities of this Markov chain are then directed by the potential function

P_{i j} = \{\begin{matrix} exp (\frac{V_{i} - V_{j}}{D}) & if (i, j) is in G_{N} \\ 0 & otherwise \end{matrix}

After the random walk is constructed, PBA utilizes the conditional mean first passage time to quantify the difference of pseudotime for any pair of transient cells i and j.

Dissecting Dynamical Structure While previous methods such as DPT, Palantir, and PBA construct the random walk dynamics on snapshots of individual cells, methods like MuTrans [61] and CellRank [62,63] take a deeper look into the dynamical structure of the system itself, especially focusing on metastability and attractor structures, therefore robustly dissecting the system’s latent dynamics to identify long-term patterns of the cell-state transition.

MuTrans [61] adopts a multi-scale reduction technique for a diffusion-based, unidirectional cellular random walk, to infer stable and transient cells from snapshot scRNA-seq. Central to MuTrans is a membership matrix

χ_{i, k}

representing the soft clustering probability that cell i belongs to attractor

S_{k}, k = 1, . . ., K

, which could be interpreted as a cell cluster. A transient cell might have multiple positive components in its attractor membership, while the distribution of stable cells tends to be concentrated in one specific attractor. Meanwhile, MuTrans also reduces the dynamics on the attractor level, using

P^{(coar)} \in R^{K \times K}

to represent the coarsened transition matrix, and

π^{(coar)}

to represent the stationary distribution of the coarse-grained Markov chain. The original cell-to-cell dynamics can then be reconstructed from the coarsened dynamics, and the transition probability between cells i and j is given by

{\hat{P}}_{i, j} = \sum_{m, n = 1}^{K} χ_{i, m} P_{m, n}^{(coar)} χ_{j, n} \frac{π_{j}}{π_{n}^{(coar)}} .

The goal is to minimize the discrepancy between the reconstructed cell–cell dynamics

\hat{P}

and the actual dynamics

P

, which is achieved by minimizing

∥ \hat{P} {- P ∥}^{2}

. This can be done using an EM-like algorithm, alternately optimizing the elements of

P^{(coar)}

and

χ_{i, j}

. With the inferred attractor membership matrix and coarse-grained transition probabilities among clusters, MuTrans then constructs a dynamical manifold inspired by the energy landscape concept [64] to visualize the transient and stable cells, and uses transition path theory [65] based on

P^{(coar)}

to calculate the most probable transition paths among attractors.

CellRank extends the analysis by introducing a coarse-graining strategy for directed cellular random walks, such as those induced by pseudotime [57] or RNA velocity as described in Section 5.1. The approach begins with the clustering of cells into macro-states (i.e., attractors) using the GPCCA (Generalized Perron Cluster Cluster Analysis) algorithm [66], which is based on the Shur decomposition of the directed transition matrix

P

. The membership matrix

χ

and the coarse-grained transition probability matrix between attractors

P^{(coar)}

can be computed based on the decomposition. Once the attractors are identified, the terminal states can be determined in which the diagonal elements of the coarse transition matrix

P^{(coar)}

exceed a certain threshold. Cells in terminal states can then be treated as absorption sets of the random walk, and the cell fate vector could be computed similarly to Palantir.

3.1.3. Continuous Dynamics Modeling

RNA Velocity Model and Parameter Estimation Based on the unspliced RNA and spliced RNA counts

u_{g}

and

s_{g}

for each gene, an underlying ODE model could be naturally derived based on mass-action law such that

\begin{matrix} \frac{d u_{g}}{d t} & = α_{g} (t) - β_{g} (t) u_{g} (t), \\ \frac{d s_{g}}{d t} & = β_{g} (t) u_{g} (t) - γ_{g} (t) s_{g} (t), \end{matrix}

(4)

where

α_{g} (t)

,

β_{g} (t)

, and

γ_{g} (t)

represent the rates of mRNA transcription, splicing, and degradation, respectively. Here,

v_{g} = \frac{d s_{g}}{d t}

is defined as the RNA velocity of gene g [26]. By concatenating the RNA velocities of all genes in a cell, a vector

v = (v_{1}, v_{2}, \dots, v_{n})

is formed, which contains information about how the amounts of spliced RNA in the cell are changing. This vector represents the potential direction of the cell state evolution and can be used for downstream tasks of cell fate inference.

From an algorithm perspective, the central issue in the RNA velocity is to determine the parameters of the Equation (4) from static snapshot data, where the time t of each cell is not explicitly known. In the following sections, we will summarize methods for solving

v_{g}

and downstream tasks that utilize

v_{g}

.

Steady-State Assumption: Parameter Estimation in Velocyto [26]

In the original RNA velocity paper [26], parameter estimation was performed using linear regression with steady-state assumption to avoid the reliance on latent time t. Firstly, it assumes that for all genes g,

α_{g} (t)

,

β_{g} (t)

, and

γ_{g} (t)

are time-invariant. Secondly, it assumes that all genes share the same splicing rate

β

. Denote

\tilde{α} = \frac{α}{β}

and

\tilde{γ} = \frac{γ}{β}

. To compute RNA velocity, one only needs to estimate

{\tilde{γ}}_{g}

with the steady-state assumption that

\frac{d s_{g}}{d t} = 0

. Indeed, we have the linear relation

{\tilde{γ}}_{g} = \frac{u_{g} (t)}{s_{g} (t)},

suggesting that under the steady-state assumption,

{\tilde{γ}}_{g}

can be estimated using linear regression. In practice, since most cells do not satisfy this assumption, it is commonly assumed that cells in the upper-right or lower-left regions of a scatter plot with unspliced RNA on the x-axis and spliced RNA on the y-axis are in equilibrium. Therefore, the common algorithm implementation is to limit the linear regression to the top or bottom

5 %

of cells based on unspliced and spliced RNA levels.

If the dynamic equations Equation (4) are expressed probabilistically, the parameter estimation could be enhanced based on the linear regression formulation of steady-state stochastic model [30]. The equation could be expressed as the regression problem

[\begin{matrix} 〈 u_{g} (t) 〉 \\ 〈 u_{g} (t) 〉 + 2 〈 u_{g} (t) s_{g} (t) 〉 \end{matrix}] = {\tilde{γ}}_{g} [\begin{matrix} 〈 s_{g} (t) 〉 \\ 2 〈 s_{g}^{2} (t) 〉 - 〈 s_{g} (t) 〉 \end{matrix}] + ϵ,

where

〈 x 〉

denotes the expectation of random variable x. The regression equation incorporates both first-order and second-order moment information of

u_{g} (t)

and

s_{g} (t)

and can be solved using generalized least squares.

Dynamic Inference: Parameter Estimation in scVelo [30]

A major issue with steady-state analysis is that many transient cells would be discarded in the parameter estimation. The scVelo approach [30] circumvents the issues through the estimation of kinetic parameters (

α_{g}

,

β_{g}

,

γ_{g}

) via an expectation-maximization (EM) algorithm by modeling all the cells with the dynamical process.

Guided by transcriptional regulation principles, scVelo models gene expression dynamics through two distinct transcriptional phases: (1) an induction phase (

k = 0

) characterized by promoter activation and transcriptional upregulation, and (2) a subsequent repression phase (

k = 1

) marked by transcriptional suppression. This phase-specific regulation manifests through different unspliced RNA production rates (

α_{g}^{(0)} \neq α_{g}^{(1)}

). Let

t_{g}^{(k)}

denote the transition time from phase

k - 1

to k for gene g, with initial conditions

u_{g}^{★} = u_{g} (t_{g}^{(k)})

and

s_{g}^{★} = s_{g} (t_{g}^{(k)})

. The analytical solution to Equation (4) during phase k yields

\begin{matrix} u_{g} (t) & = u_{g}^{★} e^{- β_{g} τ} + \frac{α_{g}^{(k)}}{β_{g}} (1 - e^{- β_{g} τ}), \\ s_{g} (t) & = s_{g}^{★} e^{- γ_{g} τ} + \frac{α_{g}^{(k)}}{γ_{g}} (1 - e^{- γ_{g} τ}) + \frac{α_{g}^{(k)} - β_{g} u_{g}^{★}}{γ_{g} - β_{g}} (e^{- γ_{g} τ} - e^{- β_{g} τ}), τ & = t - t_{g}^{(k)} . \end{matrix}

For each gene g, one can estimate the parameter set

θ_{g} = {α_{g}^{(k)}, β_{g}, γ_{g}, t_{g}^{(k)}}

by minimizing the discrepancy between modeled trajectories

{\hat{x}}_{g} (t) = (u_{g} (t), s_{g} (t))

and observed single-cell measurements

x_{g, c} = (u_{g, c}, s_{g, c})

across cells c. Assuming Gaussian residuals

e_{c} = ∥ x_{g, c} - {\hat{x}}_{g} (t_{c}) ∥

with variance

σ^{2}

, the log-likelihood function becomes

max_{θ_{g}, t_{c}} L (θ_{g},, t_{c}) = - \frac{1}{2 σ^{2}} \sum_{c} {∥ x_{g, c} - {\hat{x}}_{g} (t_{c}) ∥}^{2} + constant .

(5)

The EM implementation proceeds as follows:

Initialization: Using steady-state estimation as the initial value for iteration.

$\begin{matrix} β_{g} & = 1, γ_{g} = \frac{u_{g}^{⊤} s_{g}}{∥ s_{g} ∥^{2}}, \\ k_{g, c} & = I (u_{g, c} - \tilde{γ} s_{g, c} \geq 0), α_{g}^{(1)} = max_{c} s_{g, c}, α_{g}^{(0)} = 0 . \end{matrix}$
E-step: Assigning hidden latent time $t_{c}$ for each cell by projecting observations onto the current estimated trajectory ${\hat{x}}_{g} (t | θ_{g})$ .
M-step: Updating $θ_{g}$ via maximum likelihood estimation given current latent time assignments.

Function Class-Based Estimation

While traditional RNA velocity methods focus on estimating parameters

α_{g}, β_{g}, γ_{g}

in dynamic Equation (4), alternative approaches such as UniTVelo [67] and TF Velo [68] take a different path by directly parameterizing the dynamics of spliced RNA. UniTVelo [67] models transcriptional phases through radial basis functions

\begin{matrix} s_{g} (t_{g, c}) & = h_{g} exp (- a_{g} {(t_{g, c} - τ_{g})}^{2}) + o_{g}, \\ u_{g} (t_{g, c}) & = \frac{1}{β_{g}} ({\dot{s}}_{g} (t_{g, c}) + γ_{g} s_{g} (t_{g, c})) + i_{g}, \end{matrix}

where the velocity derives directly from function differentiation

{\dot{s}}_{g} (t_{g, c}) = - 2 a_{g} (t_{g, c} - τ_{g}) s_{g} (t_{g, c}) .

The full parameter set

(h_{g}, a_{g}, τ_{g}, o_{g}, γ_{g}, β_{g}, i_{g}, t_{g, c})

is estimated via maximum likelihood framework as Equation (5), comparing model predictions

{\hat{x}}_{c} = (u_{g} (t_{g, c}), s_{g} (t_{g, c}))

against observations

x_{c} = (u_{g, c}, s_{g, c})

under Gaussian residuals.

TF Velo [68] introduces transcription factor coupling through linear dynamics

{\dot{s}}_{g} (t) = w_{g}^{⊤} f_{g} (t) - γ_{g} s_{g} (t),

combined with sinusoidal splicing

s_{g} (t) = A_{g} sin (ω_{g} t + θ_{g}) + b_{g} .

This functional specification enables the analytical resolution of TF interactions

w_{g}^{⊤} f_{g} (t) = A_{g} \sqrt{4 π^{2} + γ_{g}^{2}} sin (2 π t + θ_{g} + ϕ_{g}) + b_{g} γ_{g}, ϕ_{g} = arctan (2 π / γ_{g}) .

Parameters are optimized by matching predicted trajectories

{\hat{x}}_{c} = (w_{g}^{⊤} f_{g} (t_{g, c}), s_{g} (t_{g, c}))

to observed data

x_{c} = (w_{g}^{⊤} f_{g, c}, s_{g, c})

using the same likelihood framework as Equation (5).

Deep Learning-Based RNA Velocity Recently, the application of deep learning methods has expanded the possibilities for RNA velocity estimation [69,70,71]. In RNA velocity analysis, the expressive power of neural networks is especially useful in inferring the latent state of cells, as well as encouraging the consistency of the learned vector field.

Latent State: VAE-Based Methods

The Variational Autoencoder (VAE) [72] is an effective approach to model the distribution of data through latent variables to achieve high-dimensional data reconstruction. Its core idea is to introduce a latent variable

z

and express the distribution of data

x

as the following conditional distribution:

p (x) = \int p (x | z) p (z) d z,

where the generative process is

z \sim p (z), x \sim p (x | z) .

The VAE employs a decoder

p_{θ} (x | z)

and an encoder

q_{ϕ} (z | x)

to map between the latent space and data space. The training objective is the Evidence Lower Bound (ELBO):

L_{ELBO} = L_{rec} + L_{reg} = E_{q_{ϕ} (z | x)} [log p_{θ} (x | z)] - D_{KL} [q_{ϕ} (z | x) ∥ p (z)], x \sim p_{data} (x),

where the first term is the reconstruction loss, enforcing similarity between decoded data and observations, and the second term acts as a regularizer, aligning the latent distribution with the prior. Typically, the decoder outputs the mean

μ_{θ} (z)

of a Gaussian distribution with fixed variance

σ^{2}

, leading to

L_{rec} \propto - \frac{1}{2 σ^{2}} E_{q_{ϕ} (z | x)} [∥ x - μ_{θ} {(z) ∥}^{2}] .

Several methods are based on VAE to improve the RNA velocity model by taking advantage of the latent space.

VeloAE [73] computes RNA velocity using latent space representations. Its encoder maps the spliced RNA matrix

S \in R^{n_{c} \times n_{g}}

and unspliced RNA matrix

U \in R^{n_{c} \times n_{g}}

to latent representations

\tilde{S} \in R^{n_{c} \times d_{z}}

and

\tilde{U} \in R^{n_{c} \times d_{z}}

, while the decoder reconstructs

\hat{S}

and

\hat{U}

. Velo AE enforces the steady-state constraint on latent representations

{\tilde{u}}_{i}, {\tilde{s}}_{i}

, resulting in a composite loss:

L = L_{rec} + L_{reg} = \sum_{i = 1}^{d_{z}} MSE ({\tilde{u}}_{i} - γ_{i} {\tilde{s}}_{i}) + [MSE (S, \hat{S}) + MSE (U, \hat{U})] .

The RNA velocity is derived as

\tilde{u_{i}} - γ_{i} \tilde{s_{i}}

after training.

LatentVelo [74] incorporates latent variables

z_{c} = (u_{c}^{(z)}, s_{c}^{(z)})

and pseudotime

t_{c}

, representing latent-space unspliced/spliced RNA levels and cellular pseudotime. It assumes the following dynamics:

\begin{matrix} \frac{d u_{c}^{(z)} (t)}{d t} & = f_{u} (u_{c}^{(z)} (t), r_{c}^{(z)} (t)), \\ \frac{d s_{c}^{(z)} (t)}{d t} & = f_{s} (u_{c}^{(z)} (t), s_{c}^{(z)} (t)), \\ \frac{d r_{c}^{(z)} (t)}{d t} & = f_{r} (s_{c}^{(z)} (t), r_{c}^{(z)} (t), h_{c}), \\ h_{c} & = f_{h} (s_{c}^{(z), obs}, u_{c}^{(z), obs}), \end{matrix}

where

s_{c}^{(z), obs}

and

u_{c}^{(z), obs}

are latent-space observations,

r_{c}^{(z)}

governs chromatin dynamics, and

f_{u}, f_{s}, f_{r}, f_{h}

are neural networks with

f_{h}

computing the cell state encoding

h_{c}

. Beyond standard VAE losses, an evolution loss ensures correct dynamics:

L_{evol} = \sum_{c = 1}^{n_{c}} E_{t_{c} \sim q (t_{c} | x)} [\frac{∥ z_{c}^{obs} - z_{c} (t_{c}) ∥^{2} + ∥ x_{c} - {\hat{x}}_{c} (t_{c}) ∥}{σ^{2}}] .

The total loss is

L = L_{rec} + L_{reg} + L_{evol}

.

VeloVI [75] models gene-specific state distributions

π_{g, c}

with a Dirichlet prior

π_{g c} \sim Dirichlet (\frac{1}{4}, \frac{1}{4}, \frac{1}{4}, \frac{1}{4})

, where states

k_{g, c} \in {1, 2, 3, 4}

correspond to induction, repression, induction steady, and repression steady. Key parameters include gene-specific rates

α_{g}, β_{g}, γ_{g}

, pseudotime

t_{g, c}

, and switching time

t_{g}^{s}

. For repression-related states,

α_{g} = 0

is fixed. Genes in transient states follow Equation (4), while steady states use analytic solutions. Reconstruction losses follow standard VAE training.

VeloVAE [76] uses latent variables

z_{c} \sim N (0, I)

and pseudotime

t_{c} \sim N (t_{0}, σ_{0}^{2})

. A fully connected network maps

z

to gene-specific parameters

α_{c, g}, β_{c, g}, γ_{c, g}

, enabling reconstruction via Equation (4) and training with standard VAE losses.

Enhancing Velocity: Continuity-Based Methods

Several methods leverage the continuity assumption in single-cell data: If the observed data fully capture the continuous dynamics of cellular evolution, a cell’s state at the next timestep should align with its neighbors. Several methods use such prior information for defining the loss function to further refine the RNA velocity.

DeepVelo [77] parameterizes

α_{g, c}, β_{g, c}, γ_{g, c}

via neural networks to compute RNA velocity

v_{g, c}

. Its loss function incorporates temporal consistency: The state

s_{c} (t + 1)

is approximated as a weighted sum of neighboring cell states

s_{j} (t (j))

, where the transition probability

P_{+} (c \to j)

is defined as

P_{+} (c \to j) = \{\begin{matrix} \frac{1}{Z} & if cos (s_{j} - s_{c}, {\hat{v}}_{c}) > 0 and j \in N (c), \\ 0 & otherwise . \end{matrix}

where Z is the normalization constant. The forward consistency loss enforces this assumption as follows:

L_{+} = \sum_{c = 1}^{n_{c}} {∥s_{c} (t) + {\hat{v}}_{c} - s_{c} (t + 1)∥}^{2},

with

s_{c} (t + 1) = \sum_{j \in N (c)} s_{j} (t (j)) P_{+} (c \to j)

. A symmetric backward consistency loss

L_{-}

is similarly constructed. Additionally, a correlation loss ensures consistency with transcriptional dynamics in the following way:

L_{corr} = - [λ_{u} \frac{{\hat{v}}_{c} \cdot u_{c}}{∥ {\hat{v}}_{c} ∥ ∥ u_{c} ∥} + λ_{s} \frac{{\hat{v}}_{c} \cdot s_{c}}{∥ {\hat{v}}_{c} ∥ ∥ s_{c} ∥}],

yielding the total loss

L = L_{+} + L_{-} + L_{corr}

.

CellDancer [78] similarly parameterizes

α_{g, c}, β_{g, c}, γ_{g, c}

with neural networks, and estimates velocity as

{\hat{v}}_{c} = (Δ u_{c}, Δ s_{c}), where Δ u_{c} = α_{c} - β_{c} ⊙ u_{c}, Δ s_{c} = β_{c} - γ_{c} ⊙ s_{c} .

Its loss maximizes velocity alignment with neighbors, as follows:

L = \sum_{c} [1 - max_{j \in N (c)} cos ({\hat{v}}_{c}, v_{j})], v_{j} = (u_{j} - u_{c}, s_{j} - s_{c}) .

Vector Field Reconstruction Based on RNA Velocity After estimating the model parameters and obtaining the RNA velocity for each cell, an important downstream task is to predict the fate of cells based on the estimated velocity, e.g., the continuous differentiation trajectory of cells. To construct a continuous vector field, the RNA velocity of each cell could be treated as the value of a vector field at discrete points. Then, the vector field is reconstructed by formulating a regression problem. Subsequently, quantities derived from vector field analysis, such as equilibrium points, streamlines, gradients, divergence, and curl, are used to study cell fate in continuous dynamics setup [79].

Estimating the Vector Field

Denote the spliced RNA counts and RNA velocities at various data points

{x_{i}, v_{i}}_{i = 1}^{n}

. The task is to find a continuous vector field function

f^{★}

that minimizes the regression loss

L = \sum_{i} p_{i} {∥ v_{i} - f (x_{i}) ∥}^{2}

where

p_{i}

denotes the weights of each data point.

Dynamo [79] approximates the unknown vector-valued function in a sparse reproducing kernel Hilbert space (RKHS). For a vector-valued function

f \in H

in RKHS, it can be represented as a sum of Gaussian kernels as follows:

f (x) = \sum_{i = 1}^{m} Γ (x, {\tilde{x}}_{i}) c_{i}, Γ (x, \tilde{x}) = exp (- w ∥ x - \tilde{x} ∥^{2}),

where

{\tilde{x}}_{i}

are called control points. Additionally, the norm of

f

in

H

can be computed as

{∥ f ∥}^{2} = \sum_{i, j = 1}^{m} c_{i}^{T} Γ ({\tilde{x}}_{i}, {\tilde{x}}_{j}) c_{j}^{T} .

The loss function for the vector field estimation problem includes a regularization term based on the norm of

f

such that

L_{λ} = \sum_{i} p_{i} ∥ v_{i} - f (x_{i}) ∥^{2} + \frac{λ}{2} {∥ f ∥}^{2},

where

λ

is the regularization coefficient.

Another popular fitting strategy to reconstruct the continuous vector field is to use the neural network, where a VAE-based deep learning method was proposed in [80].

Geometric Analysis of Vector Field

Based on the estimated continuous vector field, Dynamo [79] proposed several analyses to reveal the differential geometry of the RNA velocity. First, the Jacobian matrix is essential for analyzing the stability of equilibrium points in a dynamical system and for studying gene–gene interactions. In RKHS context, the Jacobian matrix can be analytically computed as

J = \frac{\partial f (x)}{\partial x} = - 2 w \sum_{i = 1}^{m} Γ (x, {\tilde{x}}_{i}) c_{i} {(x - {\tilde{x}}_{i})}^{T}

where

Γ (x, {\tilde{x}}_{i})

is the Gaussian kernel. Since the

(i, j)

-th entry of the Jacobian matrix

J_{i j}

represents the effect of unspliced RNA levels of gene j on the RNA velocity of gene i, the Jacobian matrix can be used to analyze the strength of gene–gene interactions. By averaging the Jacobian matrix across all data points, an average Jacobian matrix

〈 J 〉

can be obtained. By sorting the elements in each row of

〈 J 〉

, the top regulators for each effector can be identified. Conversely, by sorting the elements in each column of

〈 J 〉

, the top effectors for each regulator can be identified. The Jacobian matrix can also be used to compute the effect of perturbations. If the system state changes by

Δ x

at

x

, the resulting change in the vector field is

Δ f = J Δ x .

Several other quantities could also be conveniently derived based on the Jacobian matrix.

The divergence represents the net flux generated or dissipated per unit time at each point in the vector field:

$\nabla \cdot f = Tr (J),$

where $Tr (J)$ denotes the trace of the Jacobian matrix. Regions with divergence greater than 0 (sources) may correspond to the initial states of cells, while regions with divergence less than 0 (sinks) may correspond to the terminal states of cells.
The acceleration of a particle moving along the streamlines of the vector field can be directly computed from the Jacobian matrix:

$a = \frac{d v}{d t} = J v .$
The curvature vector of the streamlines is defined as the derivative of the unit tangent vector with respect to time:

$κ = \frac{1}{∥ v ∥} \frac{d}{d t} \frac{v}{∥ v ∥} = \frac{(v^{T} v) J v - (v^{T} J v) v}{{∥ v ∥}^{4}} .$

Transition Path Analysis

Based on the learned vector field, continuous cell trajectories could also be constructed in Dynamo [79] using the concept of most probable path [64]. Given the SDE (2), the action along any path

Ψ

is defined as

S_{T} [ψ] = \int_{0}^{T} L^{F W} (ψ, \dot{ψ}) d t, L^{F W} (ψ, \dot{ψ}) = \frac{1}{4} {[\dot{ψ} (s) - b (ψ (s))]}^{t} D^{- 1} (ψ (s)) [\dot{ψ} (s) - b (ψ (s)) .]

According to the Freidlin–Wentzell theorem [64], the path of least action is indeed the most probable path to make transitions between two attractors. In actual computation, Dynamo assumes the constant noise coefficient by taking

D = \frac{σ^{2}}{2}

since only a continuous vector field is reconstructed.

For a transition connecting two meta-stable states along the optimal path with action

S^{★} \geq 0

, the transition rate between the attractors is given by

R (x_{s} \to x_{t}) \approx C exp (- S^{★}),

(6)

where C is a proportionality constant.

3.2. Temporally Resolved Single-Cell RNA-Seq

Temporally resolved scRNA-seq provides us with a deeper understanding of the dynamics process in single cells. However, due to the destructive nature of scRNA-seq technology, we cannot track the trajectories of individual cells. Instead, we can only observe the changes in cellular distribution with time. Thus, reconstructing the trajectories of single cells from samples collected at discrete and sparse temporal points becomes crucial for understanding developmental processes and other dynamic biological processes and remains a challenging problem [13,35,41,81,82,83,84,85,86,87,88,89,90,91]. To overcome these challenges, many methods have been developed in recent years. From a dynamical perspective, these approaches can be broadly classified into two categories: those that model dynamics on discrete cell states and those that model dynamics in continuous spaces. In the following, we will introduce these methods separately from these two viewpoints. Figure 3 summarizes these approaches.

3.2.1. Discrete Temporal Dynamics Modeling

Among the methods that model dynamics on discrete cell states, pioneering work includes Waddington OT [35] and Moscot [36]. These approaches employ static optimal transport as the main tools.

Static Optimal Transport To formulate this problem, consider

X \in R^{N \times G}

and

Y \in R^{M \times G}

represent two unpaired datasets of N and M cells observed at different time points (

t_{1}, t_{2}

), respectively, in the G dimensional gene expression space. Then, one can define two marginal distributions

ν_{0} \in C_{N}

,

ν_{1} \in C_{M}

at

t_{1}

and

t_{2}

respectively on the probability simplex

C_{N} = {a \in R^{N} | \sum_{i = 1}^{N} a_{i} = 1, a \geq 0} .

The goal of optimal transport is to find the optimal coupling

π \in R_{+}^{N \times M}

that transports a distribution to another while minimizing the cost associated with the transportation. The feasible transport plan is defined as

Π (ν_{0}, ν_{1}) = \{π \in R^{N \times M} : π 1_{M} = ν_{0}, π^{T} 1_{N} = ν_{1}, π \geq 0\} .

So the static optimal transport problem is formally defined as

\begin{matrix} min_{π \in Π (ν_{0}, ν_{1})} 〈 π, c 〉 = \sum_{i, j} c_{i, j} π_{i, j} . \end{matrix}

(7)

The cost matrix

c \in R_{+}^{n \times m}

defines the transportation cost between each pair of points, where

c_{i j} : = c (x_{i}, y_{j})

quantifies the expense of transferring a unit mass from the source point

x_{i}

to the target point

y_{j}

. By solving the static optimal transport problem, one can determine the transport matrix that couples the two distributions. This static optimal transport can be effectively addressed using the Python Optimal Transport (POT) library [92].

Entropic Optimal Transport Additionally, to enhance the efficiency of solving the optimal transport problem, a regularized optimal transport approach is often introduced. The discrete entropy of a coupling matrix is defined as

H (π) \overset{def}{=} - \sum_{i, j} π_{i, j} (log (π_{i, j}) - 1) .

The function H is 1 -strongly concave, because its Hessian is

\partial^{2} H (π) = - diag (1 / π_{i, j})

and

0 < π_{i, j} \leq 1

. The idea of the entropic regularization of optimal transport is to use

- H

as a regularizing function to obtain approximate solutions to the original transport problem:

min_{π \in Π (ν_{0}, ν_{1})} 〈 π, c 〉 - ε H (π)

(8)

Since the objective is an

ε

-strongly convex function, problem (8) has a unique optimal solution. Using the KKT conditions, the solution to (8) is unique and has the form [93]

\forall (i, j) \in (N \times M), π_{i, j} = a_{i} K_{i, j} b_{j}

for two unknown variables

(a, b) \in R_{+}^{N} \times R_{+}^{M}

, where

K_{i, j} = e^{- \frac{c_{i, j}}{ε}}

.

Sinkhorn Algorithm From above, the optimal solution of problem (8) can be expressed in matrix form as

π = diag (a) K diag (b)

. Then it is necessary to satisfy the constraints

Π (ν_{0}, ν_{1})

, i.e.,

diag (a) K diag (b) 1_{M} = ν_{0}, diag (a) K^{T} diag (b) 1_{N} = ν_{1} .

Note that

diag (b) 1_{M}

is

b

and the multiplication of

diag (a)

times

K b

is

a ⊙ (K b) = ν_{0}, a ⊙ (K^{T} b) = ν_{1},

where ⊙ represents the entry-wise multiplication. An intuitive way is to solve them iteratively, and these two updates yield the Sinkhorn algorithm,

a^{(ℓ + 1)} \overset{def .}{=} \frac{ν_{0}}{K b^{(ℓ)}}, b^{(ℓ + 1)} \overset{def .}{=} \frac{ν_{1}}{K^{T} a^{(ℓ + 1)}} .

The division used above between two vectors is entry-wise and can be computed in time and memory quadratically of cell number.

Unbalanced Optimal Transport Static optimal transport inherently conserves mass. However, during cellular development and differentiation, processes such as cell proliferation and apoptosis result in mass non-conservation. Consequently, it is essential to consider unnormalized distributions that account for cell growth and death. Additionally, the marginals must be adjusted to incorporate these factors effectively. So, the unbalanced optimal transport can be defined as follows:

min_{π \in R_{+}^{N \times M}} 〈 π, c 〉 + τ_{1} KL (π 1_{M} | | ν_{0}) + τ_{2} KL (π^{T} 1_{N} | | ν_{1}),

(9)

where

τ_{1}

and

τ_{2}

are hyperparameters that control the degree of penalization. When

τ_{1} = τ_{2} \to + \infty

, then one can recover the original optimal transport. To further adjust the marginal distributions accounting for growth and death [35,36], for the left marginal distribution

ν_{0}

we set

{(ν_{0})}_{i} = \frac{g {(x_{i})}^{t_{2} - t_{1}}}{\sum_{j = 1}^{N} g {(x_{j})}^{t_{2} - t_{1}}}, \forall i \in {1, \dots, N} .

(10)

where g is the growth/death function and can be estimated through the gene sets. For the right marginal distribution

ν_{1}

, set it as the uniform distribution, i.e.,

{(ν_{1})}_{j} = 1 / M, \forall j \in {1, \dots, M} .

By the obtained coupling matrix

π \in R_{+}^{N \times M}

, one can perform biological downstream analysis such as computing ancestors or descends of a cell state and imputing gene expressions [35,36]. Naturally, a Markov chain model can be formulated to quantify the transition probability among cells across time points based on the optimal transport plan [36,63]. By weighting this random walk with those induced by other quantities such as gene expression similairity (e.g., DiffuionMap), pseudotime (e.g., Palantir), or RNA velocity, the CellRank analysis could also be conducted to dissect the underlying structure of the transitional dynamics [63].

3.2.2. Continuous Temporal Dynamics Modeling

Although static optimal transport provides a robust framework for coupling distributions at different time points, there is a substantial interest in capturing continuous cellular dynamics over time and fitting mechanistic models that transform the source distribution into the target distribution. This interest has driven the development of various dynamical optimal transport (OT) methods and flow-based generative models. Prominent approaches include those based on the Benamou–Brenier formulation [94], such as TrajectoryNet [95], MIOFlow [96], and other related methodologies [90,97,98,99,100,101]. Additionally, unbalanced dynamic OT methods [41,81,86,89], Gromov–Wasserstein OT approaches [102], continuous normalizing flows (CNF), and conditional flow matching techniques (CFM) [37,38,103,104,105,106,107,108,109,110,111] have also been proposed. Despite these advancements, many of these methods do not fully account for stochastic dynamical effects, particularly the intrinsic noise inherent in gene expression and cell differentiation [61,112], which are prevalent in single-cell biological processes [113].

In the realm of stochastic dynamics, the Schrödinger bridge (SB) problem seeks to identify the most probable stochastic transition path between two arbitrary distributions relative to a reference stochastic process [114]. Variants of the SB problem have been applied across various domains, including single-cell RNA sequencing (scRNA-seq) analysis and generative modeling. These approaches encompass static methods [82,115,116,117,118,119,120,121,122], dynamic methods [84,88,91,123,124,125,126,127,128,129,130,131,132,133,134,135], and flow-matching techniques [136]. However, these methods often fail to address unnormalized distributions resulting from cell growth and death. To address this problem, some methods have been developed to account for the unbalanced stochastic dynamics, for example, those based on branching SDE theory (e.g, gWOT) [82,87,116,117] and those based on the Feynman–Kac formula with forward-backward SDE theory [115]. Among those, most methods often require prior knowledge of these processes, such as growth or death rates [82,87,115,117] or depend on additional information like cell lineage data [116].

Recently, regularized unbalanced optimal transport (RUOT), also known as unbalanced Schrödinger bridge [137], has emerged as a promising approach for modeling stochastic unbalanced continuous dynamics [137,138,139,140]. RUOT can be viewed as an unbalanced relaxation of the dynamic Schrödinger bridge formulation. For instance, ref. [138] elucidates the connection between certain RUOT formulations and branching Schrödinger bridges. Meanwhile, a new deep learning framework (DeepRUOT) [40], has been developed to learn general RUOT and infer continuous unbalanced stochastic dynamics from sample data based on derived Fisher regularization forms without requiring prior knowledge.

The primary objective now transforms to determine the dynamics described by Equations (2) and (3) from observed data, given unnormalized distributions at T discrete time points where

x_{i} \in R^{d} \sim ν_{i}

for each fixed time point

i \in {0, \dots, T - 1}

. Note that solely satisfying Equation (3) does not admit a unique solution. Consequently, we need to ensure that the inferred dynamics also adhere to certain energy minimization principles. Building upon Equation (3), the problem can be categorized into four distinct scenarios: (1)

g (x, t) = 0 and σ (x, t) = 0,

(2)

g (x, t) = 0 and σ (x, t) \neq 0,

(3)

g (x, t) \neq 0 and σ (x, t) = 0,

(4)

g (x, t) \neq 0 and σ (x, t) \neq 0 .

Each of these cases is examined to systematically address the learning of the underlying dynamics.

Dynamical Optimal Transport (

g = 0, σ = 0

) In this case, it means the dynamics do not account for unblancedness and stochastic, i.e., the cellular dynamics are governed by

d x_{t} = b (x_{t}, t) d t

. Then we can use the dynamical optimal transport to model these dynamics, also known as the Benamou–Brenier formulation [94], which can be stated as follows:

\begin{matrix} \frac{1}{2} W_{2}^{2} (ν_{0}, ν_{1}) = \inf_{(p (x, t), b (x, t))} \int_{0}^{1} \int_{R^{d}} \frac{1}{2} {∥ b (x, t) ∥}_{2}^{2} p (x, t) d x d t, \\ s . t . \partial_{t} p + \nabla \cdot (b (x, t) p) = 0, p {|_{t = 0} = ν_{0}, p |}_{t = 1} = ν_{1} . \end{matrix}

(11)

The inclusion of the factor

\frac{1}{2}

on the left-hand side ensures that the Wasserstein distance has a more physically meaningful interpretation; for example, it represents the total action required to transport one distribution to another. In this formulation, probability distributions are connected through a deterministic transport equation. It has been demonstrated that this dynamic formulation is equivalent to static optimal transport problem (Equation (7)) when employing the cost function

c (x, y) = {∥ x - y ∥}_{2}^{2}

.

Neural ODE Solver

Numerous methodologies have been proposed to solve dynamical optimal transport or its variants numerically. The basic approach involves employing a neural network, denoted as

b_{θ} (x, t)

, to parameterize

b (x, t)

, and subsequently utilizing the ordinary differential equations (ODEs) that govern particle trajectories. From Problem (11), it is evident that the optimization process must address two distinct loss components: the first pertains to the computation of an energy-related loss, while the second concerns the reconstruction error (

p (x, 1) = ν_{1}

).

Regarding energy loss, the high-dimensional nature of the integral presents significant challenges due to the curse of dimensionality. To mitigate this issue, the integral is approximated using Monte Carlo integration and continuous normalizing flow (CNF) techniques [95]. The strategy involves performing integration along the particle trajectories dictated by the ODE, i.e.,

\int_{0}^{1} \int_{R^{d}} \frac{1}{2} {∥ b (x, t) ∥}_{2}^{2} p (x, t) d x d t = E_{x_{0} \sim ν_{0}} \int_{0}^{1} \frac{1}{2} {∥ b (x (t), t) ∥}_{2}^{2} d t,

where

x (t)

satisfies the ODE

\frac{d x}{d t} = b (x, t), x_{0} \sim ν_{0}

. For the distribution reconstruction loss, the authors incorporate an additional penalizing constraint. By integrating these two loss components, there exists a sufficiently large

λ \geq 0

such that [95,96]

\frac{1}{2} W_{2}^{2} (ν_{0}, ν_{1}) = inf_{(p (x, t), b (x, t))} E_{x_{0} \sim ν_{0}} \int_{0}^{1} \frac{1}{2} {∥ b (x (t), t) ∥}_{2}^{2} d t + λ D (p (x, 1), ν_{1}) .

Based on this formulation, TrajectoryNet [95] computes both the energy and the reconstruction errors by using neural ODE [141] to parametrize the velocity

b (x, t)

.

Conditional Flow Matching

Recently, conditional flow matching (CFM) presents another efficient dynamical OT solver especially in high dimensionality case [37,38,104,129]. Assume that the probability path

p (x, t)

and the corresponding vector field

b (x, t)

generating it are known, and that

p (x, t)

can be efficiently sampled. Under these conditions, a neural network

b_{θ} (x, t)

can be trained to approximate

b (x, t)

by minimizing the flow matching (FM) objective:

L_{FM} (θ) = E_{t \sim U (0, 1), x \sim p (x, t)} {∥ b_{θ} (x, t) - b (x, t) ∥}_{2}^{2} .

However, this objective is computationally intractable when dealing with general source and target distributions. Consider the specific case of Gaussian marginal densities, defined as

p (x, t) = N (x | μ (t), σ {(t)}^{2} I)

. The corresponding unique vector field that generates this density from

N (x | μ (0), σ {(0)}^{2} I)

is

b (x, t) = μ^{'} (t) + \frac{σ (t)}{σ^{'} (t)} (x - μ (t)),

where

μ^{'} (t)

and

σ^{'} (t)

means the time derivative [37,38]. Now, assume the marginal probability trajectory

p (x, t)

is a mixture of conditional probability paths

p (x, t | z)

. Specifically, this can be expressed as:

p (x, t) = \int p (x, t | z) q (z) d z .

If the

p (x, t | z)

is generated by the vector field

b (x, t | z)

from

p (x, 0 | z)

, then

p (x, t)

can be generated by

b (x, t)

defined as follows:

b (x, t) : = E_{q} (z) \frac{b (x, t | z) p (x, 0 | z)}{p (x, t)} .

This is also intractable since

p (x, t)

is difficult to compute. The key is to introduce the conditional flow matching objective:

L_{CFM} (θ) = E_{t \sim U (0, 1), q (z), p (x, t | z)} {∥ b_{θ} (x, t) - b (x, t | z) ∥}_{2}^{2} .

One can prove that

\nabla_{θ} L_{CFM} = \nabla_{θ} L_{FM}

, so training with CFM is equivalent with FM. The CFM objective is very useful when the

b (x, t)

is intractable but the conditional

b (x, t | z)

is tractable. So to approximate the dynamical optimal transport (11) is to use CFM. Assume

q (z) = q (z_{0}, z_{1})

, and set

q (z)

to be the Wasserstein optimal transport map

π

between the source distribution

ν_{0}

and the target distribution

ν_{1}

, i.e.,

q (z) = π (z_{0}, z_{1})

, where

z_{0} \sim ν_{0}

,

z_{1} \sim ν_{1}

. Then one can construct the Gaussian flow between

z_{0}

and

z_{1}

with standard deviation

σ

,

p (x, t | z) = N (x | t z_{1} + (1 - t) z_{0} | σ^{2}), b (x, t | z) = z_{1} - z_{0} .

It can be proved that when

σ \to 0

, this also gives a way to solve the dynamical optimal transport [38]. The advantage of CFM is that it is simulation-free and can handle the thousand gene dimensions without reducing dimensionality.

Schrödinger Bridge Problem (

g = 0, σ \neq 0

) In this case, the model can account for the stochastic effects, yet without unbalanced effects. We employ the Schrödinger Bridge problem to model the SDE dynamics, i.e., the cellular dynamics are

d x_{t} = b (x_{t}, t) d t + σ (x_{t}, t) d w_{t}

. The Schrödinger Bridge problem seeks to determine the most probable evolution between a specified initial distribution

ν_{0}

and a terminal distribution

ν_{1}

(assumed to possess a density in this study) relative to a given reference stochastic process. Formally, this problem is formulated as the minimization of the Kullback–Leibler (KL) divergence from the perspective of optimal control [142], as shown below:

min_{μ_{0}^{X} = ν_{0}, μ_{1}^{X} = ν_{1}} D_{KL} (μ_{[0, 1]}^{X} | μ_{[0, 1]}^{Y}),

(12)

where

μ_{[0, 1]}^{X}

denotes the probability measure induced by the stochastic process

x_{t}

for

0 \leq t \leq 1

, defined on the space of all continuous paths

C ([0, 1], R^{d})

. The distribution of

x_{t}

at a given time t is characterized by the measure

μ_{t}^{X}

with density function

p (x, t)

. The reference measure

μ_{[0, 1]}^{Y}

is chosen as the probability measure induced by the process

d Y_{t} = σ (Y_{t}, t) d w_{t},

where

w_{t} \in R^{d}

represents the standard multidimensional Brownian motion.

Interestingly, the problem can be equivalently transformed into a dynamical form [39,40,142,143]

inf_{(p, b)} \int_{0}^{1} \int_{R^{d}} [\frac{1}{2} b^{⊤} (x, t) a {(x, t)}^{- 1} b (x, t)] p (x, t) d x d t,

(13)

where the infimum is taken over all pairs of functions

(p, b)

satisfying

p (\cdot, 0) = ν_{0}

,

p (\cdot, 1) = ν_{1}

, and

p (x, t)

is absolutely continuous with respect to time. Additionally, the pair

(p, b)

must satisfy the Fokker–Planck Equation (3). We denote minimization problem (13) and the constraints (3) as the dynamic diffusion Schrödinger bridge formulation. Methods for modeling stochastic dynamics based on it have been widely developed [84,88,91,123,134,136], involving neural SDE, neural ODE, or flow matching techniques. We will next provide an overview of the methodologies in these approaches.

Neural SDE Solver

Similar to dynamical OT, one can solve the dynamical SB problem through the CNF formulation

inf_{(p (x, t), b (x, t))} E_{x_{0} \sim ν_{0}} \int_{0}^{1} [\frac{1}{2} b^{⊤} (x (t), t) a {(x (t), t)}^{- 1} b (x (t), t)] d t + λ D (p (x, 1), ν_{1}) .

Building upon this, one can parametrize

b (x, t)

and

σ (x, t)

using neural networks respectively and solve this formulation through POT and the neural SDE solver. However, besides these two terms, some work also introduces the idea of the principle of least action along the trajectory in which the optimal path has the smallest action value [84,123]. Thus, they introduce a new Hamilton–Jacobi–Bellman (HJB) regularization term [84] when assuming

b (x, t) = - \nabla_{x} Φ (x, t)

, i.e.,

R_{h} = \int_{0}^{1} \int_{R^{d}} |\partial_{t} Φ (x, t) - {∥ \nabla_{x} Φ (x, t) ∥}_{2}^{2}| p (x, t) d x d t .

or a general form derived in [123].

Shrödinger Bridge Conditional Flow Matching

By leveraging CFM techniques, the simulation-free Shrödinger bridge [136] has also been recently developed. The core idea is to decompose the problem into a sequence of elementary conditional subproblems, each of which is more tractable, and subsequently express the overall solution as a mixture of the solutions to these conditional subproblems. Let the reference process be a Brownian motion (i.e.,

Y = σ W

). In this case, the Schrödinger bridge problem admits a unique solution

P^{*}

, which is expressed as a mixture of Brownian bridges weighted by an entropic optimal transport (OT) plan:

P^{*} ({(x_{t})}_{t \in [0, 1]}) = \int W ((x_{t}) ∣ x_{0}, x_{1}) d π_{2 σ^{2}}^{★} (x_{0}, x_{1}),

(14)

where

W ((x_{t}) ∣ t \in (0, 1) ∣ x_{0}, x_{1})

denotes the Brownian bridge between

x_{0}

and

x_{1}

with a diffusion rate

σ

, and

π_{2 σ^{2}}^{★} (x_{0}, x_{1})

represents the entropic optimal transport plan between the distributions. The calculation of

W ((x_{t}) ∣ t \in (0, 1) ∣ x_{0}, x_{1})

can be framed as an optimal control problem:

\begin{matrix} min_{b} E \int_{0}^{1} {∥b (x_{t}, t)∥}^{2} d t, \\ d x_{t} = b (x_{t}, t) d t + σ d w_{t}, \\ X_{0} \sim δ_{x_{0}}, X_{1} \sim δ_{x_{1}}, \end{matrix}

where

δ_{x_{0}}

and

δ_{x_{1}}

are Dirac delta functions centered at

x_{0}

and

x_{1}

, respectively.

Assume

σ

is constant and then the corresponding Fokker–Planck equation in (12) yields

\partial_{t} p (x, t) = - \nabla_{x} \cdot (p (x, t) b (x, t)) + \frac{1}{2} σ^{2} Δ p (x, t) .

From this equation, it can be derived that the ODE

d X_{t} = \underset{v (X_{t}, t)}{\underset{︸}{(b (X_{t}, t) - \frac{1}{2} σ^{2} \nabla_{x} log p (X_{t}, t))}} d t,

(15)

together with the initial distribution generate the same distribution as SDE. The (15) is called the probability flow ODE. Conversely, if the probability flow ODE

v (x, t)

and

\nabla_{x} log p (x, t)

(also known as score function) are known, one can recover the SDE drift through

v (x, t) = b (x, t) + \frac{1}{2} σ^{2} \nabla_{x} log p (x, t)

. So the flow-matching objective is

L_{U {[SF]}^{2} M} (θ) = E [\underset{flow matching loss}{\underset{︸}{{∥v_{θ} (x, t) - v (x, t)∥}^{2}}} + λ {(t)}^{2} \underset{score matching loss}{\underset{︸}{{∥\nabla s_{θ} (x, t) - \nabla log p (x, t)∥}^{2}}}] .

However, this loss is intractable; by (14) and the CFM objective, one can transform it into a tractable loss

L_{{[SF]}^{2} M} (θ) = E_{Q^{'}} \underset{conditional flow matching loss}{\underset{︸}{{∥v_{θ} (x, t) - v (x, t ∣ (x_{0}, x_{1})∥}^{2}}} + E_{Q^{'}} λ {(t)}^{2} \underset{conditional score matching loss}{\underset{︸}{{∥\nabla s_{θ} (x, t) - \nabla log p (x, t ∣ (x_{0}, x_{1})∥}^{2}}},

where

Q^{'} = t \sim U (0, 1) \otimes q (x_{0}, x_{1}) \otimes p (x, t | (x_{0}, x_{1}))

. Since the conditional path is a Brownian bridge, the analytic form can be derived, i.e.,

p (x, t ∣ (x_{0}, x_{1})) = N (x; t x_{1} + (1 - t) x_{0}, σ^{2} t (1 - t))

and

v (x, t ∣ (x_{0}, x_{1})) = \frac{1 - 2 t}{t (1 - t)} (x - (t x_{1} + (1 - t) x_{0})) + (x_{1} - x_{0}),

\nabla_{x} log p (x, t ∣ (x_{0}, x_{1})) = \frac{t x_{1} + (1 - t) x_{0} - x}{σ^{2} t (1 - t)}, t \in [0, 1] .

And

q (x_{0}, x_{1})

can be computed by the entropic optimal transport.

Unbalanced Wasserstein–Fisher–Rao metric (

g \neq 0, σ = 0

) In this case, the model can account for the unbalanced dynamics, however, it can not account for stochastic dynamics. The cellular dynamics are also governed by the ODE model by

d x_{t} = b (x_{t}, t) d t

. Then, one can use the dynamical unbalanced optimal transport to model these dynamics, also known as Wasserstein–Fisher–Rao metric [144,145,146], which can be stated as

\begin{matrix} \inf_{(p (x, t), b (x, t), g (x, t))} \int_{0}^{1} \int_{R^{d}} (\frac{1}{2} {∥ b (x, t) ∥}_{2}^{2} + α {| g (x, t) |}_{2}^{2}) p (x, t) d x d t, \\ s . t . \partial_{t} p + \nabla \cdot (b (x, t) p) = g (x, t) p, p {|_{t = 0} = ν_{0}, p |}_{t = 1} = ν_{1} . \end{matrix}

(16)

Here,

α

denotes a hyperparameter that controls the weighting. It is also important to note that in this context,

ν_{0}

and

ν_{1}

do not necessarily correspond to normalized probability densities; rather, they generally represent mass densities.

Recent works such as TrajectoryNet and TIGON utilize (16) to infer unbalanced dynamics from scRNA-seq data [81,86]. To derive a CNF solver for (16), TIGON [81] observes that along the characteristic line

\frac{d x}{d t} = b (x, t)

, one has

\int_{0}^{1} \int_{R^{d}} f (x, t) p (x, t) d x d t = E_{x_{0} \sim p_{0}} \int_{0}^{1} f (x, t) e^{\int_{0}^{t} g (x, s) d s} d t

and

\frac{d (ln p)}{d t} = g - \nabla \cdot v

. This can make the computation of both energy loss and reconstruction loss in high dimensional space tractable. Therefore, one can parameterize

b (x, t)

and

g (x, t)

using neural networks, respectively, and train them by minimizing the overall loss.

Regularized Unbalanced Optimal Transport (

g \neq 0, σ \neq 0

) In this case, the model can account for both the unbalanced and stochastic dynamics. The cellular dynamics are governed by the SDE model

d x_{t} = b (x_{t}, t) d t + σ (t) I d w_{t}

. We can use the regularized unbalanced optimal transport to model it [40,138]. It can be viewed as an unbalanced relaxation of the dynamic formulation of the Schrödinger bridge problem. Consider

inf_{(p, b, g)} \int_{0}^{1} \int_{R^{d}} \frac{1}{2} {∥b (x, t)∥}_{2}^{2} p (x, t) d x d t + \int_{0}^{1} \int_{R^{d}} α Ψ (g (x, t)) p (x, t) d x d t,

(17)

where

Ψ : R \to [0, + \infty]

corresponds to the growth penalty function, the infimum is taken over all pairs

(p, b)

such that

p (\cdot, 0) =

ν_{0}, p (\cdot, 1) = ν_{1}, p (x, t)

absolutely continuous, and

\partial_{t} p = - \nabla_{x} \cdot (p b) + \frac{1}{2} \nabla_{x}^{2} : (σ^{2} (t) I p) + g p

(18)

with vanishing boundary condition:

lim_{| x | \to \infty} p (x, t) = 0

.

One can similarly develop a dynamical OT solver relying on a neural SDE solver, which might be less efficient compared to a neural ODE solver. Recently, DeepRUOT [40] reformulates the RUOT problem with the Fisher information regularization, equivalently expressed as

inf_{(p, v, g)} \int_{0}^{1} \int_{R^{d}} [\frac{1}{2} {∥v (x, t)∥}_{2}^{2} + \frac{σ^{4} (t)}{8} {∥\nabla_{x} log p∥}_{2}^{2} - \frac{σ^{2} (t)}{2} (1 + log p) g + α Ψ (g)] p (x, t) d x d t,

(19)

where the infimum is taken over all triplets

(p, v, g)

such that

p (\cdot, 0) =

ν_{0}, p (\cdot, 1) = ν_{1}, p (x, t)

absolutely continuous, and

\partial_{t} p = - \nabla_{x} \cdot (p v (x, t)) + g (x, t) p

(20)

with vanishing boundary condition:

lim_{| x | \to \infty} p (x, t) = 0

. Here

v (x, t)

is a new vector field, representing the probability flow ODE field.

Thus, the original SDE

d x_{t} = (b (x_{t}, t)) d t + σ (t) d w_{t}

now can be transformed into the probability flow ODE

d x_{t} = \underset{v (x_{t}, t)}{\underset{︸}{(b (x_{t}, t) - \frac{1}{2} σ^{2} (t) \nabla_{x} log p (x_{t}, t))}} d t .

If the probability flow ODE’s drift

v (x, t)

,

σ (t)

and the score function

\nabla_{x} log p (x, t)

are specified, then the the drift term

b (x, t)

of the SDE can be recovered by

b (x, t) = v (x, t) + \frac{1}{2} σ^{2} (t) \nabla_{x} log p (x, t) .

Therefore, specifying an SDE is equivalent to specifying the probability flow ODE and the score function

\nabla_{x} log p (x, t)

. One can then use neural networks

v_{θ}, g_{θ}

and

s_{θ}

to parameterize

v (x, t)

,

g (x, t)

, and

\frac{1}{2} σ^{2} (t) log p (x, t)

, respectively.

To train DeepRUOT, the overall loss is composed of three parts, i.e., the energy loss, reconstruction loss, and the Fokker–Planck constraint:

L = L_{Energy} + λ_{r} L_{Recons} + λ_{f} L_{FP} .

(21)

The

L_{Energy}

loss aims for the least action of kinetic energy in Equation (19), which can be computed via CNF by adopting the similar approach in TIGON [81]. The reconstruction loss

L_{Recons}

aims the dynamics to match data distribution at the later time point (i.e.,

p (\cdot, 1) = ν_{1}

). To achieve the matching in unbalanced settings, DeepRUOT further decomposes it into two parts:

L_{Recons} = λ_{m} L_{Mass} + λ_{d} L_{OT}

(22)

where

L_{Mass}

aims to align the number of cells and

L_{OT}

uses normalized weights to perform optimal transport matching. Lastly,

L_{FP}

aims to let the three parameterized neural networks satisfy the Fokker–Planck constraints (20). DeepRUOT first utilizes a Gaussian mixture model to estimate the initial distribution, ensuring that it satisfies the initial conditions

p_{0}

, and the physics-informed (PINN) loss [147] is defined as

L_{FP} = ∥\partial_{t} p_{θ} + \nabla_{x} \cdot (p_{θ} v_{θ}) - g_{θ} p_{θ}∥ + λ_{w} ∥p_{θ} (x, 0) - p_{0}∥, p_{θ} = exp \frac{2}{σ^{2}} s_{θ} .

(23)

In [40], DeepRUOT adopts a two-stage training approach to stabilize the training process. For the pre-training stage, they use reconstruction loss only to train

v_{θ}

and

g_{θ}

. Then, they fix

v_{θ}

and

g_{θ}

and employ conditional flow-matching [37,38,136] to learn the log density function (

s_{θ} (x, t)

). Finally, for the training stage, they use the

v_{θ}

,

g_{θ}

, and the log density function as the starting point, then obtain the final result by minimizing the total loss (21).

4. Dynamic Modeling of Spatial Transcriptomics

In this section, we review the dynamical modeling approaches for spatial transcriptomics data. We will first present several random walk- or ODE-based methods to model the snapshot spatial data. Next, we focus on the recent progress to dissect the spatiotemporal dynamics underlying datasets with both space and time resolutions. Figure 4 provides an overview of these spatial transcriptomics modeling approaches.

4.1. Snapshot Spatial Transcriptomics

Below we describe several modeling strategies for single snapshot spatial transcriptomics data, including pseudotime, random walk, and continuous differential equation models, respectively.

4.1.1. Pseudotime Methods

In the context of spatiotemporal trajectory inference, stLearn [148] proposes the PSTS algorithm that combines spatial information and geodesic-based pseudotime information to infer the spatiotemporal developmental trajectory of cells. The pseudotime distance between two clusters is defined as

d_{PT} (u, v) = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{n} (1 - \frac{p_{u, i} \cdot p_{v, i}}{∥ p_{u, i} ∥ \cdot ∥ p_{v, j} ∥}),

where

p_{u, i}

and

p_{v, i}

are the PCA vectors of gene expression data points in two clusters. The spatial distance is defined as the Euclidean distance between the centroids of the two clusters. The spatiotemporal distance between clusters is the weighted sum of pseudotime distance and spatial distance

d_{PTS} (u, v) = ω d_{PT} (u, v) + (1 - ω) d_{S} (u, v) .

Each cluster is then treated as a node in a graph, and the edge weights are determined by

d_{PTS}

. By optimizing edge selection using a minimum spanning tree, the optimal trajectory structure can be identified.

4.1.2. Discrete Spatial Dynamics Modeling

STT [149] is a random walk-based algorithm to detect multi-stable attractors in spatial transcriptomics. Central to STT is the incorporation of a space coordinate-aware random walk, with the transition probability matrix having the form

P = w_{1} P_{v} + w_{2} P_{c} + (1 - w_{1} - w_{2}) P_{s}

, where

P_{v}

is induced by an attractor-specific RNA velocity (named spatial transition tensor),

P_{c}

is induced by gene expression similarity (i.e., diffusion in the gene space), and

P_{s}

is induced by space coordinates (i.e., diffusion in the physical space). By iteratively (1) decomposing P to identify attractors and assign attractor membership to each individual cell and (2) improving attractor-specific RNA velocity estimation, STT is able to identify transitional cells in the snapshot spatial transcriptomics data and plot the local streamlines within attractors.

SpaTrack [150] is a spatial transcriptomics analysis tool based on optimal transport theory, which reconstructs cell differentiation trajectories by integrating gene expression profiles and spatial coordinates of cells. When processing Snapshot data, SpaTrack defines the transition cost matrix between cells by weighting gene expression distance and spatial distance as follows:

C_{i j} = α_{1} ∥ g_{i} - g_{j} ∥^{2} + α_{2} {∥ z_{i} - z_{j} ∥}^{2}

where

g_{i}

represents the gene expression of cell i,

x_{i}

denotes the spatial coordinates of cell i, and

α_{1}, α_{2}

are weighting coefficients. The transition probability matrix between cells is obtained by solving the following entropy-regularized optimal transport (OT) problem

P = arg min_{P} \sum_{i j} C_{i j} P_{i j} + ϵ H (P) s . t . \sum_{i} P_{i j} = 1, \sum_{j} P_{i j} = 1

where

H (P)

denotes the entropy regularization term and

ϵ

is the regularization coefficient. SpaTrack identifies trajectory starting points using single-cell entropy. Let the identified starting points be cells

1, 2, \dots, s

. The probability of transitioning from starting cells to cell i can be calculated as

γ_{i} = \sum_{j = 1}^{s} P_{j i}

. By sorting cells in ascending order based on their

γ_{i}

values, the position of each cell in the differentiation trajectory can be determined.

4.1.3. Continuous Spatial Dynamics Modeling

Several methods aim to extend the continuous RNA velocity model of scRNA-seq data toward snapshot spatial transcriptomics. One recent method, iSORT [151] uses transfer learning to obtain a mapping of gene expression to spatial location

z = f (x)

and proposes the concept of spatial RNA velocity, which utilizes the velocity of gene expression and the mapping

z = f (x)

to obtain spatial RNA velocity, formally

\frac{d z}{d t} = \nabla f \cdot \frac{d x}{d t} .

In addition, Topovelo [152] uses a graph neural network to infer RNA velocity for spatial transcriptomics data and suggests that a decoder could be trained to further infer continuous spatial velocities.

4.2. Temporally Resolved Spatial Transcriptomics

The availability of time-series ST data opens new avenues to explore cellular migration within physical space [41,152,153]. Nevertheless, the inherently destructive nature of sequencing limits ST data to static snapshots rather than continuous trajectories. Particularly, when sequencing is performed at various time points during embryonic development, the resulting time-series ST data are often derived from distinct biological samples, leading to multiple unpaired snapshots [17,154,155]. In addition, due to possible rotation, translation, and stretching of different slices, the spatial coordinates of different samples are not in the same coordinate system [41,153,156]. Therefore, reconstructing trajectories of cell state transition, proliferation, and migration for time-series ST data is a challenging task.

To overcome these challenges, many methods have been developed in recent years. Similar to modeling temporal single-cell data, these methods can be divided into two categories: those that model dynamics on discrete cell states [36,153,157] and those that model dynamics in continuous spaces [41].

4.2.1. Discrete Spatiotemporal Dynamics Modeling

Among the methods that model spatiotemporal dynamics on discrete cell states, recent work includes Moscot [36], DeST-OT [157], and Spateo [153]. These approaches employ fused Gromov–Wasserstein optimal transport [158,159] as the main tool, which was first used by PASTE [156] to align adjacent 2D slices to reconstruct the 3D structure of the tissue.

Fused Gromov–Wasserstein Optimal Transport We consider two adjacent unpaired slices

(X, Z)

and

(X^{'}, Z^{'})

with spots (or cell) numbers N and M, where

X \in R^{N \times G}

,

X^{'} \in R^{M \times G}

are the gene expression of the two slices, and

Z \in R^{N \times 2}

and

Z^{'} \in R^{M \times 2}

are the spatial coordinates. In addition, the spatial coordinates of each slice can be converted into a distance matrix

D \in R_{+}^{N \times N}

, where

d_{i j} = {∥ z_{i} - z_{j} ∥}_{2}

. The fused Gromov–Wasserstein optimal transport problem reads as follows:

\begin{matrix} min_{π \in Π (ν_{0}, ν_{1})} (1 - α) \sum_{i, j} c_{i, j} π_{i, j} + α \sum_{i, j, k, l} {(d_{i, k} - d_{j, l}^{'})}^{2} π_{i, j} π_{k, l}, \end{matrix}

(24)

where

π, Π (ν_{0}, ν_{1})

and c have the same meaning as before, and

α

is a hyperparameter that weighs the importance of gene expression and spatial location. The fused Gromov–Wasserstein optimal transport (FGW-OT) problems can also be solved by calling the POT (version 0.9.5) package [92].

Generalized Weighted Procrustes Problem When the mapping

π

is found, in order to unify the spatial coordinates of adjacent slices into the same coordinate system by a rigid body transformation (rotation and translation), we need to solve a generalized weighted Procrustes problem. Formally,

\hat{R}, \hat{r} = \underset{\begin{matrix} R \in R^{2 \times 2}, r \in R^{2} \\ R^{T} R = I, det R = 1 \end{matrix}}{arg min} \sum_{i, j} π_{i j} {∥z_{i} - (R z_{j}^{'} + r)∥}_{2}^{2},

(25)

where

R

is the rotation matrix and

r

is the translation vector.

Applications in ST Data Moscot [36] extends FGW-OT to model slices of adjacent time points by adding the entropy regularization mentioned in Equation (8) and the unbalanced settings mentioned in Equations (9) and (10). Formally,

\begin{matrix} min_{π \in R_{+}^{N \times M}} & (1 - α) \sum_{i, j} c_{i, j} π_{i, j} + α \sum_{i, j, k, l} {(d_{i, k} - d_{j, l}^{'})}^{2} π_{i, j} π_{k, l} \\ + τ_{1} KL (π 1_{M} | | ν_{0}) + τ_{2} KL (π^{T} 1_{N} | | ν_{1}) - ε H (π), \end{matrix}

where

τ_{1}, τ_{2}, ε

and

H (π)

have the same meaning as before and

ν_{0}

is calculated from a pre-selected gene set according to Equation (10). Note that when we talk about ST data at different time points, the spatial coordinates can not only be 2D but can also be reconstructed in 3D. We use

d_{spa}

to refer to the dimension of the spatial coordinates.

DeST-OT [157] designs methods that enable simultaneous inference of cell growth rate and mapping from data. The DeST-OT optimization problem with the semi-relaxed constraints and entropic regularization is

\begin{matrix} min & E_{DeST - OT} (π) + τ_{1} KL (π 1_{M} | | ν_{0}) - ε H (π) \\ s . t . & π^{T} 1_{N} = ν_{1}, π \in R_{+}^{N \times M}, \end{matrix}

where

E_{DeST - OT} (π)

includes the Wasserstein OT term and another term

E^{M} (π)

related to growth and GW-OT, that is,

E_{DeST - OT} : = (1 - α) \sum_{i, j^{'}} c_{i, j^{'}} π_{i, j^{'}} + α E^{M} (π) .

E^{M} (π)

is defined as

\begin{matrix} E^{M} (π) : = \frac{1}{2} & (\sum_{i, j^{'}, k^{'}} π_{i j^{'}} π_{i k^{'}} M_{j^{'} k^{'}}^{' 2} + \sum_{i, j, k^{'}} π_{i k^{'}} π_{j k^{'}} M_{i j}^{2} \\ + \sum_{i, j^{'}, k, l^{'}} {(M_{i k} - M_{j^{'} l^{'}}^{'})}^{2} π_{i j^{'}} π_{k l^{'}}), \end{matrix}

where

M = D^{spa} ⊙ D^{\exp}

measures the distance between two cells in the same slice, and

D^{spa}

and

D^{\exp}

are distance matrices constructed on each slice according to spatial coordinates

S

and gene expression

X

, respectively. That is,

d_{i, j}^{spa} = {∥ z_{i} - z_{j} ∥}_{2}

and

d_{i, j}^{\exp} = {∥ x_{i} - x_{j} ∥}_{2}

. In addition, the first and second terms in

E^{M} (π)

promote the proximity of different descendants of a cell at the previous moment and the proximity of different ancestors of a cell at the later moment, respectively, and the third term is the usual GW term that only replaces the spatial distance matrix

D

with the

M

distance matrix. In DeST-OT, the authors define the growth vector

ξ = π 1_{M} - ν_{0}

and the growth rate

g = log (1 + N ξ) / (t_{1} - t_{0})

.

Spateo [153] uses maps

π

obtained by other OT-based methods to unify spatial coordinates at different times into the reference coordinate system by solving the generalized weighted Procrustes problem in Equation (25) (possibly in 3D). Next, Spateo selects the spatial coordinates of the cell with the most weight at the late time point mapped from each cell at the early time point as its future state, formally

z_{i, future} = z_{arg max π_{i, :}}^{'},

where

π_{i, :}

refers to the i row of

π

, that is, the weight of the i cell mapped from the early time point to each cell at the late time point. When the future spatial coordinates of each cell are determined, we can define the spatial velocity of each cell

v_{i}^{spa} = z_{i, future} - z_{i} .

Finally, Spateo recovers a continuous spatial velocity field

\frac{d z}{d t} = f (z),

from the spatial velocity of each cell, allowing for a series of differential geometry analyses, including divergence, acceleration, curvature, and torsion.

4.2.2. Spatiotemporal Dynamics Modeling

The majority of current approaches model spatial coordinates based on Gromov–Wasserstein OT, which has no dynamic form. Recently, stVCR [41] proposes to model spatial coordinates using rigid-body transformation invariant OT, as well as using the widely used Wasserstein OT for modeling gene expression and unbalanced OT for modeling cellular proliferation. Next, stVCR integrates all modules into dynamic forms, making it possible to reconstruct dynamic continuous trajectories of cell differentiation, migration, and proliferation simultaneously.

Rigid body transformation invariant optimal transport The method in [160] considers the optimal transport problem invariant to a given set of manipulations

G

. It simultaneously searches for the optimal mapping

π

and the optimal transformation g through the optimization problem:

(π^{★}, g^{★}) = \underset{π \in Π (ν_{0}, ν_{1}), g \in G}{arg min} 〈 c (g), π 〉 \overset{def .}{=} \sum_{i, j} π_{i, j} d (z_{i}, g (Z_{j}^{'})) .

(26)

Solving problem (26) directly is difficult, and it can be solved iteratively by

\begin{matrix} π^{(n)} & = \underset{π \in Π (ν_{0}, ν_{1})}{arg min} \sum_{i, j} π_{i j} d (z_{i}, g^{(n)} (Z_{j}^{'})), \end{matrix}

(27)

\begin{matrix} g^{(n + 1)} & = \underset{g \in G}{arg min} \sum_{i, j} π_{i j}^{(n)} d (z_{i}, g (Z_{j}^{'})) . \end{matrix}

(28)

The subproblem (27) is to solve a static OT. In addition, when we choose the set

G

as the set of rigid body transformations, we call this problem rigid body transformation invariant optimal transport. At this point, subproblem (28) is the generalized weighted Procrustes problem mentioned before.

Consider the ST data

(Z^{(0 : K)}, X^{(0 : K)})

at

t_{0}, t_{1} \dots t_{K}

totaling K time points, and the number of cells in each observation is

n_{0}, n_{1} \dots n_{K}

. stVCR uses the spatial coordinate system of the data at

t_{0}

as a reference, and searches for the optimal dynamics and the optimal rigid-body transformation

(r_{1 : k}, R_{1 : k})

by interpolating the empirical probability distributions of the data after the rigid-body transformation

{\hat{p}}^{k}

and the number of cells

n_{k}

using a transport-with-growth partial differential equation (PDE)

\partial_{t} p_{t} (z, x) + \nabla \cdot ((v_{t} (z, x), b_{t} (z, x)) p_{t} (z, x)) = g_{t} (z, x) p_{t} (z, x),

(29)

where

v_{t} (z, x)

is cell spatial migration velocity,

b_{t} (z, x)

is RNA velocity and

g_{t} (z, x)

is cell growth rate. Thus, the feasible state space

S

for the arguments under constraints is

\begin{matrix} S ( & Z^{(0 : K)}, X^{(0 : K)}) : = {(p_{t}, v_{t}, b_{t}, g_{t}; R_{1 : K}, r_{1 : K}) | \partial_{t} p_{t} + \nabla \cdot ((v_{t}, b_{t}) p_{t}) = g_{t} p_{t}, \\ p_{t_{0}} = p^{(0)}, {∥ p_{t_{k}} ∥}_{1} = n_{k} / n_{0}, {\bar{p}}_{t_{k}} = p^{(k)}, R_{k}^{T} R_{k} = I, det R_{k} = 1, k = 1, 2, \dots, K}, \end{matrix}

(30)

where

∥ p_{t} ∥_{1} : = \int p_{t} d z d x

is the total mass of

p_{t}

and

{\bar{p}}_{t} : = p_{t} / {∥ p_{t} ∥}_{1}

. stVCR finds optimal dynamics

(p_{t}, v_{t}, b_{t}, g_{t})

and optimal rigid-body transformations

(R_{1 : K}, r_{1 : K})

by minimizing the Wasserstein–Fisher–Rao (WFR) distance

\int_{t_{0}}^{t_{K}} \int_{R^{G + d_{spa}}} ({∥v_{t}∥}^{2} + α_{Exp} {∥b_{t}∥}^{2} + α_{Gro} g_{t}^{2}) p_{t} (z, x) d z d x d t

(31)

for

(p_{t}, v_{t}, b_{t}, g_{t}; R_{1 : K}, r_{1 : K}) \in S (Z^{(0 : K)}, X^{(0 : K)})

. According to the direct derivation of the solution of the Feynman–Kac type PDE (29) by characteristics, Equation (31) has a dimensionally independent form

\begin{matrix} L_{Dyn} = E_{(z^{(t_{0})}, x^{(t_{0})}) \sim p^{(0)}} \int_{t_{0}}^{t_{K}} ( & ∥ v_{t} (z^{(t)}, x^{(t)}) ∥^{2} + α_{Exp} {∥ b_{t} (z^{(t)}, x^{(t)}) ∥}^{2} \\ + α_{Gro} {∥ g_{t} (z^{(t)}, x^{(t)}) ∥}^{2}) w_{t} [z, x] d t, \end{matrix}

(32)

where

z^{(t)}, x^{(t)}, w_{t} [z, x]

satisfies the characteristic ordinary differential equations (ODEs)

\begin{matrix} \frac{d z^{(t)}}{d t} = v_{t} (z^{(t)}, x^{(t)}), \frac{d x^{(t)}}{d t} & = b_{t} (z^{(t)}, x^{(t)}), \frac{d ln w_{t}}{d t} = g_{t} (z^{(t)}, x^{(t)}), \\ (z^{(t)}, x^{(t)}, w_{t}) |_{t = t_{0}} & = (z^{(t_{0})}, x^{(t_{0})}, 1) . \end{matrix}

(33)

stVCR implemented the constraints

∥ p_{t_{k}} ∥_{1} = n_{k} / n_{0}

and

{\bar{p}}_{t_{k}} = {\hat{p}}^{(k)}

in Equation (30) as soft penalties by performing distribution matching

L_{Mch} = \sum_{k = 1}^{K} {(W_{2} ({\bar{p}}_{t_{k}}, {\hat{p}}^{(k)}))}^{2} + κ_{Gro} \sum_{k = 1}^{K} \frac{| \sum_{j = 1}^{n_{0}} w_{t_{k}, j} - n_{k} |}{n_{k}},

(34)

where the second term promotes a reduction in the relative error of the total mass, and the first term is the 2-Wasserstein distance between the normalized distribution corresponding to the dynamics

{\bar{p}}_{t_{k}}

and the probability distribution of the observed data after rigid body transformation

{\hat{p}}^{(k)}

, where the cost function is defined as

c_{i j}^{(k)} = κ_{Exp} ∥ x_{i}^{(t_{k})} - {\hat{x}}_{j}^{(k)} ∥_{2}^{2} + (1 - κ_{Exp}) {∥ z_{i}^{(t_{k})} - {\hat{z}}_{j}^{(k)} ∥}_{2}^{2}, i = 1 : n_{0}, j = 1 : n_{k},

where

κ_{Exp}

weighs the importance of gene expression and spatial coordinates in distribution matching. In addition, for annotated data, stVCR achieves modeling of known type transitions by modifying

L_{Mch}

, and spatial structure preservation for specified organs or tissues by adding an optional objective function

L_{SSP}^{opt}

.

In summary, the loss function of stVCR contains two required items and one optional item

L = L_{Dyn} + λ_{Mch} L_{Mch} + λ_{SSP} L_{SSP}^{opt} .

(35)

In practice, stVCR parameterizes the dynamics

v_{t} (z^{(t)}, x^{(t)})

,

b_{t} (z^{(t)}, x^{(t)})

and

g_{t} (z^{(t)}, x^{(t)})

into neural networks as well as parameterizes the rotation matrix

R_{1 : K}

into rotation angles

α_{1 : K}

(or Euler angles

(α_{1 : K}, β_{1 : K}, γ_{1 : K})

in 3D) and solves iteratively using back-propagation algorithm.

5. Extensions, Challenges, and Future Directions

Recent advancements in single-cell transcriptomics, spatial transcriptomics, and computational modeling have significantly improved our ability to reconstruct cellular dynamics. However, several outstanding challenges remain, particularly in integrating different discrete and continuous models, handling the complexity of single-cell dynamics, and ensuring the biological interpretability of inferred dynamical systems. This section discusses key areas for further development, focusing on new methodologies that combine discrete and continuous modeling approaches, the construction of comprehensive dynamical frameworks, and the broader applications for modeling cellular fate decisions.

5.1. Bridging Discrete and Continuous Dynamics Modeling

One interesting topic to explore is building connections between discrete dynamic models (e.g., Markov chain) with continuous differential equations when dealing with scRNA-seq data. In CellRank [62,63], the output from continuous models could help to refine the random walk on the data point cloud by introducing various kernels. Let the spliced RNA counts of cells i and j be

s_{i}

and

s_{j}

, respectively. The Gaussian Kernel is defined as

d_{g} (s_{i}, s_{j}) = exp (- \frac{∥ s_{i} - s_{j} ∥^{2}}{σ^{2}}),

Let the position vector from cell i to cell j be

δ_{i j} = s_{j} - s_{i}

and

v_{i}

denotes the estimated RNA velocity of cell i. Then, the three velocity kernels can be introduced as

Cosine Kernel: $v_{cos} (s_{i}, s_{j}) = g (cos (δ_{i j}, v_{i}))$ ,
Correlation Kernel: $v_{corr} (s_{i}, s_{j}) = g (corr (δ_{i j}, v_{i}))$ ,
Inner Product Kernel: $v_{ip} (s_{i}, s_{j}) = g (δ_{i j}^{T} v_{i})$ .

Here, g is a bounded, positive, monotonic increasing function such as an exponential function. In CellRank [62], the actual transition kernel can combine these two parts either by weighted summation or multiplication. For example, the kernel used in the original RNA velocity method is

Ker (s_{i}, s_{j}) = λ v_{cos} (s_{i}, s_{j}) + (1 - λ) d_{g} (s_{i}, s_{j}),

where

λ

is a weighting coefficient. The Markov chain transition matrix is constructed as

p_{i j} = \frac{Ker (s_{i}, s_{j})}{\sum_{j} Ker (s_{i}, s_{j})} .

CellRank2 [63] provides more flexible options for the transition kernel and incorporates prior knowledge. For example, if pseudotime is known in advance, it can be used to adjust the transition kernel

{Ker}_{adj} (s_{i}, s_{j}) = Ker (s_{i}, s_{j}) f (Δ t_{i j}),

where

f (Δ t) = \{\begin{matrix} \frac{2}{\sqrt{1 + exp (b Δ t)}} & Δ t < 0, \\ 1 & Δ t \geq 0 . \end{matrix}

Additionally, a unified transition kernel can be constructed for multi-time point data. For data at different time points, a transport map such as optimal transport (OT) can be used to define

π_{t_{j}, t_{j + 1}}

. By placing the same time point data on the diagonal of a global transition matrix and the transport map between different time points on the off-diagonal, a global transition matrix T can be obtained, enabling the construction of a Markov chain across different time points.

Another direction is to analyze the theoretical convergence of discrete dynamics as the number of data points tends to infinity. A well-known example is the study of the continuum limit of diffusion map random walk [161], stating that the random walk induced by the Gaussian kernel would converge to the dynamics of the Fokker–Planck equation. When considering growth, the directed random walk defined by PBA would converge to (3). Interestingly, ref. [61] also proves that the coarse-grained transition probabilities yield the continuum limit of transition rate among attractors (6), therefore validating the rationale of MuTrans.

Once such a theoretical connection is built, new theoretical insights could be drawn toward the algorithm design. For instance, ref. [44] systematically investigates the continuous limit of RNA-velocity-induced random walk kernels. For example, if the transition kernel is

Ker (s_{i}, s_{j}) = d_{g} (s_{i}, s_{j}) \cdot v_{cos} (s_{i}, s_{j}),

the corresponding ODE yields the desired streamlined equation that correctly reveals the vector field directionality

\frac{d x}{d t} = \frac{v}{∥ v ∥} .

Meanwhile, for

Ker (s_{i}, s_{j}) = d_{g} (s_{i}, s_{j}) \cdot v_{corr} (s_{i}, s_{j})

, the corresponding ODE is

\frac{d x}{d t} = \frac{P_{1} v}{∥ P_{1} v ∥},

where

P

is the projection operator defined as

P_{n} x = (I - \hat{n} \otimes \hat{n}) \cdot x, \hat{n} = \frac{n}{∥ n ∥}, 1 = {(1, 1, \dots, 1)}^{T} .

which indicates that the correlation kernel might alter both the direction and magnitude of RNA velocity in the continuous limit.

Some approaches also use discrete graphs to represent the geometric structure of data [162]. Graphdynamo [163] and Graphvelo [164] propose a method that leverages geometric structure to correct RNA velocity. They assume that the cell data points

x_{i}

lie on a low-dimensional manifold embedded in a high-dimensional space (in classical mechanics, this is known as the “configuration manifold”) and that each cell’s RNA velocity vector lies in the tangent space

T_{x} M

at the point

x_{i}

on the manifold. Let

δ_{i j}

denote the displacement vector from cell i to its neighboring cell j. With a sufficient number of such

δ_{i j}

, one can construct a non-orthogonal normalized basis for

T_{x} M

. Thus, the RNA velocity vector in

T_{x} M

can be expressed as

v_{‖} (x_{i}) = \sum_{j \in N_{i}} ϕ_{i j} δ_{i j}

The coefficients of the linear combination,

ϕ_{i} = {ϕ_{i j} ∣ j \in N_{i}}

, are determined by minimizing the following loss:

L (ϕ_{i}) = ∥ v_{i} - v_{‖} (x_{i}) ∥^{2} - b cos (ϕ_{i}, ϕ_{i}^{corr}) + λ {∥ ϕ_{i} ∥}^{2}

Here,

ϕ_{i}^{corr}

denotes the transition probabilities provided by the Cosine Kernel, and the last term is a regularization term. Thus,

v_{‖}

serves as a geometry-aware correlation of

v_{i}

, ensuring greater coherence with the underlying manifold structure.

Furthermore, the population dynamics in the feature space can be transferred to the dynamics on the graph. The unbalanced Fokker–Planck Equation (3) could be generalized to a graph, such that the mass evolution at node i is given by the following equation:

\frac{d p_{i}}{d t} = - \frac{1}{2} \sum_{j \neq i} (p_{i} ϕ_{i j} - p_{j} ϕ_{i j}) + \sum_{j \neq i} \frac{D_{i j}}{{| e_{i j} |}^{2}} (p_{j} - p_{i}) + g_{i} p_{i}

5.2. Modeling Cell–Cell Interaction Dynamically

For temporally resolved single-cell RNA-seq data, previous modeling approaches have typically been developed in the continuous space of

R^{d}

. However, the data inherently exist within a discrete space. In addition, incorporating cell–cell communication or interaction into these dynamics are important for constructing accurate spatiotemporal developmental landscapes and for advancing our understanding of complex biological systems [165,166,167,168,169,170,171]. Therefore, an interesting question is how to construct continuous cellular dynamics from a discrete space of interacting cells, e.g., those represented by a graph [172].

In [83], it proposes GraphFP, a graph Fokker–Planck equation-based method to model cellular dynamics by explicitly considering cell interactions. Assume that data can be clustered or annotated into M cell types, GraphFP constructs a cell state transition graph

G = (V, E)

, where each vertex in V represents a cell type and each edge

{i, j}

in E means the cell type i can transit to cell type j. Unlike other methods considering probability distribution in

R^{d}

, GraphFP consider the probability distribution on graph G. Suppose there are M vertices in graph G, consider the probability simplex supported on all vertices of G

P (G) = {p (t) = {(p_{i} (t))}_{i = 1}^{M} ∣ \sum_{i = 1}^{M} p_{i} (t) = 1, p_{i} (t) \geq 0} .

The aim is also to transport the distribution from

p_{0}

to

p_{1}

on G, satisfying least action principles. Similar to the continuous space case, one need to define the Fokker–Planck equation and the Wasserstein distance on graph G. First, one can define a free energy

F : P (G) \to R

, then the Fokker–Planck equation can be defined as follows:

\frac{d p_{i} (t)}{d t} = \sum_{j \in N (i)} (\frac{\partial F (p)}{\partial p_{j}} - \frac{\partial F (p)}{\partial p_{i}}) g_{i j} (p),

where

N (i)

is the neighbor set of vertex i and

g_{i j} (p)

satisfy certain constraints that could constructed from

p

[83]. Next, the discrete L2-Wasserstein distance on graph G between

p_{0}, p_{1} \in P (G)

can be defined as

W_{2, G}^{2} (p_{0}, p_{1}) = inf_{F} \frac{1}{2} \int_{0}^{1} \sum_{i, j \in E} {(\frac{\partial F (p)}{\partial p_{j}} - \frac{\partial F (p)}{\partial p_{i}})}^{2} g_{i j} (p) .

Next, the target is to find the minimum energy path. In [83], they parameterize

F

by a linear energy form

\begin{matrix} F (p ∣ Φ, W) & = V (p) + W (p) + β H (p), \\ = \sum_{i = 1}^{n} Φ_{i} p_{i} + \frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} w_{i j} p_{i} p_{j} + β \sum_{i = 1}^{n} p_{i} log p_{i}, \\ = p^{T} Φ + \frac{1}{2} p^{T} W p + β \sum_{i = 1}^{n} p_{i} log p_{i}, \end{matrix}

where

Φ = {(Φ_{i})}_{i = 1}^{M}

,

W = {(w_{i, j})}_{1 \leq i, j \leq M}

represents the interactions among cell types, and

β \geq 0

is a hyper-parameter. After this parametrization, one denotes the parameters of the free energy as

θ = {Φ, W}

, and the goal is to find the parameter

θ

such that

\begin{matrix} θ^{*} = arg min_{θ} \int_{t_{1}}^{t_{f}} \frac{1}{2} \sum_{{i, j} \in E} {(\frac{\partial F (p)}{\partial p_{i}} - \frac{\partial F (p)}{\partial p_{j}})}^{2} \cdot g_{i j} (p (t)) d t, \end{matrix}

subject to the constraints

\begin{matrix} \frac{d p (t)}{d t} & = {(\sum_{j \in N (i)} (\frac{\partial F (p)}{\partial p_{j}} - \frac{\partial F (p)}{\partial p_{i}}) g_{i j} (p (t)))}_{i = 1}^{M} \\ p (1) & = p_{1} \end{matrix}

This problem can be solved by the adjoint method. Once these dynamics are solved, one can then use them for downstream analysis, e.g., cell–cell interaction, probability flow of cell types, and the potential energy [83].

Looking ahead, we anticipate that important future directions based on GraphFP include the expansion of the current framework to achieve single-cell resolution instead of cellular types, incorporating the matching of unnormalized distribution results from cell proliferation and death, and extending the model to spatial transcriptomics.

5.3. Reconstructing Waddington Developmental Landscapes

Waddington’s landscape metaphor is a widely recognized framework to depict the cell fate decision process. This conceptual model suggests that metastable cellular states are analogous to wells within a potential landscape, and transitions between these states can be understood as movements or “hops” between these potential wells. While the development of such potential landscapes has been extensively explored [4,64,149,173,174,175,176,177,178,179,180,181,182], effectively constructing these landscapes using single-cell omics data remains the major challenge. In recent works [183,184], the authors utilize RNA velocity to construct a vector field from the snapshot scRNA seq data and then compute the potential landscape based on the Boltzmann distribution-like relations proposed by Wang et al. [185]. To be precise, the landscape is characterized by the expression

U = - σ^{2} log p_{ss} / 2

where

p_{ss}

represents the steady-state probability density function (PDF) that satisfies the steady-state Fokker–Planck equation:

- \nabla \cdot (p_{s s} b) + \frac{σ^{2}}{2} Δ p_{ss} = g p_{ss}

. For the temporal scRNA-seq data, following [40,136], it enables a natural inference of the time-evolving potential energy landscape by leveraging the learned log-density function. Specifically, one can define the landscape at time t as

U (x, t) = - \frac{σ^{2} (t)}{2} log p (x, t) .

Regions of lower energy correspond to more stable cell fates, providing a quantitative measure of stability in the cellular state space.

5.4. Challenges and Further Directions

Integrating multiomics data is critical for comprehensively characterizing cell–cell interaction dynamics and regulatory mechanisms [186,187,188,189,190,191,192,193,194,195]. Furthermore, aligning both temporal and spatial scales within time-series spatial transcriptomics data (e.g., using non-rigid or non-linear spatial transformations, and latent space) presents a significant challenge. Additionally, the application of SDEs in modeling spatial transcriptomics data, and subsequently constructing spatiotemporal developmental landscapes, represents an important avenue for further exploration.

6. Discussion and Conclusions

Inferring dynamical processes from high-throughput single-cell sequencing data is a critical problem in understanding cellular development and fate decisions. With the advancements in sequencing technologies, the field has evolved from dynamic inference based on snapshot single-cell RNA sequencing (scRNA-seq) data to inferring dynamics from temporally resolved scRNA-seq data. Moreover, the development of spatial transcriptomics and time-series spatial transcriptomics (ST) data now offers the potential to decode the spatiotemporal developmental trajectories of single cells and construct their spatiotemporal dynamics. In this review, we have focused on dissecting biological data through the lens of dynamical systems models, specifically investigating how various kinds of models can be applied to study cellular development and fate decisions.

When presenting existing approaches, we chose various types of dynamical systems modeling approaches as the main focus, providing a systematic overview of their applications across different contexts. Specifically, we examine the utility and limitations of dynamic modeling techniques in four distinct data scenarios: (1) single time-point scRNA-seq data, (2) multi time-point scRNA-seq data, (3) single time-point spatial transcriptomics data, and (4) multi time-point spatial transcriptomics data, i.e., spatiotemporal single-cell data. For each data type, we explore how different modeling paradigms—ranging from discrete Markov chain models to continuous Ordinary Differential Equations (ODEs), Stochastic Differential Equations (SDEs), and Partial Differential Equations (PDEs)—can be used to study the underlying biological processes embedded in the snapshot and high-dimensional nature of sequencing data.

For static single snapshot scRNA-seq data, where explicit temporal information is unavailable, top-down discrete-state models such as Markov chains have been widely adopted to infer latent cell state transitions and developmental trajectories. We also address how bottom-up mechanism models like RNA velocity methods provide insights into modeling complex state transition dynamics. When explicit temporal resolution is introduced as in time-series scRNA-seq datasets, continuous dynamical models become more suitable. Incorporating the dynamical optimal transport (OT) theoretical framework, the Fokker–Planck PDE-based models allow us to track the evolution of cellular states over time. The integration of spatial transcriptomics adds another dimension to the modeling challenge. In the case of multi-time-point spatial transcriptomics, we review emerging approaches that combine dynamical OT and geometric transformation to simultaneously account for trajectory inference and batch correction across time points.

Beyond summarizing the mathematical aspects of existing models, we also provide a forward-looking perspective on potential future directions. For instance, integrating both discrete and continuous models, as well as cellular interaction effects, could provide new insights to handle more realistic dynamics with increasing biological interpretability. Additionally, the use of multi-modal omics could also enhance the resolution of dynamical models. To sum up, by examining the intersection of dynamical systems theory and single-cell data modeling, this review provides conceptual and methodological insights that may inspire the development of novel algorithms to dissect the spatiotemporal dynamics underlying single-cell sequencing data.

Due to the limited scope of the current review, several important aspects of the dynamical models of scRNA-seq have not been discussed thoroughly here and remain for further exploration. Firstly, lineage tracing plays a critical role in understanding cellular history and developmental trajectories, and integrating such data into trajectory inference could provide deeper insights into cellular fate decisions [116,196,197]. Secondly, incorporating gene regulatory networks (GRNs) [198,199,200,201,202,203,204] into spatiotemporal trajectory inference is an exciting avenue for future research, as it could enhance the understanding of the regulatory mechanisms driving cellular transitions. Lastly, the concepts of dynamic network biomarkers (DNBs) and critical transitions [205,206,207,208] are promising for understanding cellular fate shifts, particularly in disease progression and cellular development.

Overall, this review demonstrates how dynamic modeling approaches can provide insight into the underlying biological processes underlying single-cell transcriptomics, spatial transcriptomics, and their temporal extensions. In the future, these techniques, when combined with machine learning and other computational advancements, will enable more comprehensive models of cellular dynamics, promising new therapeutic strategies and a deeper understanding of development, disease, and tissue regeneration.

Author Contributions

Conceptualization, Z.Z., P.Z. and T.L.; investigation, Z.Z., Y.S., Q.P., P.Z. and T.L.; writing—original draft preparation, Z.Z., Y.S., Q.P. and P.Z.; writing—review and editing, Z.Z., Y.S., Q.P., P.Z. and T.L.; visualization, Z.Z., Y.S., Q.P. and P.Z.; supervision, P.Z. and T.L.; funding acquisition, P.Z. and T.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key R&D Program of China (No. 2021YFA1003301 to T.L.) and National Natural Science Foundation of China (NSFC No. 12288101 to T.L. & P.Z., and 8206100646, T2321001 to P.Z.).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

scRNA-seq	single-cell RNA sequencing
SDE	Stochastic Differential Equation
ODE	Ordinary Differential Equation
PDE	Partial Differential Equation
OT	Optimal Transport
ICA	Independent Component Analysis
PCA	Principle Component Analysis
MST	Minimum Spanning Tree
MCE	Markov Chain Entropy
GPCCA	Generalized Perron Cluster Cluster Analysis
EM	Expectation Maximum
VAE	Variational Autoencoder
RKHS	Reproducing Kernel Hilbert Space
CNF	Continuous Normalizing Flow
FM	Flow Matching
CFM	Conditional Flow Matching
SB	Schrödinger Bridge
RUOT	Regularized Unbalanced Optimal Transport
GWOT	Gromov–Wasserstein optimal transport
FGWOT	Fused Gromov–Wasserstein optimal transport

References

Lei, J. Mathematical modeling of heterogeneous stem cell regeneration: From cell division to Waddington’s epigenetic landscape. arXiv 2023, arXiv:2309.08064. [Google Scholar]
Hong, T.; Xing, J. Data-and theory-driven approaches for understanding paths of epithelial–mesenchymal transition. Genesis 2024, 62, e23591. [Google Scholar] [CrossRef] [PubMed]
Xing, J. Reconstructing data-driven governing equations for cell phenotypic transitions: Integration of data science and systems biology. Phys. Biol. 2022, 19, 061001. [Google Scholar] [CrossRef] [PubMed]
Schiebinger, G. Reconstructing developmental landscapes and trajectories from single-cell data. Curr. Opin. Syst. Biol. 2021, 27, 100351. [Google Scholar] [CrossRef]
Heitz, M.; Ma, Y.; Kubal, S.; Schiebinger, G. Spatial Transcriptomics Brings New Challenges and Opportunities for Trajectory Inference. Annu. Rev. Biomed. Data Sci. 2024. [Google Scholar] [CrossRef]
Waddington, C.H. The Strategy of the Genes; Routledge: London, UK, 2014. [Google Scholar]
Moris, N.; Pina, C.; Arias, A.M. Transition states and cell fate decisions in epigenetic landscapes. Nat. Rev. Genet. 2016, 17, 693–703. [Google Scholar] [CrossRef]
MacLean, A.L.; Hong, T.; Nie, Q. Exploring intermediate cell states through the lens of single cells. Curr. Opin. Syst. Biol. 2018, 9, 32–41. [Google Scholar] [CrossRef]
Ziegenhain, C.; Vieth, B.; Parekh, S.; Reinius, B.; Guillaumet-Adkins, A.; Smets, M.; Leonhardt, H.; Heyn, H.; Hellmann, I.; Enard, W. Comparative analysis of single-cell RNA sequencing methods. Mol. Cell 2017, 65, 631–643. [Google Scholar] [CrossRef]
Tang, F.; Barbacioru, C.; Wang, Y.; Nordman, E.; Lee, C.; Xu, N.; Wang, X.; Bodeau, J.; Tuch, B.B.; Siddiqui, A.; et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nat. Methods 2009, 6, 377–382. [Google Scholar] [CrossRef]
Stark, R.; Grzelak, M.; Hadfield, J. RNA sequencing: The teenage years. Nat. Rev. Genet. 2019, 20, 631–656. [Google Scholar] [CrossRef]
Ding, J.; Sharon, N.; Bar-Joseph, Z. Temporal modelling using single-cell transcriptomics. Nat. Rev. Genet. 2022, 23, 355–368. [Google Scholar] [CrossRef] [PubMed]
Bunne, C.; Schiebinger, G.; Krause, A.; Regev, A.; Cuturi, M. Optimal transport for single-cell and spatial omics. Nat. Rev. Methods Prim. 2024, 4, 58. [Google Scholar] [CrossRef]
Ståhl, P.L.; Salmén, F.; Vickovic, S.; Lundmark, A.; Navarro, J.F.; Magnusson, J.; Giacomello, S.; Asp, M.; Westholm, J.O.; Huss, M.; et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 2016, 353, 78–82. [Google Scholar] [CrossRef] [PubMed]
Rodriques, S.G.; Stickels, R.R.; Goeva, A.; Martin, C.A.; Murray, E.; Vanderburg, C.R.; Welch, J.; Chen, L.M.; Chen, F.; Macosko, E.Z. Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution. Science 2019, 363, 1463–1467. [Google Scholar] [CrossRef]
Stickels, R.R.; Murray, E.; Kumar, P.; Li, J.; Marshall, J.L.; Di Bella, D.J.; Arlotta, P.; Macosko, E.Z.; Chen, F. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat. Biotechnol. 2021, 39, 313–319. [Google Scholar] [CrossRef]
Chen, A.; Liao, S.; Cheng, M.; Ma, K.; Wu, L.; Lai, Y.; Qiu, X.; Yang, J.; Xu, J.; Hao, S.; et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell 2022, 185, 1777–1792. [Google Scholar] [CrossRef]
Oliveira, M.F.; Romero, J.P.; Chung, M.; Williams, S.; Gottscho, A.D.; Gupta, A.; Pilipauskas, S.E.; Mohabbat, S.; Raman, N.; Sukovich, D.; et al. Characterization of immune cell populations in the tumor microenvironment of colorectal cancer using high definition spatial profiling. bioRxiv 2024. [Google Scholar] [CrossRef]
Moffitt, J.R.; Bambah-Mukku, D.; Eichhorn, S.W.; Vaughn, E.; Shekhar, K.; Perez, J.D.; Rubinstein, N.D.; Hao, J.; Regev, A.; Dulac, C.; et al. Molecular, spatial, and functional single-cell profiling of the hypothalamic preoptic region. Science 2018, 362, eaau5324. [Google Scholar] [CrossRef]
Eng, C.H.L.; Lawson, M.; Zhu, Q.; Dries, R.; Koulena, N.; Takei, Y.; Yun, J.; Cronin, C.; Karp, C.; Yuan, G.C.; et al. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH+. Nature 2019, 568, 235–239. [Google Scholar] [CrossRef]
Wang, X.; Allen, W.E.; Wright, M.A.; Sylwestrak, E.L.; Samusik, N.; Vesuna, S.; Evans, K.; Liu, C.; Ramakrishnan, C.; Liu, J.; et al. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science 2018, 361, eaat5691. [Google Scholar]
Liu, L.; Chen, A.; Li, Y.; Mulder, J.; Heyn, H.; Xu, X. Spatiotemporal omics for biology and medicine. Cell 2024, 187, 4488–4519. [Google Scholar] [CrossRef] [PubMed]
Qiu, X.; Mao, Q.; Tang, Y.; Wang, L.; Chawla, R.; Pliner, H.A.; Trapnell, C. Reversed graph embedding resolves complex single-cell trajectories. Nat. Methods 2017, 14, 979–982. [Google Scholar] [CrossRef] [PubMed]
Cao, J.; Spielmann, M.; Qiu, X.; Huang, X.; Ibrahim, D.M.; Hill, A.J.; Zhang, F.; Mundlos, S.; Christiansen, L.; Steemers, F.J.; et al. The single-cell transcriptional landscape of mammalian organogenesis. Nature 2019, 566, 496–502. [Google Scholar] [CrossRef]
Street, K.; Risso, D.; Fletcher, R.B.; Das, D.; Ngai, J.; Yosef, N.; Purdom, E.; Dudoit, S. Slingshot: Cell lineage and pseudotime inference for single-cell transcriptomics. BMC Genom. 2018, 19, 1–16. [Google Scholar] [CrossRef]
La Manno, G.; Soldatov, R.; Zeisel, A.; Braun, E.; Hochgerner, H.; Petukhov, V.; Lidschreiber, K.; Kastriti, M.E.; Lönnerberg, P.; Furlan, A.; et al. RNA velocity of single cells. Nature 2018, 560, 494–498. [Google Scholar] [CrossRef]
Bergen, V.; Soldatov, R.A.; Kharchenko, P.V.; Theis, F.J. RNA velocity—current challenges and future perspectives. Mol. Syst. Biol. 2021, 17, e10282. [Google Scholar] [CrossRef]
Wang, K.; Hou, L.; Wang, X.; Zhai, X.; Lu, Z.; Zi, Z.; Zhai, W.; He, X.; Curtis, C.; Zhou, D.; et al. PhyloVelo enhances transcriptomic velocity field mapping using monotonically expressed genes. Nat. Biotechnol. 2024, 42, 778–789. [Google Scholar] [CrossRef]
Liu, Y.; Huang, K.; Chen, W. Resolving cellular dynamics using single-cell temporal transcriptomics. Curr. Opin. Biotechnol. 2024, 85, 103060. [Google Scholar] [CrossRef]
Bergen, V.; Lange, M.; Peidli, S.; Wolf, F.A.; Theis, F.J. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat. Biotechnol. 2020, 38, 1408–1414. [Google Scholar] [CrossRef]
Ho, J.; Jain, A.; Abbeel, P. Denoising diffusion probabilistic models. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2020; Volume 33, pp. 6840–6851. [Google Scholar]
Sohl-Dickstein, J.; Weiss, E.; Maheswaranathan, N.; Ganguli, S. Deep unsupervised learning using nonequilibrium thermodynamics. In Proceedings of the International Conference on Machine Learning, PMLR, Lille, France, 6–11 July 2015; pp. 2256–2265. [Google Scholar]
Song, Y.; Sohl-Dickstein, J.; Kingma, D.P.; Kumar, A.; Ermon, S.; Poole, B. Score-Based Generative Modeling through Stochastic Differential Equations. In Proceedings of the International Conference on Learning Representations, Vienna, Austria, 4 May 2021. [Google Scholar]
Ren, T.; Zhang, Z.; Li, Z.; Jiang, J.; Qin, S.; Li, G.; Li, Y.; Zheng, Y.; Li, X.; Zhan, M.; et al. Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer. arXiv 2025, arXiv:2502.00639. [Google Scholar]
Schiebinger, G.; Shu, J.; Tabaka, M.; Cleary, B.; Subramanian, V.; Solomon, A.; Gould, J.; Liu, S.; Lin, S.; Berube, P.; et al. Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming. Cell 2019, 176, 928–943. [Google Scholar] [CrossRef] [PubMed]
Klein, D.; Palla, G.; Lange, M.; Klein, M.; Piran, Z.; Gander, M.; Meng-Papaxanthos, L.; Sterr, M.; Saber, L.; Jing, C.; et al. Mapping cells through time and space with moscot. Nature 2025, 638, 1065–1075. [Google Scholar] [CrossRef] [PubMed]
Lipman, Y.; Chen, R.T.Q.; Ben-Hamu, H.; Nickel, M.; Le, M. Flow Matching for Generative Modeling. In Proceedings of the Eleventh International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023. [Google Scholar]
Tong, A.; FATRAS, K.; Malkin, N.; Huguet, G.; Zhang, Y.; Rector-Brooks, J.; Wolf, G.; Bengio, Y. Improving and generalizing flow-based generative models with minibatch optimal transport. Trans. Mach. Learn. Res. 2024. Available online: https://openreview.net/forum?id=CD9Snc73AW (accessed on 18 April 2025).
Gentil, I.; Léonard, C.; Ripani, L. About the analogy between optimal transport and minimal entropy. Ann. Fac. Sci. Toulouse Math. 2017, 26, 569–600. [Google Scholar] [CrossRef]
Zhang, Z.; Li, T.; Zhou, P. Learning stochastic dynamics from snapshots through regularized unbalanced optimal transport. In Proceedings of the Thirteenth International Conference on Learning Representations, Singapore, 24–28 April 2025. [Google Scholar]
Peng, Q.; Zhou, P.; Li, T. stVCR: Reconstructing spatio-temporal dynamics of cell development using optimal transport. bioRxiv 2024. [Google Scholar] [CrossRef]
Wang, L.; Zhang, Q.; Qin, Q.; Trasanidis, N.; Vinyard, M.; Chen, H.; Pinello, L. Current progress and potential opportunities to infer single-cell developmental trajectory and cell fate. Curr. Opin. Syst. Biol. 2021, 26, 1–11. [Google Scholar] [CrossRef]
Saelens, W.; Cannoodt, R.; Todorov, H.; Saeys, Y. A comparison of single-cell trajectory inference methods. Nat. Biotechnol. 2019, 37, 547–554. [Google Scholar] [CrossRef]
Li, T.; Shi, J.; Wu, Y.; Zhou, P. On the mathematics of RNA velocity I: Theoretical analysis. bioRxiv 2020. [Google Scholar] [CrossRef]
Jiang, Q.; Wan, L. Dynamic modeling, optimization, and deep learning for high-dimensional complex biological data. Sci. Sin. Math. 2025, 55, 1–14. [Google Scholar]
Deconinck, L.; Cannoodt, R.; Saelens, W.; Deplancke, B.; Saeys, Y. Recent advances in trajectory inference from single-cell omics data. Curr. Opin. Syst. Biol. 2021, 27, 100344. [Google Scholar] [CrossRef]
Gandrillon, O.; Gaillard, M.; Espinasse, T.; Garnier, N.B.; Dussiau, C.; Kosmider, O.; Sujobert, P. Entropy as a measure of variability and stemness in single-cell transcriptomics. Curr. Opin. Syst. Biol. 2021, 27, 100348. [Google Scholar] [CrossRef]
Grün, D.; Muraro, M.J.; Boisset, J.C.; Wiebrands, K.; Lyubimova, A.; Dharmadhikari, G.; van den Born, M.; Van Es, J.; Jansen, E.; Clevers, H.; et al. De novo prediction of stem cell identity using single-cell transcriptome data. Cell Stem Cell 2016, 19, 266–277. [Google Scholar] [CrossRef] [PubMed]
Guo, M.; Bao, E.L.; Wagner, M.; Whitsett, J.A.; Xu, Y. SLICE: Determining cell differentiation and lineage based on single cell entropy. Nucleic Acids Res. 2017, 45, e54. [Google Scholar] [CrossRef]
Teschendorff, A.E.; Enver, T. Single-cell entropy for accurate estimation of differentiation potency from a cell’s transcriptome. Nat. Commun. 2017, 8, 15599. [Google Scholar] [CrossRef]
Shi, J.; Teschendorff, A.E.; Chen, W.; Chen, L.; Li, T. Quantifying Waddington’s epigenetic landscape: A comparison of single-cell potency measures. Briefings Bioinform. 2020, 21, 248–261. [Google Scholar] [CrossRef]
Jin, S.; MacLean, A.L.; Peng, T.; Nie, Q. scEpath: Energy landscape-based inference of transition probabilities and cellular trajectories from single-cell transcriptomic data. Bioinformatics 2018, 34, 2077–2086. [Google Scholar] [CrossRef]
Liu, J.; Song, Y.; Lei, J. Single-cell entropy to quantify the cellular order parameter from single-cell RNA-seq data. Biophys. Rev. Lett. 2020, 15, 35–49. [Google Scholar] [CrossRef]
Haghverdi, L.; Büttner, M.; Wolf, F.A.; Buettner, F.; Theis, F.J. Diffusion pseudotime robustly reconstructs lineage branching. Nat. Methods 2016, 13, 845–848. [Google Scholar] [CrossRef]
Wolf, F.A.; Hamey, F.K.; Plass, M.; Solana, J.; Dahlin, J.S.; Göttgens, B.; Rajewsky, N.; Simon, L.; Theis, F.J. PAGA: Graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biol. 2019, 20, 59. [Google Scholar] [CrossRef]
Coifman, R.R.; Lafon, S. Diffusion maps. Appl. Comput. Harmon. Anal. 2006, 21, 5–30. [Google Scholar] [CrossRef]
Setty, M.; Kiseliovas, V.; Levine, J.; Gayoso, A.; Mazutis, L.; Pe’Er, D. Characterization of cell fate probabilities in single-cell data with Palantir. Nat. Biotechnol. 2019, 37, 451–460. [Google Scholar] [CrossRef] [PubMed]
Stassen, S.V.; Yip, G.G.; Wong, K.K.; Ho, J.W.; Tsia, K.K. Generalized and scalable trajectory inference in single-cell omics data with VIA. Nat. Commun. 2021, 12, 5528. [Google Scholar] [CrossRef] [PubMed]
Pandey, K.; Zafar, H. Inference of cell state transitions and cell fate plasticity from single-cell with MARGARET. Nucleic Acids Res. 2022, 50, e86. [Google Scholar] [CrossRef] [PubMed]
Weinreb, C.; Wolock, S.; Tusi, B.K.; Socolovsky, M.; Klein, A.M. Fundamental limits on dynamic inference from single-cell snapshots. Proc. Natl. Acad. Sci. USA 2018, 115, E2467–E2476. [Google Scholar] [CrossRef]
Zhou, P.; Wang, S.; Li, T.; Nie, Q. Dissecting transition cells from single-cell transcriptome data through multiscale stochastic dynamics. Nat. Commun. 2021, 12, 5609. [Google Scholar] [CrossRef]
Lange, M.; Bergen, V.; Klein, M.; Setty, M.; Reuter, B.; Bakhti, M.; Lickert, H.; Ansari, M.; Schniering, J.; Schiller, H.B.; et al. CellRank for directed single-cell fate mapping. Nat. Methods 2022, 19, 159–170. [Google Scholar] [CrossRef]
Weiler, P.; Lange, M.; Klein, M.; Pe’er, D.; Theis, F. CellRank 2: Unified fate mapping in multiview single-cell data. Nat. Methods 2024, 21, 1196–1205. [Google Scholar] [CrossRef]
Zhou, P.; Li, T. Construction of the landscape for multi-stable systems: Potential landscape, quasi-potential, A-type integral and beyond. J. Chem. Phys. 2016, 144, 094109. [Google Scholar] [CrossRef]
Vanden-Eijnden, E. Transition-path theory and path-finding algorithms for the study of rare events. Annu. Rev. Phys. Chem. 2010, 61, 391–420. [Google Scholar]
Reuter, B.; Fackeldey, K.; Weber, M. Generalized Markov modeling of nonreversible molecular kinetics. J. Chem. Phys. 2019, 150, 174103. [Google Scholar] [CrossRef]
Gao, M.; Qiao, C.; Huang, Y. UniTVelo: Temporally unified RNA velocity reinforces single-cell trajectory inference. Nat. Commun. 2022, 13, 6586. [Google Scholar] [CrossRef] [PubMed]
Li, J.; Pan, X.; Yuan, Y.; Shen, H.B. TFvelo: Gene regulation inspired RNA velocity estimation. Nat. Commun. 2024, 15, 1387. [Google Scholar] [CrossRef] [PubMed]
Oller-Moreno, S.; Kloiber, K.; Machart, P.; Bonn, S. Algorithmic advances in machine learning for single-cell expression analysis. Curr. Opin. Syst. Biol. 2021, 25, 27–33. [Google Scholar] [CrossRef]
Raimundo, F.; Meng-Papaxanthos, L.; Vallot, C.; Vert, J.P. Machine learning for single-cell genomics data analysis. Curr. Opin. Syst. Biol. 2021, 26, 64–71. [Google Scholar] [CrossRef]
Gu, Y.; Blaauw, D.; Welch, J.D. Bayesian inference of rna velocity from multi-lineage single-cell data. bioRxiv 2022. [Google Scholar] [CrossRef]
Kingma, D.P.; Welling, M. Auto-encoding variational bayes. arXiv 2013, arXiv:1312.6114. [Google Scholar]
Qiao, C.; Huang, Y. Representation learning of RNA velocity reveals robust cell transitions. Proc. Natl. Acad. Sci. USA 2021, 118, e2105859118. [Google Scholar] [CrossRef]
Farrell, S.; Mani, M.; Goyal, S. Inferring single-cell transcriptomic dynamics with structured latent gene expression dynamics. Cell Rep. Methods 2023, 3, 100581. [Google Scholar] [CrossRef]
Gayoso, A.; Weiler, P.; Lotfollahi, M.; Klein, D.; Hong, J.; Streets, A.; Theis, F.J.; Yosef, N. Deep generative modeling of transcriptional dynamics for RNA velocity analysis in single cells. Nat. Methods 2024, 21, 50–59. [Google Scholar] [CrossRef]
Gu, Y.; Blaauw, D.T.; Welch, J. Variational mixtures of ODEs for inferring cellular gene expression dynamics. In Proceedings of the International Conference on Machine Learning. PMLR, Baltimore, MD, USA, 17–23 July 2022; pp. 7887–7901. [Google Scholar]
Cui, H.; Maan, H.; Vladoiu, M.C.; Zhang, J.; Taylor, M.D.; Wang, B. DeepVelo: Deep learning extends RNA velocity to multi-lineage systems with cell-specific kinetics. Genome Biol. 2024, 25, 27. [Google Scholar] [CrossRef]
Li, S.; Zhang, P.; Chen, W.; Ye, L.; Brannan, K.W.; Le, N.T.; Abe, J.i.; Cooke, J.P.; Wang, G. A relay velocity model infers cell-dependent RNA velocity. Nat. Biotechnol. 2024, 42, 99–108. [Google Scholar] [CrossRef]
Qiu, X.; Zhang, Y.; Martin-Rufino, J.D.; Weng, C.; Hosseinzadeh, S.; Yang, D.; Pogson, A.N.; Hein, M.Y.; Min, K.H.J.; Wang, L.; et al. Mapping transcriptomic vector fields of single cells. Cell 2022, 185, 690–711. [Google Scholar] [CrossRef] [PubMed]
Chen, Z.; King, W.C.; Hwang, A.; Gerstein, M.; Zhang, J. DeepVelo: Single-cell transcriptomic deep velocity field learning with neural ordinary differential equations. Sci. Adv. 2022, 8, eabq3745. [Google Scholar] [CrossRef] [PubMed]
Sha, Y.; Qiu, Y.; Zhou, P.; Nie, Q. Reconstructing growth and dynamic trajectories from single-cell transcriptomics data. Nat. Mach. Intell. 2024, 6, 25–39. [Google Scholar] [CrossRef] [PubMed]
Lavenant, H.; Zhang, S.; Kim, Y.H.; Schiebinger, G. Toward a mathematical theory of trajectory inference. Ann. Appl. Probab. 2024, 34, 428–500. [Google Scholar] [CrossRef]
Jiang, Q.; Zhang, S.; Wan, L. Dynamic inference of cell developmental complex energy landscape from time series single-cell transcriptomic data. PLoS Comput. Biol. 2022, 18, e1009821. [Google Scholar] [CrossRef]
Jiang, Q.; Wan, L. A physics-informed neural SDE network for learning cellular dynamics from time-series scRNA-seq data. Bioinformatics 2024, 40, ii120–ii127. [Google Scholar] [CrossRef]
Bunne, C.; Stark, S.G.; Gut, G.; Del Castillo, J.S.; Levesque, M.; Lehmann, K.V.; Pelkmans, L.; Krause, A.; Rätsch, G. Learning single-cell perturbation responses using neural optimal transport. Nat. Methods 2023, 20, 1759–1768. [Google Scholar] [CrossRef]
Tong, A.; Kuchroo, M.; Gupta, S.; Venkat, A.; San Juan, B.P.; Rangel, L.; Zhu, B.; Lock, J.G.; Chaffer, C.L.; Krishnaswamy, S. Learning transcriptional and regulatory dynamics driving cancer cell plasticity using neural ODE-based optimal transport. bioRxiv 2023. [Google Scholar] [CrossRef]
Zhang, S.; Afanassiev, A.; Greenstreet, L.; Matsumoto, T.; Schiebinger, G. Optimal transport analysis reveals trajectories in steady-state systems. PLoS Comput. Biol. 2021, 17, e1009466. [Google Scholar] [CrossRef]
Maddu, S.; Chardès, V.; Shelley, M. Inferring biological processes with intrinsic noise from cross-sectional data. arXiv 2024, arXiv:2410.07501. [Google Scholar]
Eyring, L.; Klein, D.; Uscidda, T.; Palla, G.; Kilbertus, N.; Akata, Z.; Theis, F.J. Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation. In Proceedings of the Twelfth International Conference on Learning Representations, Vienna, Austria, 7–11 May 2024. [Google Scholar]
Zhang, J.; Larschan, E.; Bigness, J.; Singh, R. scNODE: Generative model for temporal single cell transcriptomic data prediction. Bioinformatics 2024, 40, ii146–ii154. [Google Scholar] [CrossRef] [PubMed]
Yeo, G.H.T.; Saksena, S.D.; Gifford, D.K. Generative modeling of single-cell time series with PRESCIENT enables prediction of cell trajectories with interventions. Nat. Commun. 2021, 12, 3222. [Google Scholar] [CrossRef] [PubMed]
Flamary, R.; Courty, N.; Gramfort, A.; Alaya, M.Z.; Boisbunon, A.; Chambon, S.; Chapel, L.; Corenflos, A.; Fatras, K.; Fournier, N.; et al. POT: Python Optimal Transport. J. Mach. Learn. Res. 2021, 22, 1–8. [Google Scholar]
Peyré, G.; Cuturi, M. Computational optimal transport: With applications to data science. Found. Trends® Mach. Learn. 2019, 11, 355–607. [Google Scholar] [CrossRef]
Benamou, J.D.; Brenier, Y. A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem. Numer. Math. 2000, 84, 375–393. [Google Scholar] [CrossRef]
Tong, A.; Huang, J.; Wolf, G.; Van Dijk, D.; Krishnaswamy, S. Trajectorynet: A dynamic optimal transport network for modeling cellular dynamics. In Proceedings of the International Conference on Machine Learning. PMLR, Virtual, 13–18 July 2020; pp. 9526–9536. [Google Scholar]
Huguet, G.; Magruder, D.S.; Tong, A.; Fasina, O.; Kuchroo, M.; Wolf, G.; Krishnaswamy, S. Manifold interpolating optimal-transport flows for trajectory inference. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2022; Volume 35, pp. 29705–29718. [Google Scholar]
Ruthotto, L.; Osher, S.J.; Li, W.; Nurbekyan, L.; Fung, S.W. A machine learning framework for solving high-dimensional mean field game and mean field control problems. Proc. Natl. Acad. Sci. USA 2020, 117, 9183–9193. [Google Scholar] [CrossRef]
Liu, S.; Ma, S.; Chen, Y.; Zha, H.; Zhou, H. Learning high dimensional Wasserstein geodesics. arXiv 2021, arXiv:2102.02992. [Google Scholar]
Cheng, Q.; Liu, Q.; Chen, W.; Shen, J. A new flow dynamic approach for Wasserstein gradient flows. arXiv 2024, arXiv:2406.14870. [Google Scholar] [CrossRef]
Wan, W.; Zhang, Y.; Bao, C.; Dong, B.; Shi, Z. A scalable deep learning approach for solving high-dimensional dynamic optimal transport. SIAM J. Sci. Comput. 2023, 45, B544–B563. [Google Scholar] [CrossRef]
Pooladian, A.A.; Domingo-Enrich, C.; Chen, R.T.Q.; Amos, B. Neural Optimal Transport with Lagrangian Costs. In Proceedings of the 40th Conference on Uncertainty in Artificial Intelligence, Barcelona, Spain, 15–19 July 2024. [Google Scholar]
Klein, D.; Uscidda, T.; Theis, F.; Cuturi, M. Generative Entropic Neural Optimal Transport To Map Within and Across Spaces. arXiv 2023, arXiv:2310.09254. [Google Scholar]
Albergo, M.S.; Vanden-Eijnden, E. Building Normalizing Flows with Stochastic Interpolants. In Proceedings of the Eleventh International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023. [Google Scholar]
Liu, X.; Gong, C.; Liu, Q. Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow. In Proceedings of the Eleventh International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023. [Google Scholar]
Jiao, Y.; Lai, Y.; Wang, Y.; Yan, B. Convergence Analysis of Flow Matching in Latent Space with Transformers. arXiv 2024, arXiv:2404.02538. [Google Scholar]
Gao, Y.; Huang, J.; Jiao, Y.; Zheng, S. Convergence of Continuous Normalizing Flows for Learning Probability Distributions. arXiv 2024, arXiv:2404.00551. [Google Scholar]
Liu, S.; Li, W.; Zha, H.; Zhou, H. Neural Parametric Fokker–Planck Equation. SIAM J. Numer. Anal. 2022, 60, 1385–1449. [Google Scholar] [CrossRef]
Wu, H.; Liu, S.; Ye, X.; Zhou, H. Parameterized wasserstein hamiltonian flow. arXiv 2023, arXiv:2306.00191. [Google Scholar] [CrossRef]
Jin, Y.; Liu, S.; Wu, H.; Ye, X.; Zhou, H. Parameterized Wasserstein Gradient Flow. arXiv 2024, arXiv:2404.19133. [Google Scholar] [CrossRef]
Chow, S.N.; Li, W.; Zhou, H. Wasserstein hamiltonian flows. J. Differ. Equ. 2020, 268, 1205–1219. [Google Scholar] [CrossRef]
Cheng, X.; Lu, J.; Tan, Y.; Xie, Y. Convergence of flow-based generative models via proximal gradient descent in Wasserstein space. IEEE Trans. Inf. Theory 2024, 70, 8087–8106. [Google Scholar] [CrossRef]
Zhou, P.; Gao, X.; Li, X.; Li, L.; Niu, C.; Ouyang, Q.; Lou, H.; Li, T.; Li, F. Stochasticity triggers activation of the S-phase checkpoint pathway in budding yeast. Phys. Rev. X 2021, 11, 011004. [Google Scholar] [CrossRef]
Elowitz, M.B.; Levine, A.J.; Siggia, E.D.; Swain, P.S. Stochastic gene expression in a single cell. Science 2002, 297, 1183–1186. [Google Scholar] [CrossRef]
Léonard, C. A survey of the Schrödinger problem and some of its connections with optimal transport. Discret. Contin. Dyn. Syst.-Ser. A 2014, 34, 1533–1574. [Google Scholar] [CrossRef]
Pariset, M.; Hsieh, Y.P.; Bunne, C.; Krause, A.; Bortoli, V.D. Unbalanced Diffusion Schrödinger Bridge. In Proceedings of the ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, Honolulu, HI, USA, 23–29 July 2023. [Google Scholar]
Ventre, E.; Forrow, A.; Gadhiwala, N.; Chakraborty, P.; Angel, O.; Schiebinger, G. Trajectory inference for a branching SDE model of cell differentiation. arXiv 2023, arXiv:2307.07687. [Google Scholar]
Chizat, L.; Zhang, S.; Heitz, M.; Schiebinger, G. Trajectory inference via mean-field langevin in path space. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2022; Volume 35, pp. 16731–16742. [Google Scholar]
Shi, Y.; De Bortoli, V.; Campbell, A.; Doucet, A. Diffusion Schrödinger bridge matching. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2024; Volume 36. [Google Scholar]
De Bortoli, V.; Thornton, J.; Heng, J.; Doucet, A. Diffusion schrödinger bridge with applications to score-based generative modeling. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2021; Volume 34, pp. 17695–17709. [Google Scholar]
Pooladian, A.A.; Niles-Weed, J. Plug-in estimation of Schrödinger bridges. arXiv 2024, arXiv:2408.11686. [Google Scholar]
Liu, G.H.; Chen, T.; So, O.; Theodorou, E. Deep Generalized Schrödinger Bridge. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2022; Volume 35, pp. 9374–9388. [Google Scholar]
Gu, A.; Chien, E.; Greenewald, K. Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior. arXiv 2024, arXiv:2406.07475. [Google Scholar]
Koshizuka, T.; Sato, I. Neural Lagrangian Schrödinger Bridge: Diffusion Modeling for Population Dynamics. In Proceedings of the Eleventh International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023. [Google Scholar]
Neklyudov, K.; Brekelmans, R.; Severo, D.; Makhzani, A. Action matching: Learning stochastic dynamics from samples. In Proceedings of the International Conference on Machine Learning. PMLR, Honolulu, HI, USA, 23–29 July 2023; pp. 25858–25889. [Google Scholar]
Neklyudov, K.; Brekelmans, R.; Tong, A.; Atanackovic, L.; Liu, Q.; Makhzani, A. A Computational Framework for Solving Wasserstein Lagrangian Flows. In Proceedings of the Forty-first International Conference on Machine Learning, Vienna, Austria, 21–27 July 2024. [Google Scholar]
Zhang, P.; Gao, T.; Guo, J.; Duan, J. Action Functional as Early Warning Indicator in the Space of Probability Measures. arXiv 2024, arXiv:2403.10405. [Google Scholar]
Bunne, C.; Hsieh, Y.P.; Cuturi, M.; Krause, A. The schrödinger bridge between gaussian measures has a closed form. In Proceedings of the International Conference on Artificial Intelligence and Statistics, PMLR, Valencia, Spain, 25–27 April 2023; pp. 5802–5833. [Google Scholar]
Chen, T.; Liu, G.H.; Theodorou, E. Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs Theory. In Proceedings of the International Conference on Learning Representations, Online, 25–29 April 2022. [Google Scholar]
Albergo, M.S.; Boffi, N.M.; Vanden-Eijnden, E. Stochastic interpolants: A unifying framework for flows and diffusions. arXiv 2023, arXiv:2303.08797. [Google Scholar]
Wang, G.; Jiao, Y.; Xu, Q.; Wang, Y.; Yang, C. Deep generative learning via schrödinger bridge. In Proceedings of the International Conference on Machine Learning, PMLR, Virtual, 18–24 July 2021; pp. 10794–10804. [Google Scholar]
Jiao, Y.; Kang, L.; Lin, H.; Liu, J.; Zuo, H. Latent Schrödinger Bridge Diffusion Model for Generative Learning. arXiv 2024, arXiv:2404.13309. [Google Scholar]
Zhou, L.; Lou, A.; Khanna, S.; Ermon, S. Denoising Diffusion Bridge Models. In Proceedings of the Twelfth International Conference on Learning Representations, Vienna, Austria, 7–11 May 2024. [Google Scholar]
Liu, G.H.; Vahdat, A.; Huang, D.A.; Theodorou, E.A.; Nie, W.; Anandkumar, A. I²SB: Image-to-Image Schrödinger Bridge. arXiv 2023, arXiv:2302.05872. [Google Scholar]
Zhou, M.; Osher, S.; Li, W. Score-based Neural Ordinary Differential Equations for Computing Mean Field Control Problems. arXiv 2024, arXiv:2409.16471. [Google Scholar]
Zhu, Q.; Zhao, B.; Zhang, J.; Li, P.; Lin, W. Governing equation discovery of a complex system from snapshots. arXiv 2024, arXiv:2410.16694. [Google Scholar]
Tong, A.; Malkin, N.; Fatras, K.; Atanackovic, L.; Zhang, Y.; Huguet, G.; Wolf, G.; Bengio, Y. Simulation-Free Schrödinger Bridges via Score and Flow Matching. In Proceedings of the International Conference on Artificial Intelligence and Statistics, PMLR, Valencia, Spain, 2–4 May 2024; pp. 1279–1287. [Google Scholar]
Chen, Y.; Georgiou, T.T.; Pavon, M. The most likely evolution of diffusing and vanishing particles: Schrodinger bridges with unbalanced marginals. SIAM J. Control Optim. 2022, 60, 2016–2039. [Google Scholar] [CrossRef]
Baradat, A.; Lavenant, H. Regularized unbalanced optimal transport as entropy minimization with respect to branching brownian motion. arXiv 2021, arXiv:2111.01666. [Google Scholar]
Buze, M.; Duong, M.H. Entropic regularisation of unbalanced optimal transportation problems. arXiv 2023, arXiv:2305.02410. [Google Scholar]
Janati, H.; Muzellec, B.; Peyré, G.; Cuturi, M. Entropic optimal transport between unbalanced gaussian measures has a closed form. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2020; Volume 33, pp. 10468–10479. [Google Scholar]
Chen, R.T.Q.; Rubanova, Y.; Bettencourt, J.; Duvenaud, D. Neural Ordinary Differential Equations. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
Dai Pra, P. A stochastic control approach to reciprocal diffusion processes. Appl. Math. Optim. 1991, 23, 313–329. [Google Scholar] [CrossRef]
Chen, Y.; Georgiou, T.T.; Pavon, M. On the relation between optimal transport and Schrödinger bridges: A stochastic control viewpoint. J. Optim. Theory Appl. 2016, 169, 671–691. [Google Scholar] [CrossRef]
Chizat, L.; Peyré, G.; Schmitzer, B.; Vialard, F.X. An interpolating distance between optimal transport and Fisher–Rao metrics. Found. Comput. Math. 2018, 18, 1–44. [Google Scholar] [CrossRef]
Chizat, L.; Peyré, G.; Schmitzer, B.; Vialard, F.X. Unbalanced optimal transport: Dynamic and Kantorovich formulations. J. Funct. Anal. 2018, 274, 3090–3123. [Google Scholar] [CrossRef]
Gangbo, W.; Li, W.; Osher, S.; Puthawala, M. Unnormalized optimal transport. J. Comput. Phys. 2019, 399, 108940. [Google Scholar] [CrossRef]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Pham, D.; Tan, X.; Balderson, B.; Xu, J.; Grice, L.F.; Yoon, S.; Willis, E.F.; Tran, M.; Lam, P.Y.; Raghubar, A.; et al. Robust mapping of spatiotemporal trajectories and cell–cell interactions in healthy and diseased tissues. Nat. Commun. 2023, 14, 7739. [Google Scholar] [CrossRef]
Zhou, P.; Bocci, F.; Li, T.; Nie, Q. Spatial transition tensor of single cells. Nat. Methods 2024, 21, 1053–1062. [Google Scholar] [CrossRef] [PubMed]
Shen, X.; Zuo, L.; Ye, Z.; Yuan, Z.; Huang, K.; Li, Z.; Yu, Q.; Zou, X.; Wei, X.; Xu, P.; et al. Inferring cell trajectories of spatial transcriptomics via optimal transport analysis. Cell Syst. 2025, 16, 556175. [Google Scholar] [CrossRef] [PubMed]
Tan, Y.; Wang, A.; Wang, Z.; Lin, W.; Yan, Y.; Nie, Q.; Shi, J. Transfer learning of multicellular organization via single-cell and spatial transcriptomics. bioRxiv 2024. [Google Scholar] [CrossRef]
Gu, Y.; Liu, J.; Li, C.; Welch, J.D. Mapping Cell Fate Transition in Space and Time. In Proceedings of the International Conference on Research in Computational Molecular Biology, Cambridge, MA, USA, 29 April–2 May 2024; Springer: Berlin/Heidelberg, Germany, 2024; pp. 417–420. [Google Scholar]
Qiu, X.; Zhu, D.Y.; Yao, J.; Jing, Z.; Zuo, L.; Wang, M.; Min, K.H.; Pan, H.; Wang, S.; Liao, S.; et al. Spateo: Multidimensional spatiotemporal modeling of single-cell spatial transcriptomics. bioRxiv 2022. [Google Scholar] [CrossRef]
Wei, X.; Fu, S.; Li, H.; Liu, Y.; Wang, S.; Feng, W.; Yang, Y.; Liu, X.; Zeng, Y.Y.; Cheng, M.; et al. Single-cell Stereo-seq reveals induced progenitor cells involved in axolotl brain regeneration. Science 2022, 377, eabp9444. [Google Scholar] [CrossRef]
Wang, M.; Hu, Q.; Tu, Z.; Kong, L.; Yao, J.; Xiang, R.; Chen, Z.; Zhao, Y.; Zhou, Y.; Yu, T.; et al. A single-cell 3D spatiotemporal multi-omics atlas from Drosophila embryogenesis to metamorphosis. bioRxiv 2024. [Google Scholar] [CrossRef]
Zeira, R.; Land, M.; Strzalkowski, A.; Raphael, B.J. Alignment and integration of spatial transcriptomics data. Nat. Methods 2022, 19, 567–575. [Google Scholar] [CrossRef]
Halmos, P.; Liu, X.; Gold, J.; Chen, F.; Ding, L.; Raphael, B.J. DeST-OT: Alignment of Spatiotemporal Transcriptomics Data. bioRxiv 2024. [Google Scholar] [CrossRef]
Titouan, V.; Courty, N.; Tavenard, R.; Flamary, R. Optimal transport for structured data with application on graphs. In Proceedings of the International Conference on Machine Learning, PMLR, Long Beach, CA, USA, 9–15 June 2019; pp. 6275–6284. [Google Scholar]
Chowdhury, S.; Mémoli, F. The Gromov–Wasserstein distance between networks and stable network invariants. Inf. Inference J. IMA 2019, 8, 757–787. [Google Scholar] [CrossRef]
Cohen, S.; Guibasm, L. The earth mover’s distance under transformation sets. In Proceedings of the Seventh IEEE International Conference on Computer Vision, Corfu, Greece, 20–27 September 1999; Werner, B., Ed.; IEEE: New York, NY, USA, 1999; Volume 2, pp. 1076–1083. [Google Scholar]
Coifman, R.R.; Kevrekidis, I.G.; Lafon, S.; Maggioni, M.; Nadler, B. Diffusion maps, reduction coordinates, and low dimensional representation of stochastic systems. Multiscale Model. Simul. 2008, 7, 842–864. [Google Scholar] [CrossRef]
Hetzel, L.; Fischer, D.S.; Günnemann, S.; Theis, F.J. Graph representation learning for single-cell biology. Curr. Opin. Syst. Biol. 2021, 28, 100347. [Google Scholar] [CrossRef]
Zhang, Y.; Qiu, X.; Ni, K.; Weissman, J.; Bahar, I.; Xing, J. Graph-Dynamo: Learning stochastic cellular state transition dynamics from single cell data. bioRxiv 2023. [Google Scholar] [CrossRef]
Chen, Y.; Zhang, Y.; Gan, J.; Ni, K.; Chen, M.; Bahar, I.; Xing, J. GraphVelo allows inference of multi-modal single cell velocities and molecular mechanisms. bioRxiv 2024. [Google Scholar] [CrossRef]
Almet, A.A.; Cang, Z.; Jin, S.; Nie, Q. The landscape of cell–cell communication through single-cell transcriptomics. Curr. Opin. Syst. Biol. 2021, 26, 12–23. [Google Scholar] [CrossRef]
Jin, S.; Guerrero-Juarez, C.F.; Zhang, L.; Chang, I.; Ramos, R.; Kuan, C.H.; Myung, P.; Plikus, M.V.; Nie, Q. Inference and analysis of cell-cell communication using CellChat. Nat. Commun. 2021, 12, 1088. [Google Scholar] [CrossRef]
Jin, S.; Plikus, M.V.; Nie, Q. CellChat for systematic analysis of cell–cell communication from single-cell transcriptomics. Nat. Protoc. 2025, 20, 180–219. [Google Scholar] [CrossRef]
Cang, Z.; Zhao, Y.; Almet, A.A.; Stabell, A.; Ramos, R.; Plikus, M.V.; Atwood, S.X.; Nie, Q. Screening cell–cell communication in spatial transcriptomics via collective optimal transport. Nat. Methods 2023, 20, 218–228. [Google Scholar] [CrossRef]
Almet, A.A.; Tsai, Y.C.; Watanabe, M.; Nie, Q. Inferring pattern-driving intercellular flows from single-cell and spatial transcriptomics. Nat. Methods 2024, 21, 1806–1817. [Google Scholar] [CrossRef]
Wada, T.; Hironaka, K.i.; Kuroda, S. Cell-to-cell variability serves as information not noise. Curr. Opin. Syst. Biol. 2021, 27, 100339. [Google Scholar]
Topolewski, P.; Komorowski, M. Information-theoretic analyses of cellular strategies for achieving high signaling capacity—dynamics, cross-wiring, and heterogeneity of cellular states. Curr. Opin. Syst. Biol. 2021, 27, 100352. [Google Scholar] [CrossRef]
Gandrillon, O.; Stumpf, M.P. Editorial overview: ‘Theoretical approaches to analyze single-cell data’ (April 2021) within the theme ‘Mathematical modelling’. Curr. Opin. Syst. Biol. 2021, 28, 100382. [Google Scholar] [CrossRef]
Ao, P. Potential in stochastic differential equations: Novel construction. J. Phys. A Math. Gen. 2004, 37, L25. [Google Scholar] [CrossRef]
Shi, J.; Aihara, K.; Li, T.; Chen, L. Energy landscape decomposition for cell differentiation with proliferation effect. Natl. Sci. Rev. 2022, 9, nwac116. [Google Scholar] [CrossRef] [PubMed]
Zhao, Y.; Zhang, W.; Li, T. EPR-Net: Constructing a non-equilibrium potential landscape via a variational force projection formulation. Natl. Sci. Rev. 2024, 11, nwae052. [Google Scholar] [CrossRef]
Li, C.; Wang, J. Quantifying cell fate decisions for differentiation and reprogramming of a human stem cell network: Landscape and biological paths. PLoS Comput. Biol. 2013, 9, e1003165. [Google Scholar] [CrossRef]
Wang, J.; Li, C.; Wang, E. Potential and flux landscapes quantify the stability and robustness of budding yeast cell cycle network. Proc. Natl. Acad. Sci. USA 2010, 107, 8195–8200. [Google Scholar] [CrossRef]
Li, C.; Wang, J. Landscape and flux reveal a new global view and physical quantification of mammalian cell cycle. Proc. Natl. Acad. Sci. USA 2014, 111, 14130–14135. [Google Scholar] [CrossRef]
Bian, S.; Zhang, Y.; Li, C. An improved approach for calculating energy landscape of gene networks from moment equations. Chaos Interdiscip. J. Nonlinear Sci. 2023, 33, 023116. [Google Scholar] [CrossRef]
Bian, S.; Zhou, R.; Lin, W.; Li, C. Quantifying energy landscape of oscillatory systems: Explosion, pre-solution, and diffusion decomposition. arXiv 2024, arXiv:2401.06959. [Google Scholar]
Zhou, R.; Yu, Y.; Li, C. Revealing neural dynamical structure of C. elegans with deep learning. Iscience 2024, 27, 109759. [Google Scholar] [CrossRef]
Torregrosa, G.; Garcia-Ojalvo, J. Mechanistic models of cell-fate transitions from single-cell data. Curr. Opin. Syst. Biol. 2021, 26, 79–86. [Google Scholar] [CrossRef]
Zhu, L.; Wang, J. Quantifying Landscape-Flux via Single-Cell Transcriptomics Uncovers the Underlying Mechanism of Cell Cycle. Adv. Sci. 2024, 11, 2308879. [Google Scholar] [CrossRef] [PubMed]
Zhu, L.; Yang, S.; Zhang, K.; Wang, H.; Fang, X.; Wang, J. Uncovering underlying physical principles and driving forces of cell differentiation and reprogramming from single-cell transcriptomics. Proc. Natl. Acad. Sci. USA 2024, 121, e2401540121. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Xu, L.; Wang, E. Potential landscape and flux framework of nonequilibrium networks: Robustness, dissipation, and coherence of biochemical oscillations. Proc. Natl. Acad. Sci. USA 2008, 105, 12271–12276. [Google Scholar] [CrossRef]
Stein-O’Brien, G.L.; Ainslie, M.C.; Fertig, E.J. Forecasting cellular states: From descriptive to predictive biology via single-cell multiomics. Curr. Opin. Syst. Biol. 2021, 26, 24–32. [Google Scholar] [CrossRef]
Cang, Z.; Zhao, Y. Synchronized Optimal Transport for Joint Modeling of Dynamics Across Multiple Spaces. arXiv 2024, arXiv:2406.03319. [Google Scholar] [CrossRef]
Demetci, P.; Santorella, R.; Sandstede, B.; Noble, W.S.; Singh, R. SCOT: Single-cell multi-omics alignment with optimal transport. J. Comput. Biol. 2022, 29, 3–18. [Google Scholar] [CrossRef]
Zhou, X.; Dong, K.; Zhang, S. Integrating spatial transcriptomics data across different conditions, technologies and developmental stages. Nat. Comput. Sci. 2023, 3, 894–906. [Google Scholar] [CrossRef]
Cao, K.; Gong, Q.; Hong, Y.; Wan, L. A unified computational framework for single-cell data integration with optimal transport. Nat. Commun. 2022, 13, 7419. [Google Scholar] [CrossRef]
Xia, C.R.; Cao, Z.J.; Tu, X.M.; Gao, G. Spatial-linked alignment tool (SLAT) for aligning heterogenous slices. Nat. Commun. 2023, 14, 7236. [Google Scholar] [CrossRef]
Gao, Z.; Cao, K.; Wan, L. Graspot: A graph attention network for spatial transcriptomics data integration with optimal transport. bioRxiv 2024. [Google Scholar] [CrossRef] [PubMed]
Tang, Z.; Luo, S.; Zeng, H.; Huang, J.; Sui, X.; Wu, M.; Wang, X. Search and match across spatial omics samples at single-cell resolution. Nat. Methods 2024, 21, 1818–1829. [Google Scholar] [CrossRef] [PubMed]
Lahat, D.; Adali, T.; Jutten, C. Multimodal data fusion: An overview of methods, challenges, and prospects. Proc. IEEE 2015, 103, 1449–1477. [Google Scholar] [CrossRef]
Liu, X.; Zeira, R.; Raphael, B.J. Partial alignment of multislice spatially resolved transcriptomics data. Genome Res. 2023, 33, 1124–1132. [Google Scholar] [CrossRef]
Wagner, D.E.; Klein, A.M. Lineage tracing meets single-cell omics: Opportunities and challenges. Nat. Rev. Genet. 2020, 21, 410–427. [Google Scholar] [CrossRef]
Weinreb, C.; Rodriguez-Fraticelli, A.; Camargo, F.D.; Klein, A.M. Lineage tracing on transcriptional landscapes links state to fate during differentiation. Science 2020, 367, eaaw3381. [Google Scholar] [CrossRef]
Pratapa, A.; Jalihal, A.P.; Law, J.N.; Bharadwaj, A.; Murali, T. Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data. Nat. Methods 2020, 17, 147–154. [Google Scholar] [CrossRef]
Van de Sande, B.; Flerin, C.; Davie, K.; De Waegeneer, M.; Hulselmans, G.; Aibar, S.; Seurinck, R.; Saelens, W.; Cannoodt, R.; Rouchon, Q.; et al. A scalable SCENIC workflow for single-cell gene regulatory network analysis. Nat. Protoc. 2020, 15, 2247–2276. [Google Scholar] [CrossRef]
Zhang, S.Y. Joint trajectory and network inference via reference fitting. arXiv 2024, arXiv:2409.06879. [Google Scholar]
Stumpf, M.P. Inferring better gene regulation networks from single-cell data. Curr. Opin. Syst. Biol. 2021, 27, 100342. [Google Scholar] [CrossRef]
Akers, K.; Murali, T. Gene regulatory network inference in single-cell biology. Curr. Opin. Syst. Biol. 2021, 26, 87–97. [Google Scholar] [CrossRef]
Zhao, W.; Larschan, E.; Sandstede, B.; Singh, R. Optimal transport reveals dynamic gene regulatory networks via gene velocity estimation. bioRxiv 2024. [Google Scholar] [CrossRef]
Yang, M. Topological Schrödinger Bridge Matching. In Proceedings of the Thirteenth International Conference on Learning Representations, Singapore, 24–28 April 2025. [Google Scholar]
Liu, X.; Chang, X.; Liu, R.; Yu, X.; Chen, L.; Aihara, K. Quantifying critical states of complex diseases using single-sample dynamic network biomarkers. PLoS Comput. Biol. 2017, 13, e1005633. [Google Scholar] [CrossRef] [PubMed]
Zhang, X.; Xiao, K.; Wen, Y.; Wu, F.; Gao, G.; Chen, L.; Zhou, C. Multi-omics with dynamic network biomarker algorithm prefigures organ-specific metastasis of lung adenocarcinoma. Nat. Commun. 2024, 15, 9855. [Google Scholar] [CrossRef]
Chen, Z.; Bai, X.; Ma, L.; Wang, X.; Liu, X.; Liu, Y.; Chen, L.; Wan, L. A branch point on differentiation trajectory is the bifurcating event revealed by dynamical network biomarker analysis of single-cell data. IEEE/ACM Trans. Comput. Biol. Bioinform. 2018, 17, 366–375. [Google Scholar] [CrossRef]
Han, C.; Zhong, J.; Zhang, Q.; Hu, J.; Liu, R.; Liu, H.; Mo, Z.; Chen, P.; Ling, F. Development of a dynamic network biomarkers method and its application for detecting the tipping point of prior disease development. Comput. Struct. Biotechnol. J. 2022, 20, 1189–1197. [Google Scholar] [CrossRef]

Figure 1. Overview of the data and models. (a) Discrete and Continuous Model: The discrete model constructs Markov chains between cells with dynamics encoded in a transition matrix, while the continuous model describes single-cell motion via stochastic differential equations (SDEs) and cell population dynamics through a corresponding partial differential equation (PDE). (b) Datasets: Snapshot data is an

n \times g

matrix

X

(n: cell count, g: gene count); temporal data provides gene expression matrices

X_{i}

at time points

i \in {0, \dots, T - 1}

; spatial data additionally records coordinates for each cell in

X_{i}

.

Figure 1. Overview of the data and models. (a) Discrete and Continuous Model: The discrete model constructs Markov chains between cells with dynamics encoded in a transition matrix, while the continuous model describes single-cell motion via stochastic differential equations (SDEs) and cell population dynamics through a corresponding partial differential equation (PDE). (b) Datasets: Snapshot data is an

n \times g

matrix

X

(n: cell count, g: gene count); temporal data provides gene expression matrices

X_{i}

at time points

i \in {0, \dots, T - 1}

; spatial data additionally records coordinates for each cell in

X_{i}

.

Figure 2. Dynamic modeling of snapshot single-cell transcriptomics.

Figure 3. Dynamic modeling of temporally-resolved single-cell transcriptomics.

Figure 4. Dynamic modeling of spatial transcriptomics.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, Z.; Sun, Y.; Peng, Q.; Li, T.; Zhou, P. Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-Seq Data Analysis. Entropy 2025, 27, 453. https://doi.org/10.3390/e27050453

AMA Style

Zhang Z, Sun Y, Peng Q, Li T, Zhou P. Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-Seq Data Analysis. Entropy. 2025; 27(5):453. https://doi.org/10.3390/e27050453

Chicago/Turabian Style

Zhang, Zhenyi, Yuhao Sun, Qiangwei Peng, Tiejun Li, and Peijie Zhou. 2025. "Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-Seq Data Analysis" Entropy 27, no. 5: 453. https://doi.org/10.3390/e27050453

APA Style

Zhang, Z., Sun, Y., Peng, Q., Li, T., & Zhou, P. (2025). Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-Seq Data Analysis. Entropy, 27(5), 453. https://doi.org/10.3390/e27050453

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-Seq Data Analysis

Abstract

1. Introduction

2. Overview of the Data and Models

2.1. Spatiotemporal scRNA-Seq Data

2.1.1. Snapshot scRNA-Seq Data

2.1.2. Temporally and Spatially Resolved scRNA-Seq

2.2. Models for Cell-State Transitions

2.2.1. Discrete Dynamics: Markov Chain Model

2.2.2. Continuous Dynamics: From Trajectories to Population Dynamics

3. Dynamic Modeling of Single-Cell Transcriptomics

3.1. Snapshot Single-Cell RNA-Seq

3.1.1. Pseudotime Methods

3.1.2. Discrete Dynamics Modeling

3.1.3. Continuous Dynamics Modeling

Steady-State Assumption: Parameter Estimation in Velocyto [26]

Dynamic Inference: Parameter Estimation in scVelo [30]

Function Class-Based Estimation

Latent State: VAE-Based Methods

Enhancing Velocity: Continuity-Based Methods

Estimating the Vector Field

Geometric Analysis of Vector Field

Transition Path Analysis

3.2. Temporally Resolved Single-Cell RNA-Seq

3.2.1. Discrete Temporal Dynamics Modeling

3.2.2. Continuous Temporal Dynamics Modeling

Neural ODE Solver

Conditional Flow Matching

Neural SDE Solver

Shrödinger Bridge Conditional Flow Matching

4. Dynamic Modeling of Spatial Transcriptomics

4.1. Snapshot Spatial Transcriptomics

4.1.1. Pseudotime Methods

4.1.2. Discrete Spatial Dynamics Modeling

4.1.3. Continuous Spatial Dynamics Modeling

4.2. Temporally Resolved Spatial Transcriptomics

4.2.1. Discrete Spatiotemporal Dynamics Modeling

4.2.2. Spatiotemporal Dynamics Modeling

5. Extensions, Challenges, and Future Directions

5.1. Bridging Discrete and Continuous Dynamics Modeling

5.2. Modeling Cell–Cell Interaction Dynamically

5.3. Reconstructing Waddington Developmental Landscapes

5.4. Challenges and Further Directions

6. Discussion and Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI