Research Topics

Over time, some of our work naturally clusters into coherent thematic directions. To provide a clearer structural view of our research and to improve accessibility, we organize them below by topic, with the most recently active ones listed first.
  1. Reaction-diffusion equations and wave front propagation in random media.

    Molecular motors, which are biological molecular machines that are the essential agents of movement in living organisms, can be modeled as diffusion particles traveling in a designated track. To model the environment in which the fluctuations due to thermal noise are significant, I have been considering the traveling of these motors in a narrow random channel. Under the asymptotic when the channel width is thin, I derive the limiting process as a diffusion process on a graph (see the work [1] below). Furthermore, I introduce in [2] a reaction-diffusion equation in random media, which models the change in space and time of the concentration of these motors. By making use of large deviations theory for diffusion processes in random media, I derive the wave front propagation formula for the corresponding reaction-diffusion system.

    In the work [3] done in the year 2019, we consider the asymptotic wave speed for Fisher-Kolmogorov-Petrovskii-Piscounov (FKPP) type reaction-diffusion equations on a class of infinite random metric trees. We show that a travelling wavefront emerges and we quantify it via a variational formula involving the random branching degrees and the random branch lengths of the tree. Here our key idea is to project the Brownian motion on the tree onto a one-dimensional axis along the direction of the wave propagation. The projected process is a multi-skewed Brownian motion, with skewness and interface sets that encode the metric structure of the tree. Combined with analytic arguments based on the Feynman-Kac formula, this idea connects our analysis of the wavefront propagation to the Large Deviations Principle (LDP) of the multi-skewed Brownian motion with random skewness and random interface set.

    A recent work [4] done in 2025 resolves the small reaction rate case left in [3] as well as an open problem in the classical monograph "Functional Integration and Partial Differential Equations" (Princeton University Press, 1985) by Freidlin (Section 7.6, Remark 4, pp.524-525). Here we consider the wave propagation for a reaction-diffusion equation on the real line, with a random drift and FKPP type nonlinear reaction. We show that when the average drift is positive, the asymptotic wave fronts propagating to the positive and negative directions are both pushed in the negative direction, leading to the possibility that both wave fronts propagate toward negative infinity. Our probabilistic arguments also reveal the underlying physical mechanism of the wave fronts formation: the drift acts as an external field that shifts the (quenched) free-energy reference level without altering the intrinsic fluctuation structure of the system.

    Papers.

    [4] Guan, D., He, H., Hu, W., Yang, J., Wave propagation for 1-dimensional reaction-diffusion equation with nonzero random drift. [arXiv]

    [3] Fan, W., Hu, W., Terlov, G., Wave propagation for reaction-diffusion equations on infinite random trees. Communications in Mathematical Physics, 384, Issue 1, April 2021, pages 109-163. [arXiv] [journal paper]

    [2] Freidlin, M., Hu, W., Wave front propagation for a reaction-diffusion equation in narrow random channels. Nonlinearity, 26, 8, 2013, pp. 2333-2356. [arXiv] [journal paper]

    [1] Freidlin, M., Hu, W., On diffusion in narrow random channels. Journal of Statistical Physics, 152, 2013, pp. 136-158. [arXiv] [journal paper]

    Slides.

    [2] Wave propagation for reaction-diffusion equation on infinite random trees.

    [1] Diffusion and wave front propagation in narrow random channels.

  2. Markov Decision Processes and Reinforcement Learning applied to Microgrid and Manufacturing Systems.

    Several real-world applications of Markov Decision Process (MDP) and Reinforcement Learning are considered under this project. In [1], we propose a joint dynamic decision-making model for the optimal control for both manufacturing system and onsite generation system via MDP and a neural network integrated reinforcement learning algorithm. In [2] a real-time decision-making model is proposed for the electric vehicle (EV) aggregator to dynamically control the energy flow between the grid and each individual EV in the aggregated group.

    In the work [3] finished in 2020, we propose a joint dynamic control model of microgrids and manufacturing systems using Markov Decision Process (MDP) to identify an optimal control strategy for both microgrid components and manufacturing system so that the energy cost for production can be minimized without sacrificing production throughput. The proposed MDP model has a high dimensional state/action space and is complicated in that the state and action spaces have both discrete and continuous parts and are intertwined through constraints. To resolve these challenges, a novel reinforcement learning algorithm that leverages both on-policy temporal difference control (TD-control) and deterministic policy gradient (DPG) algorithms is proposed. In this algorithm, the values of discrete decision actions are learned through neural network integrated temporal difference iteration, while the parameterized values of continuous actions are learned from deterministic policy gradients. The constraints are then addressed via proximal projection operators at the policy gradient updates. Experiments for a manufacturing system with an onsite microgrid with renewable sources have been implemented to identify optimal control actions for both manufacturing system and microgrid components towards cost optimality. The experimental results show the effectiveness of combining TD control and policy gradient methodologies in addressing the "curse of dimensionality" in dynamic decision-making with high dimensional and complicated state and action spaces. We refer to this slide.

    Papers.

    [3] Yang, J., Sun, Z., Hu, W., Steimeister, L., Joint Control of Manufacturing and Onsite Microgrid System via Novel Neural-Network Integrated Reinforcement Learning Algorithms. Applied Energy, Volume 315, 1 June 2022, 118982. [manuscript] [journal paper] [source code]

    [2] Islam, Md M., Zhong, X., Sun, Z., Xiong, H., Hu, W., Real-Time Frequency Regulation Using Aggregated Electric Vehicles in Smart Grid. Computers & Industrial Engineering, Volume 134, August 2019, pages 11-26. [journal paper]

    [1] Hu, W., Sun, Z., Zhang, Y., Li, Y., Joint Manufacturing and Onsite Microgrid System Control Using Markov Decision Process and Neural Network Integrated Reinforcement Learning. ICPR 2019 (the 25th International Conference on Production Research), Chicago, Illinois, USA, August 10-14, 2019. [conference paper]

    Slides.

    [1] Joint Control of Manufacturing and Onsite Microgrid System via Markov Decision Processes and Reinforcement Learning.
  3. Zero-Knowledge Proofs applied to Blockchain Systems and Virtual Machines.

    Zero-Knowledge Proofs (ZKPs) are interactive protocols where a prover can convince a verifier they know a secret without revealing it. The leap to Succinct Non-interactive Arguments of Knowledge (SNARKs) enabled very short proofs and one-message communication. This has become a core component of Layer-2 scaling solutions (rollups) for blockchain systems like Ethereum, leading to the so called zero-knowledge Ethereum Virtual Machine (zkEVM). For example, zk-Rollups bundle many transactions off-chain, then publish a SNARK proof on-chain that the rollup state transition was correct. This results in a relatively low, fixed on-chain verification cost, instead of processing each transaction individually on-chain.

    In a recent paper [1], we propose a novel zero-knowledge Virtual Machine (zkVM) framework. Unlike previous proof systems following the sequential execution of opcodes, here we break the execution of arbitrary code into segments (at opcode and basic-block levels) and use data-parallel circuits with dynamic duplication of identical segments. A key technical tool is our use of an asymmetric GKR protocol (Goldwasser-Kalai-Rothblum) scheme, pairing a non-uniform prover and a uniform verifier to handle dynamic-length data-parallel circuits. This allows them to commit only to the circuit's input and output (in contrast to many PLONK-style proofs where the prover must commit to all internal wires).

    Papers.

    [1] Liu, T., Zhang, Z., Zhang, Y., Hu, W., Zhang, Y., Ceno: Non-uniform, Segment and Parallel Zero-knowledge Virtual Machine. Journal of Cryptology, 38, 17(2025). [Cryptology ePrint Archive] [journal paper] [open-source project code]

    Expository Notes.

    [3] Documentations of zkEVM circuits: EVM Circuit, Bytecode Circuit, State Circuit, Transaction Circuit, MPT Circuit, Signature Circuit, Public Input Circuit, Keccak Circuit, Modexp Circuit.

    [2] Summary of some recent algorithms related to Zero-Knowledge Proofs. Notes.

    [1] Notes on Cryptographic Zero-Knowledge Proofs (based on Justin Thaler's textbook Proofs, Arguments and Zero-Knowledge). Notes.

  4. Geometry and Intepretability of Neural Networks.

    (to be revised) Subspace Indexing with Interpolation (SIM-I) on Stiefel and Grassmann manifolds is proposed in [1]. Given a partition of some original high-dimensional data set, SIM-I is constructed via two steps: in the first step we build linear affinity-aware subspace models based on each partition; in the second step we interpolate between several adjacent linear subspace models constructed in the first step using the “center of mass” calculation on Stiefel and Grassmann manifolds. Through these two steps, SIM-I builds a globally nonlinear and smoothly regularized low-dimensional embedding model of the original data set. Furthermore, given sufficiently many training samples on the data manifold either labelled by some pre-trained learning model such as Deep Neural Networks (DNNs) or provided with original natural labels, we first apply SIM-I on this data set and then perform nearest-neighbor classification on the resulting low-dimensional embedding. This helps us to build a Lightweight Inference Engine (LIE) carrying similar level of feature extraction by the pre-trained learning model. For DNNs, such LIE can be interpreted as some (nonstandard) shallow neural network with a wide first hidden layer. From this perspective, SIM-I provides a way to exchange deep network for wide but shallow ones and may provide some new insights to interpret DNNs.

    Papers.

    [1] Hu, W., Jiang, T., Kathariya, B., Abrol, V., Zhang, J., Li, Z., Subspace Interpolation and Indexing on Stiefel and Grassmann Manifolds as a Lightweight Inference Engine. IEEE Big Data 2023 (2023 IEEE International Conference on Big Data), Sorrento, Italy, December 15-18, 2023. (Acceptance Rate: 92/526=17.5%) [conference paper] [manuscript] [source code] [video]

    Slides.

    [1] Subspace Interpolation and Indexing on Stiefel and Grassmann Manifolds as a Lightweight Inference Engine.
  5. Stochastic Optimization Algorithms in Machine Learning.

    Many large-scale learning problems in modern statistics and machine learning can be reduced to solving stochastic optimization problems, i.e., the search for (local) minimum points of the expectation of an objective random function (loss function). These optimization problems are usually solved by certain stochastic approximation algorithms, which are recursive update rules with random inputs in each iteration. Under this project, we have been considering various types of such stochastic approximation algorithms, including the stochastic gradient descent, the stochastic composite gradient descent and the stochastic heavy-ball method. By introducing approximating diffusion processes to the discrete recursive schemes, we have analyzed the convergence of the diffusion limits to these algorithms via delicate techniques in stochastic analysis and asymptotic methods. We also study the generalization properties of these algorithms when they are applied to machine learning problems.

    Papers.

    [7] Hu, W., Li, C.J., Zhou, X., On the Global Convergence of Continuous-Time Stochastic Heavy-Ball Method for Nonconvex Optimization. IEEE Big Data 2019 (2019 IEEE International Conference on Big Data), Los Angeles, California, USA, December 9-12, 2019. [arXiv] [conference paper]

    [6] Wu, J., Hu, W., Xiong, H., Huan, J., Braverman, V., Zhu, Z., On the Noisy Gradient Descent that Generalizes as SGD. ICML 2020 (37th International Conference on Machine Learning), virtual conference due to COVID-19, July 12-18, 2020. [arXiv] [conference paper]

    [5] Yuan, H., Lian, X., Li, C.J., Liu, J., Hu, W., Efficient Smooth Non-Convex Stochastic Compositional Optimization via Stochastic Recursive Gradient Descent. NeurIPS 2019 (Thirty-third Conference on Neural Information Processing Systems), Vancouver, Canada, December 8-14, 2019. [conference paper]

    [4] Hu, W., Li, C.J., Li, L., Liu, J., On the diffusion approximation of nonconvex stochastic gradient descent. Annals of Mathematical Science and Applications, Vol. 4, No. 1(2019), pp. 3-32. [arXiv] [journal paper]

    [3] Hu, W., Li, C.J., A convergence analysis of the perturbed compositional gradient flow: averaging principle and normal deviations. Discrete and Continuous Dynamical Systems, Series A, 38, 10, October 2018, pp. 4951-4977. [arXiv] [journal paper]

    [2] Hu, W., Li, C.J., Zhou, X., On the Global Convergence of Continuous-Time Stochastic Heavy-Ball Method for Nonconvex Optimization. IEEE Big Data 2019 (2019 IEEE International Conference on Big Data), Los Angeles, California, USA, December 9-12, 2019. [arXiv] [conference paper]

    [1] Yang, J., Hu, W., Li, C.J., On the fast convergence of random perturbations of the gradient flow. Asymptotic Analysis, Volume 122, 2021, pages 371-393. [arXiv] [journal paper]

    Slides.

    [4] Fast convergence of random perturbations of the gradient flow.

    [3] A random perturbation approach to some stochastic approximation algorithms in optimization.

    [2] Some Probabilistic Understandings of the Effects of Noise in the Stochastic Gradient Descent.

    [1] Stochastic Approximations, Diffusion Limit and Small Random Perturbations of Dynamical Systems - a probabilistic approach to machine learning.

    Expository Notes.

    [2] Nonlinear Optimization in Machine Learning. (A new graduate-level topic course that I developed at Missouri S&T.) Lecture Notes.

    [1] Lectures on Nonlinear Optimization in Machine Learning. (A series of lectures that I gave at the School of Mathematics and Statistics, Anhui Normal University in June 27-July 8, 2018.) Lecture 1. Lecture 2. Lecture 3. Lecture 4.

  6. Statistical Inference for High-Dimensional and Spatial-Temporal Events.

    We study statistical modeling and inference for complex structured data, with a focus on high-dimensional estimation, temporal and spatial event processes, and network-driven stochastic systems. Particular emphasis is placed on scalable inference and uncertainty quantification under structural and sample size constraints.

    In [1]-[4], we develop regularization methods for covariance matrix estimation in high-dimensional, low-sample-size regimes. These techniques are further applied to improve the inferential performance of classical Fisher's Linear Discriminant Analysis (LDA), with empirical validation on large-scale Electronic Health Records (EHR) datasets.

    In [5], we study multivariate Hawkes processes on networks to model event-driven dynamics with long-memory and self-exciting properties. This framework enables the characterization of human mobility patterns and the discovery of synchronization effects in trip purposes based on real-world datasets.

    Papers.

    [5] Wang, P., Liu, G., Fu, Y., Hu, W., Aggarwal, C., Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes. KDD 2017 (Knowledge, Discovery and Data Mining), Halifax, Nova Scotia, Canada, August 13-17, 2017. Accepted paper ID=fp1019. [conference paper] [abstract and video]

    [4] Xiong, H., Cheng, W., Bian, J., Hu, W., Sun, Z., Guo, Z., DBSDA: Lowering the Bound of Misclassification Rate for Sparse Linear Discriminant Analysis via Model Debiasing. IEEE Transactions on Neural Networks and Learning Systems, Volume 30, Issue 3, pp. 707-717, March 2019. [journal paper]

    [3] Xiong, H., Cheng, W., Fu, Y., Bian, J., Hu, W., Guo, Z., De-Biasing Covariance-Regularized Discriminant Analysis. IJCAI-ECAI 2018 (the 27th International Joint Conference on Artificial Intelligence and the 23rd European Conference on Artificial Intelligence), Stockholm, Sweden, July 13-19, 2018. [conference paper]

    [2] Bian, J., Xiong, H., Cheng, W., Fu, Y., Hu, W., Guo, Z., Multi-Party Sparse Discriminant Learning. ICDM 2017 (2017 IEEE International Conference on Data Mining), New Orleans, Louisiana, USA, November 8-21, 2017. [conference paper]

    [1] Xiong, H., Cheng, W., Bian, J., Hu, W., Guo, Z., AWDA: Adapted Wishart Discriminant Analysis. ICDM 2017 (2017 IEEE International Conference on Data Mining), New Orleans, Louisiana, USA, November 8-21, 2017. [conference paper]

  7. Two-dimensional stochastic fluid mechanics and turbulence models.

    The famous canonical picture of 2-d turbulence due to Kraichnan [Kraichnan, R.H., Inertial ranges in two dimensional turbulence, Physics of Fluids, 10(7), pp. 1417-1423, 1967] conjectures a dual cascade: an inverse energy cascade toward low Fourier modes (large scales) and a direct enstrophy cascade toward high Fourier modes (small scales), driven by the nonlinear interactions. In a series of groundbreaking works starting from [Hairer, M., Mattingly, J. C., Ergodicity of the 2-d Navier-Stokes equations with degenerate stochastic forcing. Annals of Mathematics (2), 164(3):993-1032, 2006], unique ergodicity has been developed under the physically important case of a spatially degenerate (that is frequency localized) stochastic forcing. In this project, we study problems of 2-d turbulence related to Kraichnan's conjectures from various geometric and dynamical perspectives. We show (see our work [1] below) that if we consider the 2-d Navier-Stokes equations on the torus T^2 and we modify the viscous term to damp all but finitely many modes, then all solutions converge in the long-time limit to a stationary solution to the 2-d Euler equation living on those finitely many modes. Then one can classify those stationary solutions precisely. For example, if one removes damping from exactly two modes which are of different frequency and in different directions (like sin(2x) and cos(y) for example), then solutions must choose one of the two modes and land only on one of them. This "choice" happens through a non-linear process and it is unclear whether there are even statistics of which one is most likely chosen (though, one expects that the stationary solution with the lowest frequency is generically chosen in the long-time limit). To understand this non-linear process more thoroughly, we propose geometric approaches in our works [2] and [3] below, where we study finite-dimensional model problems for the 2-d Navier-Stokes and Euler equations respecting their Hamiltonian and Lie-Poisson structures. We reveal mechanisms that lead to the interactions of the nonlinearity, the stochastic noise and the partial dissipation. These interactions lead to novel long-time limit of the solutions.

    Papers.

    [3] Hu, W., On the long time behavior of a perturbed conservative system with degeneracy. Journal of Theoretical Probability, Volume 33, pp.1266-1295, 2020. (Published online on 11, May 2019.) [arXiv] [journal paper]

    [2] Hu, W., Sverak, V., Dynamics of geodesic flows with random forcing on Lie groups with left-invariant metrics. Journal of Nonlinear Science, 28(6):2249-2274, December 2018. [arXiv] [journal paper]

    [1] Elgindi, T., Hu, W., Sverak, V., On 2d incompressible Euler equations with partial damping. Communications in Mathematical Physics, 355, Issue 1, October 2017, pp. 145-159. [arXiv] [journal paper]

    Slides.

    [5] Stochastically perturbed geodesic flows on Lie groups.

    [4] On 2d Euler equations with partial damping and some related model problems.

    [3] On the long time behavior of a perturbed conservative system with degeneracy.

    [2] 2-d incompressible Euler equations with partial damping.

    [1] Dynamics of geodesic flows with random forcing on Lie groups with left invariant metrics.

    Expository Notes.

    [1] Lectures on Stochastic Fluid Mechanics. (A series of lectures that I gave at the School of Mathematical Sciences, Peking University in July 9-17, 2015.) Poster. Abstract. Lecture Notes.

  8. Small mass limit of the Langevin equation (Smoluchowski-Kramers approximation).

    The Langevin equation is one of the most classical models in stochastic calculus for the random motion of a particle suspended in a fluid. As a second-order stochastic differential equation, it describes the dynamics of a particle subject to a deterministic drift, a friction proportional to its velocity, as well as random fluctuations. The small-mass limit of this equation, sometimes also called the Smoluchowski-Kramers approximation, has been the main justification of using a first order stochastic differential equation to replace the original second-order equation. I have been considering variable and vanishing friction case of the Langevin equation, as well as a multiscale Langevin equation.

    Papers.

    [3] Hu, W., Spiliopoulos, K., Hypoelliptic multiscale Langevin diffusions: Large deviations, invariant measures and small mass asymptotics. Electronic Journal of Probability, 22 (2017), paper no. 55, pp. 1-38. [arXiv] [journal paper]

    [2] Freidlin, M., Hu, W., Wentzell, A., Small mass asymptotic for the motion with vanishing friction. Stochastic Processes and their Applications, 123 (2013), pp. 45-75. [arXiv] [journal paper]

    [1] Freidlin, M., Hu, W., Smoluchowski-Kramers approximation in the case of variable friction. Journal of Mathematical Sciences, 79, 1, November 2011, translated from Problems in Mathematical Analysis, 61, October 2011 (in Russian). [arXiv] [journal paper]

    Slides.

    [2] Hypoelliptic multiscale Langevin diffusions.

    [1] Small mass asymptotic for the motion with variable and vanishing friction.

  9. Random Perturbations, Small Parameters, and Multiscale Problems in Dynamical Systems, PDEs, and Stochastic PDEs.

    Dynamical systems with small random inputs are ubiquitous phenomena that appear in many scientific and engineering discipline. In the understanding of the time-evolution of a complex system, one chooses a relatively few number of major factors that govern the evolution of the system while neglecting other factors that are relatively insignificant. Due to the undetectable nature of these other factors being neglected, in a mathematical model they usually present themselves as random inputs. The random inputs can be included in some parameters that characterize the system, such as diffusion coefficients, rates of chemical reactions, time scales, etc. . Neglecting these random inputs is only effective in the case of finite time evolution. In fact, on long time scales, the factors which were considered as negligible, can become important and even critical for determining the system's behavior. By making use of large deviations theory and averaging principle, I have analyzed in [1]-[4] various model problems such as small random perturbations of nearly-elastic mechanical system (a.k.a. nearly-elastic billiard system), a generalization of the Landau-Lifschitz dynamics characterizing the magnetization dynamics in ferromagnetics, as well as dynamical systems with reflecting boundary conditions.

    The close relation between the theory of second-order differential equations and Markov processes with continuous trajectories benefits each other. By making use of the averaging principle of diffusion processes, I analyzed in [5] the behavior of the solution to a second-order equation with an elliptic operator having a degenerate characteristic form, perturbed by another elliptic operator multiplied by a small parameter.

    Stochastic partial differential equations of reaction-diffusion type have been introduced to model the spatial-temporal evolution of concentrations of various components in a chemical reaction. The stochastic noises are responsible for the random changes in space-time of the rates of reaction. As a rule, the rates of chemical reactions in the system and the diffusion coefficients have different orders. Some of them are much smaller than others and this leads to the consideration of stochastic reaction-diffusion equations with a separation of slow and fast scales, i.e. multiscale stochastic reaction-diffusion equations. In [6] we considered for the first time the problem of large deviations for multiscale stochastic reaction-diffusion equations in multiple dimensions with multiplicative noise.

    In the work [7] done in the year 2018, I considered the long-time behavior of random perturbations of a degenerate system as an extension of the classical Freidlin-Wentzell theory.

    Papers.

    [7] Hu, W., On the long time behavior of a perturbed conservative system with degeneracy. Journal of Theoretical Probability, Volume 33, pp.1266-1295, 2020. (Published online on 11, May 2019.) [arXiv] [journal paper]

    [6] Hu, W., Salins, M., Spiliopoulos, K., Large deviations and averaging for systems of slow-fast stochastic reaction-diffusion equations. Stochastics and Partial Differential Equations: Analysis and Computations, December 2019, Volume 7, Issue 4, pp. 808-874. [arXiv] [journal paper]

    [5] Freidlin, M., Hu, W., On second order elliptic equations with a small parameter. Communications in Partial Differential Equations, 38, 10, 2013, pp. 1712-1736. [arXiv] [journal paper]

    [4] Hu, W., Tcheuko, L., Random perturbations of dynamical systems with reflecting boundary and corresponding PDE with a small parameter. Asymptotic Analysis, 87, 1-2, 2014, pp. 43-56. [arXiv] [journal paper]

    [3] Hu, W., On metastability in nearly-elastic systems. Asymptotic Analysis, 79, 1-2, 2012, pp. 65-86. [arXiv] [journal paper]

    [2] Freidlin, M., Hu, W., On perturbations of generalized Landau-Lifshitz dynamics. Journal of Statistical Physics, 144, 2011, pp. 978-1008. [arXiv] [journal paper]

    [1] Freidlin, M., Hu, W., On stochasticity in nearly-elastic systems. Stochastics and Dynamics, 12, 3, 2012. [arXiv] [journal paper]

    Slides.

    [5] On the long time behavior of a perturbed conservative system with degeneracy.

    [4] Hypoelliptic multiscale Langevin diffusions and Slow-fast stochastic reaction-diffusion equations.

    [3] Large deviations and averaging for systems of slow-fast reaction-diffusion equations.

    [2] Second order elliptic equations with a small parameter.

    [1] Stochastic behavior in nearly-elastic billiard systems.


Last updated: 02/2026