-
Skelet #17 and the fifth Busy Beaver number
Authors:
Chris Xu
Abstract:
We prove nonhalting of the Turing machine dubbed "Skelet #17", known to be one of the toughest 5-state, 2-symbol Turing machines to analyze. Combined with the efforts of The Busy Beaver Challenge, we are therefore able to show that BB(5), the fifth Busy Beaver number, equals 47,176,870.
We prove nonhalting of the Turing machine dubbed "Skelet #17", known to be one of the toughest 5-state, 2-symbol Turing machines to analyze. Combined with the efforts of The Busy Beaver Challenge, we are therefore able to show that BB(5), the fifth Busy Beaver number, equals 47,176,870.
△ Less
Submitted 3 July, 2024; v1 submitted 2 July, 2024;
originally announced July 2024.
-
Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Authors:
Tim Tsz-Kit Lau,
Weijian Li,
Chenwei Xu,
Han Liu,
Mladen Kolar
Abstract:
Modern deep neural networks often require distributed training with many workers due to their large size. As worker numbers increase, communication overheads become the main bottleneck in data-parallel minibatch stochastic gradient methods with per-iteration gradient synchronization. Local gradient methods like Local SGD reduce communication by only syncing after several local steps. Despite under…
▽ More
Modern deep neural networks often require distributed training with many workers due to their large size. As worker numbers increase, communication overheads become the main bottleneck in data-parallel minibatch stochastic gradient methods with per-iteration gradient synchronization. Local gradient methods like Local SGD reduce communication by only syncing after several local steps. Despite understanding their convergence in i.i.d. and heterogeneous settings and knowing the importance of batch sizes for efficiency and generalization, optimal local batch sizes are difficult to determine. We introduce adaptive batch size strategies for local gradient methods that increase batch sizes adaptively to reduce minibatch gradient variance. We provide convergence guarantees under homogeneous data conditions and support our claims with image classification experiments, demonstrating the effectiveness of our strategies in training and generalization.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Analytic smoothing effect of the Cauchy problem for a class of ultra-parabolic equations
Authors:
Xiao-Dong Cao,
Chao-Jiang Xu
Abstract:
In this paper, we study a class of strongly degenerate ultraparabolic equations with analytic coefficients. We demonstrate that the Cauchy problem exhibits an analytic smoothing effect. This means that, with an initial datum belonging to the Sobolev space $H^s$ (of real index s), the associated Cauchy problem admits a unique solution that is analytic in all spatial variables for any strictly posit…
▽ More
In this paper, we study a class of strongly degenerate ultraparabolic equations with analytic coefficients. We demonstrate that the Cauchy problem exhibits an analytic smoothing effect. This means that, with an initial datum belonging to the Sobolev space $H^s$ (of real index s), the associated Cauchy problem admits a unique solution that is analytic in all spatial variables for any strictly positive time. This smoothing effect property is similar to that of the Cauchy problem for uniformly parabolic equations with analytic coefficients.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Dynamics of the non-radial energy-critical inhomogeneous NLS
Authors:
Carlos M. Guzmán,
Chenbgin Xu
Abstract:
We consider the focusing inhomogeneous nonlinear Schrödinger equation \[ i\partial_t u + Δu + |x|^{-b}|u|^αu = 0\qtq{on}\R\times\R^N, \] with $α=\tfrac{4-2b}{N-2}$, $N=\{3,4,5\}$ and $0<b\leq \min\Big\{\tfrac{6-N}{2},\tfrac{4}{N}$\Big\}. This paper establishes global well-posedness and scattering for the non-radial energy-critical case in $\dot{H}^1(\R^N)$. It extends the previous research by Murp…
▽ More
We consider the focusing inhomogeneous nonlinear Schrödinger equation \[ i\partial_t u + Δu + |x|^{-b}|u|^αu = 0\qtq{on}\R\times\R^N, \] with $α=\tfrac{4-2b}{N-2}$, $N=\{3,4,5\}$ and $0<b\leq \min\Big\{\tfrac{6-N}{2},\tfrac{4}{N}$\Big\}. This paper establishes global well-posedness and scattering for the non-radial energy-critical case in $\dot{H}^1(\R^N)$. It extends the previous research by Murphy and the first author \cite{GM}, which focused on the case $(N,α,b)=(3,2,1)$. The novelty here, beyond considering higher dimensions, lies in our assumption of the condition $\sup_{t\in I}\|\nabla u(t)\|_{L^2}<\|\nabla Q\|_{L^2}$, which is weaker than the condition stated in \cite{Guzman}. Consequently, if a solution has energy and kinetic energy less than the ground state $Q$ at some point, then the solution is global and scatters. Moreover, we show scattering for the defocusing case. On the other hand, in this work, we also investigate the blow-up issue with nonradial data for $N\geq 3$ in $H^1(\mathbb{R}^N)$. This implies that our result holds without classical assumptions such as spherically symmetric data or $|x|u_0 \in L^2(\mathbb{R}^N)$.
\
\noindent Mathematics Subject Classification. 35A01, 35QA55, 35P25.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
A Single-Loop Robust Policy Gradient Method for Robust Markov Decision Processes
Authors:
Zhenwei Lin,
Chenyu Xue,
Qi Deng,
Yinyu Ye
Abstract:
Robust Markov Decision Processes (RMDPs) have recently been recognized as a valuable and promising approach to discovering a policy with creditable performance, particularly in the presence of a dynamic environment and estimation errors in the transition matrix due to limited data. Despite extensive exploration of dynamic programming algorithms for solving RMDPs, there has been a notable upswing i…
▽ More
Robust Markov Decision Processes (RMDPs) have recently been recognized as a valuable and promising approach to discovering a policy with creditable performance, particularly in the presence of a dynamic environment and estimation errors in the transition matrix due to limited data. Despite extensive exploration of dynamic programming algorithms for solving RMDPs, there has been a notable upswing in interest in developing efficient algorithms using the policy gradient method. In this paper, we propose the first single-loop robust policy gradient (SRPG) method with the global optimality guarantee for solving RMDPs through its minimax formulation. Moreover, we complement the convergence analysis of the nonconvex-nonconcave min-max optimization problem with the objective function's gradient dominance property, which is not explored in the prior literature. Numerical experiments validate the efficacy of SRPG, demonstrating its faster and more robust convergence behavior compared to its nested-loop counterpart.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
A note on Kollár valuations
Authors:
Yuchen Liu,
Chenyang Xu
Abstract:
We prove the set of Kollár valuations in the dual complex of a klt singularity with a fixed complement is path connected. We also classify the case when the dual complex is one dimensional.
We prove the set of Kollár valuations in the dual complex of a klt singularity with a fixed complement is path connected. We also classify the case when the dual complex is one dimensional.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
Kernel-based optimally weighted conformal prediction intervals
Authors:
Jonghyeok Lee,
Chen Xu,
Yao Xie
Abstract:
Conformal prediction has been a popular distribution-free framework for uncertainty quantification. In this paper, we present a novel conformal prediction method for time-series, which we call Kernel-based Optimally Weighted Conformal Prediction Intervals (KOWCPI). Specifically, KOWCPI adapts the classic Reweighted Nadaraya-Watson (RNW) estimator for quantile regression on dependent data and learn…
▽ More
Conformal prediction has been a popular distribution-free framework for uncertainty quantification. In this paper, we present a novel conformal prediction method for time-series, which we call Kernel-based Optimally Weighted Conformal Prediction Intervals (KOWCPI). Specifically, KOWCPI adapts the classic Reweighted Nadaraya-Watson (RNW) estimator for quantile regression on dependent data and learns optimal data-adaptive weights. Theoretically, we tackle the challenge of establishing a conditional coverage guarantee for non-exchangeable data under strong mixing conditions on the non-conformity scores. We demonstrate the superior performance of KOWCPI on real time-series against state-of-the-art methods, where KOWCPI achieves narrower confidence intervals without losing coverage.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Integral action feedback design for conservative abstract systems in the presence of input nonlinearities
Authors:
Ling Ma,
Vincent Andrieu,
Daniele Astolfi,
Mathieu Bajodek,
Cheng-Zhong Xu,
Xuyang Lou
Abstract:
In this article, we present a stabilization feedback law with integral action for conservative abstract linear systems subjected to actuator nonlinearity. Based on the designed control law, we first prove the well-posedness and global asymptotic stability of the origin of the closed-loop system by constructing a weak Lyapunov functional. Secondly, as an illustration, we apply the results to a wav…
▽ More
In this article, we present a stabilization feedback law with integral action for conservative abstract linear systems subjected to actuator nonlinearity. Based on the designed control law, we first prove the well-posedness and global asymptotic stability of the origin of the closed-loop system by constructing a weak Lyapunov functional. Secondly, as an illustration, we apply the results to a wave equation coupled with an ordinary differential equation (ODE) at the boundary. Finally, we give the simulation results to illustrate the effectiveness of our method.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Boundedness of log Fano cone singularities and discreteness of local volumes
Authors:
Chenyang Xu,
Ziquan Zhuang
Abstract:
We prove that in any fixed dimension, K-semistable log Fano cone singularities whose volumes are bounded from below by a fixed positive number form a bounded set. As a consequence, we show that the set of local volumes of klt singularities of a fixed dimension has zero as the only accumulation point.
We prove that in any fixed dimension, K-semistable log Fano cone singularities whose volumes are bounded from below by a fixed positive number form a bounded set. As a consequence, we show that the set of local volumes of klt singularities of a fixed dimension has zero as the only accumulation point.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Gamma positivity of variations of $(α,t)$-Eulerian polynomials
Authors:
Chao Xu,
Jiang Zeng
Abstract:
In 1977 Carlitz and Scoville introduced the cycle $(α,t)$-Eulerian polynomials $A^{\mathrm{cyc}}_n(x,y, t\,|\,α)$ by enumerating permutations with respect to the number of excedances, drops, fixed points and cycles. In this paper, we introduce a nine-variable generalization of the Eulerian polynomials $A_n(u_1,u_2,u_3,u_4, f, g, t\,|\,α, β)$ in terms of descent based statistics of permutations and…
▽ More
In 1977 Carlitz and Scoville introduced the cycle $(α,t)$-Eulerian polynomials $A^{\mathrm{cyc}}_n(x,y, t\,|\,α)$ by enumerating permutations with respect to the number of excedances, drops, fixed points and cycles. In this paper, we introduce a nine-variable generalization of the Eulerian polynomials $A_n(u_1,u_2,u_3,u_4, f, g, t\,|\,α, β)$ in terms of descent based statistics of permutations and prove a connection formula between these two kinds of generalized Eulerian polynomials. By exploring the connection formula, we derive plainly the exponential generating function of the latter polynomials and various $γ$-positive formulas for variants of Eulerian polynomials. In particular, our results unify and strengthen the recent results by Ji and Ji-Lin. In related work to the transition matrix between the Specht and web bases, Hwang, Jang and Oh recently introduced the web permutations, which can be characterised by cycle André permutations. We show that enumerating the latter permutations with respect to the number of drops, fixed points and cycles gives rise to the normalised $γ$-vectors of the $(α,t)$-Eulerian polynomials. Our result generalizes and unifies several known results in the literature.
△ Less
Submitted 25 June, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
Mneimneh-type Binomial Sums of Multiple Harmonic-type Sums
Authors:
Ende Pan,
Ce Xu
Abstract:
In this paper, we establish some expressions of Mneimneh-type binomial sums involving multiple harmonic-type sums in terms of finite sums of Stirling numbers, Bell numbers and some related variables. In particular, we present some new formulas of Mneimneh-type binomial sums involving generalized (alternating) harmonic numbers. Further, we establish a new identity relating the multiple zeta star va…
▽ More
In this paper, we establish some expressions of Mneimneh-type binomial sums involving multiple harmonic-type sums in terms of finite sums of Stirling numbers, Bell numbers and some related variables. In particular, we present some new formulas of Mneimneh-type binomial sums involving generalized (alternating) harmonic numbers. Further, we establish a new identity relating the multiple zeta star values $ζ^\star(m+2,\{1\}_{r-1})$ and specific multiple polylogarithms by applying the Toeplitz principle. Furthermore, we present some interesting consequences and illustrative examples.
△ Less
Submitted 28 March, 2024; v1 submitted 9 March, 2024;
originally announced March 2024.
-
General Mneimneh-type Binomial Sum involving Harmonic Numbers
Authors:
Ende Pan,
Ce Xu
Abstract:
Recently, Mneimneh proved the remarkable identity \begin{align*} \sum_{k=0}^n H_k\binom{n}{k} p^k(1-p)^{n-k}=\sum_{i=1}^n \frac{1-(1-p)^i}{i}\quad (p\in [0,1]) \end{align*} as the main result of a 2023 \emph{Discrete Mathematics} paper, where $H_k:=\sum\nolimits_{i=1}^k 1/i$ is the classical $k$-th harmonic number. Thereafter, Campbell provided several other proofs of Mneimneh's formula as above i…
▽ More
Recently, Mneimneh proved the remarkable identity \begin{align*} \sum_{k=0}^n H_k\binom{n}{k} p^k(1-p)^{n-k}=\sum_{i=1}^n \frac{1-(1-p)^i}{i}\quad (p\in [0,1]) \end{align*} as the main result of a 2023 \emph{Discrete Mathematics} paper, where $H_k:=\sum\nolimits_{i=1}^k 1/i$ is the classical $k$-th harmonic number. Thereafter, Campbell provided several other proofs of Mneimneh's formula as above in a note published in \emph{Discrete Mathematics} in 2023. Moreover, Campbell also considered how Mneimneh's identity may be proved and generalized using the \emph{Mathematica package Sigma}. In particular, he found the generalized Mneimneh's identity \begin{align*} \sum_{k=0}^n x^k y^{n-k} \binom{n}{k}H_k =(x+y)^n \left(H_n-\sum_{i=1}^n \frac{y^i (x+y)^{-i}}{i}\right). \end{align*} In this paper, we will prove a more generalization of Mneimneh's identity involving Bell numbers and some Mneimneh-type identities involving (alternating) harmonic numbers by using a few results of our previous papers.
△ Less
Submitted 9 March, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
Decoupling Learning and Decision-Making: Breaking the $\mathcal{O}(\sqrt{T})$ Barrier in Online Resource Allocation with First-Order Methods
Authors:
Wenzhi Gao,
Chunlin Sun,
Chenyu Xue,
Dongdong Ge,
Yinyu Ye
Abstract:
Online linear programming plays an important role in both revenue management and resource allocation, and recent research has focused on developing efficient first-order online learning algorithms. Despite the empirical success of first-order methods, they typically achieve a regret no better than $\mathcal{O}(\sqrt{T})$, which is suboptimal compared to the $\mathcal{O}(\log T)$ bound guaranteed b…
▽ More
Online linear programming plays an important role in both revenue management and resource allocation, and recent research has focused on developing efficient first-order online learning algorithms. Despite the empirical success of first-order methods, they typically achieve a regret no better than $\mathcal{O}(\sqrt{T})$, which is suboptimal compared to the $\mathcal{O}(\log T)$ bound guaranteed by the state-of-the-art linear programming (LP)-based online algorithms. This paper establishes several important facts about online linear programming, which unveils the challenge for first-order-method-based online algorithms to achieve beyond $\mathcal{O}(\sqrt{T})$ regret. To address the challenge, we introduce a new algorithmic framework that decouples learning from decision-making. For the first time, we show that first-order methods can attain regret $\mathcal{O}(T^{1/3})$ with this new framework.
△ Less
Submitted 28 May, 2024; v1 submitted 11 February, 2024;
originally announced February 2024.
-
A dynamic model to study the potential TB infections and assessment of control strategies in China
Authors:
Chuanqing Xu,
Kedeng Cheng,
Songbai Guo,
Dehui Yuan,
Xiaoyu Zhao
Abstract:
China is one of the countries with a high burden of tuberculosis, and although the number of new cases of tuberculosis has been decreasing year by year, the number of new infections per year has remained high and the diagnosis rate of tuberculosis-infected patients has remained low. Based on the analysis of TB infection data, we develop a model of TB transmission dynamics that include potentially…
▽ More
China is one of the countries with a high burden of tuberculosis, and although the number of new cases of tuberculosis has been decreasing year by year, the number of new infections per year has remained high and the diagnosis rate of tuberculosis-infected patients has remained low. Based on the analysis of TB infection data, we develop a model of TB transmission dynamics that include potentially infected individuals and BCG vaccination, fit the model parameters to the data on new TB cases, calculate the basic reproduction number \mathcal{R}_v= 0.4442. A parametric sensitivity analysis of \mathcal{R}_v is performed, and we obtained the correlation coefficients of BCG vaccination rate and effectiveness rate with \mathcal{R}_v as -0.810, -0.825. According to the model, we estimate that there are 614,186 (95% CI [562,631,665,741]) potentially infected TB cases in China, accounting for about 39.5% of the total number of TB cases. We assess the feasibility of achieving the goals of the WHO strategy to end tuberculosis in China and find that reducing the number of new cases by 90 per cent by 2035 is very difficult with the current tuberculosis control measures. However, with an effective combination of control measures such as increased detection of potentially infected persons, improved drug treatment, and reduction of overall exposure to tuberculosis patients, it is feasible to reach the WHO strategic goal of ending tuberculosis by 2035.
△ Less
Submitted 25 January, 2024; v1 submitted 22 January, 2024;
originally announced January 2024.
-
General Berndt-Type Integrals and Series Associated with Jacobi Elliptic Functions
Authors:
Ce Xu,
Jianqiang Zhao
Abstract:
In this paper, we prove two structural theorems on the general Berndt-type integrals with the denominator having arbitrary positive degrees by contour integrations involving hyperbolic and trigonometric functions, and hyperbolic sums associated with Jacobi elliptic functions. We first establish explicit relations between these integrals and four classes of hyperbolic sums. Then, using our previous…
▽ More
In this paper, we prove two structural theorems on the general Berndt-type integrals with the denominator having arbitrary positive degrees by contour integrations involving hyperbolic and trigonometric functions, and hyperbolic sums associated with Jacobi elliptic functions. We first establish explicit relations between these integrals and four classes of hyperbolic sums. Then, using our previous results on hyperbolic series and applying the matrix method from linear algebra, we compute explicitly several general hyperbolic sums and their higher derivatives. These enable us to express two families of general Berndt-type integrals as polynomials in $Γ^4(1/4)$ and $π^{-1}$ with rational coefficients, where $Γ$ is the Euler gamma function. At the end of the paper, we provide some conjectures of general Berndt-type integrals.
△ Less
Submitted 18 January, 2024; v1 submitted 1 January, 2024;
originally announced January 2024.
-
Stationary measures of continuous time Markov chains with applications to stochastic reaction networks
Authors:
Mads Chr Hansen,
Carsten Wiuf,
Chuang Xu
Abstract:
We study continuous-time Markov chains on the non-negative integers under mild regularity conditions (in particular, the set of jump vectors is finite and both forward and backward jumps are possible). Based on the so-called flux balance equation, we derive an iterative formula for calculating stationary measures. Specifically, a stationary measure $π(x)$ evaluated at $x\in\mathbb{N}_0$ is represe…
▽ More
We study continuous-time Markov chains on the non-negative integers under mild regularity conditions (in particular, the set of jump vectors is finite and both forward and backward jumps are possible). Based on the so-called flux balance equation, we derive an iterative formula for calculating stationary measures. Specifically, a stationary measure $π(x)$ evaluated at $x\in\mathbb{N}_0$ is represented as a linear combination of a few generating terms, similarly to the characterization of a stationary measure of a birth-death process, where there is only one generating term, $π(0)$. The coefficients of the linear combination are recursively determined in terms of the transition rates of the Markov chain. For the class of Markov chains we consider, there is always at least one stationary measure (up to a scaling constant). We give various results pertaining to uniqueness and non-uniqueness of stationary measures, and show that the dimension of the linear space of signed invariant measures is equal to the number of generating terms. A minimization problem is constructed in order to compute stationary measures numerically. Moreover, a heuristic linear approximation scheme is suggested for the same purpose by first approximating the generating terms. The correctness of the linear approximation scheme is justified in some special cases. Furthermore, a decomposition of the state space into different types of states (open and closed irreducible classes, and trapping, escaping and neutral states) is presented. Applications to stochastic reaction networks are well illustrated.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Berndt-Type Integrals of Order Three and Series Associated with Jacobi Elliptic Functions
Authors:
Hongyuan Rui,
Ce Xu,
Jianqiang Zhao
Abstract:
In this paper, we first establish explicit evaluations of six classes of hyperbolic sums by special values of the Gamma function by using the tools of the Fourier series expansions and the Maclaurin series expansions of a few Jacobi elliptic functions developed in our previous paper. Then, using the method of contour integrations involving hyperbolic and trigonometric functions, we establish expli…
▽ More
In this paper, we first establish explicit evaluations of six classes of hyperbolic sums by special values of the Gamma function by using the tools of the Fourier series expansions and the Maclaurin series expansions of a few Jacobi elliptic functions developed in our previous paper. Then, using the method of contour integrations involving hyperbolic and trigonometric functions, we establish explicit evaluations of two families of Berndt-type integrals of order three by special values of the Gamma function. Furthermore, we present some interesting consequences and illustrative examples.
△ Less
Submitted 28 November, 2023; v1 submitted 27 November, 2023;
originally announced November 2023.
-
On the Congruency-Constrained Matroid Base
Authors:
Siyue Liu,
Chao Xu
Abstract:
Consider a matroid where all elements are labeled with an element in $\mathbb{Z}$. We are interested in finding a base where the sum of the labels is congruent to $g \pmod m$. We show that this problem can be solved in $\tilde{O}(2^{4m} n r^{5/6})$ time for a matroid with $n$ elements and rank $r$, when $m$ is either the product of two primes or a prime power. The algorithm can be generalized to a…
▽ More
Consider a matroid where all elements are labeled with an element in $\mathbb{Z}$. We are interested in finding a base where the sum of the labels is congruent to $g \pmod m$. We show that this problem can be solved in $\tilde{O}(2^{4m} n r^{5/6})$ time for a matroid with $n$ elements and rank $r$, when $m$ is either the product of two primes or a prime power. The algorithm can be generalized to all moduli and, in fact, to all abelian groups if a classic additive combinatorics conjecture by Schrijver and Seymour holds true. We also discuss the optimization version of the problem.
△ Less
Submitted 20 March, 2024; v1 submitted 20 November, 2023;
originally announced November 2023.
-
The analytic Gelfand-Shilov smoothing effect of the Landau equation with hard potential
Authors:
Chao-Jiang Xu,
Yan Xu
Abstract:
Inthispaper,westudytheCauchyproblemoftheinhomogeneous Landau equation with hard potentials under the perturbation framework to global equilibrium. We prove that the solution to the Cauchy problem enjoys the analytic Gelfand-Shilov regularizing effect with a Sobolev initial datum for positive time.
Inthispaper,westudytheCauchyproblemoftheinhomogeneous Landau equation with hard potentials under the perturbation framework to global equilibrium. We prove that the solution to the Cauchy problem enjoys the analytic Gelfand-Shilov regularizing effect with a Sobolev initial datum for positive time.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
The Non-zonal Rossby-Haurwitz Solutions of the 2D Euler Equations on a Rotating Ellipsoid
Authors:
Chenghao Xu
Abstract:
In this article, we investigate the incompressible 2D Euler equations on a rotating biaxial ellipsoid, which model the dynamics of the atmosphere of a Jovian planet. We study the non-zonal Rossby-Haurwitz solutions of the Euler equations on an ellipsoid, while previous works only considered the case of a sphere. Our main results include: the existence and uniqueness of the stationary Rossby-Haurwi…
▽ More
In this article, we investigate the incompressible 2D Euler equations on a rotating biaxial ellipsoid, which model the dynamics of the atmosphere of a Jovian planet. We study the non-zonal Rossby-Haurwitz solutions of the Euler equations on an ellipsoid, while previous works only considered the case of a sphere. Our main results include: the existence and uniqueness of the stationary Rossby-Haurwitz solutions; the construction of the traveling-wave solutions; and the demonstration of the Lyapunov instability of both the stationary and the traveling-wave solutions.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Ramified and Unramified Motivic Multiple $t$-, $T$- and $S$-Values
Authors:
Ce Xu,
Jianqiang Zhao
Abstract:
In this paper we shall consider a few variants of the motivic multiple zeta values of level two by restricting the summation indices in the definition of multiple zeta values to some fixed parity patterns. These include Hoffman's multiple $t$-values, Kaneko and Tsumura's multiple $T$-values, and the multiple $S$-values studied previously by the authors. By applying Brown and Glanois's descent theo…
▽ More
In this paper we shall consider a few variants of the motivic multiple zeta values of level two by restricting the summation indices in the definition of multiple zeta values to some fixed parity patterns. These include Hoffman's multiple $t$-values, Kaneko and Tsumura's multiple $T$-values, and the multiple $S$-values studied previously by the authors. By applying Brown and Glanois's descent theory on the motivic versions of these values we shall derive some criterion for when these values are ramified and unramified. Assuming Grothendieck's period conjecture, our results partially confirms a conjecture of Kaneko and Tsumura about when multiple $T$-values can be expressed as a rational linear combination of multiple zeta values (i.e., unramified) if their depth is less than four. Similar results are obtained for motivic multiple $S$-values. Further, we are able to generalize a result of Charlton to more families of unramified multiple $t$-values with unit components (i.e. components equal to 1). We propose some more unsolved problems at the end of the paper.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
On Weighted Generalized Gauss Quadratures for Müntz Systems
Authors:
Huaijin Wang,
Chuanju Xu
Abstract:
A novel recurrence formula for moments with respect to Müntz-Legendre polynomials is proposed and applied to construct a numerical method for solving generalized Gauss quadratures with power function weight for Müntz systems. These quadrature rules exhibit several properties similar to the classical Gaussian quadratures for polynomial systems, including positive weights, rapid convergence, and oth…
▽ More
A novel recurrence formula for moments with respect to Müntz-Legendre polynomials is proposed and applied to construct a numerical method for solving generalized Gauss quadratures with power function weight for Müntz systems. These quadrature rules exhibit several properties similar to the classical Gaussian quadratures for polynomial systems, including positive weights, rapid convergence, and others. They are applicable to a wide range of functions, including smooth functions and functions with endpoint singularities, commonly found in integral equations with singular kernels, complex analysis, potential theory, and other areas.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
An efficient numerical method for the anisotropic phase field dendritic crystal growth model
Authors:
Yayu Guo,
Mejdi Azaiez,
Chuanju Xu
Abstract:
In this paper, we propose and analyze an efficient numerical method for the anisotropic phase field dendritic crystal growth model, which is challenging because we are facing the nonlinear coupling and anisotropic coefficient in the model. The proposed method is a two-step scheme. In the first step, an intermediate solution is computed by using BDF schemes of order up to three for both the phase-f…
▽ More
In this paper, we propose and analyze an efficient numerical method for the anisotropic phase field dendritic crystal growth model, which is challenging because we are facing the nonlinear coupling and anisotropic coefficient in the model. The proposed method is a two-step scheme. In the first step, an intermediate solution is computed by using BDF schemes of order up to three for both the phase-field and heat equations. In the second step the intermediate solution is stabilized by multiplying an auxiliary variable. The key of the second step is to stabilize the overall scheme while maintaining the convergence order of the stabilized solution. In order to overcome the difficulty caused by the gradient-dependent anisotropic coefficient and the nonlinear terms, some stabilization terms are added to the BDF schemes in the first step. The second step makes use of a generalized auxiliary variable approach with relaxation. The Fourier spectral method is applied for the spatial discretization. Our analysis shows that the proposed scheme is unconditionally stable and has accuracy in time up to third order. We also provide a sophisticated implementation showing that the computational complexity of our schemes is equivalent to solving two linear equations and some algebraic equations. To the best of our knowledge, this is the cheapest unconditionally stable schemes reported in the literature. Some numerical examples are given to verify the efficiency of the proposed method.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Solving Inverse Problems with Reinforcement Learning
Authors:
Chen Xu,
Zhipeng Lu,
Ye Zhang
Abstract:
In this paper, we formally introduce, with rigorous derivations, the use of reinforcement learning to the field of inverse problems by designing an iterative algorithm, called REINFORCE-IP, for solving a general type of non-linear inverse problem. By choosing specific probability models for the action-selection rule, we connect our approach to the conventional regularization methods of Tikhonov re…
▽ More
In this paper, we formally introduce, with rigorous derivations, the use of reinforcement learning to the field of inverse problems by designing an iterative algorithm, called REINFORCE-IP, for solving a general type of non-linear inverse problem. By choosing specific probability models for the action-selection rule, we connect our approach to the conventional regularization methods of Tikhonov regularization and iterative regularization. For the numerical implementation of our approach, we parameterize the solution-searching rule with the help of neural networks and iteratively improve the parameter using a reinforcement-learning algorithm~-- REINFORCE. Under standard assumptions we prove the almost sure convergence of the parameter to a locally optimal value. Our work provides two typical examples (non-linear integral equations and parameter-identification problems in partial differential equations) of how reinforcement learning can be applied in solving non-linear inverse problems. Our numerical experiments show that REINFORCE-IP is an efficient algorithm that can escape from local minimums and identify multi-solutions for inverse problems with non-uniqueness.
△ Less
Submitted 13 November, 2023; v1 submitted 10 October, 2023;
originally announced October 2023.
-
On two conjectural series involving Riemann zeta function
Authors:
Chuanan Wei,
Ce Xu
Abstract:
Riemann zeta function is important in a lot of branches of number theory. With the help of the operator method and several transformation formulas for hypergeometric series, we prove four series involving Riemann zeta function. Two of them are series expansions for $ζ(7)$ and $ζ(3)^2$ recently conjectured by Z.-W. Sun.
Riemann zeta function is important in a lot of branches of number theory. With the help of the operator method and several transformation formulas for hypergeometric series, we prove four series involving Riemann zeta function. Two of them are series expansions for $ζ(7)$ and $ζ(3)^2$ recently conjectured by Z.-W. Sun.
△ Less
Submitted 6 October, 2023;
originally announced October 2023.
-
Refinement of Interval Approximations for Fully Commutative Quivers
Authors:
Yasuaki Hiraoka,
Ken Nakashima,
Ippei Obayashi,
Chenguang Xu
Abstract:
A fundamental challenge in multiparameter persistent homology is the absence of a complete and discrete invariant. To address this issue, we propose an enhanced framework that realizes a holistic understanding of a fully commutative quiver's representation via synthesizing interpretations obtained from intervals. Additionally, it provides a mechanism to tune the balance between approximation resol…
▽ More
A fundamental challenge in multiparameter persistent homology is the absence of a complete and discrete invariant. To address this issue, we propose an enhanced framework that realizes a holistic understanding of a fully commutative quiver's representation via synthesizing interpretations obtained from intervals. Additionally, it provides a mechanism to tune the balance between approximation resolution and computational complexity. This framework is evaluated on commutative ladders of both finite-type and infinite-type. For the former, we discover an efficient method for the indecomposable decomposition leveraging solely one-parameter persistent homology. For the latter, we introduce a new invariant that reveals persistence in the second parameter by connecting two standard persistence diagrams using interval approximations. We subsequently present several models for constructing commutative ladder filtrations, offering fresh insights into random filtrations and demonstrating our toolkit's effectiveness in analyzing the topology of materials.
△ Less
Submitted 12 November, 2023; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Analytic smoothing effect of Cauchy problem for a class of Kolmogorov-Fokker-Planck equations
Authors:
Xiao-Dong Cao,
Chao-Jiang Xu,
Yan Xu
Abstract:
We study the Cauchy problem of the Kolmogorov-Fokker-Planck equations and show that the solution enjoys an analytic smoothing effect with L2 initial datum for positive time.
We study the Cauchy problem of the Kolmogorov-Fokker-Planck equations and show that the solution enjoys an analytic smoothing effect with L2 initial datum for positive time.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
On Some Unramified Families of Motivic Euler Sums
Authors:
Ce Xu,
Jianqiang Zhao
Abstract:
It is well known that sometimes Euler sums (i.e., alternating multiple zeta values) can be expressed as $\Q$-linear combinations of multiple zeta values (MZVs). In her thesis Glanois presented a criterion for motivic Euler sums to be unramified, namely, expressible as $\Q$-linear combinations of motivic MZVs. By applying this criterion we present a few families of such unramified motivic Euler sum…
▽ More
It is well known that sometimes Euler sums (i.e., alternating multiple zeta values) can be expressed as $\Q$-linear combinations of multiple zeta values (MZVs). In her thesis Glanois presented a criterion for motivic Euler sums to be unramified, namely, expressible as $\Q$-linear combinations of motivic MZVs. By applying this criterion we present a few families of such unramified motivic Euler sums in two groups. In one such group we can further prove the concrete identities relating the motivic Euler sums to the motivic MZVs, determined up to rational multiple of a motivic Riemann zeta value by a result of Brown, under the assumption that the analytic version of such identities hold.
△ Less
Submitted 24 January, 2024; v1 submitted 13 September, 2023;
originally announced September 2023.
-
A Homogenization Approach for Gradient-Dominated Stochastic Optimization
Authors:
Jiyuan Tan,
Chenyu Xue,
Chuwen Zhang,
Qi Deng,
Dongdong Ge,
Yinyu Ye
Abstract:
Gradient dominance property is a condition weaker than strong convexity, yet sufficiently ensures global convergence even in non-convex optimization. This property finds wide applications in machine learning, reinforcement learning (RL), and operations management. In this paper, we propose the stochastic homogeneous second-order descent method (SHSODM) for stochastic functions enjoying gradient do…
▽ More
Gradient dominance property is a condition weaker than strong convexity, yet sufficiently ensures global convergence even in non-convex optimization. This property finds wide applications in machine learning, reinforcement learning (RL), and operations management. In this paper, we propose the stochastic homogeneous second-order descent method (SHSODM) for stochastic functions enjoying gradient dominance property based on a recently proposed homogenization approach. Theoretically, we provide its sample complexity analysis, and further present an enhanced result by incorporating variance reduction techniques. Our findings show that SHSODM matches the best-known sample complexity achieved by other second-order methods for gradient-dominated stochastic optimization but without cubic regularization. Empirically, since the homogenization approach only relies on solving extremal eigenvector problem at each iteration instead of Newton-type system, our methods gain the advantage of cheaper computational cost and robustness in ill-conditioned problems. Numerical experiments on several RL tasks demonstrate the better performance of SHSODM compared to other off-the-shelf methods.
△ Less
Submitted 29 May, 2024; v1 submitted 21 August, 2023;
originally announced August 2023.
-
On some conjectural series containing harmonic numbers of 3-order
Authors:
Chuanan Wei,
Ce Xu
Abstract:
Harmonic numbers are important in a lot of branches of number theory. By means of the derivative operator, the integral operator, and several summation and transformation formulas for hypergeometric series, we prove four series containing harmonic numbers of 3-order. Three of them are conjectures which were recently proposed by Z.-W. Sun.
Harmonic numbers are important in a lot of branches of number theory. By means of the derivative operator, the integral operator, and several summation and transformation formulas for hypergeometric series, we prove four series containing harmonic numbers of 3-order. Three of them are conjectures which were recently proposed by Z.-W. Sun.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Operator Norm Bounds on the Correlation Matrix of the SK Model at High Temperature
Authors:
Christian Brennecke,
Changji Xu,
Horng-Tzer Yau
Abstract:
We prove that the two point correlation matrix $ \textbf{M}= (\langle σ_i ; σ_j\rangle)_{1\leq i,j\leq N} \in \mathbb{R}^{N\times N}$ of the Sherrington-Kirkpatrick model has the property that for every $ε>0$ there exists $K_ε>0$, that is independent of $N$, such that
\[ \mathbb{P}\big( \| \textbf{M} \|_{\text{op}} \leq K_ε\big) \geq 1- ε\] for $N$ large enough, for suitable interaction and exte…
▽ More
We prove that the two point correlation matrix $ \textbf{M}= (\langle σ_i ; σ_j\rangle)_{1\leq i,j\leq N} \in \mathbb{R}^{N\times N}$ of the Sherrington-Kirkpatrick model has the property that for every $ε>0$ there exists $K_ε>0$, that is independent of $N$, such that
\[ \mathbb{P}\big( \| \textbf{M} \|_{\text{op}} \leq K_ε\big) \geq 1- ε\] for $N$ large enough, for suitable interaction and external field parameters $(β,h)$ in the replica symmetric region. In other words, the operator norm of $\textbf{M}$ is of order one with high probability. Our results are in particular valid for all $ (β,h)\in (0,1)\times (0,\infty) $ and thus complement recently obtained results in \cite{EAG,BSXY} that imply the operator norm boundedness of $\textbf{M}$ for all $β<1$ in the special case of vanishing external field.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Cooperative coloring of some graph families
Authors:
Xuqing Bai,
Bi Li,
Chuandong Xu,
Xin Zhang
Abstract:
In a family ${G_1, G_2, \ldots, G_m}$ of graphs sharing the same vertex set $V$, a cooperative coloring involves selecting one independent set $I_i$ from $G_i$ for each $i\in \{1,2,\ldots,m\}$ such that $\bigcup_{i=1}^m I_i = V$. For a graph class $\mathcal{G}$, let $m_{\mathcal{G}}(d)$ denote the minimum $m$ required to ensure that any graph family ${G_1, G_2, \ldots, G_m}$ on the same vertex set…
▽ More
In a family ${G_1, G_2, \ldots, G_m}$ of graphs sharing the same vertex set $V$, a cooperative coloring involves selecting one independent set $I_i$ from $G_i$ for each $i\in \{1,2,\ldots,m\}$ such that $\bigcup_{i=1}^m I_i = V$. For a graph class $\mathcal{G}$, let $m_{\mathcal{G}}(d)$ denote the minimum $m$ required to ensure that any graph family ${G_1, G_2, \ldots, G_m}$ on the same vertex set, where $G_i\in\mathcal{G}$ and $Δ(G_i)\leq d$ for each $i\in \{1,2,\ldots,m\}$, admits a cooperative coloring. For the graph classes $\mathcal{T}$ (trees) and $\mathcal{W}$ (wheels), we find that $m_\mathcal{T}(3)=4$ and $m_\mathcal{W}(4)=5$. Also, we prove that $m_{\mathcal{B}^*}(d)=O(\log_2 d)$ and $m_{\mathcal{L}}(d)=O\left(\frac{\log d}{\log\log d}\right)$, where $\mathcal{B}^*$ represents the class of graphs whose components are balanced complete bipartite graphs, and $\mathcal{L}$ represents the class of graphs whose components are generalized theta graphs.
△ Less
Submitted 15 February, 2024; v1 submitted 14 July, 2023;
originally announced July 2023.
-
Design of the Reverse Logistics System for Medical Waste Recycling Part II: Route Optimization with Case Study under COVID-19 Pandemic
Authors:
Chaozhong Xue,
Yongqi Dong,
Jiaqi Liu,
Yijun Liao,
Lingbo Li
Abstract:
Medical waste recycling and treatment has gradually drawn concerns from the whole society, as the amount of medical waste generated is increasing dramatically, especially during the pandemic of COVID-19. To tackle the emerging challenges, this study designs a reverse logistics system architecture with three modules, i.e., medical waste classification & monitoring module, temporary storage & dispos…
▽ More
Medical waste recycling and treatment has gradually drawn concerns from the whole society, as the amount of medical waste generated is increasing dramatically, especially during the pandemic of COVID-19. To tackle the emerging challenges, this study designs a reverse logistics system architecture with three modules, i.e., medical waste classification & monitoring module, temporary storage & disposal site (disposal site for short) selection module, as well as route optimization module. This overall solution design won the Grand Prize of the "YUNFENG CUP" China National Contest on Green Supply and Reverse Logistics Design ranking 1st. This paper focuses on the design of the route optimization module. In this module, a route optimization problem is designed considering transportation costs and multiple risk costs (e.g., environment risk, population risk, property risk, and other accident-related risks). The Analytic Hierarchy Process is employed to determine the weights for each risk element, and a customized genetic algorithm is developed to solve the route optimization problem. A case study under the COVID-19 pandemic is further provided to verify the proposed model. Limited by length, detailed descriptions of the whole system and the other modules can be found at https://shorturl.at/cdY59.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
One Forward is Enough for Neural Network Training via Likelihood Ratio Method
Authors:
Jinyang Jiang,
Zeliang Zhang,
Chenliang Xu,
Zhaofei Yu,
Yijie Peng
Abstract:
While backpropagation (BP) is the mainstream approach for gradient computation in neural network training, its heavy reliance on the chain rule of differentiation constrains the designing flexibility of network architecture and training pipelines. We avoid the recursive computation in BP and develop a unified likelihood ratio (ULR) method for gradient estimation with just one forward propagation.…
▽ More
While backpropagation (BP) is the mainstream approach for gradient computation in neural network training, its heavy reliance on the chain rule of differentiation constrains the designing flexibility of network architecture and training pipelines. We avoid the recursive computation in BP and develop a unified likelihood ratio (ULR) method for gradient estimation with just one forward propagation. Not only can ULR be extended to train a wide variety of neural network architectures, but the computation flow in BP can also be rearranged by ULR for better device adaptation. Moreover, we propose several variance reduction techniques to further accelerate the training process. Our experiments offer numerical results across diverse aspects, including various neural network training scenarios, computation flow rearrangement, and fine-tuning of pre-trained models. All findings demonstrate that ULR effectively enhances the flexibility of neural network training by permitting localized module training without compromising the global objective and significantly boosts the network robustness.
△ Less
Submitted 13 October, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
Analytic Gelfand-Shilov smoothing effect of fractional Kramers-Fokker-Planck equation
Authors:
Chao-Jiang Xu,
Yan Xu
Abstract:
We study the Cauchy problem of the fractional Kramers-Fokker- Planck equation and show that the solution to the Cauchy problem enjoys an analytic Gelfand-Shilov regularizing effect for positive time.
We study the Cauchy problem of the fractional Kramers-Fokker- Planck equation and show that the solution to the Cauchy problem enjoys an analytic Gelfand-Shilov regularizing effect for positive time.
△ Less
Submitted 13 May, 2023;
originally announced May 2023.
-
To AI or not to AI, to Buy Local or not to Buy Local: A Mathematical Theory of Real Price
Authors:
Huan Cai,
Catherine Xu,
Weiyu Xu
Abstract:
In the past several decades, the world's economy has become increasingly globalized. On the other hand, there are also ideas advocating the practice of ``buy local'', by which people buy locally produced goods and services rather than those produced farther away. In this paper, we establish a mathematical theory of real price that determines the optimal global versus local spending of an agent whi…
▽ More
In the past several decades, the world's economy has become increasingly globalized. On the other hand, there are also ideas advocating the practice of ``buy local'', by which people buy locally produced goods and services rather than those produced farther away. In this paper, we establish a mathematical theory of real price that determines the optimal global versus local spending of an agent which achieves the agent's optimal tradeoff between spending and obtained utility. Our theory of real price depends on the asymptotic analysis of a Markov chain transition probability matrix related to the network of producers and consumers. We show that the real price of a product or service can be determined from the involved Markov chain matrix, and can be dramatically different from the product's label price. In particular, we show that the label prices of products and services are often not ``real'' or directly ``useful'': given two products offering the same myopic utility, the one with lower label price may not necessarily offer better asymptotic utility. This theory shows that the globality or locality of the products and services does have different impacts on the spending-utility tradeoff of a customer. The established mathematical theory of real price can be used to determine whether to adopt or not to adopt certain artificial intelligence (AI) technologies from an economic perspective.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
On Sombor Index of Graphs
Authors:
Batmend Horoldagva,
Chunlei Xu
Abstract:
Recently, Gutman defined a new vertex-degree-based graph invariant, named the Sombor index $SO$ of a graph $G$, and is defined by $$SO(G)=\sum_{uv\in E(G)}\sqrt{d_G(u)^2+d_G(v)^2},$$ where $d_G(v)$ is the degree of the vertex $v$ of $G$. In this paper, we obtain the sharp lower and upper bounds on $SO(G)$ of a connected graph, and characterize graphs for which these bounds are attained.
Recently, Gutman defined a new vertex-degree-based graph invariant, named the Sombor index $SO$ of a graph $G$, and is defined by $$SO(G)=\sum_{uv\in E(G)}\sqrt{d_G(u)^2+d_G(v)^2},$$ where $d_G(v)$ is the degree of the vertex $v$ of $G$. In this paper, we obtain the sharp lower and upper bounds on $SO(G)$ of a connected graph, and characterize graphs for which these bounds are attained.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Sharp regularization effect for the non-cutoff Boltzmann equation with hard potentials
Authors:
Jun-Ling Chen,
Wei-Xi Li,
Chao-Jiang Xu
Abstract:
For the Maxwellian molecules or hard potentials case, we verify the smoothing effect for the spatially inhomogeneous Boltzmann equation without angular cutoff. Given initial data with low regularity, we prove its solutions at any positive time are analytic for strong angular singularity, and in Gevrey class with optimal index for mild angular singularity. To overcome the degeneracy in the spatial…
▽ More
For the Maxwellian molecules or hard potentials case, we verify the smoothing effect for the spatially inhomogeneous Boltzmann equation without angular cutoff. Given initial data with low regularity, we prove its solutions at any positive time are analytic for strong angular singularity, and in Gevrey class with optimal index for mild angular singularity. To overcome the degeneracy in the spatial variable, a family of well-chosen vector fields with time-dependent coefficients will play a crucial role, and the sharp regularization effect of weak solutions relies on a quantitative estimate on directional derivatives in these vector fields.
△ Less
Submitted 18 January, 2024; v1 submitted 4 May, 2023;
originally announced May 2023.
-
The energy-critical inhomogeneous generalized Hartree equation in 3D
Authors:
Carlos M. Guzmán,
Chengbin Xu
Abstract:
The purpose of this work is to study the $3D$ energy-critical inhomogeneous generalized Hartree equation $$ i\pa_tu+Δu+|x|^{-b}(I_α\ast|\cdot|^{-b}|u|^{p})|u|^{p-2}u=0,\;\ x\in\R^3, $$ where $p=3+α-2b$. We establish global well-posedness and scattering below the ground state threshold with non-radial initial data in $\dot{H}^1$. To this end, we exploit the decay of the nonlinearity, which together…
▽ More
The purpose of this work is to study the $3D$ energy-critical inhomogeneous generalized Hartree equation $$ i\pa_tu+Δu+|x|^{-b}(I_α\ast|\cdot|^{-b}|u|^{p})|u|^{p-2}u=0,\;\ x\in\R^3, $$ where $p=3+α-2b$. We establish global well-posedness and scattering below the ground state threshold with non-radial initial data in $\dot{H}^1$. To this end, we exploit the decay of the nonlinearity, which together with the Kenig-Merle roadmap, allows us to treat the non-radial case as the radial case. In this paper are introduced new techniques to overcome the challenges posed by the presence of the potential and the nonlocal nonlinear term of convolution type. In particular, we also show scattering for the classical generalized Hartree equation ($b=0$) assuming radial data. Additionally, in the defocusing case, we show scattering with general data. We believe that the ideas developed here are robust and can be applicable to other types of nonlinear Hartree equations. In the introduction, we discuss some open problems.
△ Less
Submitted 3 August, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
Generalized generalized linear models: Convex estimation and online bounds
Authors:
Anatoli Juditsky,
Arkadi Nemirovski,
Yao Xie,
Chen Xu
Abstract:
We introduce a new computational framework for estimating parameters in generalized generalized linear models (GGLM), a class of models that extends the popular generalized linear models (GLM) to account for dependencies among observations in spatio-temporal data. The proposed approach uses a monotone operator-based variational inequality method to overcome non-convexity in parameter estimation an…
▽ More
We introduce a new computational framework for estimating parameters in generalized generalized linear models (GGLM), a class of models that extends the popular generalized linear models (GLM) to account for dependencies among observations in spatio-temporal data. The proposed approach uses a monotone operator-based variational inequality method to overcome non-convexity in parameter estimation and provide guarantees for parameter recovery. The results can be applied to GLM and GGLM, focusing on spatio-temporal models. We also present online instance-based bounds using martingale concentrations inequalities. Finally, we demonstrate the performance of the algorithm using numerical simulations and a real data example for wildfire incidents.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
Accelerated Distributed Aggregative Optimization
Authors:
Jiaxu Liu,
Song Chen,
Shengze Cai,
Chao Xu
Abstract:
In this paper, we investigate a distributed aggregative optimization problem in a network, where each agent has its own local cost function which depends not only on the local state variable but also on an aggregated function of state variables from all agents. To accelerate the optimization process, we combine heavy ball and Nesterov's accelerated methods with distributed aggregative gradient tra…
▽ More
In this paper, we investigate a distributed aggregative optimization problem in a network, where each agent has its own local cost function which depends not only on the local state variable but also on an aggregated function of state variables from all agents. To accelerate the optimization process, we combine heavy ball and Nesterov's accelerated methods with distributed aggregative gradient tracking, and propose two novel algorithms named DAGT-HB and DAGT-NES for solving the distributed aggregative optimization problem. We analyse that the DAGT-HB and DAGT-NES algorithms can converge to an optimal solution at a global $\mathbf{R}-$linear convergence rate when the objective function is smooth and strongly convex, and when the parameters (e.g., step size and momentum coefficients) are selected within certain ranges. A numerical experiment on the optimal placement problem is given to verify the effectiveness and superiority of our proposed algorithms.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
The Novel Adaptive Fractional Order Gradient Decent Algorithms Design via Robust Control
Authors:
Jiaxu Liu,
Song Chen,
Shengze Cai,
Chao Xu
Abstract:
The vanilla fractional order gradient descent may oscillatively converge to a region around the global minimum instead of converging to the exact minimum point, or even diverge, in the case where the objective function is strongly convex. To address this problem, a novel adaptive fractional order gradient descent (AFOGD) method and a novel adaptive fractional order accelerated gradient descent (AF…
▽ More
The vanilla fractional order gradient descent may oscillatively converge to a region around the global minimum instead of converging to the exact minimum point, or even diverge, in the case where the objective function is strongly convex. To address this problem, a novel adaptive fractional order gradient descent (AFOGD) method and a novel adaptive fractional order accelerated gradient descent (AFOAGD) method are proposed in this paper. Inspired by the quadratic constraints and Lyapunov stability analysis from robust control theory, we establish a linear matrix inequality to analyse the convergence of our proposed algorithms. We prove that the proposed algorithms can achieve R-linear convergence when the objective function is $\textbf{L-}$smooth and $\textbf{m-}$strongly-convex. Several numerical simulations are demonstrated to verify the effectiveness and superiority of our proposed algorithms.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Geometric Barriers for Stable and Online Algorithms for Discrepancy Minimization
Authors:
David Gamarnik,
Eren C. Kızıldağ,
Will Perkins,
Changji Xu
Abstract:
For many computational problems involving randomness, intricate geometric features of the solution space have been used to rigorously rule out powerful classes of algorithms. This is often accomplished through the lens of the multi Overlap Gap Property ($m$-OGP), a rigorous barrier against algorithms exhibiting input stability. In this paper, we focus on the algorithmic tractability of two models:…
▽ More
For many computational problems involving randomness, intricate geometric features of the solution space have been used to rigorously rule out powerful classes of algorithms. This is often accomplished through the lens of the multi Overlap Gap Property ($m$-OGP), a rigorous barrier against algorithms exhibiting input stability. In this paper, we focus on the algorithmic tractability of two models: (i) discrepancy minimization, and (ii) the symmetric binary perceptron (\texttt{SBP}), a random constraint satisfaction problem as well as a toy model of a single-layer neural network.
Our first focus is on the limits of online algorithms. By establishing and leveraging a novel geometrical barrier, we obtain sharp hardness guarantees against online algorithms for both the \texttt{SBP} and discrepancy minimization. Our results match the best known algorithmic guarantees, up to constant factors. Our second focus is on efficiently finding a constant discrepancy solution, given a random matrix $\mathcal{M}\in\mathbb{R}^{M\times n}$. In a smooth setting, where the entries of $\mathcal{M}$ are i.i.d. standard normal, we establish the presence of $m$-OGP for $n=Θ(M\log M)$. Consequently, we rule out the class of stable algorithms at this value. These results give the first rigorous evidence towards a conjecture of Altschuler and Niles-Weed~\cite[Conjecture~1]{altschuler2021discrepancy}.
Our methods use the intricate geometry of the solution space to prove tight hardness results for online algorithms. The barrier we establish is a novel variant of the $m$-OGP. Furthermore, it regards $m$-tuples of solutions with respect to correlated instances, with growing values of $m$, $m=ω(1)$. Importantly, our results rule out online algorithms succeeding even with an exponentially small probability.
△ Less
Submitted 13 February, 2023;
originally announced February 2023.
-
Efficient numerical methods for the Navier-Stokes-Nernst-Planck-Poisson equations
Authors:
Xiaolan Zhou,
Chuanju Xu
Abstract:
We propose in this paper efficient first/second-order time-stepping schemes for the evolutional Navier-Stokes-Nernst-Planck-Poisson equations. The proposed schemes are constructed using an auxiliary variable reformulation and sophisticated treatment of the terms coupling different equations. By introducing a dynamic equation for the auxiliary variable and reformulating the original equations into…
▽ More
We propose in this paper efficient first/second-order time-stepping schemes for the evolutional Navier-Stokes-Nernst-Planck-Poisson equations. The proposed schemes are constructed using an auxiliary variable reformulation and sophisticated treatment of the terms coupling different equations. By introducing a dynamic equation for the auxiliary variable and reformulating the original equations into an equivalent system, we construct first- and second-order semi-implicit linearized schemes for the underlying problem. The main advantages of the proposed method are: (1) the schemes are unconditionally stable in the sense that a discrete energy keeps decay during the time stepping; (2) the concentration components of the discrete solution preserve positivity and mass conservation; (3) the delicate implementation shows that the proposed schemes can be very efficiently realized, with computational complexity close to a semi-implicit scheme. Some numerical examples are presented to demonstrate the accuracy and performance of the proposed method. As far as the best we know, this is the first second-order method which satisfies all the above properties for the Navier-Stokes-Nernst-Planck-Poisson equations.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Eigenstate Thermalization Hypothesis for Generalized Wigner Matrices
Authors:
Arka Adhikari,
Sofiia Dubova,
Changji Xu,
Jun Yin
Abstract:
In this paper, we extend results of Eigenvector Thermalization to the case of generalized Wigner matrices. Analytically, the central quantity of interest here are multiresolvent traces, such as $Λ_A:= \frac{1}{N} \text{Tr }{ GAGA}$. In the case of Wigner matrices, as in \cite{cipolloni-erdos-schroder-2021}, one can form a self-consistent equation for a single $Λ_A$. There are multiple difficulties…
▽ More
In this paper, we extend results of Eigenvector Thermalization to the case of generalized Wigner matrices. Analytically, the central quantity of interest here are multiresolvent traces, such as $Λ_A:= \frac{1}{N} \text{Tr }{ GAGA}$. In the case of Wigner matrices, as in \cite{cipolloni-erdos-schroder-2021}, one can form a self-consistent equation for a single $Λ_A$. There are multiple difficulties extending this logic to the case of general covariances. The correlation structure prevents us from deriving a self-consistent equation for a single matrix $A$; this is due to the introduction of new terms that are quite distinct from the form of $Λ_A$. We find a way around this by carefully splitting these new terms and writing them as sums of $Λ_B$, for matrices $B$ obtained by modifying $A$ using the covariance matrix. The result is a system of self-consistent equations relating families of deterministic matrices. Our main effort in this work is to derive and analyze this system of self-consistent equations.
△ Less
Submitted 15 February, 2023; v1 submitted 31 January, 2023;
originally announced February 2023.
-
Apéry-Like Sums and Colored Multiple Zeta Values
Authors:
Ce Xu,
Jianqiang Zhao
Abstract:
In this article we shall survey some recent progress on the study of Apéry-like sums which are multiple variable generalizations of the two sums Apéry used in his famous proof of the irrationality of $ζ(2)$ and $ζ(3)$. We only allow the central binomial coefficients to appear in these infinite sums but they can appear either on the numerator or on the denominator. Special values of both types are…
▽ More
In this article we shall survey some recent progress on the study of Apéry-like sums which are multiple variable generalizations of the two sums Apéry used in his famous proof of the irrationality of $ζ(2)$ and $ζ(3)$. We only allow the central binomial coefficients to appear in these infinite sums but they can appear either on the numerator or on the denominator. Special values of both types are closely related to the colored multiple zeta values and have played important roles in the calculations of the $\eps$-expansion of multiloop Feynman diagrams. We will summarize several different approaches to computing these sums and prove a few conjectural identities of Z.-W. Sun as corollaries along the way.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
Set-Theoretic and Type-Theoretic Ordinals Coincide
Authors:
Tom de Jong,
Nicolai Kraus,
Fredrik Nordvall Forsberg,
Chuangjie Xu
Abstract:
In constructive set theory, an ordinal is a hereditarily transitive set. In homotopy type theory (HoTT), an ordinal is a type with a transitive, wellfounded, and extensional binary relation. We show that the two definitions are equivalent if we use (the HoTT refinement of) Aczel's interpretation of constructive set theory into type theory. Following this, we generalize the notion of a type-theoret…
▽ More
In constructive set theory, an ordinal is a hereditarily transitive set. In homotopy type theory (HoTT), an ordinal is a type with a transitive, wellfounded, and extensional binary relation. We show that the two definitions are equivalent if we use (the HoTT refinement of) Aczel's interpretation of constructive set theory into type theory. Following this, we generalize the notion of a type-theoretic ordinal to capture all sets in Aczel's interpretation rather than only the ordinals. This leads to a natural class of ordered structures which contains the type-theoretic ordinals and realizes the higher inductive interpretation of set theory. All our results are formalized in Agda.
△ Less
Submitted 12 June, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Berndt-Type Integrals and Series Associated with Ramanujan and Jacobi Elliptic Functions
Authors:
Ce Xu,
Jianqiang Zhao
Abstract:
In this paper, we evaluate in closed forms two families of infinite integrals containing hyperbolic and trigonometric functions in their integrands. We call them Berndt-type integrals since he initiated the study of similar integrals. We first establish explicit evaluations of four classes of hyperbolic sums by special values of the Gamma function, by two completely different approaches, which ext…
▽ More
In this paper, we evaluate in closed forms two families of infinite integrals containing hyperbolic and trigonometric functions in their integrands. We call them Berndt-type integrals since he initiated the study of similar integrals. We first establish explicit evaluations of four classes of hyperbolic sums by special values of the Gamma function, by two completely different approaches, which extend those sums considered by Ramanujan and Zucker previously. We discover the first by refining two results of Ramanujan concerning some $q$-series. For the second we compare both the Fourier series expansions and the Maclaurin series expansions of a few Jacobi elliptic functions. Next, by contour integrations we convert two families of Berndt-type integrals to the above hyperbolic sums, all of which can be evaluated in closed forms. We then discover explicit formulas for one of the two families. Throughout the paper we present many examples which enable us to formulate a conjectural explicit formula for the other family of the Berndt-type integrals at the end.
△ Less
Submitted 19 April, 2024; v1 submitted 19 January, 2023;
originally announced January 2023.
-
The Two Point Function of the SK Model without External Field at High Temperature
Authors:
Christian Brennecke,
Adrien Schertzer,
Changji Xu,
Horng-Tzer Yau
Abstract:
We show that the two point correlation matrix $ \textbf{M}= (\langle σ_i σ_j\rangle)_{1\leq i,j\leq N} $ of the Sherrington-Kirkpatrick model with zero external field satisfies
\[ \lim_{N\to\infty} \| \textbf{M} - ( 1+β^2 - β\textbf{G})^{-1} \|_{\text{op}} =0 \] in probability, in the full high temperature regime $β< 1$. Here, $\textbf{G}$ denotes the GOE interaction matrix of the model.
We show that the two point correlation matrix $ \textbf{M}= (\langle σ_i σ_j\rangle)_{1\leq i,j\leq N} $ of the Sherrington-Kirkpatrick model with zero external field satisfies
\[ \lim_{N\to\infty} \| \textbf{M} - ( 1+β^2 - β\textbf{G})^{-1} \|_{\text{op}} =0 \] in probability, in the full high temperature regime $β< 1$. Here, $\textbf{G}$ denotes the GOE interaction matrix of the model.
△ Less
Submitted 8 November, 2023; v1 submitted 29 December, 2022;
originally announced December 2022.
-
Regularity of the spatially homogenous fractional Kramers-Fokker-Planck equation
Authors:
Chao-Jiang Xu,
Yan Xu
Abstract:
We study the Cauchy problem of the spatially homogenous fractional Kramers-Fokker-Planck equation and show that the solution enjoys Gevrey regularity and decay estimation with an L2 initial datum for positive time.
We study the Cauchy problem of the spatially homogenous fractional Kramers-Fokker-Planck equation and show that the solution enjoys Gevrey regularity and decay estimation with an L2 initial datum for positive time.
△ Less
Submitted 13 May, 2023; v1 submitted 29 December, 2022;
originally announced December 2022.