Low-Degree Approximation of Random Polynomials

Diatta, Daouda Niang; Lerario, Antonio

doi:10.1007/s10208-021-09506-y

Low-Degree Approximation of Random Polynomials

Open access
Published: 25 March 2021

Volume 22, pages 77–97, (2022)
Cite this article

Download PDF

You have full access to this open access article

Foundations of Computational Mathematics Aims and scope Submit manuscript

Low-Degree Approximation of Random Polynomials

Download PDF

Daouda Niang Diatta¹ &
Antonio Lerario²

1982 Accesses
7 Citations
Explore all metrics

Abstract

We prove that with “high probability” a random Kostlan polynomial in $n+1$ many variables and of degree d can be approximated by a polynomial of “low degree” without changing the topology of its zero set on the sphere $\mathbb {S}^n$. The dependence between the “low degree” of the approximation and the “high probability” is quantitative: for example, with overwhelming probability, the zero set of a Kostlan polynomial of degree d is isotopic to the zero set of a polynomial of degree $O(\sqrt{d \log d})$. The proof is based on a probabilistic study of the size of $C^1$-stable neighborhoods of Kostlan polynomials. As a corollary, we prove that certain topological types (e.g., curves with deep nests of ovals or hypersurfaces with rich topology) have exponentially small probability of appearing as zero sets of random Kostlan polynomials.

Quasi-random multilinear polynomials

Article 01 March 2019

Hall’s Conjecture on Extremal Sets for Random Triangles

Article 10 May 2019

Zero distribution of random polynomials

Article 01 February 2018

1 Introduction

Over the past few years, there has been an intense activity around the field of Random Algebraic Geometry, whose main interest has been studying topological properties of the zero set of random real algebraic equations.

This approach goes back to the classical work of Kac [15], who studied the expected number of real zeroes of a random polynomial in one variable whose coefficients are Gaussian random variables, and was later extended and generalized in the 1990s to systems of equations in a sequence of influential papers by Edelman, Kostlan, Shub, and Smale [6, 7, 17, 31,32,33]. More recently, in 2011, Sarnak [28] suggested to look at the connected components of a real algebraic curve from the random point of view, proposing a random version of Hilbert’s Sixteenth Problem (to investigate the “number, shape, and position” of the connected components of a real algebraic hypersurface [35]). Since then, the area has seen much progress [9, 11,12,13, 18,19,20,21, 24, 25, 28, 29], with a focus on the expectation of topological quantities such as the Betti numbers of random algebraic hypersurfaces [9, 12, 13].

In this paper, we concentrate on the so-called Kostlan model: we sample a random polynomial according to the rule

$$\begin{aligned} P(x)=\sum _{|\alpha |=d}\xi _\alpha \cdot \left( \left( \frac{d!}{\alpha _0!\cdots \alpha _n!}\right) ^{1/2}x_0^{\alpha _0}\cdots x_n^{\alpha _n}\right) , \end{aligned}$$

with $\{\xi _\alpha \}_{|\alpha |=d}$ a family of independent, standard Gaussian variables (see Sect. 3 for more details). A main feature of this probabilistic model, in the univariate case, is that the expectation of the number of real zeroes of a Kostlan polynomial equals $\sqrt{d}$ [6]. This phenomenon is called “square-root law”: essentially the Kostlan polynomial seems to behave as if its degree is $\sqrt{d}$ rather than d. In higher dimensions, a similar phenomenon happens to the Betti numbers of its zero set: their expectation is of the order $O(d^{n/2})$, while the deterministic upper bound is $O(d^n).$ In this paper, we give a further contribution in this direction, by proving the following theorem (see Theorem 7).

Theorem A

(Low-degree approximation) Let P be a random Kostlan polynomial in $n+1$ many variables and of degree d. Denote by $p=P|_{\mathbb {S}^n}$ its restriction to the unit sphere $\mathbb {S}^{n}\subset \mathbb {R}^{n+1}$ and by $Z(p)\subset \mathbb {S}^n$ its zero set on the sphere. With probability that goes to one as $d\rightarrow \infty $, the pair $(\mathbb {S}^n,Z(p))$ is diffeomorphic to a pair $(\mathbb {S}^n, Z(q))$ where q is the restriction to the sphere of a polynomial of degree $O(\sqrt{d\log d})$.

Remark 1

Let us comment on the meaning of this theorem, by looking at the example of real algebraic curves on the sphere $\mathbb {S}^2.$ As d goes to infinity, the number D(d) of rigid isotopy classes of smooth real curves of degree d on $\mathbb {S}^2$ growths as ${\mathrm{e}}^{\Theta (d^2)}$ [26]. This means that in the space of curves (i.e., in the space of homogeneous polynomials of degree d) the real discriminant (i.e., the set of curves which are singular) separates the space into super-exponentially many connected components (i.e., rigid isotopy classes, also called chambers). Inside the space of curves of degree d, there are the curves of degree $O(\sqrt{d\log d})$; the number of rigid isotopy classes of these curves is much smaller, “only” ${\mathrm{e}}^{\Theta (d\log d)}$. Let us denote by $\mathcal {C}_d=\{c_i\}_{i=1}^{D(d)}$ the set of rigid isotopy classes of smooth curves of degree d on $\mathbb {S}^2$. If we put on $\mathcal {C}_d$ the uniform probability distribution (i.e., we set $\mathbb {P}(c_i)=D(d)^{-1}$ for all i), then with probability that goes to one as $d\rightarrow \infty $, curves are not isotopic to curves of smaller degree. However, if we put on $\mathcal {C}_d$ the probability measure such that $\mathbb {P}(c_i)$ equals the Kostlan probability of the corresponding chamber, then most of the mass comes from $\mathcal {C}_{O(\sqrt{d\log d})}\subset \mathcal {C}_d$ and with probability that goes to one as $d\rightarrow \infty $, curves are isotopic to curves of smaller degree. From the point of view of random Kostlan polynomials, curves which are not isotopic to curves of smaller degree are inaccessible.

The idea of the proof of the previous theorem is the following. Thom’s First Isotopy Lemma implies that, given a function $p:\mathbb {S}^n\rightarrow \mathbb {R}$ whose zero set $Z(p)\subset \mathbb {S}^n$ is nonsingular, there is a small $C^1$ neighborhood (we call it a “stable neighborhood”) such that all functions in this neighborhood have zero sets diffeomorphic to Z(p). However, how large this neighborhood can be depends on p and in Proposition 3 we prove that it contains a $C^1$-ball:

$$\begin{aligned} \left\{ \Vert f-p\Vert _{C^1}<\frac{\delta (p)}{2}\right\} \implies (\mathbb {S}^n, Z(p))\sim (\mathbb {S}^n, Z(f)), \end{aligned}$$

(1.1)

where $\delta (p)$ denotes the distance, in the Bombieri–Weyl norm, from p to the set of polynomials with a singular zero set (the “discriminant,” see Sect. 4). In order to produce a low-degree approximation of p, we first write it as $p=\sum _{\ell }p_\ell $, where each $p_\ell $ denotes the projection of p to the space of spherical harmonics of degree $\ell $, and then take only the part of degree smaller than L of this expansion:

$$\begin{aligned} p|_{L}=\sum _{\ell \le L}p_\ell .\end{aligned}$$

We will prove that, choosing $L=O(\sqrt{d\log d}),$ with probability that goes to one as $d\rightarrow \infty $, the difference $p-p|_{L}$ has small enough $C^1$-norm to be contained in the above stable neighborhood.

From the technical point of view, this last step requires three estimates: we first bound the $C^1$-norm of $p-p|_{L}$ with its Sobolev norm (Proposition 1), then the Sobolev norm with the Bombieri–Weyl norm of the original polynomial (which is the norm endowing the space of polynomials with the Kostlan Gaussian measure, Proposition 2) and finally we estimate the size (i.e., the probability) of a small neighborhood of the discriminant (Proposition 4).

1.1 Consequences

All the previous estimates are quantitative and produce different outcomes for different choices of the degree L to which we truncate the expansion of p. The most general bound that we obtain is the following (Theorem 5): there exists $c_5(n)>0$ such that for every $L, \sigma >1$ we have:

$$\begin{aligned} \mathbb {P}\left\{ \left\| p-p|_{L}\right\| _{C^1}<\frac{\delta (p)}{2}\right\} \ge 1-\left( c_5(n)d^{\frac{5n}{2}+2}{\mathrm{e}}^{-\frac{L^2}{4d}}\sigma ^2+\frac{1}{\sigma }\right) .\end{aligned}$$

For example, choosing L to be a fraction of $d=\deg (p)$, the above $\sigma $ can be tuned so that the probability from the statement of Theorem 1 goes exponentially fast to one as $d\rightarrow \infty $.

We use this idea to constraint the typical topology of $(\mathbb {S}^n, Z(p))$ as follows: (i) we identify a “family” of topological types (e.g., hypersurfaces of the sphere $\mathbb {S}^n$ with more than $\alpha d^n$ many components); (ii) we show that we need at least degree $L_d$ to realize this topological type (e.g., we need degree at least $c(\alpha )d$ to have $\alpha d^n$ many components); (iii) we prove that with “high probability” p can be stably approximated by a polynomial of degree smaller than $L_d$ (which implies its zero set cannot have that topological type). Here are two examples of the application of this strategy.

(Theorem 8) The probability that the zero set on $\mathbb {S}^n$ of a Kostlan polynomial of degree d has total Betti number larger than $\alpha d^n$ is bounded by $ \gamma _1(\alpha ){\mathrm{e}}^{-\gamma _2(\alpha )d}$ for some constants $\gamma _1(\alpha ),\gamma _2(\alpha )>0$. This was known for the case $n=1$ (points on $\mathbb {S}^1$) and for the case $n=2$ (algebraic curves) [10], but only in the case of maximal curves, see Remark 10.

(Theorem 9) The probability that the zero set on $\mathbb {S}^n$ of a Kostlan polynomial of degree d contains a nest of depth $\alpha d^n$ is bounded by $ \gamma _1(\alpha ){\mathrm{e}}^{-\gamma _2(\alpha )d}$ for some constants $\gamma _1(\alpha ),\gamma _2(\alpha )>0$, see Fig. 1. For example, hyperbolic hypersurfaces of degree d (whose isotopy class is the one of a nest of spheres of depth d) have exponentially small probability.

Remark 2

Gichev has proved a result with a flavor similar to Theorem 1: in [14, Theorem 5.3] he shows that with probability that goes to one as $d\rightarrow \infty $, a random Kostlan polynomial of degree d can be approximated in the Sobolev norm with a polynomial of degree $O(\sqrt{d\log d}).$ Unfortunately, we cannot use directly Gichev’s result, essentially because how close should the approximation be in order to guarantee that the zero sets are diffeomorphic depends on p (note that in (1.1) the distance of p from the discriminant is involved). The precise point where the current paper gets close to [14] is in Proposition 2, which is responsible for the similarities. The difference with Gichev’s result relies on the fact that we are not showing only that lower-degree polynomials are “near,” but they are “near enough” so that the zero sets are diffeomorphic. The main practical difference is that Gichev compares the Sobolev norm of the tail with the Sobolev norm of p, while in Proposition 2 we compare the Sobolev norm of the tail with the Bombieri–Weyl norm of p. This difference is crucial for the final estimate that we will use in Proposition 4. (However, the use of the Sobolev norm is often practical for us too, because it is quite easy to handle it.)

Remark 3

Since the low-degree approximation from Theorem 1 is the projection of p to low-degree harmonics, this could be used, in the case $n=1$ and with high probability, to improve the complexity of a certain class of algorithms in real algebraic geometry (e.g., adaptive algorithms for real root isolation), essentially showing that “for most polynomials” the bound on the complexity of these algorithms is better than the absolute deterministic bound. We plan to elaborate on this idea in a forthcoming work.

Remark 4

After this paper was written, building on our results, there was some development on the low-degree approximation theme. In [2], Breiding, Keneshlou and the second named author of the current paper generalized Theorem 5 from the case of zero sets on the sphere to more general “singularities” of polynomial maps from the sphere to $\mathbb {R}^k$; in [1], Ancona extended our method to the case of zero sets of Gaussian random sections of high tensor power of real line bundles on a general real algebraic manifold.

2 Spaces of Polynomials and Norms

We denote by $\mathcal {P}_{n,d}=\mathbb {R}[x_0, \ldots , x_n]_{(d)}$ the space of real homogeneous polynomials of degree d. We endow $ \mathcal {P}_{n,d}$ with the Bombieri–Weyl norm, which is defined as follows: writing a homogeneous polynomial in the monomial basis we set:

$$\begin{aligned} \left\| \sum _{|\alpha |=d}\gamma _\alpha x_0^{\alpha _0}\cdots x_n^{\alpha _n}\right\| _{\mathrm {BW}}=\left( \sum _{|\alpha |=d}\gamma _\alpha ^2\frac{\alpha _0!\cdots \alpha _n!}{d!}\right) ^{1/2}.\end{aligned}$$

For every $\ell =0, \ldots , d$, we will also consider the space $\mathcal {H}_{n,\ell }\subset \mathcal {P}_{n, \ell }$ of homogeneous harmonic polynomials, i.e., polynomials H such that $\Delta _{\mathbb {R}^{n+1}}H=0$. It turns out that the space $\mathcal {P}_{n,d}$ can be decomposed as:

$$\begin{aligned} \mathcal {P}_{n,d}=\bigoplus _{d-\ell \in 2\mathbb {N}}\Vert x\Vert ^{d-\ell }\mathcal {H}_{n,\ell }.\end{aligned}$$

(2.1)

The decomposition (2.1) has two important properties (see [16]):

(i)
Given a scalar product which is invariant under the action of $O(n+1)$ on $\mathcal {P}_{n,d}$ by change of variables, the decomposition (2.1) is orthogonal for this scalar product.
(ii)
The action of $O(n+1)$ on $\mathcal {P}_{n,d}$ preserves each $\mathcal {H}_{n,\ell }$ and the induced representation on the space of harmonic polynomials is irreducible. In particular, there exists a unique, up to multiples, scalar product on $\mathcal {H}_{n,\ell }$ which is $O(n+1)$-invariant.

The space $\mathcal {P}_{n,d}$ injects (by taking restrictions of polynomials) into the space $C^{\infty }(\mathbb {S}^n, \mathbb {R})$ of smooth functions on the unit sphere $\mathbb {S}^n\subset \mathbb {R}^{n+1}$. We denote by

$$\begin{aligned} \mathcal {S}_{n,d}=\{p:\mathbb {S}^n\rightarrow \mathbb {R}| p=P|_{\mathbb {S}^n} \text {with} P\in \mathcal {P}_{n,d}\}=\mathcal {P}_{n,d}|_{\mathbb {S}^n}.\end{aligned}$$

the image of such injection. In particular, the two vector spaces $\mathcal {P}_{n,d}$ and $\mathcal {S}_{n,d}$ are isomorphic:

$$\begin{aligned} \mathcal {P}_{n,d}\simeq \mathcal {S}_{n, d} \simeq \mathbb {R}^{N}\quad \text {where}\quad N={n+d\atopwithdelims ()d}.\end{aligned}$$

We introduce the following convention: given $P\in \mathcal {P}_{n,d}$, we denote by $p=P|_{\mathbb {S}^n}$ (i.e., we will use capital letters for polynomials in $\mathcal {P}_{n,d}$ and small letters for their restrictions in $\mathcal {S}_{n,d}$). Restricting polynomials in $\mathcal {H}_{n,\ell }$ to the unit sphere, we obtain exactly eigenfunctions of the spherical Laplacian:

$$\begin{aligned} V_{n,\ell }=\{h:\mathbb {S}^n\rightarrow \mathbb {R}| \Delta _{\mathbb {S}^n}h=-\ell (\ell +n-1)h\}=\mathcal {H}_{n, \ell }|_{\mathbb {S}^n}.\end{aligned}$$

We will consider various norms on $\mathcal {S}_{n,d}:$

(1)
The Bombieri–Weyl norm, simply defined for $p=P|_{\mathbb {S}^n}$ as $\Vert p\Vert _{\mathrm {BW}}=\Vert P\Vert _{\mathrm {BW}}.$ Note that the same $p:\mathbb {S}^n\rightarrow \mathbb {R}$ can be the restriction of two different $P_1\in \mathcal {P}_{n, d_1}$ and $P_2\in \mathcal {P}_{n, d_2}$ (for example: take $P_2(x)=\Vert x\Vert ^{2}P_1(x)$), it is therefore important for the computation of the Bombieri–Weyl norm to specify the space where p comes from, i.e., its original homogeneous degree.
(2)
The $C^1$-norm defined for $p\in \mathcal {S}_{n,d}$ as:
$$\begin{aligned} \Vert p\Vert _{C^1}=\max _{\theta \in \mathbb {S}^n}|p(\theta )|+\max _{\varphi \in \mathbb {S}^n}\Vert \nabla _{\mathbb {S}^n} p(\varphi )\Vert ,\end{aligned}$$
where $\nabla _{\mathbb {S}^n}p$ denotes the spherical gradient, i.e., the orthogonal projection on the unit sphere of the gradient of p.
(3)
The $L^2$-norm, defined for $p\in \mathcal {S}_{n,d}$ as:
$$\begin{aligned} \Vert p\Vert _{L^2}=\left( \int _{\mathbb {S}^n} p(\theta )^2\,\mathrm {d}\theta \right) ^{1/2},\end{aligned}$$
where “$\mathrm {d}\theta $” denotes integration with respect to the standard volume form of the sphere. In the sequel, we will denote by $\{y_{\ell , j}\}_{j\in J_\ell }$ a chosen $L^2$-orthonormal basis of $V_{n, \ell }$.
(4)
The Sobolev q-norm, defined for $p=\sum _{\ell }p_\ell $ (decomposed as in (2.1)) by:
$$\begin{aligned} \Vert p\Vert _{H^q}=\left( \Vert p_0\Vert ^2+\sum _{d-\ell \in 2\mathbb {N}}\ell ^{2q}\Vert p_\ell \Vert _{L^2}^2\right) ^{1/2}. \end{aligned}$$
Note that $\Vert p_0\Vert ^2=0$ when d is odd; moreover, $\Vert \cdot \Vert _{H^0}=\Vert \cdot \Vert _{L^2}$.

The decomposition (2.1) induces a decomposition:

$$\begin{aligned} \mathcal {S}_{n,d}=\bigoplus _{d-\ell \in 2\mathbb {N}}V_{n,\ell }.\end{aligned}$$

(2.2)

By property (i) above, this decomposition is orthogonal, at the same time, for the Bombieri–Weyl, the $L^2$ and the Sobolev scalar products. Moreover, because of property (ii) above, the Bombieri–Weyl scalar product, the $L^2$ and the Sobolev one are one multiple of the others on $V_{n, \ell }$ (viewed as a subspace of $\mathcal {P}_{n,d}$):

$$\begin{aligned} \Vert h_{n, \ell }\Vert _{L^2}&=w_{n,d}(\ell )\Vert h_{n, \ell }\Vert _{\text {BW}}, \quad \Vert h_{n, \ell }\Vert _{H^q}\\&=\ell ^q w_{n,d}(\ell )\Vert h_{n, \ell }\Vert _{\text {BW}}\quad \forall h_{n, \ell }\in V_{n, \ell }\subset \mathcal {S}_{n, \ell }.\end{aligned}$$

In particular, $\Vert h_{n, \ell }\Vert _{H^q}=\ell ^q\Vert h_{n, \ell }\Vert _{L^2}$.

The rescaling weights are given by (see [9, Example 1]):

$$\begin{aligned} {w_{n,d}(\ell )} = \left( \text {vol}(\mathbb {S}^n)\Gamma \left( \frac{n+1}{2} \right) \frac{ \Gamma \left( \frac{d+\ell }{2} + 1 \right) }{ \Gamma \left( \frac{n+1}{2} + \frac{d+\ell }{2} \right) } \frac{ 1}{2^{d} } \left( {\begin{array}{c}d\\ \frac{d-\ell }{2}\end{array}}\right) \right) ^{1/2}. \end{aligned}$$

(2.3)

We observe also the following important fact: writing $P=\sum _{\ell }P_\ell $ with each $P_\ell \in \Vert x\Vert ^{d-\ell }\mathcal {H}_{n,\ell }$ as in (2.1), when taking restrictions to the unit sphere we have $p=\sum _{\ell }p_\ell $ with each $p_\ell $ the restriction to $\mathbb {S}^n$ of a polynomial of degree $\ell $: in other words, the restriction to the unit sphere “does not see” the $\Vert x\Vert ^{d-\ell }$ factor, which is constant on the unit sphere.

Proposition 1

There exists $c_1(n)>0$ such that for every $q\ge \frac{n+1}{2}$ and for every $p\in \mathcal {S}_{n,d}$ we have:

$$\begin{aligned} \Vert p\Vert _{C^1}\le c_1(n)d^{\frac{1}{2}} \Vert p\Vert _{H^{q}}.\end{aligned}$$

Proof

Given $p\in \mathcal {S}_{n,d}$, we write $p=\sum _\ell h_\ell $ with each $h_\ell \in V_{n,\ell }$, as in (2.2). We can estimate for $\theta , \varphi \in \mathbb {S}^n$:

$$\begin{aligned} |p(\theta )|+\Vert \nabla _{\mathbb {S}^n}p(\varphi )\Vert&\le \sum _{d-\ell \in 2\mathbb {N}}\left( |h_\ell (\theta )|+\Vert \nabla _{\mathbb {S}^n}h_\ell (\varphi )\Vert \right) \\&\le C_5(n)\sum _{d-\ell \in 2\mathbb {N}}\ell ^{\frac{n+1}{2}}\Vert h_\ell \Vert _{L^2}\quad ({\text {by [30, Theorem 4]}})\\&\le C_6(n)\left( \sum _{d-\ell \in 2\mathbb {N}}\ell ^{n+1}\Vert h_\ell \Vert _{L^2}^2\right) ^{1/2} \left( \frac{d}{2}\right) ^{1/2}\\&\qquad \text {(by Cauchy--Schwartz inequality)}\\&\le c_1(n)\sqrt{d}\Vert p\Vert _{H^{q}}\quad \text {for} \quad q\ge \frac{n+1}{2}. \end{aligned}$$

By taking the supremum over $\theta \in \mathbb {S}^n$ then over $\varphi \in \mathbb {S}^n$, the proof concludes.

3 Gaussian Measures and Random Polynomials

The space $\mathcal {P}_{n,d}$ can be turned into a Gaussian space by sampling a random polynomial according to the rule:

$$\begin{aligned} P(x)=\sum _{|\alpha |=d}\xi _\alpha \cdot \left( \left( \frac{d!}{\alpha _0!\cdots \alpha _n!}\right) ^{1/2}x_0^{\alpha _0}\cdots x_n^{\alpha _n}\right) ,\end{aligned}$$

with $\{\xi _\alpha \}_{|\alpha |=d}$ a family of independent, standard Gaussian variables. A random polynomial defined in this way is called a Kostlan polynomial. An alternative way for writing a random Kostlan polynomial is to expand it in the spherical harmonic basis:

$$\begin{aligned} P(x)=\sum _{d-\ell \in 2\mathbb {N}}\sum _{j\in J_\ell }\xi _{\ell ,j} \cdot \left( w_{n,d}(\ell )\Vert x\Vert ^{d-\ell }y_{\ell ,j}\left( \frac{x}{\Vert x\Vert }\right) \right) ,\end{aligned}$$

where $\{\xi _{\ell ,j}\}_{\ell , j}$ is a family of independent, standard Gaussian variables and $\{w_{n,d}(\ell )\}_{d-\ell \in 2\mathbb {N}}$ are given by (2.3).

Remark 5

Observe that both

$$\begin{aligned}&\left\{ \left( \frac{d!}{\alpha _0!\cdots \alpha _n!}\right) ^{1/2}x_0^{\alpha _0}\cdots x_n^{\alpha _n}\right\} _{|\alpha |=d}\quad \text {and}\quad \\&\left\{ w_{n,d}(\ell )\Vert x\Vert ^{d-\ell }y_{\ell ,j}\left( \frac{x}{\Vert x\Vert }\right) \right\} _{d-\ell \in 2\mathbb {N}, j\in J_\ell }\end{aligned}$$

are Bombieri–Weyl orthonormal bases for $\mathcal {P}_{n,d}.$ More generally, given a basis $\{F_k\}_{k=1}^{N}$ for $\mathcal {P}_{n,d}$ which is orthonormal for the Bombieri–Weyl scalar product, a random Kostlan polynomial can be defined by:

$$\begin{aligned} F(x)=\sum _{k=1}^{N}\xi _k F_k(x),\end{aligned}$$

where $\{\xi _k\}_{k=1}^N$ is a family of independent, standard Gaussian variables.

Given $L\in \{0, \ldots , d\}$, we consider the projection $\mathcal {S}_{n,d}\rightarrow \mathcal {S}_{n, L}$ defined by expanding p in spherical harmonics and taking only the terms of degree at most L of this expansion given by:

$$\begin{aligned} p:=\sum _{d-\ell \in 2\mathbb N}p_\ell \quad \mapsto \quad p|_{L}:=\sum _{d-\ell \in 2\mathbb N, \ell \le L}p_\ell .\end{aligned}$$

Proposition 2

There exists $c_2(n)>0$ such that for all $t,q\ge 0$ and for every $L\in \{0, \ldots , d\}$ we have:

$$\begin{aligned} \mathbb {P}\bigg \{\left\| p-p|_{L}\right\| _{H^q}\le t \Vert p\Vert _{\mathrm {BW}}\bigg \}\ge 1-c_2(n)\frac{d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}}}{t^2}. \end{aligned}$$

(3.1)

Remark 6

As already noted, this proposition is similar to [14, Theorem 5.3], which would provide a lower bound for the probability of the event $\{\left\| p-p|_{L}\right\| _{H^q}\le t \Vert p\Vert _{H^q}\}$, with an estimate which has a shape similar to (3.1).

Proof

First observe that, since $\{\left\| p-p|_{L}\right\| _{H^q}\le t \Vert p\Vert _{\mathrm {BW}}\}\subset \mathcal {P}_{n,d}$ is a cone, denoting by $\mathbb {S}^{N-1}$ the unit sphere in the Bombieri–Weyl norm, the required probability equals:

$$\begin{aligned} \mathbb {P}\bigg \{\left\| p-p|_{L}\right\| _{H^q}\le t \Vert p\Vert _{\mathrm {BW}}\bigg \}&=\frac{\mathrm {vol}\left( \{\left\| p-p|_{L}\right\| _{H^q}\le t \Vert p\Vert _{\mathrm {BW}}\}\cap \mathbb {S}^{N-1}\right) }{\text {vol}\left( \mathbb {S}^{N-1}\right) }\\&=\frac{\mathrm {vol}\left( \{\left\| p-p|_{L}\right\| _{H^q}\le t \}\cap \mathbb {S}^{N-1}\right) }{\text {vol}\left( \mathbb {S}^{N-1}\right) }\\&=1-\frac{\mathrm {vol}\left( \{\left\| p-p|_{L}\right\| _{H^q}> t \}\cap \mathbb {S}^{N-1}\right) }{\text {vol}\left( \mathbb {S}^{N-1}\right) }. \end{aligned}$$

We will estimate the quantity

$$\begin{aligned} Q(t)=\frac{\mathrm {vol}\left( \{\left\| p-p|_{L}\right\| _{H^q}> t \}\cap \mathbb {S}^{N-1}\right) }{\text {vol}\left( \mathbb {S}^{N-1}\right) } \end{aligned}$$

from above using Markov’s inequality:

$$\begin{aligned} Q(t)\le \frac{\mathbb {E}_{p\in \mathbb {S}^{N-1}}\Vert p-p|_{L}\Vert ^2_{H^q}}{t^2},\end{aligned}$$

(3.2)

where the expectation is computed sampling a polynomial p uniformly from the unit Bombieri–Weyl sphere.

More precisely, expanding p in an $L^2$-orthonormal basis $\{y_{\ell , j}\}$ (so that $\{w_{n,d}(\ell )y_{\ell , j}\}$ is a Bombieri–Weyl orthonormal basis)

$$\begin{aligned} p=\sum _{d-\ell \in 2\mathbb N}\sum _{j\in J_\ell }\gamma _{\ell , j}w_{n,d}(\ell )y_{\ell , j},\end{aligned}$$

the condition that $p\in \mathbb {S}^{N-1}$ writes $\sum _{\ell , j}\gamma _{\ell , j}^2=1$. Consequently, denoting as before “$\mathrm {d}\theta $” the integration with respect to the standard volume form of the sphere, we obtain by moments computation [8]:

$$\begin{aligned} \mathbb {E}_{p\in \mathbb {S}^{N-1}}\Vert p-p|_{L}\Vert ^2_{H^q}&=\frac{1}{\mathrm {vol}(\mathbb {S}^{N-1})}\int _{\mathbb {S}^{N-1}}\sum _{\ell>L}\sum _{j\in J_\ell }\ell ^{2q}w_{n, d}(\ell )^2 \gamma _{\ell , j}(\theta )^2 \mathrm {d}\theta \\&=\sum _{\ell>L}\sum _{j\in J_\ell }\ell ^{2q}w_{n, d}(\ell )^2 \frac{1}{\mathrm {vol}(\mathbb {S}^{N-1})}\int _{\mathbb {S}^{N-1}} \gamma _{\ell , j}(\theta )^2 \mathrm {d}\theta \\&=\sum _{\ell >L}\sum _{j\in J_\ell }\ell ^{2q}w_{n, d}(\ell )^2 N^{-1}=(*). \end{aligned}$$

We use now the fact that the cardinality of $J_\ell $ is $O(\ell ^{n-1})$ and that $N\le (2d)^n$, obtaining the estimate:

$$\begin{aligned} (*)\le C_1(n)d^{-n}\sum _{\ell >L}\ell ^{2q+n-1}w_{n,d}(\ell )^2.\end{aligned}$$

(3.3)

Moreover, from (2.3) we easily get:

$$\begin{aligned} w_{n,d}(\ell )^2\le C_2(n)d^{-\frac{n}{2}} \frac{d^{\frac{1}{2}}}{2^{d-1}}{d\atopwithdelims ()\frac{d-\ell }{2}}.\end{aligned}$$

(3.4)

Substituting (3.4) into (3.3), we get:

$$\begin{aligned} (*)\le C_3(n)d^{-\frac{3n}{2}}\sum _{\ell >L}\ell ^{2q+n-1}\frac{d^{\frac{1}{2}}}{2^{d-1}}{d\atopwithdelims ()\frac{d-\ell }{2}}=(**).\end{aligned}$$

For $y\in \mathbb {R}$, let us denote now by $\{y\}$ the nearest integer to y with the same parity as d. Then, we can rewrite:

$$\begin{aligned} (**)=C_3(n)d^{-\frac{3n}{2}}\int _L^\infty \{y\}^{2q+n-1}\frac{d^{\frac{1}{2}}}{2^{d-1}}{d\atopwithdelims ()\frac{d-\{y\}}{2}} \mathrm {d}y.\end{aligned}$$

We apply now the change of variable $y=x\sqrt{d}$ in the above integral, and obtain:

$$\begin{aligned} (**)&=C_3(n)d^{-\frac{3n}{2}}\int _{\frac{L}{\sqrt{d}}}^\infty \{x\sqrt{d}\}^{2q+n-1}\frac{d^{\frac{1}{2}}}{2^{d-1}}{d\atopwithdelims ()\frac{d-\{x\sqrt{d}\}}{2}} \sqrt{d}\,\mathrm {d}x\\&\le C_4(n)d^{-\frac{3n}{2}+\frac{2q+n}{2}}\int _{\frac{L}{\sqrt{d}}}^\infty x^{2q+n-1}\frac{d^{\frac{1}{2}}}{2^{d-1}}{d\atopwithdelims ()\frac{d-\{x\sqrt{d}\}}{2}}\,\mathrm {d}x\\&\le C_4(n)d^{-n+q}\int _{\frac{L}{\sqrt{d}}}^\infty x^{2q+n-1}\frac{d^{\frac{1}{2}}}{2^{d-1}}{d\atopwithdelims ()\frac{d-\{x\sqrt{d}\}}{2}}\,\mathrm {d}x.\\ \end{aligned}$$

We need now to estimate the function:

$$\begin{aligned} g_d(x)=x^{2q+n-1}\frac{d^{1/2}}{2^{d-1}}{d\atopwithdelims (){\frac{d-\{x\sqrt{d}\}}{2}}}.\end{aligned}$$

Observe first that $g_d(x)=0$ whenever $x>\sqrt{d}$, therefore our estimate needs to be done only for $0\le x\le \sqrt{d}.$

To this end, we assume that d is even (the odd case work in an analogous way) and establish the bound:

$$\begin{aligned} {d\atopwithdelims ()\frac{d-k}{2}}\le {d\atopwithdelims ()\frac{d}{2}}{\mathrm{e}}^{-\frac{k^2}{4d}}\quad \quad \forall 0\le k\le d (k\,\, \text {with the same parity as}\,\, d).\end{aligned}$$

(3.5)

Let us set

$$\begin{aligned} B:=\frac{{d\atopwithdelims ()\frac{d-k}{2}}}{ {d\atopwithdelims ()\frac{d}{2}}}&=\frac{\left( \frac{d}{2}\right) !}{\left( \frac{d+k}{2}\right) !}\frac{\left( \frac{d}{2}\right) !}{\left( \frac{d-k}{2}\right) !}\\&=\prod _{j=1}^{\frac{k}{2}}\frac{\frac{d}{2}+1-j}{\frac{d}{2}+j}. \end{aligned}$$

From this, it follows that:

$$\begin{aligned} \log (B)&=\sum _{j=1}^{\frac{k}{2}}\log \left( \frac{\frac{d}{2}+1-j}{\frac{d}{2}+j}\right)&\\&=\sum _{j=1}^{\frac{k}{2}}\log \left( 1-2\cdot \frac{2j-1}{d+2j}\right)&\\&\le \sum _{j=1}^{\frac{k}{2}}-2\cdot \frac{2j-1}{d+2j}&\quad (\text {since} \,\,\log (1-x)\le -x)\\&\le -\frac{2}{d+k}\sum _{j=1}^{\frac{k}{2}}(2j-1)&\quad (\text {since} \,\, j\le k/2)\\&=-\frac{2}{d+k}\frac{k^2}{4}&\\&\le -\frac{k^2}{4d}&\quad (\text {since}\,\, k\le d). \end{aligned}$$

In particular:

$$\begin{aligned} B={\mathrm{e}}^{\log (B)}\le {\mathrm{e}}^{-\frac{k^2}{4d}}, \end{aligned}$$

which gives the desired claim (3.5).

We also record the asymptotic:

$$\begin{aligned} {d\atopwithdelims ()\frac{d}{2}}=\sqrt{\frac{2}{\pi }}\frac{2^d}{\sqrt{d}}\left( 1+o(1)\right) ,\end{aligned}$$

which gives:

$$\begin{aligned} \frac{d^{1/2}}{2^{d-1}}{d\atopwithdelims ()\frac{d}{2}}\le A \end{aligned}$$

(3.6)

for some constant $A>0$.

We finally turn to the estimate of our function $ g_d$:

$$\begin{aligned} g_d(x)&\le x^{2q+n-1}\frac{d^{1/2}}{2^{d-1}}{d\atopwithdelims ()\frac{d}{2}}{\mathrm{e}}^{-\frac{x^2}{4}}&{\text {by}}~ (3.5) \nonumber \\&\le A\cdot x^{2q+n-1}{\mathrm{e}}^{-\frac{x^2}{4}}&{\text {by}}~ (3.6). \end{aligned}$$

(3.7)

Using the upper bound (3.7), we have:

$$\begin{aligned}&C_4(n)d^{-n+q}\int _{\frac{L}{\sqrt{d}}}^{\infty }x^{2q+n-1}\frac{d^{1/2}}{2^{d-1}}{d\atopwithdelims (){\frac{d-\{x\sqrt{d}\}}{2}}}\mathrm {d}x\nonumber \\&\quad =C_4(n)d^{-n+q}\int _{\frac{L}{\sqrt{d}}}^{\sqrt{d}}g_d(x)\mathrm {d}x\nonumber \\&\quad \le C_5(n)d^{-n+q}\int _{\frac{L}{\sqrt{d}}}^{\sqrt{d}}x^{2q+n-1}{\mathrm{e}}^{-\frac{x^2}{4}}\mathrm {d}x&{\text {by}}~ (3.7) \nonumber \\&\quad \le C_6(n)d^{-n+q}\int _{\frac{L^2}{4d}}^{\frac{d}{4}}t^{\frac{2q+n}{2}-1}{\mathrm{e}}^{-t}\mathrm {d}t\nonumber \\&\quad \le C_7(n)d^{2q -\frac{n}{2}-1}\int _{\frac{L^2}{4d}}^{\infty }{\mathrm{e}}^{-t}\mathrm {d}t\nonumber \\&\quad \le C_8(n)d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}}. \end{aligned}$$

(3.8)

Finally, using the estimate (3.8) into (3.2) gives the desired inequality.

Remark 7

The final estimate (3.8) from Proposition 2 takes the following interesting shapes:

If $L=b \sqrt{d}$ with $b>0$, then:
$$\begin{aligned} d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}} \le d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{b^2}{4}}.\end{aligned}$$
If $L=\sqrt{b d\log d}$ with $b>0$, then:
$$\begin{aligned} d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}} \le d^{2q-\frac{n}{2}-\frac{b}{4}-1}.\end{aligned}$$
If $L=d^{b}$ with $b\in (\frac{1}{2}, 1)$, then there exist $c_1$ (depending on b, q and n), $c_2>0$ (depending exclusively on b) such that:
$$\begin{aligned} d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}} \le c_1{\mathrm{e}}^{- \frac{d^{c_{2}}}{4}}.\end{aligned}$$
If $L=b d$ with $b\in (0, 1)$, then there exist $c_1$(depending on b, q and n), $c_2>0$ (depending exclusively on b) such that:
$$\begin{aligned} d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}} \le c_1{\mathrm{e}}^{-c_2d}. \end{aligned}$$
(3.9)

4 Stability

Let us consider the discriminant set $\Sigma _{n,d}\subset \mathcal {S}_{n,d}$ consisting of all those polynomials whose zero set on the sphere is singular:

$$\begin{aligned} \Sigma _{n,d}=\{p\in \mathcal {S}_{n,d} \text {such that there exists} x\in \mathbb {S}^n \text {with} p(x)=0 \text {and} \nabla _{\mathbb {S}^n}p(x)=0\}.\end{aligned}$$

Given $p\in \mathcal {S}_{n,d}$, we denote by $\delta (p)$ its distance, in the Bombieri–Weyl norm, to $\Sigma _{n,d}:$

$$\begin{aligned} \delta (p)=\min _{s\in \Sigma _{n,d}}\Vert s-p\Vert _{\text {BW}}.\end{aligned}$$

If $Z_1, Z_2\subset \mathbb {S}^n$ are two smooth hypersurfaces, we will write $(\mathbb {S}^n, Z_1)\sim (\mathbb {S}^n, Z_2)$ to denote that the two pairs $(\mathbb {S}^n, Z_1)$ and $(\mathbb {S}^n, Z_2)$ are diffeomorphic. Given $f\in C^1(\mathbb {S}^n, \mathbb {R})$ we denote by $Z(f)\subset \mathbb {S}^n$ its zero set. A small perturbation in the $C^1$-norm of a function $f\in C^1(\mathbb {S}^n, \mathbb {R})$ whose zero set Z(f) is nondegenerate does not change the class of the pair $(\mathbb {S}^n, Z(f))$; the next proposition makes this more quantitative.

Proposition 3

Let $p\in \mathcal {S}_{n,d}\backslash \Sigma _{n,d}$. Given $f\in C^1(\mathbb {S}^n, \mathbb {R})$ | $\Vert f-p\Vert _{C^1}< \frac{\delta (p)}{2}$, we have:

$$\begin{aligned} (\mathbb {S}^n, Z(p))\sim (\mathbb {S}^n, Z(f)).\end{aligned}$$

Proof

For $t\in [0,1]$, let us consider now the function $f_t=p+t(f-p)$. Since $\Vert f-p\Vert _{C^1}<\frac{\delta (p)}{2}$, for all $\theta \in \mathbb {S}^n$ we have:

$$\begin{aligned} |f_t(\theta )- p(\theta )|<\frac{\delta }{2}.\end{aligned}$$

Moreover, since $d\ge 1$, from $\Vert f-p\Vert _{C^1}<\frac{\delta (p)}{2}$ we also deduce $\frac{\Vert f-p\Vert _{C^1}}{\sqrt{d}}<\frac{\delta (p)}{2},$ which in turn implies for every $t\in [0,1]$ and $\theta \in \mathbb {S}^{n}$:

$$\begin{aligned} \left| \frac{\Vert \nabla _{\mathbb {S}^{n}}f_t(\theta )\Vert }{\sqrt{d}}- \frac{\Vert \nabla _{\mathbb {S}^{n}}p(\theta )\Vert }{\sqrt{d}}\right| <\frac{\delta (p)}{2}.\end{aligned}$$

(4.1)

Recall from [27, Theorem 5.1] the following explicit expression^{Footnote 1} for $\delta (p)$:

$$\begin{aligned} \delta (p)=\min _{\theta \in \mathbb {S}^n}\left( |p(\theta )|^2+\frac{\Vert \nabla _{\mathbb {S}^n}p(\theta )\Vert ^2}{d}\right) ^{1/2}.\end{aligned}$$

(4.2)

Note that $\left( |p(\theta )|^2+\frac{\Vert \nabla _{\mathbb {S}^n}p(\theta )\Vert ^2}{d}\right) ^{1/2}$ equals the distance in $\mathbb {R}^2$ between the two vectors $v_1(\theta )=(|p(\theta )|, 0)$ and $v_2(\theta )=\left( 0, \frac{\Vert \nabla _{\mathbb {S}^{n}}p(\theta )\Vert }{\sqrt{d}}\right) .$ Observe also that the two vectors $w_1(t,\theta )=(|f_t(\theta )|, 0)$ and $w_2(t,\theta )=\left( 0, \frac{\Vert \nabla _{\mathbb {S}^{n}}f_t(\theta )\Vert }{\sqrt{d}}\right) $, in virtue of (4.1) and (4.2), satisfy:

$$\begin{aligned} w_1(t,\theta )\in B_1(\theta )&=B_{\mathbb {R}^2}\left( v_1(\theta ), \frac{\delta (p)}{2}\right) \quad \\&\text {and}\quad w_2(t,\theta )\in B_2(\theta )=B_{\mathbb {R}^2}\left( v_2(\theta ), \frac{\delta (p)}{2}\right) .\end{aligned}$$

In particular:

$$\begin{aligned} \left( |f_t(\theta )|^2+\frac{\Vert \nabla _{\mathbb {S}^n}f_t(\theta )\Vert ^2}{d}\right) ^{1/2}&=\Vert w_1(t,\theta )-w_2(t,\theta )\Vert \\&> d_{\mathbb {R}^2}\left( B_1(\theta ), B_2(\theta )\right) \\&=\Vert v_1(\theta )-v_2(\theta )\Vert -\delta (p), \end{aligned}$$

where the strict inequality comes from the fact that $w_1$ and $w_2$ belong to the interior of the balls.

Taking the minimum over $\theta \in \mathbb {S}^n$ in the above expression gives:

$$\begin{aligned} \min _{\theta \in \mathbb {S}^n}\left( |f_t(\theta )|^2+\frac{\Vert \nabla _{\mathbb {S}^n}f_t(\theta )\Vert ^2}{d}\right) ^{1/2}>0\quad \forall t\in [0,1]. \end{aligned}$$

(4.3)

In particular, the equation $\{f_t=0\}$ on $\mathbb {S}^n$ is regular for all $t\in [0,1]$: whenever $f_t(\theta )=0$, then $\nabla _{\mathbb {S}^n}f_t(\theta )$ cannot vanish because of the strict inequality in (4.3). The result follows now from Thom’s First Isotopy Lemma [22, Proposition 11.1] (see also [34, Théorème 2.D.2]).

Next proposition quantifies how large is the set of stable polynomials in the Bombieri–Weyl norm. This is a special case of [3, Theorem 21.1] (see also [5, Theorem 5.1]), applied to the case of the real discriminant, which has degree $(n+1)(d-1)^n$.

Proposition 4

There exist $c_3(n), c_4(n)>0$ such that for every $s\ge c_4(n)d^{2n}$ and for $p\in \mathcal {P}_{n,d}$:

$$\begin{aligned} \mathbb {P}\bigg \{\Vert p\Vert _{\mathrm {BW}}\le s \delta (p)\bigg \}\ge 1-c_3(n)\frac{d^{2n}}{s}.\end{aligned}$$

5 Low-Degree Approximation

Theorem 5

There exists $c_5(n)>0$ such that for every $L, \sigma >1$ we have:

$$\begin{aligned} \mathbb {P}\left\{ \left\| p-p|_{L}\right\| _{C^1}<\frac{\delta (p)}{2}\right\} \ge 1-\left( c_5(n)d^{\frac{5n}{2}+2}{\mathrm{e}}^{-\frac{L^2}{4d}}\sigma ^2+\frac{1}{\sigma }\right) .\end{aligned}$$

Moreover, denoting by $\alpha (d,L)= c_5(n)d^{\frac{5n}{2}+2}{\mathrm{e}}^{-\frac{L^2}{4d}}$, there exist $c_6(n),c_7(n)>0$ such that for every L, d satisfying $\alpha (d, L)<\frac{1}{2}$, we have:

$$\begin{aligned} \mathbb {P}\left\{ \left\| p-p|_{L}\right\| _{C^1}<\frac{\delta (p)}{2}\right\} \ge 1-c_6(n)d^{c_7(n)}{\mathrm{e}}^{-\frac{L^2}{12d}}.\end{aligned}$$

Remark 8

Of course, the previous statement is interesting if we can choose $L, \sigma >0$ in such a way that $\frac{1}{\sigma }$ goes to zero, but not too fast, and L is significantly smaller than d, but not too small, because we still want the exponential term ${\mathrm{e}}^{-\frac{L^2}{4d}}$ to kill the other factors and make the probability go to one.

Proof

Let $p\in \mathcal {S}_{n,d}$ and $L\in \{0, \ldots , d\}$. We have the following chain of inequalities:

$$\begin{aligned} \left\| p-p|_{L}\right\| _{C^1}&\le c_1(n)d^{\frac{1}{2}} \Vert p-p|_{L}\Vert _{H^q}&({\text {Proposition}}~{} 1)\nonumber \\&\le c_1(n) d^{\frac{1}{2}}t\Vert p\Vert _{\text {BW}}&({\text {Proposition}}~{} 2)\nonumber \\&\le c_1(n) d^{\frac{1}{2}}t s\delta (p)&({\text {Proposition}}~{} 4) \end{aligned}$$

(5.1)

which hold for every $q \ge \frac{n+1}{2}$, $t>0$ and $s \ge c_4(n)d^{2n}$, with probability

$$\begin{aligned}\mathbb {P}\ge 1-\left( c_2(n)\frac{d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}}}{t^2}+ c_3(n)\frac{d^{2n}}{s}\right) .\end{aligned}$$

We now make the choices:

$$\begin{aligned} s=c_4(n)d^{2n}\sigma ,\quad t=\frac{1}{3c_1(n)c_4(n)d^{2n+1/2}\sigma }\quad \text {and}\quad q=\frac{n+1}{2}.\end{aligned}$$

With these choices, we have:

$$\begin{aligned}&s \ge c_4(n)d^{2n} \end{aligned}$$

(5.2)

$$\begin{aligned}&c_1(n) d^{\frac{1}{2}}t s<\frac{1}{2} \end{aligned}$$

(5.3)

$$\begin{aligned}&c_2(n)\frac{d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}}}{t^2}+ c_3(n)\frac{d^{2n}}{s}\le c_5(n)d^{\frac{5n}{2}+2}{\mathrm{e}}^{-\frac{L^2}{4d}}\sigma ^2+\frac{1}{\sigma }, \end{aligned}$$

(5.4)

where we have set $c_5(n)=c_2(n)(3c_1(n)c_4(n))^2.$

Because of (5.2), we can apply the estimate in (5.1) which, using (5.3), becomes:

$$\begin{aligned} \left\| p-p|_{L}\right\| _{C^1} \le c_1(n) d^{\frac{1}{2}}t s\delta (p)<\frac{\delta (p)}{2}.\end{aligned}$$

Using (5.4), the last chain of inequalities holds with probability:

$$\begin{aligned} \mathbb {P}\ge 1-\left( c_5(n)d^{\frac{5n}{2}+2}{\mathrm{e}}^{-\frac{L^2}{4d}}\sigma ^2+\frac{1}{\sigma }\right) .\end{aligned}$$

Now denote $\alpha (d,L)= c_5(n)d^{\frac{5n}{2}+2}{\mathrm{e}}^{-\frac{L^2}{4d}}$. We have:

$$\begin{aligned} \mathbb {P}\left\{ \left\| p-p|_{L}\right\| _{C^1}<\frac{\delta (p)}{2}\right\} \ge 1-\left( \alpha (d,L)\sigma ^2+\frac{1}{\sigma }\right) \end{aligned}$$

for all $\sigma > 1.$ In particular,

$$\begin{aligned} \mathbb {P}\left\{ \left\| p-p|_{L}\right\| _{C^1}<\frac{\delta (p)}{2}\right\} \ge 1 - \inf _{\sigma >1}\left( \alpha (d,L)\sigma ^2+\frac{1}{\sigma }\right) .\end{aligned}$$

It is not hard to check that this infimum is a minimum. It is attained at $\sigma _0=\left( \frac{1}{2\alpha (d, L)}\right) ^{1/3}$ and is equal to $\frac{2}{3}(2\alpha (d,L))^{\frac{1}{3}}$. Hence, if $\sigma _0>1$ (i.e., $\alpha (d, L)<\frac{1}{2}$), we have:

$$\begin{aligned} \mathbb {P}\left\{ \left\| p-p|_{L}\right\| _{C^1}<\frac{\delta (p)}{2}\right\} \ge 1 - c_6(n)d^{c_7(n)}{\mathrm{e}}^{-\frac{L^2}{12d}},\end{aligned}$$

where we have set $c_6(n)= \frac{2}{3}(2c_5(n))^{\frac{1}{3}}$ and $c_7(n)= \frac{5n}{6}+\frac{2}{3}.$

Proposition 6

For every $a>0$, there exists $b>0$ such that for sufficiently large d:

$$\begin{aligned} \left\| p-p|_{\sqrt{bd\log d}}\right\| _{C^1}< \frac{\delta (p)}{2}\end{aligned}$$

(5.5)

with probability greater than $1-O({d^{-a}}).$

Proof

Let $a > 0$ and $L=\sqrt{bd \log d}$. Then, we have:

$$\begin{aligned} c_6(n)d^{c_7(n)}{\mathrm{e}}^{-\frac{L^2}{12d}}\le d^{c_7(n)-\frac{b}{12}}\le O(d^{-a}),\end{aligned}$$

where the last inequality holds for $b>0$ large enough. We apply now Theorem 5 with this choice and have:

$$\begin{aligned} \mathbb {P}\left\{ \left\| p-p|_{\sqrt{bd\log d}}\right\| _{C^1}< \frac{\delta (p)}{2}\right\} \ge 1-\left( d^{c_7(n)-\frac{b}{12}}\right) \ge 1-O(d^{-a}). \end{aligned}$$

6 Applications to Random Topology

In this section, we show how the previous results can be used to put constraints on the topological type of the pair $(\mathbb {S}^n, Z(p))$ for p a random Kostlan polynomial. The first result is the following more detailed version of Theorem 1 from the Introduction, which we reformulate here making the constants and the quantifiers more precise.

Theorem 7

For every $d\in \mathbb {N}$ and $b>0$, consider the event $E_d(b)\subset \mathcal {P}_{n,d}$ consisting of the set of Kostlan polynomials p such that $(\mathbb {S}^n,Z(p))\sim (\mathbb {S}^n, Z(q))$, with q of degree $\sqrt{b d\log d}$. For every $a>0$, there exist $b>0$ and $c>0$ such that $E_d(b)$ holds with probability at least $1-\frac{c}{d^{a}}$.

Proof

Let $a>0$ and consider the $b>0$ given by Proposition 6. Then, Proposition 3, (5.5) implies that the pairs $(\mathbb {S}^n, Z(p))$ and $(\mathbb {S}^n, Z(p|_{\sqrt{bd\log d}}))$ are diffeomorphic and the conclusion follows from Proposition 6.

6.1 Hypersurfaces with Rich Topology

For a topological space X, we denote by b(X) the sum of its $\mathbb {Z}_2$-Betti numbers (sometimes also called the homological complexity of X). Recall by [23] that if $P\in \mathbb {R}[x_0, \ldots , x_n]_{d}$, then the zero set of $p=P|_{\mathbb {S}^n}$ has homological complexity bounded by $b(Z(p))\le O(d^{n}).$

Theorem 8

For $\alpha >0$, let $M_{\alpha ,d}\subset \mathcal {S}_{n,d}$ be the set:

$$\begin{aligned} M_{\alpha , d}:=\{p\in \mathcal {S}_{n,d} | b(Z(p))\ge \alpha d^n\}.\end{aligned}$$

Then, there exist $\gamma _1(\alpha ), \gamma _2(\alpha )>0$ such that:

$$\begin{aligned} \mathbb {P}(M_{\alpha ,d})\le \gamma _1(\alpha ){\mathrm{e}}^{-\gamma _2(\alpha )d}.\end{aligned}$$

Proof

Observe first that if $q\in \mathbb {R}[x_0, \ldots , x_n]_L$ (q is just a polynomial of degree L, not necessarily homogeneous), then $b(Z(q))\le cL^n$ for some $c>0$, again by [23]. Hence, if we want $b(Z(q))>\alpha d^n$ we must have:

$$\begin{aligned} L>\left( \frac{\alpha }{c}\right) ^{\frac{1}{n}} d.\end{aligned}$$

Arguing as in the proof of Theorem 5, where now we take the projection $\lambda =\lambda _L:\mathcal {S}_{n, d}\rightarrow \mathcal {S}_{n, L}$ choosing the value $L=\left( \frac{\alpha }{c}\right) ^{\frac{1}{n}} d$, we see that for every $t>0$ and $s\ge c_4(n)d^{2n}$:

$$\begin{aligned} \Vert p-p|_{L}\Vert _{C^1}\le c_1(n) d^{\frac{1}{2}}t s\delta (p)\end{aligned}$$

with probability

$$\begin{aligned} \mathbb {P}&\ge 1-\left( c_2(n)\frac{d^{2q-\frac{n}{2}-1}{\mathrm{e}}^{-\frac{L^2}{4d}}}{t^2}+ c_3(n)\frac{d^{2n}}{s}\right)&({\text {Propositions}}~{} 1, ~{} 2,~{} 4)\\&\ge 1-\left( c_5(n,\alpha )\frac{{\mathrm{e}}^{-c_6(\alpha )d}}{t^2}+ c_3(n)\frac{d^{2n}}{s}\right)&{\text {by}}~ (3.9). \end{aligned}$$

Observe now that:

$$\begin{aligned} t=c_5(n,\alpha )^{1/2}{\mathrm{e}}^{-\frac{c_6(\alpha )d}{4}}\implies c_5(n,\alpha )\frac{{\mathrm{e}}^{-c_6(\alpha )d}}{t^2}\le \gamma _{3}(\alpha ){\mathrm{e}}^{-\gamma _{4}(\alpha )d}\end{aligned}$$

(6.1)

for some constants $\gamma _3(\alpha ), \gamma _4(\alpha )>0$, and

$$\begin{aligned} s= \frac{{\mathrm{e}}^{\frac{c_6(\alpha )d}{4}}}{3c_1(n) d^{1/2}c_5(n,\alpha )^{1/2}}\implies c_3(n)\frac{d^{2n}}{s} \le \gamma _5(\alpha ){\mathrm{e}}^{-\gamma _6(\alpha )d} \end{aligned}$$

(6.2)

for some constants $\gamma _5(\alpha ), \gamma _6(\alpha )>0$.

Choosing s as in (6.2) and t as in (6.1), for $d>0$ large enough we have $s\ge c_4(n)d^{2n}$, and $c_1(n)d^{1/2}ts<\frac{1}{2}$; it follows that there exist constants $\gamma _1(\alpha ), \gamma _2(\alpha )>0$ such that

$$\begin{aligned} \Vert p-p|_{L}\Vert _{C^1}<\frac{\delta (p)}{2}\quad \text {with probability}\quad \mathbb {P}\ge 1-\gamma _1(\alpha ){\mathrm{e}}^{-\gamma _2(\alpha )d}.\end{aligned}$$

(6.3)

The condition $b(Z(p))>\alpha d^n$ implies that with the choice of $L<\left( \frac{\alpha }{c}\right) ^{\frac{1}{n}} d$ we must have $\Vert p-p|_{L}\Vert _{C^1}\ge \frac{\delta (p)}{2},$ for otherwise the zero set of p would be diffeomorphic to the zero set of $p-p|_{L}$ which, since $\deg (p-p|_{L})<L$, has homological complexity bounded by $b(Z(p-p|_{L}))<cL^n<\alpha d^n.$ In particular:

$$\begin{aligned} \bigg \{b(Z(p))>\alpha d^n\bigg \}\subset \left\{ \Vert p-p|_{L}\Vert _{C^1}\ge \frac{\delta (p)}{2}\right\} ,\end{aligned}$$

which combined with (6.3) implies the statement.

Remark 9

Notice that, using Theorem 5 and Remark 7, similar rarefaction estimates can be produced for the set of hypersurface with less rich topology. For instance, those with $b(Z(p))\ge (d^{1/2+\epsilon })^n$ would have probability smaller than $c_1{\mathrm{e}}^{-d^{c_2}/2}$, for some constant $c_1(\epsilon ), c_2(\epsilon )>0.$

Remark 10

It is not difficult to derive from Theorem 8 a similar result for random zero projective sets $Z(p)\subset \mathbb {R}\mathrm {P}^n$. In this context, the previous result should be compared with [10, Theorem 1], where the authors prove that the Kostlan measure of the set of curves $C\subset \mathbb {R}\mathrm {P}^2$ of degree d whose number of components is more than $\frac{(d-1)(d-2)}{2}+1-ad$ is $O({\mathrm{e}}^{-c_2d})$. Theorem 8 is stronger in two senses: it applies to the general case of hypersurfaces in $\mathbb {R}\mathrm {P}^n$ and it gives exponential rarefaction for all sets of the form $\{b_0(Z(p))\ge \alpha d^n\}$ (i.e., not necessarily a linear correction from the maximal bound).

6.2 Depth of a Nest

Given $p\in \mathcal {S}_{n,d}\backslash \Sigma _{n,d}$, its zero set $Z(p)\subset \mathbb {S}^n$ consists of a finite union of connected, smooth and compact hypersurfaces. Fixing a point $y_\infty \in \mathbb {S}^n$ (with $\mathbb {P}=1$, this point does not belong to Z(p)), every such component of Z(p) separates the sphere $\mathbb {S}^n$ into two open sets: a “bounded” one (the open set which does not contain $y_\infty $) and an “unbounded” one (the open set which contains $y_\infty $). The nesting graph of Z(p) (with respect to $y_\infty $) is a graph whose vertices are the components of Z(p) and there is an edge between two components if and only if one is contained in the bounded component of the other. The resulting graph is a forest (a union of trees) and we say that $(\mathbb {S}^n, Z(p))$ has a nest of depth m if this forest contains a tree of depth m.

Theorem 9

For $\alpha >0$, let $N_{\alpha d}\subset \mathcal {S}_{n,d}$ be the set:

$$\begin{aligned} N_{\alpha , d}:=\{p\in \mathcal {S}_{n,d} | Z(p) \text {has a nest of depth} \ge \alpha d\}.\end{aligned}$$

Then, there exist $c_1(\alpha ), c_2(\alpha )>0$ such that:

$$\begin{aligned} \mathbb {P}(N_{\alpha , d})\le c_1(\alpha ){\mathrm{e}}^{-c_2(\alpha )d}.\end{aligned}$$

Proof

The proof is essentially the same as the proof of Theorem 8, after observing that the depth of every nest of the zero set of a polynomial of degree L is smaller than L.

Notes

This expression can also be derived from [4, Theorem 19.3], where it is proved that the distance from the real discriminant equals the reciprocal of the condition number. We prefer to quote directly [27] because it seems that this nice work of Raffalli has been forgotten from the literature on the subject.

References

Michele Ancona. Exponential rarefaction of maximal real algebraic hypersurfaces, 2020.
Paul Breiding, Hanieh Keneshlou, and Antonio Lerario. Quantitative Singularity Theory for Random Polynomials. International Mathematics Research Notices, 10 2020. rnaa274.
Peter Bürgisser and Felipe Cucker. Condition, volume 349 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer, Heidelberg, 2013. The geometry of numerical algorithms.
Peter Bürgisser and Felipe Cucker. Condition: The geometry of numerical algorithms, volume 349 of Grundlehren der Mathematischen Wissenschaften. Springer, Heidelberg, 2013.
Felipe Cucker, Teresa Krick, and Michael Shub. Computing the homology of real projective sets. Found. Comput. Math., 18(4):929–970, 2018.
Article MathSciNet Google Scholar
Alan Edelman and Eric Kostlan. How many zeros of a random polynomial are real? Bull. Amer. Math. Soc. (N.S.), 32(1):1–37, 1995.
Alan Edelman, Eric Kostlan, and Michael Shub. How many eigenvalues of a random matrix are real? J. Amer. Math. Soc., 7(1):247–267, 1994.
Article MathSciNet Google Scholar
Gerald B. Folland. How to integrate a polynomial over a sphere. Am. Math. Mon., 108(5):446–448, 2001.
Article MathSciNet Google Scholar
Y. V. Fyodorov, A. Lerario, and E. Lundberg. On the number of connected components of random algebraic hypersurfaces. J. Geom. Phys., 95:1–20, 2015.
Article MathSciNet Google Scholar
Damien Gayet and Jean-Yves Welschinger. Exponential rarefaction of real curves with many components. Publ. Math. Inst. Hautes Études Sci., (113):69–96, 2011.
Article MathSciNet Google Scholar
Damien Gayet and Jean-Yves Welschinger. Lower estimates for the expected Betti numbers of random real hypersurfaces. J. Lond. Math. Soc. (2), 90(1):105–120, 2014.
Damien Gayet and Jean-Yves Welschinger. Expected topology of random real algebraic submanifolds. J. Inst. Math. Jussieu, 14(4):673–702, 2015.
Article MathSciNet Google Scholar
Damien Gayet and Jean-Yves Welschinger. Betti numbers of random real hypersurfaces and determinants of random symmetric matrices. J. Eur. Math. Soc. (JEMS), 18(4):733–772, 2016.
Article MathSciNet Google Scholar
V. Gichev. Decomposition of the Kostlan-Shub-Smale model for random polynomials. In Complex analysis and dynamical systems VII, volume 699 of Contemp. Math., pages 103–120. Amer. Math. Soc., Providence, RI, 2017.
M. Kac. On the average number of real roots of a random algebraic equation. Bull. Amer. Math. Soc., 49:314–320, 1943.
Article MathSciNet Google Scholar
E. Kostlan. On the distribution of roots of random polynomials. In From Topology to Computation: Proceedings of the Smalefest (Berkeley, CA, 1990), pages 419–431. Springer, New York, 1993.
Eric Kostlan. On the expected number of real roots of a system of random polynomial equations. In Foundations of computational mathematics (Hong Kong, 2000), pages 149–188. World Sci. Publ., River Edge, NJ, 2002.
Antonio Lerario. Random matrices and the average topology of the intersection of two quadrics. Proc. Amer. Math. Soc., 143(8):3239–3251, 2015.
Article MathSciNet Google Scholar
Antonio Lerario and Erik Lundberg. Statistics on Hilbert’s 16th problem. Int. Math. Res. Not. IMRN, (12):4293–4321, 2015.
MathSciNet MATH Google Scholar
Antonio Lerario and Erik Lundberg. Gap probabilities and Betti numbers of a random intersection of quadrics. Discrete Comput. Geom., 55(2):462–496, 2016.
Article MathSciNet Google Scholar
Antonio Lerario and Erik Lundberg. On the geometry of random lemniscates. Proc. Lond. Math. Soc. (3), 113(5):649–673, 2016.
John Mather. Notes on topological stability john mather, 1970.
J. Milnor. On the Betti numbers of real varieties. Proc. Amer. Math. Soc., 15:275–280, 1964.
Article MathSciNet Google Scholar
F. Nazarov and M. Sodin. Asymptotic laws for the spatial distribution and the number of connected components of zero sets of Gaussian random functions. Zh. Mat. Fiz. Anal. Geom., 12(3):205–278, 2016.
Article MathSciNet Google Scholar
Fedor Nazarov and Mikhail Sodin. On the number of nodal domains of random spherical harmonics. Amer. J. Math., 131(5):1337–1357, 2009.
Article MathSciNet Google Scholar
S. Yu. Orevkov and V. M. Kharlamov. Growth order of the number of classes of real plane algebraic curves as the degree grows. Zap. Nauchn. Sem. S.-Peterburg. Otdel. Mat. Inst. Steklov. (POMI), 266(Teor. Predst. Din. Sist. Komb. i Algoritm. Metody. 5):218–233, 339, 2000.
Christophe Raffalli. Distance to the discriminant. preprint on arXiv, 2014.https://arxiv.org/abs/1404.7253
Peter Sarnak. Letter to b. gross and j. harris on ovals of random planes curve. handwritten letter, 2011. available at http://publications.ias.edu/sarnak/section/515
Peter Sarnak and Igor Wigman. Topologies of nodal sets of random band limited functions. In Advances in the theory of automorphic forms and their$L$-functions, volume 664 of Contemp. Math., pages 351–365. Amer. Math. Soc., Providence, RI, 2016.
R. T. Seeley. Spherical harmonics. Amer. Math. Monthly, 73(4, part II):115–121, 1966.
M. Shub and S. Smale. Complexity of Bezout’s theorem. II. Volumes and probabilities. In Computational algebraic geometry (Nice, 1992), volume 109 of Progr. Math., pages 267–285. Birkhäuser Boston, Boston, MA, 1993.
Michael Shub and Steve Smale. Complexity of Bézout’s theorem. I. Geometric aspects. J. Amer. Math. Soc., 6(2):459–501, 1993.
Michael Shub and Steve Smale. Complexity of Bezout’s theorem. III. Condition number and packing. J. Complexity, 9(1):4–14, 1993. Festschrift for Joseph F. Traub, Part I.
R. Thom. Ensembles et morphismes stratifiés. Bull. Amer. Math. Soc., 75:240–284, 1969.
Article MathSciNet Google Scholar
George Wilson. Hilbert’s sixteenth problem. Topology, 17(1):53–73, 1978.
Article MathSciNet Google Scholar

Download references

Acknowledgements

We are indebted to Marie-Françoise Roy, who has played a crucial role for the existence of this paper. We also wish to thank the anonymous referees for their constructive comments which helped improving the presentation of the paper.

Funding

Open access funding provided by Scuola Internazionale Superiore di Studi Avanzati - SISSA within the CRUI-CARE Agreement.

Author information

Authors and Affiliations

LMA, University Assane Seck of Ziguinchor, BP 523, Ziguinchor, Senegal
Daouda Niang Diatta
SISSA, Mathematics area, via Bonomea 265, 34131, Trieste, Italy
Antonio Lerario

Authors

Daouda Niang Diatta
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Lerario
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonio Lerario.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Communicated by Felipe Cucker.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Diatta, D.N., Lerario, A. Low-Degree Approximation of Random Polynomials. Found Comput Math 22, 77–97 (2022). https://doi.org/10.1007/s10208-021-09506-y

Download citation

Received: 14 February 2019
Revised: 30 January 2021
Accepted: 11 February 2021
Published: 25 March 2021
Issue Date: February 2022
DOI: https://doi.org/10.1007/s10208-021-09506-y

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Low-Degree Approximation of Random Polynomials

Abstract

Similar content being viewed by others

Quasi-random multilinear polynomials

Hall’s Conjecture on Extremal Sets for Random Triangles

Zero distribution of random polynomials

1 Introduction

Theorem A

Remark 1

1.1 Consequences

Remark 2

Remark 3

Remark 4

2 Spaces of Polynomials and Norms

Proposition 1

Proof

3 Gaussian Measures and Random Polynomials

Remark 5

Proposition 2

Remark 6

Proof

Remark 7

4 Stability

Proposition 3

Proof

Proposition 4

5 Low-Degree Approximation

Theorem 5

Remark 8

Proof

Proposition 6

Proof

6 Applications to Random Topology

Theorem 7

Proof

6.1 Hypersurfaces with Rich Topology

Theorem 8

Proof

Remark 9

Remark 10

6.2 Depth of a Nest

Theorem 9

Proof

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation