1 Introduction

Let \(\mathrm{Sym}_{n}(\mathbb{R})\) and \(\mathrm{Sym}_{n}(\mathbb{C})\) denote the spaces of \(n\times n\) real symmetric (correspondingly, complex Hermitian) matrices. When referring to both spaces at once, we will use the term “self-adjoint matrices” and use the notation \(\mathrm{Sym}_{n}\). The eigenvalues \(\{\widehat{\lambda}_{i}(A)\}_{i=1}^{n}\) of a matrix \(A\in \mathrm{Sym}_{n}\) are real and will be numbered in the increasing order,

$$ \widehat{\lambda}_{1}(A) \leq \widehat{\lambda}_{2}(A) \leq \cdots \leq \widehat{\lambda}_{n}(A). $$
(1.1)

Further, let \(M\) be a smooth (i.e. \(C^{\infty}\)) compact \(d\)-dimensional manifold. A smooth \(d\)-parametric family of self-adjoint matrices (on \(M\)) is a smooth map \(\mathcal{F}: M \to \mathrm{Sym}_{n}\).

The aim of this paper is to develop the Morse theory for the \(k\)-th ordered eigenvalue

$$ \lambda _{k}:=\widehat{\lambda}_{k}\circ \mathcal{F} $$

viewed as a function on \(M\). This question is motivated by numerous problems in mathematical physics. The boundaries between isolating and conducting regimes in a periodic (crystalline) structure are determined by the extrema of eigenvalues of an operatorFootnote 1 family defined on a \(d\)-dimensional torus \(M\) (for an introduction to the mathematics of this subject, see [49]). Other critical points of the eigenvalues give rise to special physically observable features of the density of states, the van Hove singularities [63]. Classifying all critical points of an eigenvalue (also on a torus) by their degree is used to study oscillation of eigenfunctions via the nodal–magnetic theorem [3, 4, 10, 20]. More broadly, the area of eigenvalue optimization encompasses questions from understanding the charge distribution in an atomic nucleus [28], configuration of atoms in a polyatomic molecule [24, 50], to shape optimization [40, 41] and optimal partition of domains and networks [8, 39]. The dimension of the manifold \(M\) in these applications can be very high or even infinite.

Morse theory is a natural tool for connecting statistics of the critical points with the topology of the underlying manifold. However, the classical Morse theory is formulated for functions that are sufficiently smooth, whereas the function \(\lambda _{k}\) is generically non-smooth at the points where \(\lambda _{k}(x)\) is a repeated eigenvalue of the matrix \(\mathcal{F}(x)\). And it is these points of non-smoothness that play an outsized role in the applications [17, 24].

By Bronstein’s theorem [15], each \(\lambda _{k}\) is Lipschitz. Furthermore, by classical perturbation theory [48], the function \(\lambda _{k}\) is smooth along a submanifold \(N \subset M\) if the multiplicity of \(\lambda _{k}(x)\) is constant on \(N\); the latter property induces a stratification of \(M\). There exist generalizations of Morse theory to Lipschitz functions [2], continuous functions [33, §45], as well as to stratified spaces [35]. These generalizations will provide the general foundation for our work, but the principal thrust of this paper is to leverage the properties of \(\mathrm{Sym}_{n}\) and to get explicit — and beautiful — answers for the Morse data in terms of the local behavior of ℱ at a discrete set of points we will identify as “critical”. One of the surprising findings is that the Morse data attributable to the non-smooth directions at a critical point does not depend on the particulars of the family ℱ.

To set the stage for our results we now review informally the main ideas of Morse theory, which links the topological invariants of the manifold \(M\) to the number and the indices of the critical points of a function \(\phi \) on \(M\).

In more detail, if \(\phi \) is smooth, a point \(x\in M\) is called a critical point if the differential of \(\phi \) vanishes at \(x\). The Hessian (second differential) of \(\phi \) at \(x\) is a quadratic form on the tangent space \(T_{x} M\). In local coordinates it is represented by the matrix of second derivatives, the Hessian matrix. The Morse index \(\mu (x)\) is defined as the negative index of this quadratic form or, equivalently, the number of negative eigenvalues of the Hessian matrix. It is assumed that the second differential at every critical point of \(\phi \) is non-singular, i.e. the Hessian matrix has no zero eigenvalues; such critical points are called non-degenerate. Non-degenerate critical points are isolated and therefore there are only finitely many of them on \(M\). A smooth function \(\phi \) is called a Morse function if all its critical points are non-degenerate.

The main result of the classical Morse theory quantifies the change in the topology of the level curves of \(\phi \) around a critical point. To be precise, for a point \(x\in M\) and its neighborhood \(U\) (which we will always assume to be homeomorphic to a ball) define the local sublevel sets,

$$ U_{x}^{-\varepsilon }(\phi ) := \{y\in U \colon \phi (y) \leq \phi (x)- \varepsilon \} \quad \text{and} \quad U_{x}^{+\varepsilon }(\phi ) := \{y\in U \colon \phi (y) \leq \phi (x)+\varepsilon \}. $$

If \(x\) is a non-degenerate critical point of index \(\mu =\mu (x)\), then, for a sufficiently small neighborhood \(U\) of \(x\) and sufficiently small \(\varepsilon >0\), the quotient space \(U_{x}^{+\varepsilon }(\phi )/U_{x}^{-\varepsilon }(\phi )\) is homotopy equivalent to the \(\mu \)-dimensional sphere \(\mathbb{S}^{\mu}\). The global consequences of this are the Morse inequalities: given a Morse function \(\phi \), denote by \(c_{q}\), \(q=0,\ldots ,d\), the number of its critical points of index \(q\). Then there exist \(d\) integers \(r_{q} \geq 0\) such that

$$ \begin{aligned} c_{0} &= \beta _{0} + r_{1}, \\ c_{1} &= \beta _{1} + r_{1} + r_{2}, \\ c_{2} &= \beta _{2} + r_{2} + r_{3}, \\ &\ldots \\ c_{d-1} &= \beta _{d-1}+r_{d-1}+r_{d}, \\ c_{d} &= \beta _{d} + r_{d}, \end{aligned} $$
(1.2)

where \(\beta _{q}\) is the \(q\)-th Betti number of the manifold \(M\), defined as the rank of the homology groupFootnote 2\(H_{q}(M)=H_{q}(M;\mathbb{Z})\). To put it another way, the Betti numbers \(\beta _{q}\) give the lower bound for the number of critical points of index \(q\); extra critical points can only be created in pairs of adjacent index.

Equations (1.2) can be expressed concisely in terms of generating functions: one defines the Morse polynomial \(P_{\phi}(t)\) of a Morse function \(\phi \) as the sum of \(t^{\mu (x)}\) over all critical points \(x \in M\) of \(\phi \). On the topological side, the Poincaré polynomial \(P_{M}(t)\) of the manifold \(M\) is the sum of \(\beta _{q}t^{q}\). Then the Morse inequalities are equivalent to the identity

$$ \left ( P_{\phi}(t) - P_{M}(t) \right ) / (1+t) = R(t), $$
(1.3)

where \(R(t)\) is a polynomial with nonnegative coefficients.

Now assume that \(\phi \) is just continuous; the local sublevel sets \(U^{\pm \varepsilon }_{x}(\phi )\) are still well-defined. Mimicking the classical Morse theory of smooth functions we adopt the following definitions (cf. [33, §45, Def. 1, 2 and 3], the critical points are called bifurcation points there):

Definition 1.1

A point \(x\in M\) is a topologically regular point of a continuous function \(\phi \) if there exists a small enough neighborhood \(U\) of \(x\) in \(M\) and \(\varepsilon >0\) such that \(U_{x}^{-\varepsilon }(\phi )\) is a strong deformation retract of \(U_{x}^{+\varepsilon }(\phi )\). We say that a point is topologically critical if it is not topologically regular.

Remark 1.2

If \(\phi \) is smooth, a topologically critical point \(x\) is also critical in the usual (differential) sense. The converse is, in general, not true: for example, if \(M=\mathbb{R}\) and \(\phi (x)=x^{3}\), then \(x=0\) is critical but not topologically critical. On the other hand, by the aforementioned main result of the classical Morse theory, if \(x\) is a non-degenerate critical point then it is also topologically critical.

Definition 1.3

Given a continuous function \(\varphi \) with a finite set of topologically critical points, the Morse polynomial \(P_{\phi}\) is the sum, over the topologically critical points \(x\), of the Poincaré polynomials of the relative homology groups \(H_{*}\big(U_{x}^{+\varepsilon }(\phi ),\, U_{x}^{-\varepsilon }( \phi )\big)\), where \(U\) is a small neighborhood of \(x\) and \(\varepsilon >0\) is sufficiently small.

Remark 1.4

If \(\phi \) is a smooth Morse function, Definition 1.3 reduces to the classical one as the relative homology groups \(H_{*}\bigl(U_{x}^{+\varepsilon }(\phi ),\, U_{x}^{-\varepsilon }( \phi )\bigr)\) coincide with the reduced homology groups of the \(\mu (x)\)-dimensional sphere \(\mathbb{S}^{\mu (x)}\), where \(\mu (x)\) is the Morse index of \(x\), and so the contribution of \(x\) to the Morse polynomial \(P_{\phi}(t)\) is equal to \(t^{\mu (x)}\).

With Definitions 1.1 and 1.3, the Morse inequalities (1.3) hold true for continuous functions \(\phi \) with finite number of topologically critical points (see, e.g., [33, §45, Theorem. 1]). The proof is essentially the same as the proof of the classical Morse inequality given in [51, §5] and is based on the exact sequence of pairs: the latter implies the subadditivity of relative Betti numbers and, more generally, of the partial alternating sums of relative Betti numbers, which implies the required Morse inequalities.

It is thus our goal to give a prescription for computing the Morse polynomial \(P_{\lambda _{k}}\) in terms ofand its derivatives, under some natural assumptions on. To that end we will need to:

  1. (1)

    Provide an explicit characterization of non-smooth topologically critical and topologically regular points of \(\lambda _{k}\);

  2. (2)

    Give a natural definition of a non-degenerate non-smooth topologically critical point;

  3. (3)

    For a non-degenerate topologically critical point \(x\) of \(\lambda _{k}\), find the relative homology

    $$ H_{q}(U_{x}^{\varepsilon }(\lambda _{k}),\, U_{x}^{-\varepsilon }( \lambda _{k})) $$

    for a sufficiently small neighborhood \(U\) of \(x\) and sufficiently small \(\varepsilon >0\). As a by-product, this will determine the correct contribution from \(x\) to the Morse polynomial \(P_{\lambda _{k}}(t)\) of \(\lambda _{k}\).

We remark that these questions are local in nature and we do not need to enforce compactness of \(M\) while answering them.

In this work, we completely implement the above objectives in the case of generic smooth families; additionally, our sufficient condition for a regular point is obtained for arbitrary families. The first objective is accomplished in the form of a “first derivative test”, with the derivative being applied to the smooth object: the family ℱ (see equation (1.5) and Theorems 1.5 and 1.12 for details).

The Morse contribution of a critical point (third objective) will consist of two parts: the classical index of the Hessian of \(\lambda _{k}\) in the directions of smoothness of \(\lambda _{k}\) and a contribution from the non-smooth directions which, remarkably, turns out to depend only on the size of the eigenvalue multiplicity and the relative position of the eigenvalue of interest and not on the particulars of the operator family. Theorem 1.14 expresses this contribution in terms of homologies of suitable Grassmannians; explicit formulas for the Poincaré polynomial are provided in Theorem 1.12. In Sect. 2.2 we mention some simple practical corollaries of our results as well as pose further problems.

1.1 A differential characterization of a topologically critical point

Our primary focus is on the points \(x\in M\) where the eigenvalue \(\lambda _{k}\) has multiplicity and is not differentiable. However, simple examples (for instance, Example 2.1 below) show that not every point of eigenvalue multiplicity is topologically critical.

Denote by \(\mathbf{E}_{k}\) the eigenspace of \(\lambda _{k}\) at a point \(x\in M\) of multiplicity \(\nu = \dim \mathbf{E}_{k}\). The compression of a matrix \(X \in \mathrm{Sym}_{n}\) to the space \(\mathbf{E}_{k}\) is the linear operator \(X_{\mathbf{E}_{k}}: \mathbf{E}_{k} \to \mathbf{E}_{k}\) acting as \(v \mapsto P_{\mathbf{E}_{k}} X v\), where \(P_{\mathbf{E}_{k}}\) is the orthogonal projector onto \(\mathbf{E}_{k}\). The matrix representation of \(X_{\mathbf{E}_{k}}\) can be computed as

$$ X _{\mathbf{E}_{k}} := \mathcal{U}^{*} X \mathcal{U}, $$
(1.4)

where \(\mathcal{U}: \mathbb{F}^{\nu}\to \mathbb{F}^{n}\) is a linear isometry such that \(\operatorname{Ran}(\mathcal{U}) = \mathbf{E}_{k}\) (explicitly, the columns of \(\mathcal{U}\) are an orthonormal basis of \(\mathbf{E}_{k}\)). Introduce the linear operator \(\mathcal{H}_{x}: T_{x}M \to \mathrm{Sym}_{\nu}\) acting as

$$ {\mathcal {H}}_{x} \colon v \mapsto \big( d\mathcal{F}(x)v \big)_{ \mathbf{E}_{k}}. $$
(1.5)

While the operator \({\mathcal {H}}_{x}\) depends on the choice of the isometry \(\mathcal{U}\) in (1.4) (or, equivalently, the choice of basis in \(\mathbf{E}_{k}\)), we will only use its properties that are invariant under unitary conjugation.

We recall that a matrix \(A\in \mathrm{Sym}_{\nu}\) is positive semidefinite (notation: \(A\in \mathrm{Sym}_{\nu}^{+}\)) if all of its eigenvalues are non-negative, positive definite (notation: \(A\in \mathrm{Sym}_{\nu}^{++}\)) if all eigenvalues are strictly positive. We denote by \(S^{\perp}\) the orthogonal complement of a space \(S\) in \(\mathrm{Sym}_{\nu}\) with respect to the Frobenius inner product \(\langle X, Y\rangle :=\operatorname{Tr}(XY)\). For future reference we note that if \(X\in \mathrm{Sym}_{\nu}^{++}\) and \(Y\in \mathrm{Sym}_{\nu}^{+}\), \(Y\neq 0\), then \(\langle X, Y\rangle >0\) (see, e.g., [14, Example 2.24]).

Our first main result gives a sufficient condition for a point of eigenvalue multiplicity to be topologically regular.

Theorem 1.5

Let \(\mathcal{F}: M \to \mathrm{Sym}_{n}\) be a smooth family whose eigenvalue \(\lambda _{k}\) has multiplicity \(\nu \geq 1\) at the point \(x\in M\). If \(\operatorname{Ran}{\mathcal {H}}_{x}\) contains a positive definite matrix or, equivalently,Footnote 3

$$ \left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp }\cap \mathrm{Sym}_{\nu}^{+} = 0, $$
(1.6)

then \(x\) is topologically regular for \(\lambda _{k}\).

This theorem is proved in Sect. 3 by studying the Clarke subdifferential at the point \(x\). We formulate the conditions in terms of both \(\operatorname{Ran}{\mathcal {H}}_{x}\) and \(\left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp}\) because the former emerges naturally from the proof while the latter is simpler in practical computations: generically it is one- or zero-dimensional as we will see in Sect. 4.

Remark 1.6

Condition (1.6) should be viewed as being analogous to the “non-vanishing gradient” in the smooth Morse theory. By what is sometimes called Hellmann–Feynman theorem (see Appendix A and references therein), the eigenvalues of \({\mathcal {H}}_{x}v\in \mathrm{Sym}_{\nu}\) give the slopes of the branches splitting off from the multiple eigenvalue \(\lambda _{k}(\mathcal{F}(x))\) when we leave \(x\) in the direction \(v\). The regularity condition of Theorem 1.5 is equivalent to having a direction in which all eigenvalues are increasing.

To further illustrate this point, consider the special case \(\nu =1\) when the eigenvalue \(\lambda _{k}\) is smooth. Let \(\psi \) be the eigenvector corresponding to \(\lambda _{k}\) at the point \(x\). The operator \({\mathcal {H}}_{x}: T_{x}M \to \mathbb{R}\) in this case maps \(v\) to \(\left < \psi , \big(d\mathcal{F}(x)v\big) \psi \right >_{\mathbb{F}^{n}}\) which is equal to the directional derivative of \(\lambda _{k}(x)\) in the direction \(v\). The condition of Theorem 1.5 is precisely that this derivative is non-zero in some direction, i.e. the gradient does not vanish.

Due to the topological nature of Definition 1.1, one cannot expect that a zero gradient-type condition alone would be sufficient for topological criticality (cf. Remark 1.2). To formulate a sufficient condition we need some notion of “non-degeneracy”, which will have a smooth (S) and non-smooth (N) parts.

Definition 1.7

Let \(\mathcal{F}: M \to \mathrm{Sym}_{n}\) be a smooth family whose eigenvalue \(\lambda _{k}\) has multiplicity \(\nu \geq 1\) at the point \(x\in M\). We say that ℱ satisfies the non-degenerate criticality condition (N) at the point \(x\) if

$$ \left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp }= \operatorname{span}\{B\}, \quad B \in \mathrm{Sym}_{n}^{++}. $$
(1.7)

Remark 1.8

Condition (1.7) ensures non-degenerate criticality in the directions in which \(\lambda _{k}\) is non-smooth (hence “N”); a single condition plays two roles:

  • it ensures that (1.6) is violated (intuitively, “the gradient is zero”), and

  • it ensures that \(\operatorname{Ran}{\mathcal {H}}_{x}\) has codimension 1, which will be interpreted in Sect. 4 as a type of transversality condition (intuitively, “non-degeneracy in the directions in which \(\lambda _{k}\) is non-smooth”).

Once condition (N) is satisfied at a point \(x\), we need to pay special attention to a submanifold \(S\) where the multiplicity of \(\lambda _{k}\) remains the same.

Proposition 1.9

Let \(\mathcal{F}: M \to \mathrm{Sym}_{n}\) be a smooth family whose eigenvalue \(\lambda _{k}\) has multiplicity \(\nu \geq 1\) at the point \(x\in M\). Ifsatisfies the non-degenerate criticality condition (N) at the point \(x\), then there exists a submanifold \(S\subset M\) such that for any \(y\) in a small neighborhood of \(x\) in \(M\), the multiplicity of \(\lambda _{k}(y)\) is equal to \(\nu \) if and only if \(y\in S\).

This submanifold, which we will call the (local) constant multiplicity stratum attached to \(x\), has the following properties:

  1. (1)

    \(S\) has codimension \(s(\nu ) := \dim \mathrm{Sym}_{\nu}(\mathbb{F}) - 1\) in \(M\),

  2. (2)

    the restriction \(\lambda _{k} \big|_{S}\) is a smooth function which has a critical point at \(x\), i.e. \(d\left (\lambda _{k}\big|_{S}\right )(x)=0\).

The proof of the above Proposition is in Sect. 4.

Definition 1.10

Assume ℱ satisfies the non-degenerate criticality condition (N) at the point \(x\) for the eigenvalue \(\lambda _{k}\) (in particular, x is a critical point of \(\lambda _{k} \big|_{S}\)). We will say ℱ satisfies the non-degenerate criticality condition (S) if \(x\) is a non-degenerate critical point of \(\lambda _{k} \big|_{S}\).

Naturally, “S” stands for smooth criticality. It turns out that, together, conditions (N) and (S) are sufficient for topological criticality. To quantify the topological change in the sublevel sets we need additional terminology. The relative index of the \(k\)-th eigenvalue at point \(x\) is

$$ i(x) = \#\left \{\lambda \in \operatorname{spec}\bigl(\mathcal{F}(x)\bigr) \colon \lambda \leq \lambda _{k}(x)\right \} - k + 1. $$
(1.8)

In other words, \(i(x)\) is the sequential number of \(\lambda _{k}\) among the eigenvalues equal to it, but counting from the top. It is an integer between 1 and \(\nu (x)\), the multiplicity of the eigenvalue \(\lambda _{k}(x)\) of the matrix \(\mathcal{F}(x)\). We will need the quantity \(s(i) := \dim \mathrm{Sym}_{i}(\mathbb{F}) - 1\), which already appeared in a different role in Proposition 1.9. It is given explicitly by

$$ s(i) := \dim \mathrm{Sym}_{i}(\mathbb{F}) - 1 = \textstyle\begin{cases} \frac{1}{2}i(i+1)-1, &\mathbb{F}=\mathbb{R}, \\ i^{2}-1, & \mathbb{F}=\mathbb{C}. \end{cases} $$
(1.9)

Finally, we denote by \(\binom{n}{k}_{q}\) the \(q\)-binomial coefficient,

$$ \binom{n}{k}_{q} := \cfrac{\prod _{i=1}^{n} (1-q^{i})}{{\prod _{i=1}^{k}} (1-q^{i})\prod _{i=1}^{n-k} (1-q^{i})}, $$

which is well known ([47, Corollary 2.6]) to be a polynomial in \(q\).

Definition 1.11

A smooth family \(\mathcal{F}:M \to \mathrm{Sym}_{n}({\mathbb{F}})\) is called generalized Morse if, at every point \(x \in M\), ℱ either satisfies the regularity condition (1.6) or satisfies the non-degenerate criticality conditions (N) and (S).

Theorem 1.12

Consider the eigenvalue \(\lambda _{k}\) of a smooth family \(\mathcal{F}: M \to \mathrm{Sym}_{n}\).

  1. (1)

    Ifsatisfies non-degenerate criticality conditions (N) and (S) at \(x\), then \(x\) is a topologically critical point of \(\lambda _{k}\). The set of points \(x\) where conditions (N) and (S) are satisfied is discrete.

  2. (2)

    If \(M\) is a compact manifold and the familyis generalized Morse, Morse inequalities (1.3) hold for the function \(\lambda _{k}:M\to \mathbb{R}\) with the Morse polynomial \(P_{\phi}(t) := P_{\lambda _{k}}(t)\) given by

    $$ P_{\lambda _{k}}(t) := \sum _{x\in \mathrm{CP}(\mathcal{F})} P_{ \lambda _{k}} (t; x), $$

    where the summation is over all topologically critical points \(x\) ofand, denoting by \(\nu (x)\) the multiplicity of the eigenvalue \(\lambda _{k}\) of \(\mathcal{F}(x)\), by \(i(x)\) its relative index, and by \(\mu (x)\) the Morse index of the restriction \(\lambda _{k}\big|_{S}\),

    $$ P_{\lambda _{k}}(t; x):= t^{\mu}\,\mathfrak{T}_{\nu}^{i} = t^{\mu +s(i)} \textstyle\begin{cases} {\binom{\lfloor (\nu -1)/2\rfloor }{(i-1)/2}}_{t^{4}}, & \textit{${\mathbb{F}}=\mathbb{R}$ and $i$ is odd}, \\ 0, & \textit{${\mathbb{F}}=\mathbb{R}$, $i$ is even, and $\nu $ is odd}, \\ t^{\nu -i}{\binom{\nu /2-1}{i/2-1}}_{t^{4}}, & \textit{${\mathbb{F}}=\mathbb{R}$, $i$ is even, and $\nu $ is even}, \\ {\binom{\nu -1}{i-1}}_{t^{2}}, & \textit{${\mathbb{F}}=\mathbb{C}$}. \end{cases} $$
    (1.10)

The topological criticality claimed in part (1) follows immediately whenever the Poincaré polynomial of the relative ℤ-homology — which is given in (1.10) — is non-zero. The case in the second line of (1.10) is more complicated, because that Poincaré polynomial is zero. To handle this case, in the final stages of the proof in Sect. 7 we will additionally calculate the Poincaré polynomial of the relative \(\mathbb{Z}_{2}\)-homology, see (7.3).

We also note that in equation (1.10) we introduced the notation \(\mathfrak{T}_{\nu}^{i}\) for the family-independent Morse contribution to \(P_{\lambda _{k}} (t; x)\). This contribution arises from the “non-smooth” directions transverse to the “smooth” constant multiplicity stratum \(S\). The index \(\mu = \mu (x)\) along the stratum \(S\) depends on the particulars of the family ℱ.

Now we state a result showing that for a “typical” ℱ, either Theorem 1.5 or Theorem 1.12 holds at every point \(x\in M\).

Theorem 1.13

The set of generalized Morse families is open and dense in the Whitney topology in \(C^{r} (M,\mathrm{Sym}_{n})\) for \(2\leq r\leq \infty \).

This result will be established in Sect. 4 as a part of Theorem 4.11. We will use transversality arguments similar to those in the proof of genericity of classical Morse functions (see, for example, [43, Chap. 4, Theorem 1.2]) via the strong (or jet) Thom transversality theorem for stratified spaces.

1.2 Geometrical description of the relative homology groups

In this subsection we provide some idea of what goes into the proof of Theorem 1.12, describing some geometric objects whose integer homology is quantified in (1.10).

First we introduce some notation. We denote by \(\operatorname{Gr}_{{\mathbb{F}}}(k,n)\) the Grassmannian of (non-oriented) \(k\)-dimensional subspaces in \({\mathbb{F}}^{n}\). Theorem 1.14 below uses certain homologies of \(\operatorname{Gr}_{\mathbb{R}}(k,n)\) with local coefficients, namely \(H_{*}\bigl(\operatorname{Gr}_{\mathbb{R}}(k, n); \widetilde{\mathbb{Z}}\bigr)\). The construction of this homology can be found, for instance, in [38, Sec. 3H] or [22, Chap. 5]; it will also be briefly summarized in Sect. 8.

Recall that for a given topological space \(Y\), the cone of \(Y\) is \(\mathcal{C}Y := Y\times [0,1] / \bigl(Y \times \{0\}\bigr)\), and the suspension of \(Y\) is \(\mathcal {S} Y := \mathcal{C}Y / \bigl(Y \times \{1\}\bigr)\). For example, \(\mathcal {S} \mathbb{S}^{\mu }= \mathbb{S}^{\mu +1}\).

Theorem 1.14

In the context of Theorem 1.12, we have the following equivalent descriptions of the relative homology \(H_{r}\bigl(U_{x}^{+\varepsilon }(\lambda _{k}),\, U_{x}^{- \varepsilon }(\lambda _{k})\bigr)\),

  1. (1)
    $$ H_{r}\bigl(U_{x}^{+\varepsilon }(\lambda _{k}),\, U_{x}^{- \varepsilon }(\lambda _{k})\bigr) = \textstyle\begin{cases} \mathbb{Z}, & \textit{if}\quad i(x)=1,\ r=\mu (x), \\ 0, & \textit{if} \quad i(x)=1,\ r\neq \mu (x), \\ \widetilde{H}_{r-\mu (x)}\left (\mathcal{S} \mathcal{R}_{\nu (x)}^{i(x)} \right ), & \textit{if} \quad 1 < i(x) \leq \nu (x), \end{cases} $$
    (1.11)

    where

    $$ \mathcal{R}_{\nu}^{i} := \{R\in \mathrm{Sym}^{+}_{\nu }\colon \operatorname{Tr}R=1, \operatorname{rank}R < i\}, $$
    (1.12)

    and \(\widetilde{H}_{q}\) denotes the \(q\)-th reduced homology group.

  2. (2)
    $$ \begin{aligned}[t] &H_{r}\bigl(U_{x}^{+\varepsilon }(\lambda _{k}),\, U_{x}^{- \varepsilon }(\lambda _{k})\bigr) \\ &\qquad = \textstyle\begin{cases} H_{r-\mu -s(i)}\big(\operatorname{Gr}_{\mathbb{R}}(i-1, \nu -1)\big), & \textit{${\mathbb{F}}=\mathbb{R}$ and $i$ is odd}, \\ H_{r-\mu -s(i)}\bigl(\operatorname{Gr}_{\mathbb{R}}(i-1, \nu -1);\, \widetilde{\mathbb{Z}}\bigr), & \textit{${\mathbb{F}}=\mathbb{R}$ and $i$ is even}, \\ H_{r-\mu -s(i)}\big(\operatorname{Gr}_{\mathbb{C}}(i-1, \nu -1)\big), & \textit{${\mathbb{F}}=\mathbb{C}$}. \end{cases}\displaystyle \end{aligned} $$
    (1.13)

To prove Theorem 1.14, in Sect. 5 we will first separate out the contribution to the relative homology \(H_{r}\bigl(U_{x}^{+\varepsilon }(\lambda _{k}),\, U_{x}^{- \varepsilon }(\lambda _{k})\bigr)\) of the local constant multiplicity stratum \(S\) and reduce the computation to the case when \(S\) is a single point. In the latter case, it will be shown that \(H_{r}\bigl(U_{x}^{+\varepsilon }(\lambda _{k}),\, U_{x}^{- \varepsilon }(\lambda _{k})\bigr)\) reduces to the homology of the space \(\mathcal{S}\mathcal{R}_{\nu}^{i}\). In the next step, we will see that \(\mathcal{S}\mathcal{R}_{\nu}^{i}\) is homotopy equivalent to the Thom space of a real bundle of rank \(s(i)\) over the Grassmannian \(\operatorname{Gr}_{{\mathbb{F}}}(i-1, \nu -1)\). The difference between the odd and the even \(i\) when \({\mathbb{F}}=\mathbb{R}\) is that this real bundle is orientable in the former case and non-orientable in the latter. So, part (2) of the Theorem follows from the Thom isomorphism theorem in the oriented bundle case and more general tools such as the usual/twisted version of Poincaré–Lefschetz duality in the non-orientable bundle case [22, 32, 38].

The study of \(\mathbb{Z}_{2}\) and integer homology groups of the complex and real Grassmannians was at the heart of the development of algebraic topology and, in particular, the characteristic classes. Starting from the classical works of Ehresmann [26, 27], the answers appearing in (1.10) were explicitly calculated using the Schubert cell decomposition and combinatorics of the corresponding Young diagrams [46, Theorem IV, p. 108], [5]. The calculation of twisted homologies of real Grassmannians is less well-known but can be deduced from the classical work [18] and incorporated into a unified algorithm [16], or computed by means of the general theory of de Rham cohomologies of homogeneous spaces, see [36, chapter XI, pp. 494–496].

Examples of local contributions to the Morse polynomial for topologically critical points of multiplicities up to 8 are presented in Table 1 in the real case. The possible contribution from the smooth directions is ignored because those are specific to the family ℱ. In other words, we set \(\mu (x)=0\) in equation (1.10). In the cases when the second line of (1.10) applies, the contribution of 0 does not mean that the point is regular; 0 appears because the polynomial ignores the torsion part of the corresponding homologies. We also observe that the contribution of the top eigenvalue (\(i=1\)) is always \(t^{0}\); the contribution of the bottom eigenvalue (\(i=\nu \)) is always \(t^{s(\nu )}\). By analogy with smooth Morse theory one can guess that the top eigenvalue always experiences a minimum, while the bottom eigenvalue always experiences a maximum (\(s(\nu )\) being the dimension of the space of non-smooth directions). This guess is rigorously established in Corollary 2.4 and its proof in Sect. 8.

Table 1 Non-smooth contributions \(\mathfrak{T}_{\nu}^{i}(t)\) to the Morse polynomial from a topologically critical point of \(\lambda _{k}(x)\) in the real case (\({\mathbb{F}}=\mathbb{R}\), first three cases of equation (1.10))

2 Examples, applications and an open question

2.1 Examples

In this section we collect examples illustrating our criteria for regularity and criticality.

Example 2.1

Consider the two families

$$ \mathcal{F}_{1}(x) = \begin{pmatrix} x_{1} & x_{2} \\ x_{2} & -x_{1} \end{pmatrix} , \quad \text{and}\quad \mathcal{F}_{2}(x) = \begin{pmatrix} x_{1} & x_{2} \\ x_{2} & 2x_{1} \end{pmatrix} , \quad x = (x_{1},x_{2}) \in \mathbb{R}^{2}. $$
(2.1)

Both families \(\mathcal{F}_{1}\) and \(\mathcal{F}_{2}\) have an isolated point of multiplicity 2 at \((x_{1},x_{2})=(0,0)\). Focusing on the lower eigenvalue \(\lambda _{1}\), its level curves in the case of \(\mathcal{F}_{1}\) undergo a significant change at the value 0 — they change from circles to empty, see Fig. 1(top). Therefore, the point \((0,0)\) is topologically critical and, visually, \(\lambda _{1}\) of \(\mathcal{F}_{1}\) has a maximum at \((0,0)\). In contrast, the level curves and the sublevel sets of \(\mathcal{F}_{2}\) remain homotopically equivalent, see Fig. 1(bottom). The point \((0,0)\) is not topologically critical for \(\lambda _{1}\) of \(\mathcal{F}_{2}\).

Fig. 1
figure 1

Eigenvalue surfaces (left) and contours of the first eigenvalue (right) for the families \(\mathcal{F}_{1}\) (top) and \(\mathcal{F}_{2}\) (bottom) from equation (2.1)

We now check the condition of Theorem 1.5 for the families \(\mathcal{F}_{1}\) and \(\mathcal{F}_{2}\). At the point \(x=(0,0)\) the eigenspace \(\mathbf{E}_{k}\) is the whole of \(\mathbb{R}^{2}\) and no restriction is needed. For the family \(\mathcal{F}_{1}\), the mapping \({\mathcal {H}}_{x}\) from (1.5) is

$$ {\mathcal {H}}_{x} : v= \begin{pmatrix} v_{1} \\ v_{2} \end{pmatrix} \mapsto v_{1} \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix} + v_{2} \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix} , $$

and therefore

$$ \left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp }= \operatorname{span}\left \{ \begin{pmatrix} 1&0 \\ 0&1\end{pmatrix} \right \}, $$

satisfying non-degenerate criticality condition (N), (1.7). Since the point \(x=(0,0)\) of eigenvalue multiplicity 2 is isolated, the criticality condition (S) is vacuously true. Theorem 1.12 applies at \(x=(0,0)\) and the Morse data for the two sheets is given by the \(\nu =2\) row of Table 1.

Proceeding to the family \(\mathcal{F}_{2}\), a similar calculation yields

$$ \left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp }= \operatorname{span}\left \{ \begin{pmatrix} 2&0 \\ 0&-1\end{pmatrix} \right \} $$

contains no positive semidefinite matrices. Hence \(x\) is a topologically regular point for \(\mathcal{F}_{2}\) by Theorem 1.5.

Example 2.2

The case of \(\left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp}\) being spanned by a semidefinite matrix which satisfies neither (1.6) nor condition (1.7), is borderline. As an example, consider the family

$$ \mathcal{F}(x) = \begin{pmatrix} x_{1} & x_{2} \\ x_{2} & x_{1}x_{2} + x_{1}^{2} \end{pmatrix} . $$
(2.2)

For the point \(x=(0,0)\) of multiplicity 2 we have

$$ \operatorname{Ran}{\mathcal {H}}_{x} = \operatorname{span}\left \{ \begin{pmatrix} 1 & 0 \\ 0 & 0 \end{pmatrix} , \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix} \right \}, \qquad \left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp }= \operatorname{span}\left \{ \begin{pmatrix} 0&0 \\ 0&1 \end{pmatrix} \right \}. $$

Regularity condition (1.6) is violated and so is criticality condition (1.7). However, the constant multiplicity stratum \(S\) is well-defined: it is the isolated point \(\{x\}\). As can be seen in Fig. 2, we have both behaviors (regular and critical) at once: the lower eigenvalue has a topologically regular point at \(x\) while the upper has a topologically critical point there.

Fig. 2
figure 2

Eigenvalue surfaces (left) and contours for the family ℱ from equation (2.2). The point \(x=(0,0)\) is topologically regular for the bottom eigenvalue and topologically critical (non-smooth minimum) for the top one

Example 2.3

We now explore in detail the regularity and criticality conditions of Theorems 1.5 and 1.12 for families of \(2\times 2\) matrices. We parametrize \(\mathrm{Sym}_{2}(\mathbb{R})\) using \(\mathbb{R}^{3}\) via the mapping

$$ (x,y,z) \mapsto \begin{pmatrix} x+y & z \\ z & x-y \end{pmatrix} . $$
(2.3)

In this parameterization, the Frobenius inner product (normalized by \(1/2\)) becomes the Euclidean inner product, making orthogonality visual. In the \((x,y,z)\) space, the sign definite matrices form the interior of the cone, \(x^{2} < y^{2}+z^{2}\).

Assume that ℱ depends on two parameters and satisfies \(\mathcal{F}(0)=0\), with \(x=0\) being the only point of multiplicity 2. Figure 3 shows two such families. The regularity condition of Theorem 1.5 is equivalent to the tangent space at 0 to the image of ℱ intersecting the sign definite cone, Fig. 3 (left). Similarly, criticality condition (N), is equivalent to the tangent space to the image of ℱ having dimension 2 and lying outside the cone, which puts the normal to ℱ at 0 inside the cone, see Fig. 3 (right).

Fig. 3
figure 3

The cone whose interior consists of sign definite \(2\times 2\) matrices is visualized in the 3-dimensional space parametrizing \(\mathrm{Sym}_{2}(\mathbb{R})\) via (2.3). The monochrome surfaces represent the images of the families ℱ satisfying (on the left) the regularity condition and (on the right) criticality condition (N) at \(\mathcal{F}(0)=0\). The normal to the surface, representing a matrix in \(\left (\operatorname{Ran}{\mathcal {H}}_{0}\right )^{\perp}\), lies outside \(\mathrm{Sym}^{+}_{2}\) in the left graph and inside \(\mathrm{Sym}^{++}_{2}\) (up to an overall sign) in the right graph

2.2 Some applications

Now we give some consequences of our main Theorems. We start with the observation that a maximum of an eigenvalue \(\lambda _{k}\) cannot occur at a point of multiplicity where \(\lambda _{k}\) coincides with an eigenvalue below it (the proof of Corollary 2.4 is given at the end of Sect. 8).

Corollary 2.4

Let \(x\) be a non-degenerate topologically critical point of the eigenvalue \(\lambda _{k}\) of a generalized Morse family (generic by Theorem 1.13) \(\mathcal{F}:M \rightarrow \mathrm{Sym}_{n}({\mathbb{F}})\), where \({\mathbb{F}}= \mathbb{R}\) or ℂ. Then \(x\) is a local maximum (resp. minimum) of \(\lambda _{k}\) if and only if the following two conditions hold simultaneously:

  1. (1)

    the eigenvalue \(\lambda _{k}\) is the bottom (resp. top) eigenvalue among those coinciding with \(\lambda _{k}(x)\) at \(x\); equivalently, the relative index \(i(x, k) = \nu (x)\) (resp. \(i(x) = 1\)).

  2. (2)

    the restriction of \(\lambda _{k}\) to the local constant multiplicity stratum attached to \(x\) has a local maximum (resp. minimum) at \(x\).

Consequently, for a generalized Morse familyover a compact manifold \(M\), we have the strict inequalities

$$\begin{aligned} &\max _{x\in M} \lambda _{k-1}(x) < \max _{x\in M} \lambda _{k}(x), \qquad k=2,\ldots, n; \\ &\min _{x\in M} \lambda _{k-1}(x) < \min _{x\in M} \lambda _{k}(x), \qquad k=2,\ldots, n. \end{aligned}$$

SimilarlyFootnote 4 to the classical Morse theory, Theorem 1.12 can be used to obtain lower bounds on the number of critical points of a particular type, smooth or non-smooth. Our particular example is motivated by condensed matter physics, where the density of states (either quantum or vibrational) of a periodic structure has singularities caused by critical points [54, 60] in the “dispersion relation” — the eigenvalue spectrum as a function of the wave vector ranging over the reciprocal space. Van Hove [63] classified the singularities (which are now known as “Van Hove singularities”) and pointed out that they are unavoidably present due to Morse theory applied to the reciprocal space, which is a torus due to periodicity of the structure.

Of primary interest is to estimate the number of smooth critical points which produce stronger singularities. Below we make the results of [63] rigorous, sharpening the estimates in \(d=3\) dimensions. We also mention that higher dimensions, now open to analysis using Theorem 1.12 are not a mere mathematical curiosity: they are accessible to physics experiments through techniques such as periodic forcing or synthetic dimensions [57].

Corollary 2.5

Assume that \(M\) is the torus \(\mathbb{T}^{d}\), \(d=2\) or 3. Let \(\mathcal{F}: M \rightarrow \mathrm{Sym}_{n}\) be a generalized Morse family (generic by Theorem 1.13). Then the number \(c_{\mu}(k)\) of smooth critical points of \(\lambda _{k}\) of Morse index \(\mu = 0,\ldots ,d\) satisfied the following lower bounds.

  1. (1)

    In \(d=2\) any ordered eigenvalue has at least two smooth saddle points, i.e.

    $$ c_{1}(k) \geq 2, \qquad k=1,\ldots ,n. $$
  2. (2)

    In \(d=3\),

    $$\begin{aligned} &c_{1}(1) \geq 3, \end{aligned}$$
    (2.4)
    $$\begin{aligned} &c_{1}(k) + c_{2}(k-1) \geq 4, \qquad k=2,\ldots ,n, \end{aligned}$$
    (2.5)
    $$\begin{aligned} &c_{2}(n) \geq 3. \end{aligned}$$
    (2.6)

Remark 2.6

Only the simpler estimates (2.4) and (2.6) for the bottom and top eigenvalue appear in [63] for \(d=3\); the guaranteed existence of smooth critical points in the intermediate eigenvalues (2.5) is a new result. The intuition behind this result is as follows: when a point of eigenvalue multiplicity affects the count of smooth critical points of \(\lambda _{k}\), it also affects the count of smooth critical point for neighboring ordered eigenvalues, such as \(\lambda _{k-1}\), and it does so in a strictly controllable fashion since the Morse data depends little on the particulars of the family ℱ. Carefully tracing these contributions across different ordered eigenvalues leads to sharper estimates.

Proof of Corollary 2.5

It follows from Proposition 1.9(1) that the maximal multiplicity of the eigenvalue is 2 (otherwise the codimension of \(S\) is larger than the dimension \(d=2\) or 3 of the manifold).

In the case \(d=2\), the non-smooth critical points are isolated. According to the first row of Table 1, such points do not contribute any \(t^{1}\) terms. Therefore, the coefficient of \(t\) in \(P_{\lambda _{k}}\) is \(c_{1}(k)\) and, by Morse inequalities (1.3), it is greater or equal than the first Betti number of \(\mathbb{T}^{2}\), which is 2.

In the case \(d=3\) we need a more detailed analysis of the Morse inequalities (1.3) for \(\lambda _{k}\). We write them as

$$ \sum _{p=0}^{3} \big(c_{p}(k)+d_{p}(k)\big)t^{p} = (1+t)^{3} + (1+t) \big(\alpha _{0}(k) + \alpha _{1}(k)t + \alpha _{2}(k)t^{2}\big), $$

where \(d_{p}(k)\) is the contribution to the polynomial \(P_{\lambda _{k}}\) coming from the points of multiplicity 2, \((1+t)^{3}\) is the Poincaré polynomial of \(\mathbb{T}^{3}\), and where \(\alpha _{p}(k)\) are the nonnegative coefficients of the remainder term \(R(t)\) in (1.3). Then similarly to (1.2), we have

$$\begin{aligned} &c_{0}(k) + d_{0}(k) = 1 + \alpha _{0}(k) \geq 1, \end{aligned}$$
(2.7)
$$\begin{aligned} &c_{1}(k) + d_{1}(k) = 3 + \alpha _{0}(k) + \alpha _{1}(k) \geq 2 + c_{0}(k) + d_{0}(k), \end{aligned}$$
(2.8)
$$\begin{aligned} &c_{2}(k) + d_{2}(k) = 3 + \alpha _{1}(k) + \alpha _{2}(k) \geq 2 + c_{3}(k) + d_{3}(k), \end{aligned}$$
(2.9)
$$\begin{aligned} &c_{3}(k) + d_{3}(k) = 1 + \alpha _{2}(k) \geq 1. \end{aligned}$$
(2.10)

We also observe that if \(\lambda _{k}\) has a non-smooth critical point \(x\) counted in \(d_{0}(k)\), then \(\nu (x)=2\) with \(\mu (x)=0\) and \(i(x)=1\) (since this is the only way to obtain \(t^{0}\) in (1.10) for \(\nu =2\)). This implies that \(\lambda _{k-1}(x) = \lambda _{k}(x)\) with the same constant multiplicity curve \(S\) and the same point \(x\) is a critical point of \(\lambda _{k-1}\) with \(\nu =2\), \(\mu =0\) and \(i=2\). From Table 1 we have \(P_{\lambda _{k-1}}(t;x) = t^{2}\), namely \(x\) contributes to \(d_{2}(k-1)\). This argument can be done in reverse and also extended to points contributing to \(d_{1}(k)\) (with \(\nu =2\), \(\mu =1\) and \(i=1\)), resulting in

$$\begin{aligned} &d_{0}(k) = d_{2}(k-1), \qquad d_{1}(k) = d_{3}(k-1), \qquad k=2, \ldots ,n, \end{aligned}$$
(2.11)
$$\begin{aligned} &d_{0}(1)=d_{1}(1)=0, \qquad d_{2}(n)=d_{3}(n)=0. \end{aligned}$$
(2.12)

The boundary values in (2.12) are obtained by noting that we cannot have \(\lambda _{1}(x) = \lambda _{0}(x)\) or \(\lambda _{n}(x) = \lambda _{n+1}(x)\) since eigenvalues \(\lambda _{0}\) and \(\lambda _{n+1}\) do not exist.

For \(k=1\), (2.12) substituted into (2.7) and (2.8) gives \(c_{0}(1) \geq 1\) and \(c_{1}(1) \geq 2+c_{0}(1) \geq 3\), establishing (2.4). Estimate (2.6) is similarly established from (2.12), (2.10) and (2.9).

Replacing \(k\) with \(k-1\) in estimate (2.9) and using (2.11) gives

$$ c_{2}(k-1) + d_{0}(k) \geq 2 + c_{3}(k-1) + d_{1}(k) \geq 2 + d_{1}(k). $$

Adding this last inequality to line (2.8) results in (2.5) after cancellations and the trivial estimate \(c_{0}(k)\geq 0\). □

Remark 2.7

It is straightforward to extend (2.4)–(2.6) to an arbitrary compact 3-dimensional manifold \(M\) with Betti numbers \(\beta _{r}\), obtaining

$$\begin{aligned} &c_{1}(1) \geq \beta _{1}, \\ &c_{1}(k) + c_{2}(k-1) \geq \beta _{1}+\beta _{2}-\beta _{0}-\beta _{3}, \qquad k=2,\ldots ,n, \\ &c_{2}(n) \geq \beta _{2}. \end{aligned}$$

These inequalities extend to \(d=3\) the results of Valero [62] who studied critical points of principal curvature functions (eigenvalues of the second fundamental form) of a smooth closed orientable surface.

Independence of the transverse Morse contributions from the particulars of the family ℱ also allows one to sort the terms in the Morse polynomial. This is illustrated by the next simple result.

Let \(\mathrm{Conseq}_{k,n}\) be the set of all subsets of \(\{1,\ldots , n\}\) containing \(k\) and consisting of consecutive numbers, i.e. subsets of the form \(\{j_{1}, j_{1}+1, \ldots , j_{2}\} \ni k\). Given \(J \in \mathrm{Conseq}_{k,n}\), let \(i(k; J)\) be the sequential number of \(k\) in the set \(J\) but counting from the top (cf. (1.8)). As usual, \(|J|\) will denote the cardinality of \(J\).

Let \(\mathcal{F}:M\rightarrow \mathrm{Sym}_{n}(\mathbb{F})\) be a generalized Morse family. For any set \(J\in \mathrm {Conseq}_{k,n}\), let

$$ S(k, J) = \big\{ x\in M: \lambda _{j}(x)=\lambda _{k}(x) \textrm{ if and only if } j\in J\big\} . $$

By our assumptions, \(S(k, J)\) are smooth embedded submanifolds of \(M\) and the restrictions \(\lambda _{k}|_{S(k, J)}\) of the eigenvalue \(\lambda _{k}\) to \(S(k, J)\) are smooth.

Corollary 2.8

Given a generalized Morse family \(\mathcal{F}:M\rightarrow \mathrm{Sym}_{n}(\mathbb{F})\) the following inequality holds

$$ \sum _{J\in \mathrm {Conseq}_{k,n}} \mathfrak{T}_{|J|}^{i(k;J)}(t) P_{ \lambda _{k}|_{S(k, J)}}(t) \succeq P_{\lambda _{k}}(t) \succeq P_{M}(t), $$
(2.13)

where \(P(t)\succeq Q(t)\) if and only if the all coefficients of the polynomials \(P(t)-Q(t)\) are nonnegative, the polynomials \(\mathfrak{T}_{|J|}^{i(k;J)}\) are defined in (1.10), and \(P_{\lambda _{k}|_{S(k,J)}}(t)\) are the Morse polynomials of the smooth functions \(\lambda _{k}|_{S(k,J)}\) on \(S(k, J)\). In particular \(P_{\lambda _{k}|_{S(k,\{k\})}}\) is the total contribution of all smooth critical points of \(\lambda _{k}\).

Proof

We only need to prove the left inequality in (2.13). By (1.10), the contribution of a topologically critical point \(x \in S(k,J)\) to \(P_{\lambda _{k}}(t)\) is \(t^{\mu (x)}\, \mathfrak{T}_{|J|}^{i(k;J)}(t)\). Therefore, the left-hand side of (2.13) is different from \(P_{\lambda _{k}}(t)\) in that the former also includes contributions from smooth critical points of \(\lambda _{k}|_{S(k,J)}\) that do not give rise to a topologically critical point of \(\lambda _{k}\). However, those contributions are polynomials with non-negative coefficients, producing the inequality. □

We demonstrate Corollary 2.8 in a simple example involving an intermediate eigenvalue. Letting \(n=3\), \(k=2\), and using the first two rows of Table 1, inequality (2.13) reads:

$$ P_{\lambda _{2}|_{S(2, \{2\})}} (t) +t^{2} P_{\lambda _{2}|_{S(2,\{2,3 \})}}(t) +P_{\lambda _{2}|_{S(2,\{1,2\})}}(t) \succeq P_{\lambda _{2}}(t) \succeq P_{M}(t). $$
(2.14)

Note that the term with \(P_{\lambda _{2}|_{S(2,\{1,2,3\})}}(t)\) does not appear in (2.14) because \(\mathfrak{T}_{3}^{2}(t)=0\) according to the second row of Table 1. Further simplifications of inequalities (2.14) are possible if it is known a priori that \(\lambda \) is a perfect Morse function when restricted to the connected components of the constant multiplicity strata \(S_{k, \{k-1,k\}}(\mathcal{F})\) and \(S_{k, \{k,k+1\}}(\mathcal{F})\).

2.3 An open question

Finally, we mention an open question which naturally follows from our work: to classify Morse contributions from points where the multiplicity \(\nu \) is higher than what is suggested by the codimension calculation in the von Neumann–Wigner theorem [64]. Such points often arise in physical problems due to presence of a discrete symmetry; for an example, see [11, 29]. At a point of “excessive multiplicity”, the transversality condition (4.1) is not satisfied because \(d < s(\nu )\), but one can still define an analogue of the “non-degeneracy in the non-smooth direction” (cf. Remark 1.8). It appears that the Morse indices are independent of the particulars of the family ℱ when the “excess” \(s(\nu )-d\) is equal to 1, but whether this persists for higher values of \(s(\nu )-d\) is still unclear.

3 Regularity condition: proof of Theorem 1.5

In this section we establish Theorem 1.5, namely the sufficient condition for a point to be regular (see Definition 1.1).

Recall the definition of the Clarke directional derivativeFootnote 5 of a locally Lipschitz function \(f:M\rightarrow \mathbb{R}\) (for details, see, for example, [19, 55]). Given \(v\in T_{x} M\), let \(\widehat{V}\) be a vector field in a neighborhood of \(x\) such that \(\widehat{V} (x)=v\) and let \(e^{t\widehat{V}}\) denote the local flow generated by the vector field \(\widehat{V}\). Then the Clarke generalized directional derivative of \(f\) at \(x\) in the direction \(v\) is

$$ f^{\circ}(x, v) = \limsup _{\substack {y\to x \\ t\to 0+}} \cfrac{f(e^{t\widehat{V}} y) - f(y)}{t}. $$

Independence of this definition of the choice of \(\widehat{V}\) follows from the flow-box theorem and the chain rule for the Clark subdifferential, see [55, Thm 1.2(i) and Prop 1.4(ii)].

Definition 3.1

The point \(x\) is called a critical point of \(f\) in the Clarke sense, if

$$ 0 \leq f^{\circ}(x, v) \quad \text{for all}\quad v\in T_{x} M. $$

Otherwise, the point \(x\) is said to be regular in the Clarke sense.

The assumptions of Theorem 1.5 will be shown to imply that the point \(x\) is regular in the Clarke sense, whereupon we will use the following result.

Theorem 3.2

[2, Proposition 1.2] A point regular in the Clarke sense is topologically regular in the sense of Definition 1.1.

Proof of Theorem 1.5

We first establish that condition (1.6), namely

$$ \left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp }\cap \mathrm{Sym}_{\nu}^{+} = 0, $$

is equivalent to existence of a matrix \(C \in \operatorname{Ran}{\mathcal {H}}_{x}\) which is (strictly) positive definite. Despite being intuitively clear, the proof of this fact is not immediate and we provide it for completeness; a similar result is known as Fundamental Theorem of Asset Pricing in mathematical finance [25]. Assume the contrary,

$$ \operatorname{Ran}{\mathcal {H}}_{x} \cap \mathrm{Sym}_{\nu}^{++} = \emptyset . $$

The set \(\mathrm{Sym}_{\nu}^{++}\) is open and convex (the latter can be seen by Weyl’s inequality for eigenvalues). A suitable version of the Helly–Hahn–Banach separation theorem (for example, [56, Thm 7.7.4]) implies existence of a functional vanishing on \(\operatorname{Ran}{\mathcal {H}}_{x}\) and positive on \(\mathrm{Sym}_{\nu}^{++}\). By Riesz Representation Theorem, this functional is \(\langle D, \cdot \rangle \) for some \(D\in \mathrm{Sym}_{\nu}\), for which we now have \(D \in \left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp}\) and \(\langle D, P \rangle > 0\) for all \(P \in \mathrm{Sym}_{\nu}^{++}\). In particular, \(D\) is non-zero and belongs to the dual cone of \(\mathrm{Sym}_{\nu}^{++}\), namely to \(\mathrm{Sym}_{\nu}^{+}\) [14], contradicting condition (1.6).

Secondly, results of [21, Theorem 4.2] (see alsoFootnote 6 [42, Sect. 6]) show that

$$\begin{aligned} \lambda ^{\circ}_{k}(x,v) &\leq \max \left \{ \Big\langle u, \big(d \mathcal{F}(x)v \big)u\Big\rangle \colon u \in \mathbf{E}_{k}, \|u\|=1 \right \} \\ &= \lambda ^{\max}\left (\big(d\mathcal{F}(x)v \big)_{\mathbf{E}_{k}} \right ) = \lambda ^{\max}\big(\mathcal{H}_{x}(v)\big), \end{aligned}$$

where \(\mathbf{E}_{k}\) is the eigenspace of the eigenvalue \(\lambda _{k}(x)\) of \(\mathcal{F}(x)\); the middle equality is by the variational characterization of the eigenvalues and the last by the definition (1.5) of \(\mathcal{H}_{x}\).

We already established that there exists \(v\) such that \(\mathcal{H}_{x}(v) \in \mathrm{Sym}^{++}_{\nu}\). Then \(\mathcal{H}_{x}(-v)\) is negative definite and we have

$$ \lambda ^{\circ}_{k}(x,v) \leq \lambda ^{\max}\big(\mathcal{H}_{x}(-v) \big) < 0. $$

The point \(x\) is regular in the Clarke sense and, therefore, regular in the sense of Definition 1.1. □

4 Transversality and its consequences

Definition 4.1

We say that a family ℱ is transverse (with respect to eigenvalue \(\lambda _{k}\)) at a point \(x\) if

$$ \mathcal{I}_{\nu }+ \operatorname{Ran}{\mathcal {H}}_{x} = \mathrm{Sym}_{\nu}, $$
(4.1)

where \(\nu \) is the multiplicity of \(\lambda _{k}\) at the point \(x\) and \(\mathcal{I}_{\nu }:= \operatorname{span}(I_{\nu}) \subset \mathrm{Sym}_{\nu}\) is the space of multiples of the identity matrix.

In this section we explore the consequences of the transversality condition, equation (4.1). In particular, in Lemma 4.2 we interpret condition (4.1) as transversality of the family ℱ and the subvariety of \(\mathrm{Sym}_{n}(\mathbb{F})\) of matrices with multiplicity. We then show that transversality at a non-degenerate topologically critical point allows us to work separately in the smooth and non-smooth directions. In particular, we establish that a non-degenerate topologically critical point satisfies the sufficient conditions of Goresky–MacPherson’s stratified Morse theory. The latter allows us to separate the Morse data at a topologically critical point into a smooth part and a transverse part; the latter will be shown in Sect. 5 to be independent of the particulars of the family ℱ.

Let \(Q^{n}_{k,\nu}\) be the subset of \(\mathrm{Sym}_{n}(\mathbb{F})\), where \(\mathbb{F}\) is ℝ or ℂ, consisting of the matrices whose eigenvalue \(\lambda _{k}\) has multiplicity \(\nu \). It is well-known [6] that the set \(Q^{n}_{k,\nu}\) is a semialgebraic submanifold of \(\mathrm{Sym}_{n}\) of codimensionFootnote 7

$$ s(\nu ) := \dim \mathrm{Sym}_{\nu}(\mathbb{F}) - 1 = \textstyle\begin{cases} \frac{1}{2}\nu (\nu +1)-1, &\mathbb{F}=\mathbb{R}, \\ \nu ^{2}-1, & \mathbb{F}=\mathbb{C}. \end{cases} $$
(4.2)

In particular, if \(\nu >1\) (the eigenvalue \(\lambda _{k}\) is not simple), then \(\operatorname{codim}\, Q^{n}_{k,\nu}\geq 2\), if \(\mathbb{F}=\mathbb{R}\) and \(\operatorname{codim}\, Q^{n}_{k,\nu}\geq 3\), if \(\mathbb{F}=\mathbb{C}\). We remark that we use real dimension in all (co)dimension calculations.

Lemma 4.2

Let \(\mathcal{F}: M \to \mathrm{Sym}_{n}\) be a smooth family whose eigenvalue \(\lambda _{k}\) has multiplicity \(\nu \) at the point \(x\in M\) (i.e. \(\mathcal{F}(x) \in Q^{n}_{k,\nu}\)). Thenis transverse at \(x\) in the sense of (4.1) if and only if

$$ \operatorname{Ran}d\mathcal{F}(x) +T_{\mathcal{F}(x)} Q^{n}_{k, \nu}= T_{ \mathcal{F}(x)} \mathrm{Sym}_{n} \quad (\ \cong \mathrm{Sym}_{n}\ ). $$
(4.3)

Remark 4.3

It is easy to see that when \(\nu =1\), both conditions (4.1) and (4.3) are satisfied independently of ℱ. When \(\nu =n\), conditions (4.1) and (4.3) become identical. The transversality condition in the case \(\nu =n=2\) is illustrated in Fig. 4.

Fig. 4
figure 4

Examples of two families visualized in the 3-dimensional space parametrizing \(\mathrm{Sym}_{2}(\mathbb{R})\) via (2.3). The subspace \(\mathcal{I}_{2}\) is drawn as a line. The monochrome surfaces represent the images of the families ℱ, satisfying \(\mathcal{F}(0)=0\). The family on the left is transverse at \(x=0\) and the family on the right is not. The cones are drawn for comparison with Fig. 3

Proof of Lemma 4.2

Consider the linear mapping \(h : \mathrm{Sym}_{n} \to \mathrm{Sym}_{\nu}\) acting as a compression to \(\mathbf{E}_{k}\). Namely, \(A \mapsto A_{\mathbf{E}_{k}} = \mathcal{U}^{*} A \mathcal{U}\), where \(\mathcal{U}\) is a linear isometry \({\mathbb{F}}^{\nu }\to \mathbf{E}_{k}\), see (1.4). The mapping \(h\) is onto: for any \(B \in \mathrm{Sym}_{\nu}\), choosing \(\widetilde{B} = \mathcal{U}B \mathcal{U}^{*} \in \mathrm{Sym}_{n}\) yields \(h\big(\widetilde{B}\big) = \mathcal{U}^{*}\mathcal{U}B \mathcal{U}^{*} \mathcal{U}= I_{\nu }B I_{\nu }= B\). Furthermore, by Hellmann–Feynman theorem (Theorem A.1), \(A \in T_{\mathcal{F}(x)} Q_{k,\nu}^{n}\) if and only if \(h(A) \in \mathcal{I}_{\nu}\) (informally, a direction is tangent to \(Q_{k,\nu}^{n}\) if and only if the eigenvalues remain equal to first order). Finally, by definition of \({\mathcal {H}}_{x}\) we have \(h\big(\operatorname{Ran}d\mathcal{F}(x)\big) = \operatorname{Ran}{\mathcal {H}}_{x}\).

Assuming condition (4.3) and applying to it the mapping \(h\), we get

$$ \mathrm{Sym}_{\nu }= h\big(\mathrm{Sym}_{n}\big) =h\big(\operatorname{Ran}d \mathcal{F}(x)\big) + h\Big(T_{\mathcal{F}(x)} Q_{k,\nu}^{n}\Big) = \operatorname{Ran}{\mathcal {H}}_{x} + \mathcal{I}_{\nu}, $$

establishing (4.1). Conversely, assume ℱ violates condition (4.3), meaning that

$$ \operatorname{Ran}d\mathcal{F}(x) + T_{\mathcal{F}(x)} Q^{n}_{k, \nu} = G + T_{ \mathcal{F}(x)} Q^{n}_{k, \nu} $$

for some linear subspace \(G\) of \(\dim G < \operatorname{codim}T_{\mathcal{F}(x)} Q^{n}_{k, \nu} = s(\nu )\). Applying \(h\) to both sides we get

$$ \operatorname{Ran}\mathcal{H}_{x} + \mathcal{I}_{\nu }= h\left (\operatorname{Ran}d\mathcal{F}(x) + T_{\mathcal{F}(x)} Q^{n}_{k, \nu}\right ) = h\left (G + T_{ \mathcal{F}(x)} Q^{n}_{k, \nu}\right ) = h(G) + \mathcal{I}_{\nu}. $$

Counting dimensions, we arrive to \(\dim \left (\operatorname{Ran}\mathcal{H}_{x} + \mathcal{I}_{\nu }\right ) < s( \nu ) + 1 = \dim (\mathrm{Sym}_{\nu})\), and therefore (4.1) cannot hold. □

Corollary 4.4

  1. (1)

    If \(x\) satisfies non-degenerate criticality condition (N), see Definition 1.7, thenis transverse at \(x\).

  2. (2)

    Ifis transverse at a point \(x\), the constant multiplicity stratum \(S\) of \(x\) is a submanifold of \(M\) of codimension \(s(\nu )\) and the function \(\lambda _{k}\) restricted to \(S\) is smooth.

Proof

Recall that non-degenerate criticality condition (N) states that \(\left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp}\) is spanned by a positive definite matrix. In particular, the codimension of \(\operatorname{Ran}{\mathcal {H}}_{x}\) is 1. Furthermore, the identity matrix \(I_{\nu}\) is not in \(\operatorname{Ran}{\mathcal {H}}_{x}\) because the identity cannot be orthogonal to a positive definite matrix. Therefore condition (4.1) holds.

Now let ℱ be transverse at \(x\) and let \(\nu =\nu (x)\) be the multiplicity of the eigenvalue \(\lambda _{k}\) at \(x\). Then \(S\) is the connected component of \(\mathcal{F}^{-1}\left (Q^{n}_{k, \nu}\right )\) containing \(x\). Transversality implies \(S\) is a submanifold of codimension \(\operatorname{codim}Q^{n}_{k, \nu} = s(\nu )\). The smoothness of \(\lambda _{k}\) restricted to \(S\) is a standard result of perturbation theory for linear operators (see, for example, [48, Section II.1.4 or Theorem II.5.4]). To see it, one uses the Cauchy integral formula for the total eigenprojector (or Riesz projector), i.e. the projector onto the span of eigenspaces of the eigenvalues lying in a small neighborhood around \(\lambda _{k}(x)\). It follows that the total eigenprojector is smooth in a sufficiently small neighborhood of \(x\in M\) (the neighborhood on \(M\) needs to be small enough so that no eigenvalues cross the contour of integration). Once restricted to \(y\in S\), the eigenprojector is simply \(\lambda _{k}(y) I_{\nu}\), therefore \(\lambda _{k}(y)\) is also smooth. □

Lemma 4.5

If \(x\) satisfies non-degenerate criticality condition (N), then \(\operatorname{Ran}\mathcal{H}_{x}\big|_{T_{x}S} = 0\) and \(x\) is a critical point of the locally smooth function \(\lambda _{k}\big|_{S}\).

Proof

By Hellmann–Feynman theorem, see Appendix A, the eigenvalues of \({\mathcal {H}}_{x}v\in \mathrm{Sym}_{\nu}\) give the slopes of the eigenvalues splitting off from the multiple eigenvalue \(\lambda _{k}\big(\mathcal{F}(x)\big)\) when we leave \(x\) in the direction \(v\). Leaving in the direction \(v \in T_{x} S\), where \(S\) is the constant multiplicity stratum attached to \(x\), must produce equal slopes, i.e. \({ \mathcal {H}}_{x}(v)\) is a multiple of the identity matrix \(I_{\nu}\) for every \(v\in T_{x}S\). But a non-zero multiple of the identity cannot be orthogonal to \(B\in \mathrm{Sym}^{++}_{\nu}\), therefore \(\operatorname{Ran}\mathcal{H}_{x}\big|_{T_{x}S} = 0\). In other words, the slopes of the branches splitting off from the multiple eigenvalue \(\lambda _{k}(x)\) are all zero. □

Proof of Proposition 1.9

Corollary 4.4 and Lemma 4.5, combined, give the conclusions of Proposition 1.9. □

The next step is to enable ourselves to focus on the directions transverse to \(S\).

Corollary 4.6

Let \(\mathcal{F}: M \to \mathrm{Sym}_{n}\) be a smooth family whose eigenvalue \(\lambda _{k}\) satisfies conditions (N) and (S) (Definitions 1.7and 1.10) at the point \(x\in M\). Let \(S\) be the constant multiplicity stratum at \(x\) and let \(N\) be a submanifold of \(M\) of dimension \(\dim N = \operatorname{codim}_{M} S = s(\nu )\) which intersects \(S\) transversely at \(x\).

Then the eigenvalue \(\lambda _{k}\) of the restriction \(\mathcal{F}\big|_{N}\) also satisfies conditions (N) and (S).

Proof

Transversality and dimension count imply that the constant multiplicity stratum of \(\mathcal{F}\big|_{N}\) is the isolated point \(x\). Therefore condition (S) for \(\mathcal{F}\big|_{N}\) is vacuously true at \(x\).

Condition (N) for ℱ, combined with Lemma 4.5, yields \(\operatorname{Ran}\mathcal{H}_{x}\big|_{T_{x}S} = 0\). We thus obtain

$$ \operatorname{Ran}\mathcal{H}_{x} = \operatorname{Ran}\mathcal{H}_{x}\big|_{T_{x}N} + \operatorname{Ran}\mathcal{H}_{x}\big|_{T_{x}S} = \operatorname{Ran}\mathcal{H}_{x}\big|_{T_{x}N}. $$

In other words, the space \(\operatorname{Ran}\mathcal{H}_{x}\) remains unchanged after restricting ℱ to \(N\), therefore condition (N) holds for \(\mathcal{F}\big|_{N}\). □

The next step is to separate the Morse data at a critical point \(x\) into a smooth part (along \(S\)) and a transverse part (along \(N\)). For this purpose we will use the stratified Morse theory of Goresky and MacPherson [35]. We now show that a point satisfying non-degenerate criticality conditions (N) and (S) is nondepraved in the sense of [35, definition in Sec. I.2.3]. The setting of [35] calls for a smooth function on a certain manifold which is then restricted to a stratified subspace of that manifold. To that end we consider the graph of the function \(\lambda _{k}\) on \(M\), i.e. the set \(Z_{k} := \big\{\big(x, \lambda _{k}(x)\big)\colon x\in M\big\}\) as a stratified subspace of \(\widetilde{M}:= M\times \mathbb{R}\) and the (smooth) function \(\pi : \widetilde{M}\to \mathbb{R}\) which is the projection to the second component of \(\widetilde{M}\). As before, the stratification (both on \(M\) and on \(Z_{k}\)) is induced by the multiplicity of the eigenvalue \(\lambda _{k}(x)\).

Recall that a subspace \(Q\) of \(T_{z} \widetilde{M}\) is called a generalized tangent space to a stratified subspace \(Z\subset \widetilde{M}\) at the point \(z\in Z\), if there exists a stratum ℛ of \(Z\) with \(z\in \overline{\mathcal {R}}\), and a sequence of points \(\{z_{i}\} \subset \mathcal {R}\) converging to \(z\) such that

$$ Q = \lim _{i\to \infty} T_{z_{i}} \mathcal{R}. $$
(4.4)

Proposition 4.7

Let the family \(\mathcal{F}: M \to \mathrm{Sym}_{n}\) be transverse and the point \(x\in M\) satisfy conditions of Theorem 1.12. Let \(z := \big(x, \lambda _{k}(x)\big)\) be the corresponding point on the stratified subspace \(Z_{k} \subset \widetilde{M}\) defined above and \(\widetilde{S}\) be the stratum of \(Z_{k}\) containing \(z\). Then the following two statements hold:

  1. (1)

    For each generalized tangent space \(Q\) at \(z\) we have \(d\pi (z)\big|_{Q} \neq 0\) except when \(Q=T_{z}\widetilde{S}\).

  2. (2)

    \(x\) is isolated in the set of all points \(y\) that are critical for \(\lambda _{k}\big|_{S_{y}}\), where \(S_{y}\) is the constant multiplicity stratum attached to \(y\).

Remark 4.8

A point \(z := \big(x, \lambda _{k}(x)\big)\) satisfying conditions (1)–(2) of Proposition 4.7 and such that \(x\) is non-degenerate as a smooth critical point of \(\lambda _{k}\big|_{S}\) is a nondepraved point of the map \(\pi |_{Z_{k}}\) in the sense of Goresky–MacPherson [35, Sec. I.2.3].Footnote 8,Footnote 9

Corollary 4.9

Let the family \(\mathcal{F}: M \to \mathrm{Sym}_{n}\) be transverse and the point \(x\in M\) satisfy conditions of item (1) of Theorem 1.12. Let \(z := \big(x, \lambda _{k}(x)\big)\) be the corresponding point on the stratified subspace \(Z_{k} \subset \widetilde{M}\) defined above and \(\pi : M\times \mathbb{R}\to \mathbb{R}\) be the projection to the second component of \(\widetilde{M}\). Then \(z\) is a nondepraved point of the map \(\pi |_{Z_{k}}\) in the sense of Goresky–MacPherson [35, Sec. I.2.3].

Remark 4.10

Let us discuss informally the idea behind part (1) of Proposition 4.7, the proof of which is fairly technical. When we leave \(x\) in the direction not tangent to \(S\), the multiplicity of eigenvalue \(\lambda _{k}\) is reduced as other eigenvalues split off. Part (1) stipulates that among the directions in which the multiplicity splits in a prescribed manner, there is at least one direction in which the slope of \(\lambda _{k}\) is not equal to zero. This is again a consequence of transversality: the space of directions is too rich to produce only zero slopes.

Proof of Proposition 4.7, part (1)

First, if \(p: M\times \mathbb{R}\to M\) denotes the projection to the first component of \(\widetilde{M}= M \times \mathbb{R}\), then \(dp(\widetilde{x}): T_{\widetilde{x}}\widetilde{M}\to T_{x} M\) is the corresponding projection to the first component of \(T_{\widetilde{x}}\widetilde{M}\cong T_{x} M \times \mathbb{R}\); here \(\widetilde{x}= (x,\lambda )\) for some \(\lambda \in \mathbb{R}\).

Since \(d\pi \big|_{T_{z}\widetilde{S}} = d\Big(\lambda _{k}\big|_{S}\Big) \circ dp\big|_{T_{z}\widetilde{S}}\), where \(S = p(\widetilde{S})\) is the constant multiplicity stratum of \(x\), we conclude from Lemma 4.5 that \(d\pi (z)\big|_{Q} = 0\) when \(Q=T_{z}\widetilde{S}\).

Let now \(Q\neq T_{z}\widetilde{S}\) and assume that

$$ d\pi \left (z\right )\big|_{Q} = 0. $$
(4.5)

Let \(\nu \) be the multiplicity of the eigenvalue \(\lambda _{k}(x)\) of \(\mathcal{F}(x)\) and \(\mathbf{E}_{k}\), \(\dim \mathbf{E}_{k} = \nu \), be the corresponding eigenspace. Let ℛ be the stratum used for the definition of \(Q\) in (4.4) and \(\nu _{\mathcal{R}}<\nu \) be the multiplicity of \(\lambda _{k}\) on \(p(\mathcal{R})\).

Let \((z_{i}) \subset \mathcal{R}\) be the sequence defining \(Q\) and let \(x_{i} = p(z_{i})\). Let \(\mathbf{E}_{k}(x_{i}) \subset {\mathbb{F}}^{n}\) denote the \(\nu _{\mathcal{R}}\)-dimensional eigenspace of the eigenvalue \(\lambda _{k}\) of \(\mathcal{F}(x_{i})\) and let \(\mathcal{U}_{i}\) be a choice of linear isometry from \({\mathbb{F}}^{\nu _{\mathcal{R}}}\) to \(\mathbf{E}_{k}(x_{i})\). Finally, let \(W_{i} \subset T_{x_{i}} M\) denote the first component of the tangent space at \(z_{i}\) to ℛ, namely \(W_{i} = dp(z_{i})\big(T_{z_{i}} \mathcal{R}\big)\).

We would like to use Hellmann–Feynman theorem at \(x_{i}\). In the directions from \(W_{i}\), the eigenvalue \(\lambda _{k}\) retains multiplicity \(\nu _{\mathcal{R}}\) in the linear approximation. In other words, directional derivatives of the eigenvalue group of \(\lambda _{k}\) are all equal. Formally,

$$ \mathcal{U}_{i}^{*} \big(d\mathcal{F}(x_{i}) w\big) \mathcal{U}_{i} = D_{w} \lambda _{k}(x_{i}) \, I_{\nu _{\mathcal{R}}}, \qquad \text{for all } w \in W_{i}; $$
(4.6)

here \(D_{w}\lambda _{k}\) is the directional derivative of \(\lambda _{k}\). This expression is invariant with respect to the choice of isometry \(\mathcal{U}_{i}\).

Using compactness of the Grassmannians and, if necessary, passing to a subsequence, the spaces \(\mathbf{E}_{k}(x_{i})\) converge to a subspace \(\mathbf{E}^{\mathcal{R}}_{k}\) of the \(\nu \)-dimensional eigenspace \(\mathbf{E}_{k}\) of the matrix \(\mathcal{F}(x)\). The isometries \(\mathcal{U}_{i}\) (adjusted if necessary) converge to a linear isometry \(\mathcal{U}_{\mathcal{R}}\) from \({\mathbb{F}}^{\nu _{\mathcal{R}}}\) to \(\mathbf{E}^{\mathcal{R}}_{k}\). Tangent subspaces \(W_{i}\) also converge to the subspace \(W_{0} := dp(z)Q\). Passing to the limit in (4.6), the derivative on the right-hand side of (4.6) must tend to 0 due to (4.5). Recalling the definition of ℋ in (1.5), we get

$$ \mathcal{U}_{\mathcal{R}}^{*} \big(d\mathcal{F}(x) w\big) \mathcal{U}_{ \mathcal{R}} = \mathcal{U}_{\mathcal{R}}^{*} \mathcal{U}\big({ \mathcal {H}}_{x} w\big) \mathcal{U}^{*} \mathcal{U}_{\mathcal{R}} = 0, \qquad \text{for all } w \in W_{0}. $$

In other words, the matrix \({\mathcal {H}}_{x}w\) with \(w\) restricted to \(W_{0}\) maps vectors from \(V = \operatorname{Ran}(\mathcal{U}^{*}\mathcal{U}_{\mathcal{R}}) \subset { \mathbb{F}}^{\nu}\) to vectors orthogonal to \(V\). We can express this as

$$ \operatorname{Ran}{\mathcal {H}}_{x}\big|_{W_{0}} \subset \mathrm{Sym}_{\nu} \left (V, V^{\perp}\right ), $$
(4.7)

where \(\mathrm{Sym}_{\nu}(X, Y)\) denotes the set of all \(\nu \times \nu \) self-adjoint matrices that map \(X\) to \(Y\). The space \(V\) is \(\nu _{\mathcal{R}}\)-dimensionalFootnote 10 and, in a suitable choice of basis, a \(\nu _{\mathcal{R}}\times \nu _{\mathcal{R}}\) subblock of \({\mathcal {H}}_{x}w\) is identically zero. Therefore, the dimension of \(\mathrm{Sym}_{\nu} \left (V, V^{\perp}\right )\) is

$$ \dim \mathrm{Sym}_{\nu} \left (V, V^{\perp}\right ) = \dim \mathrm{Sym}_{\nu} - \dim \mathrm{Sym}_{\nu _{\mathcal{R}}} = s(\nu ) - s(\nu _{\mathcal{R}}). $$
(4.8)

On the other hand, we have the following equalities,

$$ \operatorname{codim} \operatorname{Ker}{\mathcal {H}}_{x} = \dim \operatorname{Ran}{\mathcal {H}}_{x} = \dim \mathrm{Sym}_{\nu }-1 = s(\nu ) = \operatorname{codim}T_{x} S. $$

The first is the rank-nullity theorem, the second is because \(\operatorname{Ran}{\mathcal {H}}_{x}\) has codimension 1 (by condition (N)), the third is the definition of \(s(\nu )\) and the last is from the properties of \(S\). Using \(T_{x}S\subset \operatorname{Ker}{\mathcal {H}}_{x}\) (Lemma 4.5) and counting dimensions, we conclude

$$ \operatorname{Ker}{\mathcal {H}}_{x} = T_{x}S. $$

Now we want to show that the stratification on \(M\) induced by the multiplicity of the eigenvalue \(\lambda _{k}(x)\) satisfies Whitney condition A: any generalized tangent space at \(z\) contains the tangent space of the stratum containing \(z\). For this note that the discriminant variety of \(\mathrm{Sym}_{n}\),

$$ \mathrm{Discr}_{n}:=\bigcup _{1\leq k\leq n,\ \nu >1} Q^{n}_{k,\nu}, $$

is an algebraic variety. Therefore, by classical results of Whitney [65], \(\mathrm{Discr}_{n}\) admits a stratification satisfying Whitney condition A. Consequently, if \(\mathcal{F}: M \to \mathrm{Sym}_{n}\) is a transverse family then the fact that \(\mathrm{Discr}_{n}\) satisfies Whitney condition A implies that the stratifications on \(M\) induced by the multiplicity of the eigenvalue \(\lambda _{k}(x)\) satisfies Whitney condition A as well.

Whitney condition A gives the inclusion \(T_{x} S \subset W_{0}\) and therefore \(\operatorname{Ker}{\mathcal {H}}_{x}\subset W_{0}\). Using the rank-nullity theorem again, we get

dim Ran H x | W 0 = codim W 0 Ker H x | W 0 = codim W 0 Ker H x = dim W 0 dim Ker H x = codim T x M Ker H x codim T x M W 0 = s ( ν ) s ( ν R ) .
(4.9)

Comparing (4.7), (4.8) and (4.9) we conclude that

$$ \mathrm{Sym}_{\nu}\left (V, V^{\perp}\right ) = \operatorname{Ran}{\mathcal {H}}_{x} \big|_{W_{0}}. $$

Consequently,Footnote 11

$$ \left ( \operatorname{Ran}{\mathcal {H}}_{x} \right )^{\perp }\subset \left ( \operatorname{Ran}{ \mathcal {H}}_{x}\big|_{W_{0}} \right )^{\perp }= \mathrm{Sym}_{\nu} \left (V, V^{\perp}\right )^{\perp }= \mathrm{Sym}_{\nu}\big(V^{\perp}, 0\big), $$
(4.10)

i.e. \(V^{\perp}\) is in the kernel of the matrices from \(\left ( \operatorname{Ran}{\mathcal {H}}_{x} \right )^{\perp}\) which contradicts condition (N), see (1.7). □

Proof of Proposition 4.7, part (2)

Assume, by contradiction, that \(x\) is an accumulation point of a sequence \((x_{i})\) of points which are critical on their respective strata. Passing to a subsequence if necessary, we can assume that all \(z_{i} := \big(x_{i}, \lambda _{k}(x_{i})\big)\) belong to the same stratum ℛ and that the sequence of spaces \(T_{z_{i}}\mathcal {R}\) converges to a space \(Q\).

Note that \(Q\) is a nontrivial generalized tangent space to \(Z_{k}\) at \(x\). Since \(x_{i}\) are critical for \(\lambda _{k}\) restricted to the stratum \(p(\mathcal{R})\), we have \(d\pi (z_{i})\big|_{T_{z_{i}}\mathcal {R}}=d\lambda _{k}\big|_{p( \mathcal{R})}(x_{i}) = 0\) and finally \(d\pi (z)\big|_{Q} = 0\), which is a contradiction to part (1) of the Proposition. □

We finish the section with establishing the comfortingFootnote 12 result of Theorem 1.13: the set of generalized Morse families is open and dense. We restate Theorem 1.13 in an expanded form.

Theorem 4.11

Theorem 1.13

The set of familieshaving the below properties for every \(\lambda _{k}\) is open and dense in the Whitney topology of \(C^{r} (M,\mathrm{Sym}_{n})\), \(2\leq r\leq \infty \):

  1. (1)

    at every point \(x\), ℱ is transverse in the sense of Definition 4.1,

  2. (2)

    at every point \(x\), either \(\operatorname{Ran}{\mathcal {H}}_{x}\) or \(\left (\operatorname{Ran}{\mathcal {H}}_{x}\right )^{\perp}\) contains a positive definite matrix,

  3. (3)

    in the latter case, \(\lambda _{k}\) restricted to the constant multiplicity stratum of \(x\) has a non-degenerate critical point at \(x\).

In particular, a familysatisfying the above properties is generalized Morse (Definition 1.11).

Remark 4.12

Observe that property (1) of Theorem 4.11 does not imply property (2): a counter-example is provided by Example 2.2. Furthermore, when (1) and the second case of (2) hold — and thus non-degenerate criticality condition (N) is fulfilled — Lemma 4.5 shows that \(\lambda _{k} \big|_{S}\) has a critical point at \(x\). Property (3) posits non-degeneracy of this point, strengthening the conclusion to non-degenerate criticality condition (S).

Proof of Theorem 4.11

Lemma 4.2 showed that the transversality in the sense of Definition 4.1 is equivalent to the transversality between ℱ and the submanifold \(Q^{n}_{k, \nu}\) at \(x\).

As was mentioned before the discriminant variety \(\mathrm{Discr}_{n}\) admits a stratification satisfying Whitney condition A. For such stratifications,Footnote 13 we have the stratified version of the weakFootnote 14 Thom transversality theorem (see [30, Proposition 3.6] or informal discussions in [7, Sect. 2.3]). Namely, for any \(1\leq r\leq \infty \), the set of maps in \(C^{r}(M,\mathrm{Sym}_{n})\) that are transverse to \(\mathrm{Discr}_{n}\) is open and dense in the Whitney topology in \(C^{r}(M,\mathrm{Sym}_{n})\).

This establishes that property 1 holds for families ℱ from an open and dense set in the Whitney topology of \(C^{r} (M,\mathrm{Sym}_{n})\).

Properties 2–3 are more challenging because they involve properties of the derivatives of ℱ. Let \(J^{1}(M, \mathrm{Sym}_{n})\) denote the space of the 1-jets of smooth families of self-adjoint matrices and let \(\Gamma ^{1}(\mathcal{F}) \subset J^{1}(M, \mathrm{Sym}_{n})\) denote the graph of the 1-jet extension of a smooth family \(\mathcal{F}\colon M\to \mathrm{Sym}_{n}\),

$$ \Gamma ^{1}(\mathcal{F}) := \left \{\big(x,\mathcal{F}(x), d \mathcal{F}(x)\big) \colon x\in M \right \}. $$

We will show that our conclusions follows from the transversality (in the differential topological sense) of \(\Gamma ^{1}\left (\mathcal{F}\right )\) to certain stratified subspaces of \(J^{1}(M, \mathrm{Sym}_{n})\). Then the proposition will follow from a stratified version of the strong (or jet) Thom transversality theorem (see [7, p. 38 and p. 42] as well as [30, Proposition 3.6]): the set of families whose 1-jet extension graph is transverse to a closed stratified subspace is open and dense in the Whitney topology of \(C^{r} (M,\mathrm{Sym}_{n})\) with \(2\leq r\leq \infty \). The theorem holds if the stratified subspace satisfies Whitney condition A.

The jet space \(J^{1}(M, \mathrm{Sym}_{n})\) is the space of triples \((x,A,L)\) such that

$$ x \in M, \qquad A \in \mathrm{Sym}_{n}, \qquad L\in \operatorname{Hom}(T_{x}M, T_{A} \mathrm{Sym}_{n}). $$

Given an integer \(k\), \(1\leq k \leq n\), a matrix \(A \in Q^{n}_{k,\nu}\) and a “differential” \(L \in \operatorname{Hom}(T_{x}M, T_{A} \mathrm{Sym}_{n})\), introduce the linear subspace

$$ \operatorname{Ran}L_{k,A} := \Big\{ \mathcal{U}_{k,A}^{*} (Lv) \mathcal{U}_{k,A} \colon v\in T_{x}M \Big\} \ \subset \ \mathrm{Sym}_{\nu}({\mathbb{F}}), $$
(4.11)

where \(\mathcal{U}_{k,A}\) is a linear isometry from \({\mathbb{F}}^{\nu}\) to the \(\nu \)-dimensional eigenspace \(\mathbf{E}_{k}(A)\) of the eigenvalue \(\lambda _{k}\) of \(A\).

We define the following subsets of \(J^{1}(M, \mathrm{Sym}_{n})\).

$$\begin{aligned} T^{c} &:= \bigcup _{1\leq k\leq n,\ \nu \geq 1} \Big\{ (x,A,L) \colon A \in Q^{n}_{k,\nu},\ (\operatorname{Ran}L_{k,A})^{\perp }\neq 0 \Big\} . \\ T^{c}_{0} &:= \bigcup _{1\leq k\leq n,\ \nu \geq 1} \Big\{ (x,A,L) \colon A \in Q^{n}_{k,\nu},\ \exists B\in (\operatorname{Ran}L_{k,A})^{\perp}\setminus \{0\},\ \det B=0 \Big\} . \end{aligned}$$

We note that the subspace \(\operatorname{Ran}L_{k,A}\) does not depend on the base point \(x\) but it depends the particular choice of the isometry \(\mathcal{U}_{k,A}\). Nevertheless, the properties of \(\operatorname{Ran}L_{k,A}\) used in the definitions of \(T^{c}\) and \(T^{c}_{0}\) above are independent of the choice of the isometry \(\mathcal{U}_{k,A}\).

Lemma 4.13

\(T^{c}\) and \(T^{c}_{0}\) are stratified spaces satisfying Whitney condition A. Every stratum of \(T^{c}\) has codimension at least \(d\) in \(J^{1}(M, \mathrm{Sym}_{n})\), where \(d=\dim M\); every stratum of \(T^{c}_{0}\) has codimension at least \(d+1\).

Proof

Obviously the sets \(T^{c}\) and \(T^{c}_{0}\) are closed, with stratification induced by \(\nu \) and the dimension of \((\operatorname{Ran}L_{k,A})^{\perp}\). Besides, they are smoothFootnote 15 fiber bundles over \(M\) with semialgebraic fibers and therefore satisfy Whitney condition A.

Semialgebraicity of the fibers of \(T^{c}\) and \(T^{c}_{0}\) follows from the Tarski–Seidenberg theorem stating that semialgebraicity is preserved under projections ([13, Theorem 2.2.1], [53, Theorem 8.6.6]). Indeed, let \(\Pi :J^{1}(M, \mathrm{Sym}_{n})\rightarrow M\) be the canonical projection. For each \(x\in M\), we view \(\Pi ^{-1}(x)\cong \mathrm{Sym}_{n}\times \mathrm{Hom}(T_{x}M, \mathrm{Sym}_{n})\) as a vector space by canonically identifying \(T_{A}\mathrm{Sym}_{n}\) with \(\mathrm{Sym}_{n}\). Focusing on \(T^{c}\), the set

$$ \left \{(A,L,\lambda ) \colon \det (A-\lambda I)=0, \ (\operatorname{Ran}L_{k,A})^{ \perp }\neq 0\ \text{for some }k\right \} $$

is semialgebraic in the vector space \(\Pi ^{-1}(x) \times \mathbb{R}\). Its projection on \(\Pi ^{-1}(x)\) is exactly the fiber \(T^{c} \cap \Pi ^{-1}(x)\) and it is semialgebraic by the Tarski–Seidenberg theorem. The argument for \(T^{c}_{0}\) is identical.

Now we prove that every stratum of \(T^{c}\) has codimension at least \(d\). Let \(\Pi _{1}: J^{1}(M, \mathrm{Sym}_{n})\rightarrow M\times \mathrm{Sym}_{n}\) be the canonical projection. Recall that the codimension of \(Q_{k, \nu}\) in \(\mathrm{Sym}_{n}\) is \(s(\nu ) := \dim \mathrm{Sym}_{\nu }- 1\).

We consider two cases. If \(\nu \) is such that \(d\leq s(\nu )\), then \(\dim \operatorname{Ran}L_{k, A} \leq d < \dim \mathrm{Sym}_{\nu}\) and therefore \((\operatorname{Ran}L_{k,A})^{\perp }\neq 0\) for every \(L\). We get \(\Pi _{1}^{-1}(M\times Q_{k,\nu}) = T^{c}\) and has codimension \(s(\nu )\geq d\).

Assume now that \(\nu \) is such that \(d> s(\nu )\). Then for an \(A\in Q_{k, \nu}\) the codimension of the top stratum of \(\Pi _{1}^{-1}(x, A)\cap T^{c}\) in \(\Pi _{1}^{-1}(x, A)\) is equal to the codimension of the subset of matrices of the rank \(\dim \mathrm{Sym}_{\nu}-1 = s(\nu )\) in the space of all \(\bigl(\dim \mathrm{Sym}_{\nu}\bigr)\times d\) matrices, i.e. it is equal toFootnote 16\(d-s(\nu )\). Hence, the codimension in \(J^{1}(M, \mathrm{Sym}_{n})\) of the top stratum of \(T^{c}\) is at least \(d-s(\nu )\) plus \(s(\nu )\), the codimension of \(\Pi _{1}^{-1}(M\times Q_{k,\nu})\) in \(J^{1}(M, \mathrm{Sym}_{n})\).

To estimate the codimension of the strata of \(T^{c}_{0}\), we note that on the top strata of \(T^{c}\), \((\operatorname{Ran}L_{k, A})^{\perp}\) must be one-dimensional. This implies that the codimension of the intersection of \(T^{c}_{0}\) with such strata is at least \(d+1\), while the codimension of intersections of \(T^{c}_{0}\) with the lower strata of \(T^{c}\) is automatically not less than \(d+1\). □

We continue the proof of Theorem 4.11. Let \(\mathcal{F}: M\rightarrow \mathrm{Sym_{n}}\) be a transverse family in the sense of Definition 4.1 so that the graph \(\Gamma ^{1}\left (\mathcal{F}\right )\) of its 1-jet extension is transverse to \(T^{c}\) and \(T^{c}_{0}\). As we mentioned above the set of such maps is open and dense in the required topology. From the transversality of \(T^{c}_{0}\) to the \(d\)-dimensional \(\Gamma ^{1}\left (\mathcal{F}\right )\) we immediately get

$$ \Gamma ^{1}\left (\mathcal{F}\right ) \cap T^{c}_{0} = \emptyset . $$
(4.12)

Choose an arbitrary point \(z\) and eigenvalue \(\lambda _{k}\) (of multiplicity \(\nu \)). If the corresponding \(\operatorname{Ran}{\mathcal {H}}_{z}\) contains a positive definite matrix, properties 2–3 hold trivially. We therefore focus on the opposite case: \(\operatorname{Ran}{\mathcal {H}}_{z} \cap \mathrm{Sym}_{\nu}^{++} = \emptyset \). In the proof of Theorem 1.5 in Sect. 3 we saw that this means \(\left (\operatorname{Ran}{\mathcal {H}}_{z}\right )^{\perp}\) contains a positive semidefinite matrix \(B\). We want to show that \(B\) is actually positive definite.

Assume the contrary, namely \(\det B = 0\); we will work locally in \(J^{1}(M,\mathrm{Sym}_{n})\) around the point in the graph \(\Gamma ^{1}\left (\mathcal{F}\right )\),

$$ Z = (z, A, L) := \big(z,\mathcal{F}(z), d\mathcal{F}(z)\big). $$

We first observe that \(\operatorname{Ran}L_{k,A}\) defined in (4.11) coincides with \(\operatorname{Ran}{\mathcal {H}}_{z}\) defined via (1.5). Since \(B \in \left (\operatorname{Ran}{\mathcal {H}}_{z}\right )^{\perp}\setminus \{0\}\) and \(\det B=0\), we conclude that \(Z \in \Gamma ^{1}\left (\mathcal{F}\right ) \cap T^{c}_{0}\), contradicting (4.12). Property 2 is now verified.

We now verify property 3. We have a positive definite \(B \in \left (\operatorname{Ran}{\mathcal {H}}_{z}\right )^{\perp}\), therefore, by Lemma 4.5, \(z\) is a smooth critical point along its constant multiplicity stratum \(S = S_{z}\). Also from the existence of \(B\), we have

$$ Z \in \Gamma ^{1}\left (\mathcal{F}\right ) \cap T^{c}. $$

Denote by \(T^{c}_{k,\nu}\) the stratum of \(T^{c}\) containing the point \(Z\). By definition of transversality to a stratified space, \(\Gamma ^{1}\left (\mathcal{F}\right )\) is transverse to \(T^{c}_{k,\nu}\) in \(J^{1}(M,\mathrm{Sym}_{n})\). By dimension counting and transversality, \(\left (\operatorname{Ran}L_{k,A}\right )^{\perp}\) is 1-dimensional along \(T^{c}_{k,\nu}\).

Define two submanifolds of \(J^{1}(M,\mathrm{Sym}_{n})\),

$$\begin{aligned} J_{k,\nu} &:= \left \{ (x,A,L) \in J^{1}(M,\mathrm{Sym}_{n}) \colon A \in Q^{n}_{k,\nu} \right \}, \\ J_{S} &:= \left \{ (x,A,L) \in J^{1}(M,\mathrm{Sym}_{n}) \colon x \in S,\, A \in Q^{n}_{k,\nu},\, L(T_{x}S) \subset T_{A} Q^{n}_{k,\nu} \right \}\ \subset \ J_{k,\nu}. \end{aligned}$$

To see that \(J_{S}\) is a manifold, we note that for each fixed \((x, A)\in S \times Q^{n}_{k,\nu}\), the set of admissible \(L\) in \(J_{S}\) is a vector space smoothly depending on \((x, A)\). In other words, \(J_{S}\) is a smooth vector bundle over \(S\times Q_{k, \nu}\).

We now use the following simple fact (twice): If \(U\), \(V\), and \(W\) are submanifolds of \(M\) such that \(W\) is transverse to \(U\) in \(M\) and \(U \subset V\), then \(W \cap V\) is transverse to \(U\) in \(V\). Since \(T^{c}_{k,\nu} \subset J_{k,\nu}\), we conclude that \(\Gamma ^{1}\left (\mathcal{F}\right ) \cap J_{k,\nu}\) is transverse to \(T^{c}_{k,\nu}\) in \(J_{k,\nu}\). And now, since

$$ \Gamma ^{1}\left (\mathcal{F}\right ) \cap J_{k,\nu} = \left \{\big(x, \mathcal{F}(x), d\mathcal{F}(x)\big) \colon x\in S \right \} \quad \subset \quad J_{S}, $$
(4.13)

we conclude that \(T^{c}_{k,\nu} \cap J_{S}\) is transverse to \(\Gamma ^{1}\left (\mathcal{F}\right ) \cap J_{k,\nu}\) in \(J_{S}\).

We have successfully localized our \(x\) to \(S\). The space (4.13) looks similar to the graph of the 1-jet extension of \(\mathcal{F}\big|_{S}\), except that the differential \(d\mathcal{F}(x)\) is defined on \(T_{x} M\) and not on \(T_{x} S\). Consider the map \(\Psi : J_{S} \to J^{1}(S,\mathbb{R})\),

$$ \Psi (x, A, L) := \Bigl(x,\, \widehat{\lambda}_{k}(A), \, d\bigl( \widehat{\lambda}_{k}|_{Q_{k, \nu}^{n}}\bigr)(A) \circ L|_{T_{x}S} \Bigr), $$

which is well-defined and smooth because \(\widehat{\lambda}_{k}\) (defined in (1.1)) is smooth when restricted to \(Q_{k, \nu}^{n}\), \(d\bigl(\widehat{\lambda}_{k}|_{Q_{k, \nu}^{n}}\bigr)(A) : T_{A} Q_{k, \nu}^{n} \to \mathbb{R}\) and \(L|_{T_{x}S} : T_{x}S \to T_{A} Q_{k, \nu}^{n}\) by definition of \(J_{S}\).

We want to show that \(\Psi \) is a submersion and therefore preserves transversality. To prove submersivity of a map it is enough to prove that, for any point \(q\) in the domain, any smooth curve in the codomain of the map passing through the image of \(q\) is the image of a smooth curve in the domain passing through \(q\).

Let \((x_{0},A_{0},L_{0})\) be an arbitrary point on \(J_{S}\). We will work in a local chart around \(x_{0}\in M\) in which \(S\) is a subspace. Let \(P\) denote the projection in \(T_{x}M\) onto \(T_{x}S\), which now does not depend on the point \(x\in S\). Consider a smooth curve \((x_{t},f_{t},g_{t})\) in \(J^{1}(S,R)\) such that \(\Psi (x_{0},A_{0},L_{0}) = (x_{0},f_{0},g_{0})\). Then the smooth curve

$$ \Big( x_{t},\, A_{0} + (f_{t}-f_{0})I,\, L_{0} + I (g_{t} - g_{0}) P \Big), $$

is in \(J_{S}\) and is mapped to \((x_{t},f_{t},g_{t})\) by \(\Psi \). To see this, observe that all sets \(Q_{k, \nu}^{n}\) are invariant under the addition of a multiple of the identity matrix and also that \(\widehat{\lambda}_{k}(A+\mu I) = \widehat{\lambda}_{k}(A) + \mu \) and therefore \(d\bigl(\widehat{\lambda}_{k}|_{Q_{k, \nu}^{n}}\bigr)(A)I = 1\).

We now have that \(\Psi \big(T^{c}_{k,\nu} \cap J_{S}\big)\) is transverse to \(\Psi \left (\Gamma ^{1}\left (\mathcal{F}\right ) \cap J_{k,\nu} \right )\) in \(J^{1}(S,\mathbb{R})\). It is immediate that

$$ \Psi \left (\Gamma ^{1}\left (\mathcal{F}\right ) \cap J_{k,\nu} \right ) = \Gamma ^{1}\left ( \widehat{\lambda}_{k} \circ \mathcal{F}|_{S} \right ). $$

We now argue that

$$ \Psi \big(T^{c}_{k,\nu} \cap J_{S}\big) = \left \{(x, \widehat{\lambda}_{k}(A), 0) \colon x\in S,\, A\in Q_{k, \nu}^{n} \right \}. $$
(4.14)

Indeed, at \(Z\), the space \(\left (\operatorname{Ran}L_{k,A}\right )^{\perp}\) is spanned by a positive definite matrix and this property holds in a small neighborhood of \(Z\) in \(T^{c}_{k,\nu}\). By Lemma 4.5, \(L_{k,A} |_{T_{x}S} = 0\), while by Hellmann-Feynman theorem,

$$ d\bigl(\widehat{\lambda}_{k}|_{Q_{k, \nu}^{n}}\bigr)(A) \circ L|_{T_{x}S} = \frac{1}{\nu }\operatorname{Tr}\left (\mathcal{U}^{*}_{k,A} L|_{T_{x}S} \mathcal{U}_{k,A} \right ) = \frac{1}{\nu }\operatorname{Tr}\left ( L_{k,A} |_{T_{x}S} \right ) = 0. $$

Finally, it is well known that the transversality of the graph \(\Gamma ^{1}\left ( \widehat{\lambda}_{k} \circ \mathcal{F}|_{S} \right )\) to the 0-section space (4.14) is equivalent to the non-degeneracy of the critical point \(z\) of \(\widehat{\lambda}_{k} \circ \mathcal{F}|_{S}\), see [43, Sect. 6.1] or [9, Lem 5.23]. Property 3 is now established.  □

5 Topological change in the sublevel sets: part (1) of Theorem 1.14

In this section we describe the change in the sublevel sets of the eigenvalue \(\lambda _{k}\) when passing through a non-degenerate topologically critical point \(x\). It will be expressed in terms of the data introduced in Theorem 1.12, namely the Morse index \(\mu (x)\) of \(\lambda _{k}\) restricted to the local constant multiplicity stratum \(S\) attached to the point \(x\), the multiplicity \(\nu \), and the relative index \(i=i(x)\) introduced in equation (1.8). As a result of the section, we will establish part (1) of Theorem 1.14.

First we reduce our considerations to the directions transverse to the constant multiplicity stratum \(S\) at a critical point \(x\).

Lemma 5.1

Let \(N\) be a submanifold of \(M\) of dimension \(\dim N = \operatorname{codim}_{M} S = s(\nu )\) which intersects \(S\) transversely at \(x\). Then, for small enough \(U\) and \(\varepsilon >0\),

$$ H_{r}\Big(U^{+\varepsilon }_{x}(\lambda _{k}), U^{-\varepsilon }_{x}( \lambda _{k})\Big) \cong H_{r-\mu (x)}\Big( U^{+\varepsilon }_{x} \left (\lambda _{k}\big|_{N}\right ), U^{-\varepsilon }_{x}\left ( \lambda _{k}\big|_{N}\right )\Big). $$

Proof

By definition, see [35, Sec I.3.5], the local Morse data is

$$ \Big(U^{-\varepsilon ,+\varepsilon }_{x}\left (\lambda _{k}\right ), \partial U^{-\varepsilon }_{x}\left (\lambda _{k}\right )\Big), $$

where

$$ U^{-\varepsilon ,+\varepsilon }_{x} := U^{+\varepsilon }_{x} \setminus U^{-\varepsilon }_{x}. $$
(5.1)

The normal data and the tangential data is simply the data of \(\lambda _{k}\) restricted to the submanifolds \(N\) and \(S\), respectively, see [35, Sec I.3.6]. The normal data is

$$ (J,K) \cong \Big(U_{x}^{-\varepsilon ,+\varepsilon }\left (\lambda _{k} \big|_{N}\right ), \partial U_{x}^{-\varepsilon }\left (\lambda _{k} \big|_{N}\right )\Big), $$
(5.2)

and, by the local version of the main theorem of the classical Morse theory [51, Theorem 3.2], the tangential data is

$$ (P, Q) \cong \left (\mathbb{B}^{\mu (x)}, \partial \mathbb{B}^{\mu (x)} \right ), $$

where \(\mathbb{B}^{\mu (x)}\) denotes the \(\mu (x)\)-dimensional ball.

We already established in Corollary 4.9 that \(\Bigl(x,\lambda _{k}(x)\Bigr)\) is a nondepraved point of the corresponding map. We can now use [35, Thm I.3.7] to decompose the local Morse data into a product of tangential and normal data. More precisely, if the tangential data is \((P,Q)\) and the normal data is \((J,K)\), the local Morse data is homotopy equivalent to \(\bigl(P\times J, (P\times K) \cup (Q\times J)\bigr)\).

We want to compute

$$\begin{aligned} H_{r}\Big(U^{+\varepsilon }_{x}(\lambda _{k}), U^{-\varepsilon }_{x}( \lambda _{k})\Big) &\cong H_{r}\Big(U^{-\varepsilon ,+\varepsilon }_{x} \left (\lambda _{k}\right ), \partial U^{-\varepsilon }_{x}\left ( \lambda _{k}\right )\Big) \\ &\cong H_{r}\Big(P\times J, (P\times K) \cup (Q\times J) \Big), \end{aligned}$$
(5.3)

the first equality being by Excision Theorem and the second by [35, Thm I.3.7] (and homotopy invariance). By the relative version of the Künneth theorem, see [23, Proposition 12.10], we have the following short exact sequence

$$\begin{aligned} 0 \rightarrow \bigoplus _{j+k=r} H_{j}(P,Q) \otimes H_{k}(J,K) & \rightarrow H_{r}\big(P\times J, (P\times K) \cup (Q\times J)\big) \\ &\rightarrow \bigoplus _{j+k=r-1} \operatorname{Tor}_{1}\big(H_{j}(P,Q), H_{k}(J,K) \big) \rightarrow 0. \end{aligned}$$
(5.4)

Since

$$ H_{j}(P, Q) = H_{j} \left (\mathbb{B}^{\mu (x)}, \partial \mathbb{B}^{ \mu (x)}\right ) = \widetilde{H}_{j}(\mathbb{S}^{\mu (x)}) = \textstyle\begin{cases} 0,& j\neq \mu (x), \\ \mathbb{Z}, & j=\mu (x), \end{cases} $$
(5.5)

where \(\widetilde{H}_{*}\) stands for the reduced homology, are free, the torsion product terms in (5.4) are all 0. We therefore get

$$\begin{aligned} H_{r}\big(P\times J, (P\times K) \cup (Q\times J)\big) &\cong \bigoplus _{j+k=r} H_{j}(P,Q) \otimes H_{k}(J,K) \\ &= \mathbb{Z}\otimes H_{r-\mu (x)}(J,K) = H_{r-\mu (x)}(J,K), \end{aligned}$$
(5.6)

where we used (5.5) again. Combining (5.3) with (5.6) and (5.2), we obtain the result. □

The preceding lemma tells us that we can restrict our attention to the case \(M=N\). In this case the constant multiplicity stratum attached to \(x\) is the isolated point itself and \(\dim M =\dim N = s(\nu )\) (see equation (4.2) for the formula defining \(s(\nu )\) and some explanations). Since the considerations are purely local, we can assume that \(M= \mathbb{R}^{s(\nu )}\), \(x=0\), and \(\mathcal{F}(x)=0\).

Lemma 5.2

Let \(\mathcal{F}:\mathbb{R}^{s(\nu )} \rightarrow \mathrm{Sym}_{\nu}( \mathbb{F})\), \(\mathcal{F}(0)=0\), be a smooth family satisfying at \(x=0\) non-degenerate criticality conditions (N) and (S). Then there exists a neighborhood \(U\) of 0 in \(\mathbb{R}^{s(\nu )}\), such that for sufficiently small \(\varepsilon >0\) the sublevel set \(U_{0}^{+\varepsilon }(\lambda _{k})\) deformation retracts to the set \(D_{k,\varepsilon }\cup U_{0}^{-\varepsilon }(\lambda _{k})\), where

$$ D_{k,\varepsilon }:= \Big\{ x\in U \colon -\varepsilon \leq \lambda _{1} \big(\mathcal{F}(x)\big) = \cdots = \lambda _{k}\big(\mathcal{F}(x) \big) \leq 0 \Big\} . $$
(5.7)

Remark 5.3

It is instructive to consider what happens in the boundary cases \(k=1\) and \(k=\nu \). We will see that condition (N) implies that ℱ is injective and \(\mathcal{F}(U)\) does not contain any semidefinite matrices except 0 (for a suitably small \(U\)). Therefore, when \(k=1\),

$$ U_{0}^{\varepsilon }(\lambda _{1}) = U = D_{1,\varepsilon } \cup U_{0}^{- \varepsilon }(\lambda _{1}) $$

and no retraction is needed.

Similarly, when \(k=\nu \), we have \(U_{0}^{-\varepsilon }(\lambda _{\nu}) = \emptyset \) and

$$ D_{\nu ,\varepsilon } = \{x \in U \colon \mathcal{F}(x)=0\} = 0, $$

and the Lemma reduces to the claim that \(U_{0}^{\varepsilon }(\lambda _{\nu})\) deformation retracts to a point. Furthermore, the set defined in (5.1) is \(U_{0}^{-\varepsilon ,+\varepsilon }(\lambda _{\nu})= U_{0}^{ \varepsilon }(\lambda _{\nu})\). In the proof of Lemma 5.2, we will see that \(U_{0}^{\varepsilon }(\lambda _{\nu})\) is diffeomorphic to the intersection of a smooth \(s(\nu )\)-dimensional manifold through 0 with a small ball around 0. Therefore, we obtain that the normal Morse data in (5.2) is homeomorphic to the pair

$$ (J,K) = (\mathbb{B}^{s(\nu )}, \emptyset ). $$

Proof of Lemma 5.2

Since \(\mathcal{F}(0)=0 \in \mathrm{Sym}_{\nu}\), the eigenspace \(\mathbf{E}_{k}\) in (1.5) is the whole space \(\mathbb{R}^{\nu}\) and \({\mathcal {H}}_{0} = d\mathcal{F}(0)\). From condition (N) we get that \(\operatorname{Ran}d\mathcal{F}(0) = \operatorname{span}\{B\}^{\perp}\) with \(B\in \mathrm{Sym}_{\nu}^{++}\). By the definition of \(s(\nu )\) and dimension counting we conclude that \(d\mathcal{F}(0) \colon \mathbb{R}^{s(\nu )} \to \mathrm{Sym}_{\nu}\) is injective (since ℱ is a map between vector spaces, we can consider its differential to be a map between the same vector spaces).

Choose a neighborhood \(W\) of 0 such that \(d\mathcal{F}(x)\) remains close to \(d\mathcal{F}(0)\) for all \(x\in W\) (and, in particular, injective) and the suitably scaled normal to \(\operatorname{Ran}d\mathcal{F}(x)\) remains close to \(B\) (and, in particular, positive definite). For future reference we note that, under these smallness conditions, ℱ is a diffeomorphism from \(W\) to \(\mathcal{F}(W)\) and the latter set contains no positive or negative semidefinite matrices except 0.

Denote by \(\mathfrak{B}_{\delta}\) the open ball in \(\mathrm{Sym}_{\nu}\) of radius \(\delta \) around the origin in the operator norm. Choose \(\delta \) sufficiently small so that \(\partial \mathfrak{B}_{\delta }\cap \mathcal{F}(\partial W) = \emptyset \). This is possible because \(d\mathcal{F}(0)\) is injective and the operator norm on \(\mathcal{F}(\partial W)\) is bounded from below. Now we take \(U = \mathcal{F}^{-1}\big(\mathfrak{B}_{\delta }\cap \mathcal{F}(W) \big)\). This set is non-empty because it contains 0; it has the useful property that the operator norm (equivalently, spectral radius) of \(\mathcal{F}(x)\) is equal to \(\delta \) for \(x\in \partial U\) and is strictly smaller than \(\delta \) on \(U\).

Given a matrix \(F_{0} \in \mathcal{F}\left ( U_{0}^{\varepsilon }(\lambda _{k}) \right )\) we will describe the retraction trajectory \(\Gamma _{F_{0}}(t)\), \(t\in [0,1]\), starting at \(F_{0}\). The trajectory will be piecewise smooth, with the pieces described recursively. Define, for \(m \leq k\),

$$ G^{k}_{m} := \left \{F \in \mathrm{Sym}_{\nu }\colon \cdots < \lambda _{m}(F) = \cdots = \lambda _{k}(F) \leq \cdots \right \}, $$

which is the set of matrices with a gap below the eigenvalue \(\lambda _{m}(F)\) but no gap between \(\lambda _{m}(F)\) and \(\lambda _{k}(F)\). It is easy to see that

$$ \overline{G^{k}_{m}} \setminus G^{k}_{m} = \bigcup _{1\leq m' < m} G^{k}_{m'}, $$
(5.8)

which we will call the egress set of \(G^{k}_{m}\).

Assume that \(\lambda _{k}(F_{0}) > -\varepsilon \) and that \(F_{0} \in G^{k}_{m_{0}}\) for some \(m_{0}>1\); let \(t_{0}=0\). Define two complementary spectral (Riesz) projectors corresponding to \(F_{0}\),

$$ P_{-} := P\big(\{\lambda < \lambda _{k}(F_{0})\}\big), \qquad P_{+} := P\big(\{\lambda \geq \lambda _{k}(F_{0})\}\big), $$

and consider the affine plane in \(\mathrm{Sym}_{\nu}\) defined by

$$ \{F_{0} - 2\varepsilon s P_{+} + r P_{-} \colon s,r \in \mathbb{R}\}. $$
(5.9)

Since the projector \(P_{+} \in \mathrm{Sym}_{\nu}^{+}\) is non-zero and the normal to \(d\mathcal{F}(x)\) is positive definite locally around \(x=0\), this affine plane is transverse to \(\mathcal{F}(U)\) in \(\mathrm{Sym}_{\nu}\). Their intersection is nonempty because it contains \(F_{0}\) and thus, by the Implicit Function Theorem, it is a 1-dimensional embedded submanifold of \(\mathcal{F}(U)\). Denote by \(\gamma _{F_{0}}^{m_{0}}\) the connected component of the intersection that contains \(F_{0}\).

Furthermore, implicit differentiation of the equation \(F_{0} - 2\varepsilon s P_{+} + r P_{-} = \mathcal{F}(x)\) at a point \(\mathcal{F}(x) \in \gamma _{F_{0}}^{m_{0}}\) shows that

$$ \frac{dr}{ds} = 2\varepsilon \frac{\langle B_{x}, P_{+}\rangle}{\langle B_{x}, P_{-}\rangle}, \qquad r(0)=0, $$
(5.10)

where \(B_{x}\) is either positive or negative definite symmetric matrix that spans the orthogonal complement to the differential \(d\mathcal{F}(x)\). Since \(\langle B_{x}, P_{-}\rangle >0\) for any \(x\in U\), the set of points of \(\gamma _{F_{0}}^{m_{0}}\) where \(\gamma _{F_{0}}^{m_{0}}\) can be locally represented as a function of \(s\) is both open and closed in the subspace topology of \(\gamma _{F_{0}}^{m_{0}}\). We conclude that \(\gamma _{F_{0}}^{m_{0}}\) can be represented by a function of \(s\) globally, i.e. as long as the closure of \(\gamma _{F_{0}}^{m_{0}}\) in \(\mathrm {Sym}_{\nu}\) does not hit the boundary of \(\mathcal{F}(U)\). In a slight abuse of notation, we will refer to this function as \(\gamma _{F_{0}}^{m_{0}}\). Figure 5(left) shows examples of the curves \(\gamma _{F_{0}}^{1}(s)\) for the family \(\mathcal{F}_{1}\) from equation (2.1) and two different initial points \(F_{0}\).

Fig. 5
figure 5

Left: the curves \(\gamma _{F_{0}}^{2}(t)\) for \(k=2\) and \(\mathcal{F}_{1}\) from equation (2.1), for a pair of initial points \(F_{0}\). The curves are shown in the 2-dimensional plane \(\mathcal{F}_{1}(U)\). The egress set for \(G_{2}^{2}\) is the point \((0,0)\). Note that the curves intersect on the egress set, which is the reason we chose to specify the flow rather than the vector field. Middle and right: evolution of the eigenvalues of \(\Gamma _{F_{0}}(t)\) for a pair of \(F_{0}\) with \(k=3\) and the family \(\mathcal{F}(U)=\{F\in \mathrm{Sym}_{4}: \operatorname{Tr}(F)=0\}\). Egress points correspond to points where \(\lambda _{k}\) increases its multiplicity (the latter is shown with thicker lines)

The matrices on the curve \(\gamma _{F_{0}}^{m_{0}}(s)\) have fixed eigenspaces but their eigenvalues change with \(s\). For small positive \(s\) the eigenvalues \(\lambda _{m_{0}} = \lambda _{k}\) and above decrease with the constant speed \(2\varepsilon \) while the eigenvalues below \(\lambda _{m_{0}}\) increase because the derivative in (5.10) is positive. This closes the gap below the eigenvalue \(\lambda _{m_{0}}\) and decreases the spectral radius (operator norm) of \(\gamma _{F_{0}}^{m}(s)\). Therefore, the curve will intersect the egress set (5.8) at some time \(\hat{s}>0\) before it reaches the boundary \(\mathcal{F}(\partial U)\).

Setting \(F_{1} := \gamma _{F_{0}}^{m_{0}}(\hat{s})\) and \(t_{1}=t_{0}+\hat{s}\), we determine \(m_{1} < m_{0}\) such that \(F_{1}\in G^{k}_{m_{1}}\) and repeat the process starting at \((t_{1},F_{1})\). We then join the pieces together,

$$ \Gamma _{F_{0}}(t) = \gamma _{F_{j}}^{m_{j}}(t-t_{j}), \qquad t_{j} \leq t \leq t_{j+1}. $$

There are two ways in which we will terminate this recursive process. If an egress point \(F_{n} \in G^{k}_{1}\) is reached (which has no eigenvalues strictly smaller than \(\lambda _{k}(F_{n})\) and equation (5.10) becomes undefined due to \(P_{-}=0\)), we continue \(\Gamma _{F_{0}}\) as a constant, \(\Gamma _{F_{0}}(t) = F_{n}\) for \(t\geq t_{n}\). An example of this is shown in Fig. 5(middle). The case \(m_{0}=1\) which we previously excluded can now be absorbed into this rule.

Alternatively, since \(\lambda _{k}\) decreases from an initial value below \(\varepsilon \) at the constant rate \(2\varepsilon \), we will reach a point in \(U_{0}^{-\varepsilon }(\lambda _{k})\) at some \(\tilde{t}\leq 1\). In this case we also continue \(F(t)\) as a constant for \(t>\tilde{t}\), see Fig. 5(right) for an example (with \(\tilde{t}=2/3\) in this particular case). The case \(\lambda _{k}(F_{0}) \leq -\varepsilon \) can now be absorbed into the above description by setting \(\tilde{t} = 0\).

The preceding paragraphs show that the final values \(\mathcal{F}^{-1}\left (\Gamma _{F_{0}}(1)\right )\) belong to the set \(D_{k,\varepsilon }\cup U_{0}^{-\varepsilon }(\lambda _{k})\), see equation (5.7), and that \(x \mapsto \Gamma _{\mathcal{F}(x)}(t)\) acts as identity on \(D_{k,\varepsilon }\cup U_{0}^{-\varepsilon }(\lambda _{k})\) for all \(t\). This suggest that we have a deformation retraction

$$ (x,t) \mapsto \mathcal{F}^{-1} \left (\Gamma _{\mathcal{F}(x)}(t) \right ), $$

if we establish that the trajectories \(\Gamma _{F}(t)\) define a continuous mapping \(\mathcal{F}(U) \times [0,1] \to \mathcal{F}(U)\).

We first note that each trajectory is continuous in \(t\) by construction. Therefore, we need to show that starting at a point \(F'\) which is near \(F\) will result in \(\Gamma _{F'}(t)\) being near \(\Gamma _{F}(t)\). A perturbation of arbitrarily small norm may split multiple eigenvalues, therefore if \(F \in G_{m}^{k}\) with \(m < k\), then, in general, \(F' \in G^{k}_{m'}\) with \(m\leq m'\) (in fact, generically, \(m'=k\)). However,

$$\begin{aligned} \left |\lambda _{m}(F') - \lambda _{k}(F')\right | &= \left |\lambda _{m}(F') - \lambda _{m}(F)\right | + \left |\lambda _{k}(F) - \lambda _{k}(F') \right | \\ &\leq C |F - F'|, \end{aligned}$$

with some \(F\)-independentFootnote 17 constant \(C\), and therefore after a time of order \(C |F - F'| / 2\varepsilon \), the \(k\)-th eigenvalue \(\Gamma _{F'}\) will collide with \(m\)-th eigenvalue. To put it more precisely, there is \(\tau \), \(0 < \tau \leq C |F - F'| / 2\varepsilon \), such that \(\Gamma _{F'}(\tau ) \in G^{k}_{m}\). By choosing \(|F - F'|\) to be sufficiently small (while \(\varepsilon \) is small but fixed), we ensure that \(\Gamma _{F}(\tau )\) is still in \(G^{k}_{m}\). By noting that the trajectories \(\Gamma _{F'}(t)\) are continuous in \(t\) uniformly with respect to \(F'\), we conclude that \(\Gamma _{F'}(\tau )\) is close to \(\Gamma _{F}(\tau )\).

For two initial points \(F\) and \(F'\) in the same set \(G_{m}^{k}\), the curves \(\gamma _{F}^{m}(s)\) and \(\gamma _{F'}^{m}(s)\) will remain nearby for any bounded time \(s<1\). This can be seen, for example, as stability of the transverse intersection of the manifold \(\mathcal{F}(U)\) and the manifold (5.9). The stability is with respect to the parameters \(F\), \(P_{+}\) and \(P_{-}\) and the spectral projections are continuous in \(F\) precisely because \(F'\) belongs the same set \(G_{m}^{k}\).

We now chain the two argument in the alternating fashion: short time to bring two points to the same set \(G^{k}_{m}\), long time along smooth trajectories until one of the trajectories reaches an egress point, then short time to bring them to the same set \(G^{k}_{m_{1}}\) and so on. Since we iterate a bounded number of times, the composition is a continuous mapping. □

As before (see the paragraph before the formulation of Theorem 1.14), let \(\mathcal{C}Y\) and \(\mathcal {S} Y\) be the cone and the suspension of a topological space \(Y\). Also let \(\Sigma Y=\mathcal {S} Y/ \bigl(\{y_{0}\}\times I\bigr)\) be the reduced suspension of \(Y\), where \(y_{0}\in Y\). Note that if \(Y\) is a CW-complex, then \(\Sigma Y\) is homotopy equivalent to \(\mathcal {S} Y\). In Lemma 5.2 we saw that \(U_{0}^{\varepsilon }(\lambda _{k})\) is homotopy equivalent to the union of \(U_{0}^{-\varepsilon }(\lambda _{k})\) and the space \(D_{k,\varepsilon }\) which we aim to understand further. We will now show that \(D_{k,\varepsilon }\) is a cone of the space \(\mathcal{R}_{\nu}^{i}\) introduced in equation (1.12).

In the present setting (namely, \(\mathcal{F}(0)=0 \in \mathrm{Sym}_{\nu}(\mathbb{F})\)), the relative index \(i\) is related to \(k\) via \(i = \nu - k + 1\), cf. (1.8). Notationally, it will be more convenient to use \(k\) instead of \(i\), so we introduce a slight change in the notation,

$$ \mathcal{R}_{\nu}^{i} = \mathcal {R}_{k, \nu} := \{R\in \mathrm{Sym}^{+}_{ \nu }\colon \operatorname{Tr}R=1, \dim \operatorname{Ker}R \geq k\}. $$
(5.11)

Lemma 5.4

Let ℱ, \(U\) and \(D_{k,\varepsilon }\) be as in Lemma 5.2. Then, for sufficiently small \(\varepsilon >0\), the topological space \(D_{k,\varepsilon }\) is homeomorphic to \(\mathcal{C} \mathcal{R}_{k,\nu}\) and the topological space

$$ \left (U_{0}^{-\varepsilon }(\lambda _{k}) \cup D_{k,\varepsilon } \right ) / U_{0}^{-\varepsilon }(\lambda _{k}), \qquad 1 \leq k < \nu , $$
(5.12)

is homeomorphic to \(\mathcal{S} \mathcal{R}_{k,\nu}\).

Proof

The choice of \(U\) ensured that ℱ is a homeomorphism from \(D_{k,\varepsilon }\subset U\) to

$$ \mathcal{F}(D_{k,\varepsilon }) := \big\{ F \in \mathcal{F}(U): - \varepsilon \leq \lambda _{1}(F) = \cdots = \lambda _{k}(F) \leq 0 \big\} . $$

We will now describe the homeomorphism from \(\mathcal{C} \mathcal{R}_{k,\nu}\) to \(\mathcal{F}(D_{k,\varepsilon })\).

Given a point \(R \in \mathcal{R}_{k,\nu}\), consider the intersection of \(\mathcal{F}(U)\) with the plane

$$ \left \{ -\varepsilon t I + r R \colon t, r \in \mathbb{R}\right \}. $$
(5.13)

Mimicking the proof of Lemma 5.2, we conclude that the intersection is a 1-dimensional submanifold which has a connected component \(\phi _{R}\) containing the matrix 0. Moreover, implicit differentiation at \(\mathcal{F}(x) \in \phi _{R}\) yields

$$ \frac{dr}{dt} = \varepsilon \frac{\langle B_{x}, I\rangle}{\langle B_{x},R\rangle}, $$
(5.14)

therefore the submanifold can be represented by a function of \(t\),

$$ \Phi (R,t) = -\varepsilon t I + r(t) R, \qquad r(0)=0. $$

When \(t\in [0,1]\), we also have \(\Phi (R,t) \in \mathcal{F}(D_{k,\varepsilon })\) because equation (5.14) implies \(r(t) \geq 0\). We remark that \(\langle B_{x},R\rangle \) is bounded away from zero uniformly in \(x\in U\) and \(R \in \mathcal{R}_{k,\nu}\), therefore, when \(\varepsilon \) is sufficiently small, \(\Phi (R,t)\) will remain in \(\mathcal{F}(U)\) until \(t>1\). Thus the function \(\Phi \) is a well-definedFootnote 18 mapping from \(\mathcal{C} \mathcal{R}_{k,\nu}\) to \(\mathcal{F}(D_{k,\varepsilon })\). It is evidently continuous.

The properties of ℱ imply that \(\mathcal{F}(U)\) contains no multiples of identity and no positive semidefinite matrices except for the zero matrix. Therefore, for every \(F\in \mathcal{F}(D_{k,\varepsilon })\), \(F\neq 0\),

$$ R_{F} := \frac{F - \lambda _{1}(F)I}{\operatorname{Tr}\big(F - \lambda _{1}(F)I\big)} \in \mathcal{R}_{k,\nu}, $$
(5.15)

is well-defined, and we also have \(-\varepsilon \leq \lambda _{1}(F) < 0\). Thus

$$ \Phi ' \colon F \mapsto \textstyle\begin{cases} \left ( R_{F}, -\frac{\lambda _{1}(F)}{\varepsilon } \right ), & \text{if } F \neq 0, \\ (*, 0), & \text{if } F=0, \end{cases} $$

is a well-defined continuous mapping from \(\mathcal{F}(D_{k,\varepsilon })\) to \(\mathcal{C} \mathcal{R}_{k,\nu}\). It remains to verify that \(\Phi '\) is the inverse of \(\Phi \). It is immediate that \(\Phi ' \circ \Phi = \mathrm{id}\). To prove that \(\Phi \circ \Phi ' = \mathrm{id}\) we observe that the intersection \(\phi _{R_{F}}\), corresponding to \(R_{F}\) of equation (5.15), contains \(F\); we only need to show that \(F\) and 0 belong to the same connected component of \(\phi _{R}\).

The point \(F\) on the plane (5.13) corresponds to \(t = -\lambda _{1}(F)/\varepsilon > 0\) and some \(r=r'\). Decreasing \(t\) from this point decreases \(r(t)\) and therefore decreases the operator norm of \(\Phi \). Thus we will not hit the boundary of \(\mathcal{F}(D_{k,\varepsilon })\) as long as \(t\geq 0\). Therefore, we will arrive at the matrix 0 while staying on the same connected component.

We have established the first part of the lemma. To understand the quotient in (5.12), we note that

$$\begin{aligned} &\left (U_{0}^{-\varepsilon }(\lambda _{k}) \cup D_{k,\varepsilon } \right ) / U_{0}^{-\varepsilon }(\lambda _{k}) \\ &\qquad = D_{k,\varepsilon }/ \left (U_{0}^{-\varepsilon }(\lambda _{k}) \cap D_{k,\varepsilon } \right ) \\ &\qquad \cong \mathcal{F}(D_{k,\varepsilon }) / \big\{ F \in \mathcal{F}(D_{k, \varepsilon }): -\varepsilon = \lambda _{1}(F) = \cdots = \lambda _{k}(F) \big\} \\ &\qquad = \mathcal{F}(D_{k,\varepsilon }) / \big\{ \Phi (R,1) \colon R \in \mathcal{R}_{k,\nu} \} \cong \mathcal{S} \mathcal{R}_{k,\nu}, \end{aligned}$$

completing the proof. □

Proof of Theorem 1.14, part (1)

We review how the preceding lemmas link together to give the proof of the theorem. Lemma 5.1 shows that the smooth part \(\mathcal{F}\big|_{S}\) gives the classical contribution to the sublevel set quotient and we can focus on understanding the transverse part \(\mathcal{F}\big|_{N}\). We remark that by Corollary 4.6 the point \(x\) remains non-degenerate topologically critical when we replace ℱ with \(\mathcal{F}\big|_{N}\).

Combining Lemmas 5.1, 5.2, and 5.4, we compute the \(r\)-th homology group

$$\begin{aligned} H_{r}\Big(U_{x}^{+\varepsilon }(\lambda _{k}),\, U_{x}^{-\varepsilon }( \lambda _{k})\Big) &\cong H_{r-\mu (x)}\Big(U_{x}^{+\varepsilon }( \lambda _{k}|_{N}),\, U_{x}^{-\varepsilon }(\lambda _{k}|_{N})\Big) \\ &\cong H_{r-\mu (x)}\Big(U_{x}^{-\varepsilon }(\lambda _{k}|_{N}) \cup D_{k,\varepsilon },\, U_{x}^{-\varepsilon }(\lambda _{k}|_{N}) \Big) \\ &\cong \widetilde{H}_{r-\mu (x)}\Big( \big(U_{x}^{-\varepsilon }( \lambda _{k}|_{N}) \cup D_{k,\varepsilon }\big) \big/ U_{x}^{- \varepsilon }(\lambda _{k}|_{N})\Big) \\ &\cong \widetilde{H}_{r-\mu (x)}\Big(\mathcal{S}\mathcal{R}_{k,\nu} \Big). \end{aligned}$$

Taking into account (5.11), we obtain the claim for \(k < \nu \). The answer for \(k=\nu \) (equivalently, \(i=1\)) was already established in Remark 5.3. □

6 Topological change in the sublevel sets: part (2) of Theorem 1.14

We will go from part (1) to part (2) of Theorem 1.14 by relating the space \(\mathcal {R}_{k, \nu}\) to the Thom space of a particular vector bundle. Recall [52] that the Thom space \(\mathcal{T}(E)\) of a real vector bundle \(E\) over a manifold is the quotient of the unit ball bundle \(\mathcal{B}(E)\) of \(E\) by the unit sphere bundle of \(E\) with respect to some Euclidean metric on \(E\). If the base manifold of the bundle \(E\) is compact, then the Thom space of \(E\) is the Alexandroff (one point) compactification of the total space of \(E\). As before, we denote by \(\operatorname{Gr}_{{\mathbb{F}}}(k,n)\) the Grassmannian of (non-oriented) \(k\)-dimensional subspaces in \({\mathbb{F}}^{n}\).

Given a vector bundle \(E\) denote by \(S^{2} E\) the symmetric tensor product of \(E\). Namely, \(S^{2} E\) is the vector bundle over the same base as \(E\); the fiber of \(S^{2} E\) over a point is equal to the symmetric tensor product with itself of the fiber of \(E\) over the same point. Choosing a Euclidean metric on \(E\) we can identify \(S^{2} E\) with the bundle whose fiber over a point is the space of all self-adjoint isomorphisms of the fiber of \(E\) over the same point. Then by \(S^{2}_{0} E\) we denote the bundle of traceless elements of \(S^{2} E\). Obviously

$$ S^{2} E\cong S^{2}_{0} E \oplus \theta ^{1}, $$
(6.1)

where \(\theta ^{1}\) is the trivial rank 1 bundle over the base of \(E\). Finally, let \(\operatorname{Taut}_{{\mathbb{F}}}(k,n)\) be the tautological bundle over the Grassmannian \(\operatorname{Gr}_{{\mathbb{F}}}(k,n)\): the fiber of this bundle over \(\Lambda \in \operatorname{Gr}_{{\mathbb{F}}}(k,n)\) is the vector space \(\Lambda \) itself.

Lemma 6.1

Recall the relative index \(i\) of the eigenvalue, equation (1.8), which in the present situation is equal to \(i=\nu -k+1\). Then the space

$$ \mathcal {R}_{k, \nu} := \{R\in \mathrm{Sym}^{+}_{\nu }\colon \operatorname{Tr}R=1, \dim \operatorname{Ker}R \geq k\} $$

with \(1\leq k < \nu \) is homotopy equivalent to the Thom space of the real vector bundle over the Grassmannian \(\operatorname{Gr}_{{\mathbb{F}}}(i-1,\nu -1)\),

$$ E_{i,\nu} := S^{2}_{0} \operatorname{Taut}_{{\mathbb{F}}}(i-1,\nu -1) \oplus \operatorname{Taut}_{{\mathbb{F}}}(i-1,\nu -1). $$
(6.2)

The rank of the bundle is \(s(i)-1\), where \(s(i)\) is given by (1.9). The bundle is non-orientable if \({\mathbb{F}}=\mathbb{R}\) and \(i\) is even, and orientable otherwise.

Remark 6.2

Let us consider the boundary case \(k=1\) or, equivalently, \(i=\nu \). The Grassmannian \(\operatorname{Gr}_{{\mathbb{F}}}(i-1,\nu -1)\) is a single point, so the vector bundle \(E_{\nu ,\nu}\) is simply a real vector space of dimension \(s(\nu )-1\). Its Thom space is the one-point compactification of \(\mathbb{R}^{s(\nu )-1}\), namely the sphere \(\mathbb{S}^{s(\nu )-1}\). Correspondingly, the cone \(\mathcal{C}\mathcal{R}_{1, \nu}\) is homotopy equivalent to the ball \(\mathbb{B}^{s(\nu )}\). Therefore, we get that the normal data at the bottom eigenvalue is homotopy equivalent to the pair

$$ (J,K) = \left (\mathbb{B}^{s(\nu )}, \partial \mathbb{B}^{s(\nu )} \right ). $$

Proof of Lemma 6.1

The homotopy equivalence has been established in [1, Theorem 1] and the proof thereof. For completeness we review the main steps here.

Fixing an arbitrary unit vector \(e \in {\mathbb{F}}^{\nu}\) we define

$$ \begin{aligned} \mathcal{P}_{k,\nu} &:= \left \{\frac{1}{\nu -k}P \in \mathrm{Sym}_{ \nu }\colon P^{2}=P,\ \dim \operatorname{Ker}P = k, \ e \in \operatorname{Ker}P \right \} \\ &\cong \operatorname{Gr}_{{ \mathbb{F}}}(\nu -k,\nu -1). \end{aligned} $$

One can show that \(\mathcal{R}_{k,\nu} \setminus \mathcal{P}_{k,\nu}\) is contractible: if \(P_{e} = ee^{*}\) is the projection onto \(e\), consider

$$ (A,t) \mapsto \phi _{k}\big( (1-t)A + tP_{e} \big), \qquad A \in \mathcal{R}_{k,\nu} \setminus \mathcal{P}_{k,\nu}, \quad t\in [0,1] $$
(6.3)

where \(\phi _{k}(M)\) acts on the eigenvalues of \(M\) as

$$ \lambda _{j}(M) \mapsto \max \big[0, \lambda _{j}(M)-\lambda _{k}(M) \big], $$
(6.4)

followed by a normalization to get unit trace. Using interlacing inequalitiesFootnote 19 for the rank one perturbation (up to rescaling) of \(A\) by \(P\), one can show that (6.3) is a well-defined retraction. In particular, (6.4) does not produce a zero matrix (which cannot be trace-normalized) and the result of (6.3) is not in \(\mathcal{P}_{k,\nu}\) for any \(t\).

We now obtain that \(\mathcal{R}_{k,\nu}\) is homotopy equivalent to the Thom space of the normal bundle of \(\mathcal{P}_{k,\nu}\) in \(\mathcal{R}_{k,\nu}\). Indeed, a tubular neighborhood \(T\) of \(\mathcal{P}_{k,\nu}\) in \(\mathcal{R}_{k,\nu}\) is diffeomorphic to the normal bundle of \(\mathcal{P}_{k,\nu}\), while the above retraction allows one to show \(\mathcal{R}_{k,\nu}\) is homotopy equivalent to \(\mathcal{R}_{k,\nu} \big/ \left (\mathcal{R}_{k,\nu} \setminus T \right )\).

The normal bundle of \(\mathcal{P}_{k,\nu}\) in \(\mathcal{R}_{k,\nu}\) is a Whitney sum of the normal bundle of \(\mathcal{P}_{k,\nu}\) in

$$ \widehat{\mathcal{P}}_{k,\nu} := \left \{\frac{1}{\nu -k}P \in \mathrm{Sym}_{\nu }\colon P^{2}=P,\ \dim \operatorname{Ker}P = k \right \}, $$

and the normal bundle of \(\widehat{\mathcal{P}}_{k,\nu}\) in \(\mathcal{R}_{k,\nu}\). The fiber in the former bundle is \((\operatorname{Ker}P)^{\perp}\): it consists of the directions in which \(e\) can rotate out of \(\operatorname{Ker}P\). Therefore the former bundle is \(\operatorname{Taut}_{{\mathbb{F}}}(\nu -k, \nu -1)\). The fiber in the normal bundle of \(\widehat{\mathcal{P}}_{k,\nu}\) in \(\mathcal{R}_{k,\nu}\) consists of all self-adjoint perturbations to the operator \(\frac{1}{\nu -k}P\) that preserve its kernel and unit trace. Identifying these with the space of traceless self-adjoint operators on \((\operatorname{Ker}P)^{\perp}\), we get \(S_{0}^{2} \operatorname{Taut}_{{\mathbb{F}}}(\nu -k, \nu -1)\). We get (6.2) after recalling that \(\nu -k=i-1\).

To calculate the rank we use

$$ \operatorname{rank}\big(\operatorname{Taut}_{{\mathbb{F}}}(i-1, \nu -1)\big) = \textstyle\begin{cases} i-1, & {\mathbb{F}}=\mathbb{R}, \\ 2(i-1) & {\mathbb{F}}=\mathbb{C}, \end{cases} $$

and

$$ \operatorname{rank}\big(S_{0}^{2}\operatorname{Taut}_{{\mathbb{F}}}(i-1, \nu -1) \big) = \textstyle\begin{cases} \frac{1}{2}(i-1)i - 1, & {\mathbb{F}}=\mathbb{R}, \\ (i-1)^{2}-1 & {\mathbb{F}}=\mathbb{C}, \end{cases} $$

giving \(\frac{1}{2}(i-1)(i+2)-1\) in total in the real case and \(i^{2}-2\) in the complex case.

Recall that a real vector bundle \(E\) is orientable if and only if its first Stiefel–Whitney class \(w_{1}(E) \in H^{1}(B, \mathbb{Z}_{2})\) vanishes (here \(B\) is the base of the bundle). The first Stiefel–Whitney class is additive with respect to the Whitney sum, therefore

$$ w_{1}(E_{i,\nu}) = w_{1}(S^{2}_{0} \mathcal{E}) + w_{1}(\mathcal{E}), \qquad \mathcal{E}= \operatorname{Taut}_{\mathbb{R}}(i-1, \nu -1). $$

Using additivity on equation (6.1) gives \(w_{1}(S^{2}_{0} \mathcal{E})=w_{1}(S^{2} \mathcal{E})\) because \(w_{1}\) is zero for the trivial bundle. The classical formulas for the Stiefel–Whitney classes of symmetric tensor power (see, for example, [32, Sect. 19.5.C, Theorem 3]) yield \(w_{1}(S^{2} \mathcal{E})=(\operatorname{rank}\mathcal{E}+ 1) w_{1}(\mathcal{E})\) and, finally,

$$ w_{1}(E_{i,\nu}) = (\operatorname{rank}\mathcal{E}+2) w_{1}(\mathcal{E}), \qquad \mathcal{E}= \operatorname{Taut}_{\mathbb{R}}(i-1, \nu -1). $$

Since the real tautological bundle ℰ is not orientable and has rank \(i-1\), \(w_{1}(E_{i,\nu})\) vanishes if and only if \(i+1\) is zero modulo 2, completing the proof of the lemma. □

Recall that the oriented Grassmannian \(\widetilde{\operatorname{Gr}}_{\mathbb{R}}(k,n)\) consisting of the oriented \(k\)-dimensional subspaces in \(\mathbb{R}^{n}\) is a double cover of \(\operatorname{Gr}_{\mathbb{R}}(k,n)\). Let \(\tau \) denote the orientation-reversing involution on \(\widetilde{\operatorname{Gr}}_{\mathbb{R}}(k,n)\). In the space of \(q\)-chains of \(\widetilde{\operatorname{Gr}}_{\mathbb{R}}(k,n)\) over the ring ℤ we distinguish the subspace of chains which are skew-symmetric with respect to \(\tau \): \(\tau (\alpha )=-\alpha \), where \(\alpha \) is a chain. The subspaces of skew-symmetric \(q\)-chains are invariant under the boundary operator and therefore define a complex. The homology groups of this complex will be denoted \(H_{q}\big(\operatorname{Gr}_{\mathbb{R}}(k,n); \widetilde{\mathbb{Z}})\). In the sequel we refer to them as twisted homologies, as they are homologies with local coefficients in the module of twisted integers \(\widetilde{\mathbb{Z}}\), i.e. ℤ considered as the module corresponding to the nontrivial action of \(\mathbb{Z}_{2}\) on ℤ.

Proof of Theorem 1.14, part (2)

In the case \(1 < i(x) \leq \nu (x)\), we start from equation (1.11),

$$\begin{aligned} H_{r}\Big(U_{x}^{+\varepsilon }(\lambda _{k}),\, U_{x}^{-\varepsilon }( \lambda _{k})\Big) &\cong \widetilde{H}_{r-\mu (x)}\Big(\mathcal{S} \mathcal{R}_{k,\nu}\Big) \cong \widetilde{H}_{r-\mu (x)}\Big(\Sigma \mathcal{R}_{k,\nu}\Big) \\ &\cong \widetilde{H}_{r-\mu (x)}\Big(\Sigma \big(\mathcal{T}\left (E_{i, \nu}\right )\big) \Big), \end{aligned}$$

where \(E_{i,\nu}\) is given by (6.2).

Recall [44, Cor. 16.1.6] that the reduced suspension of a Thom space of a vector bundle is homeomorphic to the Thom space of the Whitney sum of this bundle with the trivial rank 1 bundle \(\theta ^{1}\), i.e.

$$ \Sigma \big(\mathcal{T}\left (E_{i,\nu}\right )\big) \cong \mathcal{T}\left (\widehat{E}_{i,\nu}\right ), \qquad \widehat{E}_{i, \nu} := E_{i, \nu}\oplus \theta ^{1} $$
(6.5)

The bundle \(\widehat{E}_{i,\nu}\) is orientable if and only if \(E_{i,\nu}\) is orientable; its rank is one plus the rank of \(E_{i,\nu}\). Lemma 6.1 supplies both pieces of information.

The bundle \(\widehat{E}_{i,\nu}\) is orientable if \({\mathbb{F}}=\mathbb{C}\) or if \({\mathbb{F}}=\mathbb{R}\) and \(i\) is odd, and we can use the homological version of the Thom isomorphism theorem [52, Lemma 18.2], which gives

$$ \widetilde{H}_{r-\mu (x)}\left (\mathcal{T}(\widehat{E}_{i,\nu})\right ) = H_{r-\mu (x)-s(i)}\big(\operatorname{Gr}_{{\mathbb{F}}}(i-1,\nu -1) \big), $$

which is the right-hand side of (1.13) in the corresponding cases.

When \(\mathbb{F}=\mathbb{R}\) and \(i\) is even, the bundle \(\widehat{E}_{i, \nu}\) is nonorientable (1.13) results from the Thom isomorphism for nonorientable bundles [59, Theorem 3.10],Footnote 20

$$ \widetilde{H}_{r-\mu (x)}\left (\mathcal{T}(\widehat{E}_{i,\nu})\right ) = H_{r-\mu (x)-s(i)}\big(\operatorname{Gr}_{{\mathbb{F}}}(i-1,\nu -1); \widetilde {\mathbb{Z}}\big). $$
(6.6)

In the special case \(k=\nu \), not covered by Lemma 6.1, we compute directly using Lemma 5.1 and Remark 5.3,

$$\begin{aligned} H_{r}\Big(U^{\lambda ^{c}+\varepsilon }(\lambda _{\nu}),\, U^{ \lambda ^{c}-\varepsilon }(\lambda _{\nu})\Big) &\cong H_{r-\mu (x)} \Big(U^{\lambda ^{c}+\varepsilon }(\lambda _{\nu}|_{N}),\, U^{ \lambda ^{c}-\varepsilon }(\lambda _{\nu}|_{N})\Big) \\ &\cong H_{r-\mu (x)}\Big(\mathbb{B}^{s(\nu )}, \emptyset \Big) \cong H_{r- \mu (x)}\Big(\{x\}\Big) \\ &\cong H_{r-\mu (x)-s(i)}\Big(\operatorname{Gr}_{{\mathbb{F}}}(i-1, \nu -1)\Big), \end{aligned}$$

since \(i=1\), \(s(i)=0\) and \(\operatorname{Gr}_{{\mathbb{F}}}(0,\nu -1)\) is a single point. □

Remark 6.3

One can also derive (6.6), using Poincaré and Poincaré–Lefschetz dualities in their usual and skew form, mimicking the proof of [52, Lemma 18.2]. This alternative derivation is included as Appendix B.

7 Proof of part (1) of Theorem 1.12: criticality

\(\mathbb{F} = \mathbb{C}\). In the setting of Theorem 1.14, the Poincaré polynomials of the relative homology groups \(H_{*}\left (U^{\lambda _{k}(x)+\varepsilon }(\lambda _{k}), U^{ \lambda _{k}(x)-\varepsilon }(\lambda _{k})\right )\) is equal to

$$ t^{\mu (x)+ s(i)} P_{\operatorname{Gr}_{\mathbb{C}}(i-1, \nu -1)}(t). $$
(7.1)

Betti numbers for complex Grassmannians were established by Ehresmann, see [26, Theorem on p. 409, section II.7]. The \(r\)-th Betti number is zero if \(r\) is odd and is equal to the number of Young diagrams with \(r/2\) cells that fit inside the \(k\times (n-k)\) rectangle, if \(r\) is even. The Poincaré polynomial \(P_{\operatorname{Gr}_{\mathbb{C}}(k,n)}\) is nothing but the generating function for this restricted partition problem. The latter is well known to be of the form

$$ P_{\operatorname{Gr}_{\mathbb{C}}(k,n)}(t) = {\binom{n}{k}}_{t^{2}}, $$

see [5, Theorem 3.1, p. 33]. By (7.1) and (7.1), the Poincaré polynomials of the relative homology groups \(H_{*}\left (U^{\lambda _{k}(x)+\varepsilon }(\lambda _{k}), U^{ \lambda _{k}(x)-\varepsilon }(\lambda _{k})\right )\) does not vanish, so \(x\) is a critical points.

\(\mathbb{F} = \mathbb{R}\). The calculation of the Poincaré polynomials of integer homologies will be done in the next section, but it is equal to zero in some cases and so does not lead to the proof of criticality. Instead, we show that \(\mathbb{Z}_{2}\)-homology groups \(H_{*}\bigl(U_{x}^{+\varepsilon }(\lambda _{k}), U_{x}^{- \varepsilon }(\lambda _{k});\mathbb{Z}_{2})\) are nontrivial.

Let \(P_{Y, \mathbb{Z}_{2}}(t)\) be the Poincaré polynomial of \(\mathbb{Z}_{2}\)-homologies of a topological space \(Y\), i.e. the coefficient of \(t^{i}\) in \(P_{Y, \mathbb{Z}_{2}}(t)\) is equal to the number of copies of \(\mathbb{Z}_{2}\)s in \(H_{i}(Y, \mathbb{Z}_{2})\). The Poincaré polynomial of \(\mathbb{Z}_{2}\)-homologies of the Grassmannian \(\operatorname{Gr}_{\mathbb{R}}(k,n)\) is well known ([5, Theorem 3.1, p. 33], [52, §7]) to be

$$ P_{\operatorname{Gr}_{\mathbb{R}}(k,n), \mathbb{Z}_{2}}(t) = \binom{n}{k}_{t}. $$
(7.2)

Moreover, when the coefficients are \(\mathbb{Z}_{2}\), there is no difference between symmetric and skew-symmetric chains, therefore if \(P_{Y, \widetilde{\mathbb{Z}_{2}}}(t)\) is the Poincaré polynomial of the twisted \(\mathbb{Z}_{2}\)-homologies of Y, then \(P_{Y, \widetilde{\mathbb{Z}_{2}}}(t) = P_{Y, \mathbb{Z}_{2}}(t)\). Based on this and (7.2), in the setting of Theorem 1.14, the Poincaré polynomial of the relative \(\mathbb{Z}_{2}\)-homology groups \(H_{*}\left (U^{\lambda _{k}(x)+\varepsilon }(\lambda _{k}), U^{ \lambda _{k}(x)-\varepsilon }(\lambda _{k});\mathbb{Z}_{2} \right )\) is equal to

$$ t^{\mu (x)+ s(i)}\binom{\nu -1}{i-1}_{t}, $$
(7.3)

which is not zero. The proof of critically in the case of \(\mathbb{F}=\mathbb{R}\) is complete.

8 Proof of Theorem 1.12, part (2): computing the Poincaré polynomials

Theorem 1.12 will be obtained as a combination of the next two lemmas. Lemma 8.1 provides an expression for the Poincaré polynomial of twisted homologies \(H_{*}\big(\operatorname{Gr}_{\mathbb{R}}(i-1,\nu -1); \widetilde {\mathbb{Z}}\big)\) by relating it to the Poincaré polynomial of the oriented Grassmannian \(\widetilde{\operatorname{Gr}}_{\mathbb{R}}\). Lemma 8.2 below collates known expressions for the Poincaré polynomials of Grassmannians and oriented Grassmannians.

Lemma 8.1

In the setting of Theorem 1.14, the Poincaré polynomials of the relative homology groups \(H_{*}\left (U^{\lambda _{k}(x)+\varepsilon }(\lambda _{k}), U^{ \lambda _{k}(x)-\varepsilon }(\lambda _{k})\right )\) is equal to

$$ t^{\mu (x)+ s(i)} \textstyle\begin{cases} P_{\operatorname{Gr}_{\mathbb{R}}(i-1, \nu -1)}(t), & \textit{${\mathbb{F}}=\mathbb{R}$ and $i$ is odd}, \\ P_{\widetilde{\operatorname{Gr}}_{\mathbb{R}}(i-1, \nu -1)}(t)- P_{ \operatorname{Gr}_{\mathbb{R}}(i-1, \nu -1)}(t), & \textit{${\mathbb{F}}=\mathbb{R}$ and $i$ is even}, \\ P_{\operatorname{Gr}_{\mathbb{C}}(i-1, \nu -1)}(t), & \textit{${\mathbb{F}}=\mathbb{C}$}. \end{cases} $$
(8.1)

where \(P_{Y}(t)\) denotes the Poincaré polynomial of the manifold \(Y\).

Proof

Since we already established part (2) of Theorem 1.14, we only need to show that the Poincaré polynomial of the homology groups \(H_{*}(\operatorname{Gr}_{\mathbb{R}}(i-1,\nu -1); \widetilde {\mathbb{Z}})\) is equal to \(P_{\widetilde{\operatorname{Gr}}_{\mathbb{R}}(i-1, \nu -1)}(t)- P_{ \operatorname{Gr}_{\mathbb{R}}(i-1, \nu -1)}(t)\).

We will use homologies with coefficients in ℚ (or ℝ). Indeed, since the Betti numbers ignore the torsion part of \(H_{r}(\cdot ; \mathbb{Z})\), the Universal Coefficients Theorem (see, e.g. [38, Sect. 3.A]) implies they can be calculated as the rank of \(H_{r}(\cdot ; G)\) with any torsion-free abelian group \(G\). The benefit of using ℚ is that now any chain \(c\) in \(\widetilde{\operatorname{Gr}}_{\mathbb{R}}(i-1,\nu -1)\) can be uniquely represented as a sum of a symmetric and a skew-symmetric chains with coefficients in ℚ,

$$ c=\cfrac{1}{2}\bigl(c+\tau (c)\bigr)+\cfrac{1}{2}\bigl(c-\tau (c) \bigr), $$

where \(\tau \) is the orientation reversing involution of \(\widetilde {\mathrm{Gr}}(i-1,\nu -1)\) (viewed as a double cover of \(\mathrm{Gr}(i-1,\nu -1)\)). The analogous statement is of course wrong in integer coefficients, as \(\cfrac{1}{2}\notin \mathbb{Z}\).

Since the boundary operator preserves the parity of a chain, the homology \(H_{r}\big(\widetilde{\operatorname{Gr}}_{\mathbb{R}}(i-1,\nu -1); \mathbb{Q}\big)\) decomposes into the direct sum of homologies of \(\tau \)-symmetric and \(\tau \)-skew-symmetric chains on \(\widetilde{\operatorname{Gr}}_{\mathbb{R}}(i-1,\nu -1)\). The former homology coincides with the usual homology of \(\operatorname{Gr}_{\mathbb{R}}(i-1,\nu -1)\). The latter yields, by definition, the twisted ℚ-homology of \(\operatorname{Gr}_{\mathbb{R}}(i-1,\nu -1)\). To summarize, we obtain

$$ \begin{aligned}[t] &H_{r}(\widetilde{\operatorname{Gr}}_{\mathbb{R}}(i-1,\nu -1); \mathbb{Q}) \\ &\qquad = H_{r}\big(\operatorname{Gr}_{\mathbb{R}}(i-1,\nu -1); \mathbb{Q}\big) \oplus H_{r}\big( \operatorname{Gr}_{\mathbb{R}}(i-1, \nu -1); \widetilde{\mathbb{Q}}\big). \end{aligned} $$
(8.2)

The sum in (8.2) translates into the sum of Poincaré polynomials, yielding the middle line in (8.1). □

Lemma 8.2

Let \(k,n \in \mathbb{N}\), \(k \leq n\). The Poincaré polynomials of the Grassmannians \(\operatorname{Gr}_{\mathbb{C}}(k,n)\), \(\operatorname{Gr}_{\mathbb{R}}(k,n)\) and \(\widetilde{\operatorname{Gr}}_{\mathbb{R}}(k,n)\) are given by

$$\begin{aligned} P_{\operatorname{Gr}_{\mathbb{C}}(k,n)}(t) &= {\binom{n}{k}}_{t^{2}}, \end{aligned}$$
(8.3)
$$\begin{aligned} P_{\operatorname{Gr}_{\mathbb{R}}(k,n)}(t) &= \textstyle\begin{cases} {\dbinom{\lfloor n/2\rfloor}{\lfloor k/2 \lfloor}}_{t^{4}}, & \textit{if } k(n-k) \textit{ is even}, \\ (1+t^{n-1})\dbinom{n/2-1}{(k-1)/2}_{t^{4}}, & \textit{if } k(n-k) \textit{ is odd}, \end{cases}\displaystyle \end{aligned}$$
(8.4)
$$\begin{aligned} P_{\widetilde {\operatorname{Gr}}_{\mathbb{R}}(k, n)}(t) &= \textstyle\begin{cases} (1+t^{n-k})\dbinom{(n-1)/2}{(k-1)/2}_{t^{4}} & \textit{if $k$ is odd, $n$ is odd}, \\ (1+t^{n-1})\dbinom{n/2-1}{(k-1)/2}_{t^{4}}, & \textit{if $k$ is odd, $n$ is even}, \\ \dfrac{(1+t^{k})(1+t^{n-k})}{1+t^{n}} \dbinom{n/2}{k/2}_{t^{4}}, & \textit{if $k$ is even, $n$ is even}. \end{cases}\displaystyle \end{aligned}$$
(8.5)

Remark 8.3

We will not need the Poincaré polynomial in the last line of (8.5) and we include it for completeness only. The case of even \(k\) and odd \(n\) is covered by the first line of (8.5) since \(\widetilde {\operatorname{Gr}}_{\mathbb{R}}(k, n) = \widetilde {\operatorname{Gr}}_{\mathbb{R}}(n-k, n)\).

Proof

The complex case was already discussed in the beginning of Sect. 7.

The \(r\)-th Betti number of the real Grassmannian has a similar combinatorial description [46, Theorem IV, p. 108]: it is equal to the number of Young diagrams of \(r\) cells that fit inside the \(k\times (n-k)\) rectangle and have even length differences for each pair of columns and for each pair of rows. From this it can be shown that the Poincaré polynomial \(P_{\operatorname{Gr}_{\mathbb{R}}(k,n)}\) satisfies (8.4) (see also [16, Theorem 5.1]). We remark that for \(k\) and \(n\) both even, (8.4) is a consequence of (8.3) because the corresponding Young diagrams must be made up from \(2\times 2\) squares.

Finally, the oriented Grassmannian is a homogeneous space, namely

$$ \widetilde {\operatorname{Gr}}(k,n)\cong SO(n)/\bigl( SO(k)\times SO(n-k) \bigr). $$

The corresponding Poincaré polynomial has been computed within the general theory of de Rham cohomologies of homogeneous spaces, see, for example, [36, Chap. XI]. Up to notation, the first line of (8.5) corresponds to [36, Lines 2–3, col. 3 of Table II on p. 494], the second line of (8.5) corresponds to [36, Lines 2–3, col. 1 of Table III on p. 496] and the third line of (8.5) corresponds to [36, Lines 2–3, col. 2 of Table III on p. 495]. □

Proof of Theorem 1.12, part (2)

We first establish equation (1.10). Using Lemma 8.1 as well as Lemma 8.2 with

$$ k:=i-1 \qquad \text{and}\qquad n:=\nu -1, $$
  1. (1)

    The first line of (1.10) is obtained directly from the first line of (8.1) and the first line of (8.4).

  2. (2)

    The second line of (1.10) is obtained from the second line of (8.1) by combining the second lines of (8.4) and (8.5).

  3. (3)

    The third line of (1.10) is obtained from the second line of (8.1) by combining the first lines of (8.4) and (8.5).

  4. (4)

    Finally, the last line of (1.10) is obtained directly from the last line of (8.1) and (8.3).

Finally, Morse inequalities (1.3) when \(M\) is compact are established by [33, § 45] using the tools identical to [51, § 5]. □

Proof of Corollary 2.4

A critical point is a point of local maximum if and only if the local Morse data is homotopy equivalent to \((\mathbb{B}^{d}, \partial \mathbb{B}^{d})\).

If \(x\) is a maximum, its contribution to the Poincaré polynomial is equal to \(t^{d}\), which occurs only in the cases described by Corollary 2.4.

To establish sufficiency, we compute the local Morse data at \(x\). If condition (1) is satisfied, the normal data at the point \(x\) has been computed in Remark 6.2, \((J,K) = \left (\mathbb{B}^{s(\nu )}, \partial \mathbb{B}^{s(\nu )} \right )\). From condition (2) we get the tangential data

$$ (P,Q) = \left (\mathbb{B}^{d-s(\nu )}, \partial \mathbb{B}^{d-s(\nu )} \right ). $$

By [35, Thm I.3.7], the local Morse data is then

$$ \bigl(P\times J, (P\times K) \cup (Q\times J)\bigr) \cong \left ( \mathbb{B}^{d}, \partial \mathbb{B}^{d} \right ), $$

implying the point is a maximum.

Similarly, a critical point is a point of local minimum if and only if the local Morse data is homotopy equivalent to \((\mathbb{B}^{d}, \emptyset )\). If \(x\) is a minimum, its contribution to the Poincaré polynomial is equal to 1, which occurs only in the cases described by Corollary 2.4.

Conversely, condition (1) implies the normal data is

$$ (J,K) = \left (\mathbb{B}^{s(\nu )}, \emptyset \right ), $$

see Remark 5.3. From condition (2), the tangential data is

$$ (P,Q) = \left (\mathbb{B}^{d-s(\nu )}, \emptyset \right ). $$

Combining these using [35, Thm I.3.7] gives the required result. □