On Minimal Subspaces in Tensor Representations

Falcó, Antonio; Hackbusch, Wolfgang

doi:10.1007/s10208-012-9136-6

On Minimal Subspaces in Tensor Representations

Published: 09 October 2012

Volume 12, pages 765–803, (2012)
Cite this article

Download PDF

Foundations of Computational Mathematics Aims and scope Submit manuscript

On Minimal Subspaces in Tensor Representations

Download PDF

Antonio Falcó¹ &
Wolfgang Hackbusch²

757 Accesses
31 Citations
Explore all metrics

Abstract

In this paper we introduce and develop the notion of minimal subspaces in the framework of algebraic and topological tensor product spaces. This mathematical structure arises in a natural way in the study of tensor representations. We use minimal subspaces to prove the existence of a best approximation, for any element in a Banach tensor space, by means of a tensor given in a typical representation format (Tucker, hierarchical, or tensor train). We show that this result holds in a tensor Banach space with a norm stronger than the injective norm and in an intersection of finitely many Banach tensor spaces satisfying some additional conditions. Examples using topological tensor products of standard Sobolev spaces are given.

Tree-based tensor formats

Article 10 October 2018

Antonio Falcó, Wolfgang Hackbusch & Anthony Nouy

The Set of Orthogonal Tensor Trains

Article 08 April 2022

Pardis Semnani & Elina Robeva

The minimal and maximal operator ideals associated to $$(n+1)$$ -tensor norms of Michor’s type

Article 16 February 2018

J. A. López Molina

1 Introduction

Recently, there has been an increased interest in numerical methods which make use of tensors. In particular, for high spatial dimensions one must take care that the numerical cost (in time and storage) is linear in the space dimension and does not increase exponentially. For three spatial dimensions, these methods can be applied with great success.

A first family of applications using tensor decompositions concerns the extraction of information from complex data. It has been used in many areas such as psychometrics [6, 26], chemometrics [2], analysis of turbulent flows [3], image analysis and pattern recognition [28], and data mining. Another family of applications concerns the compression of complex data (for storage or transmission), also introduced in many areas such as signal processing [19] or computer vision [30]. A survey of tensor decompositions in multilinear algebra and an overview of possible applications can be found in the review paper [18]. In these applications, the aim is to compress the information as much as possible or to extract a few modes representing some features to be analysed. The use of tensor product approximations is also of growing interest in numerical analysis for the solution of problems defined in high-dimensional tensor spaces, such as partial differential equations (PDEs) arising in stochastic calculus [1, 5, 11] (e.g., the Fokker-Planck equation), stochastic parametric PDEs arising in uncertainty quantification with spectral approaches [9, 22, 23], and quantum chemistry (cf., e.g., [29]). For details, we refer to [14].

Let d vector spaces V _j be given (assume, e.g., that $V_{j} =\mathbb{R}^{n_{j}}$). The generated tensor space is denoted by $\mathbf{V} ={ }_{a}\bigotimes_{j=1}^{d}V_{j}$, where ${}_{a}\bigotimes_{j=1}^{d}V_{j} = \operatorname{span} \{ \bigotimes_{j=1}^{d} v_{j}: v_{j} \in V_{j} \text{ and } 1 \le j \le d\ \} $ (assume, e.g., that ⨂ represents the Kronecker product). A typical representation format is the tensor subspace or Tucker format

$$ \mathbf{u}=\sum_{\mathbf{i}\in\mathbf{I}}\mathbf{a}_{\mathbf{i}}\bigotimes_{j=1}^{d}b_{i_{j}}^{(j)}, $$

(1.1)

where I=I ₁×⋯×I _d is a multi-index set with I _j={1,…,r _j}, r _j≤dim(V _j), $b_{i_{j}}^{(j)}\in V_{j}$ (i _j∈I _j) are basis vectors, and a _i∈ℝ. Here, i _j are the components of i=(i ₁,…,i _d). The data size is determined by the numbers r _j collected in the tuple r:=(r ₁,…,r _d). The set of all tensors representable by (1.1) with fixed r is

$$ \mathcal{T}_{\mathbf{r}}:=\left \{ \mathbf{v}\in\mathbf{V}:\begin{array}{l} \text{there are subspaces}\ U_{j}\subset V_{j}\ \text{such that}\\ \dim(U_{j})=r_{j}\text{ and }\mathbf{v}\in\mathbf{U}:={}_{a}\bigotimes_{j=1}^{d}U_{j} \end{array} \right \}. $$

(1.2)

Here, it is important that the description (1.1) with the vectors $b_{i}^{(j)}$ can be replaced by the generated subspace $U_{j}=\operatorname*{span}\{b_{i}^{(j)}:i\in I_{j}\}$. Note that $\mathcal{T}_{\mathbf{r}}$ is neither a subspace of V nor a convex set.

A question about minimal subspaces arises naturally from (1.2): Given a tensor v∈V, what are the subspaces U _j⊂V _j with minimal dimension r _j such that $\mathbf{v}\in\bigotimes_{j=1}^{d}U_{j}$?

Another natural question is the approximation of some v∈V by $\mathbf{u}\in\mathcal{T}_{\mathbf{r}}$ for a fixed r: Find $\mathbf{u}_{\mathrm{best}}\in\mathcal{T}_{\mathbf{r}}$ such that ∥v−u _best∥ equals

$$ \inf \bigl\{ \Vert \mathbf{v}-\mathbf{u}\Vert :\mathbf{u}\in \mathcal{T}_{\mathbf{r}} \bigr\} $$

(1.3)

for a suitable norm. In the finite-dimensional case, compactness arguments show the existence of a best approximation. In this paper we discuss this question in the infinite-dimensional case (i.e., dim(V _j)=∞, while still dim(U _j)=r _j<∞).

Here, one should note that tensors have properties which are unexpected compared with matrix theory. For instance, one can define another tensor format (r-term or canonical format) as follows. Fix an integer r∈ℕ₀ and set

$$\mathcal{R}_{r}:= \Biggl\{ \mathbf{v}=\sum _{i=1}^{r}\bigotimes_{j=1}^{d}u_{i}^{(j)}:u_{i}^{(j)}\in V_{j}\text{ for }1\leq i\leq r \Biggr\} . $$

For d=2, $\mathcal{R}_{r}$ corresponds to matrices of $\operatorname{rank}\leq r$. Seeking a solution of $\inf \{ \Vert \mathbf{v}-\mathbf{u}\Vert :\mathbf{u}\in\mathcal{R}_{r} \} $, one finds examples of v∈V even for finite-dimensional V, but d≥3, such that there is no minimiser $\mathbf{u}_{\mathrm{best}}\in\mathcal{R}_{r}$ (cf. [7]).

There are other formats with even better properties than (1.2) (cf. [15, 24]), which are again related to subspaces. In these cases, further subspaces like, e.g., U ₁₂⊂U ₁⊗U ₂ appear. The representation using the hierarchical format from [15] uses subspaces U ₁₂ with dimension not exceeding a given bound. For these formats, the results of this paper also apply, e.g., they ensure the existence of best approximations.

There are practical reasons for the interest in the existence of a best approximation. Truncation of a tensor v to a certain format tries to minimise ∥v−u∥. If a best approximation does not exist, one has to expect a numerical instability as ∥v−u∥ approaches the infimum. Even if V is finite dimensional, it is often a discrete version of a function space. If the infinite-dimensional function space allows a best approximation, one can expect uniform stability (i.e., independent of the discretisation parameters).

The hierarchical format of [15] is connected with a certain dimension partition tree. In particular, the approach in [24] using a linear tree corresponding to the matrix product systems is applied in quantum chemistry (cf. [29]). There are approaches using a general graph structure (cf. [17]); however, as soon as loops are contained in a graph, the parameters of its representation cannot be described by dimensions of certain subspaces, and the results of this paper do not apply.

In the sequel, we define minimal subspaces $U_{j}^{\min}(\mathbf{v})$ for algebraic tensors $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}$ (cf. Theorem 2.17) as well as for topological tensors $\mathbf{v}\in{}_{\Vert \cdot \Vert }\bigotimes_{j=1}^{d}V_{j}$ (cf. Definition 3.11). The main result is given in Theorem 3.15, where we show that for weakly convergent sequences v _n⇀v (see Definition 3.12), the dimension of the limiting minimal subspace is bounded by

$$\dim U_{j}^{\min}(\mathbf{v})\leq\liminf_{n\rightarrow\infty}\dim U_{j}^{\min }(\mathbf{v}_{n})\quad\text{for all }1 \leq j\leq d. $$

This is the key property which allows us to derive the desired properties.

Finally, we discuss the nature of the closed subspace ${}_{\Vert \cdot \Vert }\bigotimes_{j=1}^{d}\overline{U_{j}^{\min}(\mathbf{v})}^{\Vert \cdot \Vert _{j}}$. In the algebraic case, we have by definition that $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v})$. This property does not seem obvious for a general topological tensor $\mathbf{v}\in{}_{\Vert \cdot \Vert }\bigotimes_{j=1}^{d}V_{j}$, but we give sufficient conditions for this property. In particular, it holds for Hilbert tensor spaces.

The paper is organised as follows. In Sect. 2, we introduce the concept of minimal subspaces of an algebraic tensor and describe a characterisation. In Sect. 3, minimal subspaces are defined and characterised for Banach tensor spaces. Finally, Sect. 4 is devoted to the proof of the existence of best approximation tensors in $\mathcal{T}_{\mathbf{r}}$ in a Banach tensor space.

2 Minimal Subspaces in an Algebraic Tensor Space

In the following, X is a Banach space with norm ∥⋅∥=∥⋅∥_X. While X′ denotes the algebraic dual, X ^∗ is the dual space of functionals with bounded dual norm $\Vert \cdot \Vert ^{\ast}=\Vert \cdot \Vert _{X^{\ast}}$:

$$ \Vert \varphi \Vert _{X^{\ast}}=\sup \bigl\{ \bigl \vert \varphi(x)\bigr \vert :x\in X\text{ with }\Vert x\Vert _{X}\leq1 \bigr\} =\sup \bigl\{ \bigl \vert \varphi(x)\bigr \vert /\Vert x \Vert _{X}:0\neq x\in X \bigr\} . $$

(2.1)

This implies that we recover the ∥⋅∥_X norm from the dual norm via

$$ \Vert x\Vert _{X}=\max \bigl\{ \bigl \vert \varphi(x)\bigr \vert :\Vert \varphi \Vert _{X^{\ast}}=1 \bigr\} =\max \bigl\{ \bigl \vert \varphi(x)\bigr \vert /\Vert \varphi \Vert _{X^{\ast}}:0\neq \varphi\in X^{\ast} \bigr\} . $$

(2.2)

By $\mathcal{L}(X,Y)$ we denote the space of continuous linear mapping from X into Y. The corresponding operator norm is written as ∥⋅∥_Y←X. $\mathcal{L}(X,Y)$ is a subspace of the space L(X,Y) of all linear mappings (without topology).

Remark 2.1

Let {x _ν∈X:1≤ν≤n} be linearly independent. Then there are functionals φ _ν∈X ^∗ such that φ _ν(x _μ)=δ _νμ. The functionals $( \varphi_{\nu} )_{\nu=1}^{n}$ are called dual to $( x_{\nu} )_{\nu=1}^{n}$.

The following result is known as the Lemma of Auerbach and is proved, e.g., in Meise–Vogt [21, Lemma 10.5].

Lemma 2.2

For any n-dimensional subspace of a Banach space X, there exists a basis {x _ν:1≤ν≤n} and a corresponding dual basis {φ _ν:1≤ν≤n}⊂X ^∗ such that ∥x _ν∥=∥φ _ν∥^∗=1 (1≤ν≤n).

2.1 Algebraic Tensor Spaces

2.1.1 Definitions and Elementary Facts

Concerning the definition of the algebraic tensor space ${}_{a}\bigotimes_{j=1}^{d}V_{j}$ generated from vector spaces V _j (1≤j≤d), we refer to Greub [12]. As the underlying field we choose ℝ, but the results hold also for ℂ. The suffix ‘a’ in ${}_{a}\bigotimes_{j=1}^{d}V_{j}$ refers to the ‘algebraic’ nature. By definition, all elements of

$$\mathbf{V}:={}_{a}\bigotimes_{j=1}^{d}V_{j} $$

are finite linear combinations of elementary tensors $\mathbf{v}=\bigotimes_{j=1}^{d}v_{j}$ (v _j∈V _j). In Sect. 3, we shall discuss the Banach space obtained as the completion of ${}_{a}\bigotimes_{j=1}^{d}V_{j}$.

Consider a tensor product $\mathbf{V}={}_{a}\bigotimes_{j=1}^{d}V_{j}$ of vector spaces and a fixed tensor v∈V. Among the subspaces U _j⊂V _j with

$$ \mathbf{v}\in\mathbf{U}:={}_{a}\bigotimes _{j=1}^{d}U_{j} $$

(2.3)

we are looking for the smallest ones. We have to show that a minimal subspace U _j exists and that these minimal subspaces can be obtained simultaneously in (2.3) for all 1≤j≤d. We approach the problem in Sect. 2.2.1 for the matrix case d=2. In Sect. 3.1 we replace the tensor product of vector spaces by a tensor product of Banach spaces. The interesting question is how these minimal subspaces behave as a function of v.

An obvious advantage of the formulation (2.3) is the fact that the U _j are of finite dimension even if dim(V _j)=∞, as stated below.

Lemma 2.3

For $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}$ there are always finite-dimensional subspaces U _j⊂V _j satisfying (2.3).

Proof

By definition of the algebraic tensor space, $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}$ means that there is a finite linear combination

$$\mathbf{v}=\sum_{\nu=1}^{n}\bigotimes _{j=1}^{d}v_{j}^{(\nu)}$$

for some n∈ℕ₀ and $v_{j}^{(\nu)}\in V_{j}$. Define

$$U_{j}:=\operatorname*{span}\bigl\{v_{j}^{(\nu)}:1 \leq\nu\leq n\bigr\}\quad\text{for}\ 1\leq j\leq d. $$

Then $\mathbf{v}\in\mathbf{U}:={}_{a}\bigotimes_{j=1}^{d}U_{j}$ proves (2.3) with subspaces of dimension dim(U _j)≤n. □

The following well-known result is formulated for d=2.

Lemma 2.4

For any tensor v∈V⊗_a W there is an r∈ℕ₀ and a representation

$$ \mathbf{v}=\sum_{i=1}^{r}v_{i} \otimes w_{i} $$

(2.4)

with linearly independent vectors {v _i:1≤i≤r}⊂V and {w _i:1≤i≤r}⊂W.

Proof

Take any representation $\mathbf{v}=\sum_{i=1}^{n}v_{i}\otimes w_{i}$. If, e.g., the {v _i:1≤i≤n} are not linearly independent, one v _i can be expressed by the others. Without loss of generality assume $v_{n}=\sum_{i=1}^{n-1}\alpha_{i}v_{i}$. Then

$$v_{n}\otimes w_{n}= \Biggl( \sum _{i=1}^{n-1}\alpha_{i}v_{i} \Biggr) \otimes w_{n}=\sum_{i=1}^{n-1}v_{i} \otimes ( \alpha_{i}w_{n} ) $$

shows that x possesses a representation with only n−1 terms:

$$\mathbf{v}= \Biggl( \sum_{i=1}^{n-1}v_{i} \otimes w_{i} \Biggr) +v_{n}\otimes w_{n}=\sum _{i=1}^{n-1}v_{i}\otimes w_{i}^{\prime}\quad \text{with}\ w_{i}^{\prime}:=w_{i}+\alpha_{i}w_{n}. $$

Since each reduction step decreases the number of terms by one, this process terminates; i.e., we obtain a representation with linearly independent v _i and w _i. □

In accordance with the usual matrix rank we introduce the following definition.

Definition 2.5

The number r appearing in Lemma 2.4 will be called the rank of the tensor v and denoted by $\operatorname{rank}(\mathbf{v})$.

The following notation and definitions will be useful. We recall that L(V,W) is the space of linear maps from V into W, while V′=L(V,ℝ) is the algebraic dual. For metric spaces, $\mathcal{L}(V,W)$ denotes the continuous linear maps, while V ^∗ is the topological dual.

Let $\mathcal{I}:=\{1,\ldots,d\}$ be the index set of the ‘spatial directions’. In the sequel, the index sets $\mathcal{I}\backslash\{j\}$ will appear. Here, we use the abbreviations

(2.5a)

(2.5b)

Similarly, elementary tensors ⨂_k≠j v ^(j) are denoted by v _[j].

For vector spaces V _j and W _j over ℝ, let linear mappings A _j:V _j→W _j (1≤j≤d) be given. Then the definition of the elementary tensor

$$\mathbf{A}=\bigotimes_{j=1}^{d}A_{j}: \;\mathbf{V}={}_{a}\bigotimes_{j=1}^{d}V_{j} \longrightarrow\mathbf{W}={}_{a}\bigotimes _{j=1}^{d}W_{j}$$

is given by

$$ \mathbf{A} \Biggl( \bigotimes_{j=1}^{d}v^{(j)} \Biggr) :=\bigotimes_{j=1}^{d} \bigl( A_{j}v^{(j)} \bigr) . $$

(2.6)

Note that (2.6) extends uniquely to a linear mapping A:V→W.

Remark 2.6

(a)
Let $\mathbf{V}:={}_{a}\bigotimes_{j=1}^{d}V_{j}$ and $\mathbf{W}:= {}_{a}\bigotimes_{j=1}^{d}W_{j}$. Then the linear combinations of tensor products of linear mappings $\mathbf{A}=\bigotimes_{j=1}^{d}A_{j}$ defined by means of (2.6) form a subspace of L(V,W):
$${}_{a}\bigotimes_{j=1}^{d}L(V_{j},W_{j}) \subset L(\mathbf{V},\mathbf{W}). $$
(b)
The special case of W _j=ℝ for all j (implying W=ℝ) reads as ${}_{a\!}\bigotimes_{j=1}^{d}V_{j}^{\prime}\subset\mathbf{V}^{\prime}$.
(c)
If dim(V _j)<∞ and dim(W _j)<∞ for all j, the inclusion ‘⊂’ in (a) and (b) may be replaced by ‘=’. This can be easily verified by just checking the dimensions of the spaces involved.

Often, mappings $\mathbf{A}=\bigotimes_{j=1}^{d}A_{j}$ will appear, where most of the A _k are the identity (and therefore V _k=W _k). If A _j∈L(V _j,W _j) for one j, we use the following notation:

$$ \mathbf{id}_{[j]}\otimes A_{j}:=\underset{j-1\text{ factors}}{\underbrace {\mathit{id}\otimes\cdots\otimes \mathit{id}}}\otimes A_{j} \otimes\underset{d-j\text{ factors}}{\underbrace{\mathit{id}\otimes\cdots \otimes \mathit{id}}}\in L(\mathbf{V},\mathbf{V}_{[j]} \otimes_{a}W_{j}), $$

(2.7a)

provided that it is obvious what component j is meant. By the multiplication rule $( \bigotimes_{j=1}^{d}A_{j} ) \circ ( \bigotimes_{j=1}^{d}B_{j} ) =\bigotimes_{j=1}^{d} ( A_{j}\circ B_{j} ) $ and since id∘A _j=A _j∘id, the following identity^{Footnote 1} holds for j≠k:

$$ \begin{aligned}[b] &\mathit{id}\otimes\cdots\otimes \mathit{id}\otimes A_{j}\otimes \mathit{id}\otimes\cdots\otimes \mathit{id}\otimes A_{k}\otimes \mathit{id}\otimes\cdots\otimes \mathit{id}\\ &\quad =(\mathbf{id}_{[j]}\otimes A_{j})\circ(\mathbf{id}_{[k]}\otimes A_{k})\\ &\quad =(\mathbf{id}_{[k]}\otimes A_{k})\circ(\mathbf{id}_{[j]}\otimes A_{j}) \end{aligned} $$

(2.7b)

(in the first line we assume j<k). Proceeding inductively with this argument over all indices, we obtain

$$\mathbf{A}=\bigotimes_{j=1}^{d}A_{j}=( \mathbf{id}_{[1]}\otimes A_{1})\circ\cdots\circ( \mathbf{id}_{[d]}\otimes A_{d}). $$

If W _j=ℝ, i.e., if $A_{j}=\varphi_{j}\in V_{j}^{\prime}$ is a linear form, then id _[j]⊗φ _j∈L(V,V _[j]) is used to denote id⊗⋯⊗id⊗φ _j⊗id⊗⋯⊗id defined by

$$ (\mathbf{id}_{[j]}\otimes\varphi_{j}) \Biggl( \bigotimes _{k=1}^{d}v^{(k)} \Biggr) =\varphi_{j}\bigl(v^{(j)}\bigr)\cdot\bigotimes _{k\neq j}v^{(k)}. $$

(2.7c)

Thus, if $\boldsymbol{\varphi}=\otimes_{j=1}^{d}\varphi_{j}\in\bigotimes_{j=1}^{d}V_{j}^{\prime}$, we can also write

$$ \boldsymbol{\varphi}=\otimes_{j=1}^{d}\varphi_{j}=( \mathbf{id}_{[1]}\otimes\varphi_{1})\circ\cdots \circ(\mathbf{id}_{[d]}\otimes\varphi_{d}). $$

(2.7d)

Consider again the splitting of $\mathbf{V}={}_{a}\bigotimes_{j=1}^{d}V_{j}$ into V=V _j⊗_a V _[j] with V _[j]:=_a⨂_k≠j V _k. For a linear form $\boldsymbol{\varphi}_{[j]}\in\mathbf{V}_{[j]}^{\prime}$, the notation id _j⊗φ _[j]∈L(V,V _j) is used for the mapping

$$ (\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}) \Biggl( \bigotimes_{k=1}^{d}v^{(k)} \Biggr) =\boldsymbol{\varphi}_{[j]}\biggl(\bigotimes _{k\neq j}v^{(k)}\biggr)\cdot v^{(j)}. $$

(2.7e)

If $\boldsymbol{\varphi}_{[j]}=\bigotimes_{k\neq j}\varphi_{k}\in{}_{a}\bigotimes_{k\neq j}V_{k}^{\prime}$ is an elementary tensor,^{Footnote 2} $\boldsymbol{\varphi}_{[j]} ( \bigotimes_{k=1}^{d}v^{(k)} ) =\prod_{k\neq j}\varphi_{k} ( v^{(k)} ) $ holds in (2.7e). Finally, we can write (2.7d) as

$$ \boldsymbol{\varphi}=\otimes_{j=1}^{d}\varphi_{j}= \varphi_{j}\circ (\mathit{id}_{j}\otimes\boldsymbol{ \varphi}_{[j]})\quad\text{for}\ 1\leq j\leq d. $$

(2.7f)

2.1.2 Matricisation

Definition 2.7

For $j\in\mathcal{I}=\{1,\ldots,d\}$, the map $\mathcal{M}_{j}$ is defined as the isomorphism

$$\begin{aligned} \mathcal{M}_{j}{:} \quad & {}_{a}\bigotimes_{k\neq j}V_{k} \rightarrow V_{j}\otimes_{a}V_{[j]} \\ &\bigotimes_{k\neq j}v^{(k)} \mapsto v^{(j)}\otimes\mathbf{v}_{[j]}\quad \mbox{with}\ \mathbf{v}_{[j]}:= \bigotimes_{k\neq j}v^{(k)}. \end{aligned} $$

In the finite-dimensional case of $V_{k}=\mathbb{R}^{n_{k}}$, the tensor space V _j⊗_a V _[j] of order 2 may be considered as a matrix from $\mathbb{R}^{n_{j}\times n_{[j]}}$, where n _[j]=∏_k≠j n _k. Then, $\mathcal{M}_{j}$ maps a tensor entry v[i ₁,…,i _j,…,i _d] into the matrix entry $( \mathcal{M}_{j}(\mathbf{v}) ) [i_{j}, ( i_{1},\ldots,\allowbreak i_{j-1},i_{j+1},\ldots,i_{d} ) ]$. As long as we do not consider matrix properties which depend on the ordering of the index set, we need not introduce an ordering of the (d−1)-tuple (i ₁,…,i _j−1,i _j+1,…,i _d).

Example 2.8

Consider a tensor $\mathbf{v}=\sum_{i=1}^{3}\sum_{j=1}^{2}\sum_{k=1}^{3}a_{ijk}v_{i}\otimes w_{j}\otimes v_{k}\in\mathbb{R}^{3}\otimes \mathbb{R}^{2}\otimes\mathbb{R}^{3}$, where {v ₁,v ₂,v ₃} is a basis of ℝ³ and {w ₁,w ₂} a basis of ℝ². Then $\mathcal{M}_{2}(v_{i}\otimes w_{j}\otimes v_{k}) =w_{j}\otimes(v_{i}\otimes v_{k})\in\mathbb{R}^{2}\otimes\mathbb{R}^{9}$. The lexicographical ordering of (i,k) leads to the matrix

$$\mathcal{M}_{2}(\mathbf{v})=\left ( \begin{array}{c@{\quad}c@{\quad}c@{\quad}c@{\quad}c@{\quad}c@{\quad}c@{\quad}c@{\quad}c}a_{1\mathbf{1}1} & a_{2\mathbf{1}1} & a_{3\mathbf{1}1} & a_{1\mathbf{1}2} & a_{2\mathbf{1}2} & a_{3\mathbf{1}2} & a_{1\mathbf{1}3} & a_{2\mathbf{1}3} & a_{3\mathbf{1}3}\\ a_{1\mathbf{2}1} & a_{2\mathbf{2}1} & a_{3\mathbf{2}1} & a_{1\mathbf{2}2} & a_{2\mathbf{2}2} & a_{3\mathbf{2}2} & a_{1\mathbf{2}3} & a_{2\mathbf{2}3} & a_{3\mathbf{2}3}\end{array} \right ) . $$

Next, we restrict the considerations to finite-dimensional V _k. Since tensor products of two vectors can be interpreted as matrices, the mapping $\mathcal{M}_{j}$ is named ‘matricisation’ (or ‘unfolding’). The interpretation of tensors v as matrices enables us to transfer the matrix terminology to v. In particular, we may define the rank of $\mathcal{M}_{j}(\mathbf{v})$ as a property of v.

Definition 2.9

Let dim(V _k)<∞ $( k\in\mathcal{I} ) $. For all $j\in\mathcal{I}$ we define

$$ \operatorname{rank}_{j}(\mathbf{v}):=\operatorname{rank}\bigl( \mathcal{M}_{j}(\mathbf{v})\bigr). $$

(2.8)

Hitchcock [16, p. 170] (1927) introduced $\operatorname{rank}_{j}(\mathbf{v})$ as ‘the rank on the jth index’. For infinite-dimensional vector spaces V _j, the generalisation is given by $\operatorname{rank}_{j}(\mathbf{v}):=\dim U_{j}^{\min}(\mathbf{v})$, where the minimal subspaces $U_{j}^{\min}(\mathbf{v})$ will be defined in Sect. 2.2.

The next result extends Lemma 2.4.

Lemma 2.10

Assume that $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}$ with $\operatorname{rank}(\mathbf{v})=r$ and $\mathbf{v}=\sum_{\nu=1}^{r}\bigotimes_{j=1}^{d}v_{\nu}^{(j)}$. Then for each 1≤j≤d, the elementary tensors

$$\mathbf{v}_{\nu}^{[j]}:=\bigotimes _{k\neq j}v_{\nu}^{(k)}\in{}_{a} \bigotimes_{k\neq j}V_{k} \quad ( 1\leq\nu\leq r ) $$

are linearly independent in _a⨂_k≠j V _k.

Proof

Consider, without loss of generality, the case j=1. If the tensors $\{\mathbf{v}_{\nu}^{[1]}:1\leq\nu\leq r\}$ are linearly dependent, we also may assume, without loss of generality, that $\mathbf{v}_{r}^{[1]}$ may be expressed as $\mathbf{v}_{r}^{[1]}=\sum_{\nu=1}^{r-1}\beta_{\nu}\mathbf{v}_{\nu}^{[1]}$. Then

$$\mathbf{v}=\sum_{\nu=1}^{r-1}v_{\nu}^{(1)} \otimes\mathbf{v}_{\nu}^{[1]}+v_{r}^{(1)} \otimes\mathbf{v}_{r}^{[1]}=\sum_{\nu=1}^{r-1} \bigl( v_{\nu }^{(1)}+\beta_{\nu}v_{r}^{(1)} \bigr) \otimes\mathbf{v}_{\nu}^{[1]}$$

implies that $\operatorname{rank}(\mathbf{v})<r$ in contradiction to the minimality of r. □

2.2 Minimal Subspaces

2.2.1 Case d=2

The matrix case d=2 will serve as the start of an induction. To ensure the existence of minimal subspaces U ₁,U ₂ with v∈U ₁⊗U ₂, we need a lattice structure, which is the subject of the next lemma.

Lemma 2.11

Assume that X _i and Y _i are subspaces of V _i for i=1,2. Then

$$(X_{1}\otimes_{a}X_{2})\cap(Y_{1} \otimes_{a}Y_{2})= ( X_{1}\cap Y_{1} ) \otimes_{a} ( X_{2}\cap Y_{2} ) . $$

Proof

It is clear that (X ₁∩Y ₁)⊗_a(X ₂∩Y ₂)⊂(X ₁⊗_a X ₂)∩(Y ₁⊗_a Y ₂). It remains to show that v∈X ₁⊗_a X ₂ and v∈Y ₁⊗_a Y ₂ imply that v∈(X ₁∩Y ₁)⊗_a(X ₂∩Y ₂). By assumption, v has the two representations

$$\mathbf{v}=\sum_{\nu=1}^{n_{x}}x_{1}^{(\nu)} \otimes x_{2}^{(\nu)}=\sum_{\nu =1}^{n_{y}}y_{1}^{(\nu)} \otimes y_{2}^{(\nu)}\quad\text{with}\ x_{i}^{(\nu )} \in X_{i},\ y_{i}^{(\nu)}\in Y_{i}. $$

From Lemma 2.4, we may assume that $\{x_{1}^{(\nu )}\}$, $\{x_{2}^{(\nu)}\}$, $\{y_{1}^{(\nu)}\}$, $\{y_{2}^{(\nu)}\}$ are linearly independent. The dual functionals $\xi_{2}^{(\nu)}\in X_{2}$ of $\{x_{2}^{(\nu)}\}$ satisfy $\xi_{2}^{(\nu)}(x_{2}^{(\mu)})=\delta_{\nu\mu}$ (cf. Remark 2.1). Application of $\mathit{id}_{1}\otimes\xi_{2}^{(\mu)}$ to the first representation yields $(\mathit{id}_{1}\otimes\xi_{2}^{(\mu )}) (\mathbf{v})=x_{1}^{(\mu)}$, while the second representation leads to $\sum_{\nu=1}^{n_{y}}\xi_{\mu}^{(2)}(y_{2}^{(\nu)}) y_{1}^{(\nu)}$. The resulting equation $x_{1}^{(\mu)}=\sum_{\nu=1}^{n_{y}}\xi_{2}^{(\mu)} (y_{2}^{(\nu)}) y_{1}^{(\nu)}$ shows that $x_{1}^{(\nu)}\in Y_{1}$. Using the dual functionals $\xi_{1}^{(\mu)}$ of $\{x_{1}^{(\nu)}\}$ and applying $\xi_{1}^{(\mu)}\otimes \mathit{id}_{2}$ to v prove $x_{2}^{(\nu)}\in Y_{2}$. Hence $x_{i}^{(\nu)}\in X_{i}\cap Y_{i}$ is shown, i.e., v∈(X ₁∩Y ₁)⊗_a(X ₂∩Y ₂). □

Definition 2.12

For a tensor v∈V ₁⊗_a V ₂, the minimal subspaces are denoted by $U_{j}^{\min}(\mathbf{v})$ (j=1,2) defined by the property that v∈U ₁⊗_a U ₂ implies $U_{i}^{\min}(\mathbf{v})\subset U_{j}$ (j=1,2), while $\mathbf{v}\in U_{1}^{\min}(\mathbf{v})\otimes_{a}U_{2}^{\min}(\mathbf{v})$.

For each v, we introduce the family $\mathcal{F}(\mathbf{v})$ as the set of pairs (U ₁,U ₂) of subspaces with the property v∈U ₁⊗_a U ₂⊂V ₁⊗_a V ₂. By Lemma 2.11, we can write

$$\bigcap_{ ( U_{1},U_{2} ) \in\mathcal{F}(\mathbf{v})}U_{1}\otimes_{a}U_{2}=\underset{U_{1}^{\min}( \mathbf{v})}{\underbrace{\biggl(\bigcap_{ ( U_{1},U_{2} ) \in\mathcal{F}(\mathbf{v})}U_{1}\biggr) }}\otimes_{a} \underset{U_{2}^{\min}(\mathbf{v})}{\underbrace{ \biggl( \bigcap _{ ( U_{1},U_{2} ) \in\mathcal{F}(\mathbf{v})}U_{2} \biggr) }}. $$

Hereby, the existence and uniqueness of minimal subspaces $U_{j}^{\min}(\mathbf{v})$ are guaranteed.

Lemma 2.13

Assume that $\mathbf{v}=\sum_{\nu=1}^{r}v_{1}^{(\nu)}\otimes w_{2}^{(\nu)}$ with linearly independent $\{v_{1}^{(\nu )}:1\leq\nu\leq r\}$ and $\{w_{2}^{(\nu)}:1\leq\nu\leq r\}$. Then these vectors span the minimal spaces:

$$U_{1}^{\min}(\mathbf{v})=\operatorname*{span} \bigl\{ v_{1}^{(\nu)}:1\leq \nu\leq r \bigr\} \quad\text{\textit{and}}\quad U_{2}^{\min}(\mathbf{v})=\operatorname*{span} \bigl \{ w_{2}^{(\nu)}:1\leq\nu\leq r \bigr\} . $$

Proof

Apply the proof of Lemma 2.11 to $X_{1}=\operatorname*{span}\{v_{2}^{(\nu)}:1\leq\nu\leq r\}$, $X_{2} =\operatorname*{span}\{w_{2}^{(\nu)}:1\leq\nu\leq r\}$ and $Y_{j}=U_{j}^{\min }(\mathbf{v})$. It shows that $X_{j}\subset U_{j}^{\min}(\mathbf{v})$. Since a strict inclusion is excluded, $X_{j}=U_{j}^{\min}(\mathbf{v})$ proves the assertion. □

Proposition 2.14

Let v∈V ₁⊗_a V ₂. Then the minimal subspaces $U_{1}^{\min}(\mathbf{v})$ and $U_{2}^{\min}(\mathbf{v})$ are characterised by

(2.9a)

(2.9b)

Proof

We use the characterisation from Lemma 2.13, $\mathbf{v}=\sum_{\nu=1}^{r}v_{1}^{(\nu)}\otimes w_{2}^{(\nu)}$ with linearly independent $\{v_{1}^{(\nu)}:1\leq\nu\leq r\}$ and $\{w_{2}^{(\nu)}:1\leq \nu\leq r\}$ spanning the minimal subspaces. Then for each $\varphi_{2}\in V_{2}^{\prime}$ we have

$$(\mathit{id}_{1}\otimes\varphi_{2}) (\mathbf{v})=\sum _{\mu=1}^{r}\varphi_{2}\bigl(w_{2}^{(\mu)} \bigr)v_{1}^{(\mu)}\in U_{1}^{\min}( \mathbf{v}). $$

From the proof of Lemma 2.11, there are mappings (id ₁⊗φ ₂) yielding $v_{1}^{(\mu)}$ for any 1≤μ≤r; thus,

$$\bigl\{ ( \mathit{id}_{1}\otimes\varphi_{2} ) ( \mathbf{v} ) : \varphi_{2}\in V_{2}^{\prime} \bigr\} =U_{1}^{\min}(\mathbf{v}). $$

Analogously, $\{ ( \varphi_{1}\otimes \mathit{id}_{2} ) ( \mathbf{v} ) :\varphi_{1}\in V_{1}^{\prime} \} =U_{2}^{\min }(\mathbf{v})$ is shown, proving (2.9a) and (2.9b). □

For $V_{1}=\mathbb{R}^{n_{1}}$ and $V_{2}=\mathbb{R}^{n_{2}}$, when V ₁⊗_a V ₂ is isomorphic to matrices from $\mathbb{R}^{n_{1}\times n_{2}}$, definition (2.9a) may be interpreted as $U_{1}^{\min}(\mathbf{v})=\operatorname{Col}\mathcal{M}_{1}(\mathbf{v})= \operatorname{span}\{\mathcal{M}_{1}(\mathbf{v})x:x\in V_{2}\}$ ($\mathcal{M}_{1}(\mathbf{v})$ is the matrix corresponding to $\mathbf{v,}$ cf. Definition 2.7). Similarly, (2.9b) becomes $U_{1}^{\min}(\mathbf{v})=\operatorname{Col}\mathcal{M}_{1}(\mathbf{v})^{T}=\operatorname{Col}\mathcal{M}_{2}(\mathbf{v})$.

Corollary 2.15

The following statements hold.

(a)
Once $U_{1}^{\min}(\mathbf{v})$ and $U_{2}^{\min}(\mathbf{v})$ are given, one may select any basis {v ^(ν):1≤ν≤r} of $U_{1}^{\min}(\mathbf{v})$ [respectively, {w ^(ν):1≤ν≤r} of $U_{2}^{\min}(\mathbf{v})$] and find a representation (2.4) with these v ^(ν) [respectively, w ^(ν)] and some other basis of $U_{2}^{\min}(\mathbf{v})$ [respectively, $U_{1}^{\min}(\mathbf{v})$]. Otherwise, if {v ^(ν):1≤ν≤r} is a basis of a subspace $U_{1}\supsetneqq U_{1}^{\min}(\mathbf{v})$ [respectively, {w ^(ν):1≤ν≤r} of $U_{2}\supsetneqq U_{2}^{\min}(\mathbf{v})$], a representation (2.4) still exists, but the v ^(ν) [respectively, w ^(ν)] are linearly dependent.
(b)
If we fix a basis {w ^(ν):1≤ν≤r} of a subspace U ₂⊂V ₂, there are mappings $\{\varphi^{(\nu)}:1\leq\nu\leq r\}\subset U_{2}^{\prime}$ such that $(\mathit{id}_{1}\otimes\varphi^{(\nu )})(\mathbf{w})\in U_{1}^{\min}(\mathbf{w})$ and
$$\mathbf{w}=\sum_{\nu=1}^{r} \bigl(\mathit{id}_{1}\otimes\varphi^{(\nu)}\bigr) (\mathbf{w})\otimes w^{(\nu)}\quad\text{\textit{for all}}\ \mathbf{w}\in V_{1} \otimes_{a}U_{2}. $$

Proof

For statement (a) consider the representation of v by (2.4) with bases $\{v_{i}\}_{i=1}^{r}$ and $\{w_{i}\}_{i=1}^{r}$. Applying a basis transformation $\{v_{i}\}_{i=1}^{r}\mapsto\{\hat{v}_{i}\}_{i=1}^{r}$, we obtain $\mathbf{v}=\sum_{i=1}^{r}\hat{v}_{i}\otimes\hat{w}_{i}$ with another basis $\{\hat{w}_{i}\}_{i=1}^{r}$.

To prove (b) take a basis $\{\varphi_{2}^{(\nu)}:1\leq\nu\leq r\}$ of $U_{2}^{\prime}$ dual to $\{w_{2}^{(\nu)}:1\leq\nu\leq r\}$ and set $\{\mathit{id}_{1}\otimes\varphi_{2}^{(\nu)}:1\leq\nu\leq r\}\subset L(V_{1}\otimes_{a}U_{2},V_{1})$. By statement (a), any w∈V ₁⊗U ₂ has a representation given by $\mathbf{w}=\sum_{\nu=1}^{r}v_{1}^{(\mu )}\otimes w_{2}^{(\mu)}$, and here $\{v_{1}^{(\mu)}:1\leq\nu\leq r\}$ is a basis of $U_{1}^{\min}(\mathbf{w})$. Then

$$\bigl(\mathit{id}_{1}\otimes\varphi_{2}^{(\nu)}\bigr) ( \mathbf{w})=\sum_{\mu=1}^{r} \varphi_{\nu }\bigl(w_{2}^{(\mu)}\bigr)\cdot v_{1}^{(\mu)}=\sum_{\mu=1}^{r} \delta_{\nu,\mu}v_{1}^{(\mu)}=v_{1}^{(\nu)}$$

holds, proving the assertion. □

2.2.2 Definition in the General Case

In the following, we assume that d≥3, and generalise some of the features of tensors of second order.

By Lemma 2.3, we may assume $\mathbf{v}\in\mathbf{U}:={}_{a}\bigotimes_{j=1}^{d}U_{j}$ with finite-dimensional subspaces U _j⊂V _j. The lattice structure from Lemma 2.11 generalises to higher order.

Lemma 2.16

Assume that X _i and Y _i are subspaces of V _i for i=1,…,d. Then the identity

$$\Biggl({}_{a}\bigotimes_{j=1}^{d}X_{j} \Biggr)\cap \Biggl({}_{a} \bigotimes_{j=1}^{d}Y_{j} \Biggr)= {}_{a}\bigotimes_{j=1}^{d} ( X_{j}\cap Y_{j} ) $$

holds.

Proof

For the start of the induction at d=2 use Lemma 2.11. Assume that the assertion holds for d−1 and write ${}_{a}\bigotimes_{j=1}^{d}X_{j}$ as X ₁⊗_a X _[1] with $X_{[1]}:={}_{a}\bigotimes_{j=2}^{d}X_{j}$. Similarly, use ${}_{a}\bigotimes_{j=1}^{d}Y_{j} =Y_{1}\otimes_{a} Y_{[1]}$. Lemma 2.11 states that v∈(X ₁∩Y ₁)⊗_a(X _[1]∩Y _[1]). By the induction hypothesis, $X_{[1]}\cap Y_{[1]}={}_{a}\bigotimes_{j=2}^{d} ( X_{j}\cap Y_{j} )$ is valid, proving the assertion. □

Again, the minimal subspaces $U_{j}^{\min}(\mathbf{v})$ are given by the intersection of all U _j satisfying $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}$.

The algebraic characterisation of $U_{j}^{\min}(\mathbf{v})$ is similar to that for d=2. To this end, we introduce the following two subspaces (recall (2.7e)):

(2.10a)

(2.10b)

In the case of a normed space V _j, we may consider the subspace

$$ U_{j}^{\mathrm{III}}(\mathbf{v}):= \biggl\{ (\mathit{id}_{j} \otimes\boldsymbol{\varphi}_{[j]}) (\mathbf{v}):\text{\ } \boldsymbol{\varphi}_{[j]}\in{}_{a}\bigotimes _{k\neq j}V_{k}^{\ast} \biggr\} . $$

(2.10c)

Finally, if V _[j]=_a⨂_k≠j V _k is a normed space, we can define

$$ U_{j}^{\mathrm{IV}}(\mathbf{v}):= \bigl\{ (\mathit{id}_{j} \otimes\boldsymbol{\varphi} _{[j]}) (\mathbf{v}):\text{\ } \boldsymbol{\varphi}_{[j]}\in\mathbf{V}_{[j]}^{\ast} \bigr\} . $$

(2.10d)

Note that, in general, the four spaces ${}_{a}\bigotimes_{k\neq j}V_{k}^{\prime}$, (_a⨂_k≠j V _k)′, ${}_{a}\bigotimes_{k\neq j}V_{k}^{\ast}$, $\mathbf{V}_{[j]}^{\ast}$ may differ.

Theorem 2.17

For any $\mathbf{v}\in\mathbf{V}={}_{a}\bigotimes_{j=1}^{d}V_{j}$, there exist minimal subspaces $U_{j}^{\min }(\mathbf{v})$ (1≤j≤d), whose algebraic characterisation is given by

$$U_{j}^{\min}(\mathbf{v})=U_{j}^{\mathrm{I}}( \mathbf{v})=U_{j}^{\mathrm{II}}(\mathbf{v}). $$

Furthermore, if V _j and V _[j]=_a⨂_k≠j V _k are normed spaces for 1≤j≤d, then

$$U_{j}^{\min}(\mathbf{v})=U_{j}^{\mathrm{I}}( \mathbf{v})=U_{j}^{\mathrm{II}}(\mathbf{v})=U_{j}^{\mathrm{III}}(\mathbf{v})=U_{j}^{\mathrm{IV}}( \mathbf{v}). $$

Moreover, $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min }(\mathbf{v})$ and $\dim(U_{j}^{\min}(\mathbf{v}))=\operatorname{rank}_{j}(\mathbf{v})$ hold with $\operatorname{rank}_{j}$ from (2.8).

Proof

Since the mappings id _j⊗φ _[j] are applied to $\mathbf{v}\in\mathbf{U}:={}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min }(\mathbf{v})$, only the restrictions φ _[j] to ${}_{a}\bigotimes_{k\neq j}U_{k}^{\min}(\mathbf{v})^{\prime}$ and $({}_{a}\bigotimes_{k\neq j}U_{k}^{\min}(\mathbf{v}))^{\prime}$ are of interest. Since the subspace $U_{k}^{\min}(\mathbf{v})$, for all k, has finite dimension, Remark 2.6c states that ${}_{a}\bigotimes_{k\neq j}U_{k}^{\min}(\mathbf{v})^{\prime} =({}_{a}\bigotimes_{k\neq j}U_{j}^{\min}(\mathbf{v}))^{\prime}$. This proves $U_{j}^{\mathrm{I}}(\mathbf{v})=U_{j}^{\mathrm{II}}(\mathbf{v})$.

Again, the finite dimension of $U_{k}^{\min}(\mathbf{v})$ implies $U_{j}^{\mathrm{I}}(\mathbf{v})={}_{a}\bigotimes_{k\neq j}U_{k}^{\min}(\mathbf{v})^{\prime} ={}_{a}\bigotimes_{k\neq j}U_{k}^{\min}(\mathbf{v})^{\ast}$. By the Hahn–Banach theorem, $U_{k}^{\min }(\mathbf{v})^{\ast}$ can be extended to $V_{k}^{\ast}$. This proves $U_{j}^{\mathrm{III}}(\mathbf{v})\supset U_{j}^{\mathrm{I}}(\mathbf{v})$. The trivial inclusion $U_{j}^{\mathrm{III}}(\mathbf{v})\subset U_{j}^{\mathrm{I}}(\mathbf{v})$ proves equality.

Next, we show that $U_{j}^{\mathrm{II}}(\mathbf{v})=U_{j}^{\mathrm{IV}}(\mathbf{v})$. The inclusion (_a⨂_k≠j V _k)^∗⊂(_a⨂_k≠j V _k)′ implies $U_{j}^{\mathrm{IV}}(\mathbf{v})\subset U_{j}^{\mathrm{II}}(\mathbf{v})$. Consider $v_{j}:=(\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]})(\mathbf{v})\in U_{j}^{\mathrm{II}}(\mathbf{v})$ for some φ _[j]∈(_a⨂_k≠j V _k)′. Since $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v})$, we may restrict φ _[j] to $\boldsymbol{\varphi}_{[j]}\in ({}_{a}\bigotimes_{k\neq j}U_{k}^{\min}(\mathbf{v}))^{\prime}$. Since ${}_{a}\bigotimes_{k\neq j}U_{k}^{\min}(\mathbf{v})$ is a finite-dimensional subspace of the normed space _a⨂_k≠j V _k, by the Hahn–Banach theorem, the algebraic functional φ _[j] can be extended to _a⨂_k≠j V _k such that $\overline {\boldsymbol{\varphi}_{[j]}}\in({}_{a}\bigotimes_{k\neq j}V_{k})^{\ast }$, and by $v_{j}=(\mathit{id}_{j}\otimes\overline{\boldsymbol{\varphi}_{[j]}})(\mathbf{v})\in U_{j}^{\mathrm{IV}}(\mathbf{v})$ the opposite inclusion $U_{j}^{\mathrm{II}}(\mathbf{v})\subset U_{j}^{\mathrm{IV}}(\mathbf{v})$ follows.

To prove that these spaces coincide with $U_{j}^{\min}(\mathbf{v})$, we apply the matricisation from Sect. 2.1.2. The isomorphism $\mathcal{M}_{j}$ from Definition 2.7 maps ${}_{a}\bigotimes_{k=1}^{d}V_{k}$ into V _j⊗_a V _[j] (cf. (2.5a)). Proposition 2.14 states that $U_{j}^{\min}(\mathbf{v})=U_{j}^{\mathrm{II}}(\mathbf{v})$ is the minimal subspace. So far, we have proved $\mathbf{v}\in U_{j}^{\min}(\mathbf{v})\otimes_{a}V_{[j]}$. From Lemma 2.16, the intersection over all 1≤j≤d yields $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v})$. □

We remark that for d≥3, in general, the dimensions of $U_{j}^{\min }(\mathbf{v})$ may be different.

Since $U_{j}^{\min}(\mathbf{v})$ is a subspace of V _j generated by elementary tensors from

$${}_{a}\bigotimes_{k\neq j}V_{k}^{\prime} =\operatorname{span} \bigl\{ (\varphi_{1}\otimes\cdots\otimes \varphi_{j-1}\otimes\varphi_{j+1}\otimes \cdots\otimes \varphi_{d}):\varphi_{k}\in V_{k}^{\prime},\ k\neq j \bigr\} $$

for 1≤j≤d, we can write

$$U_{j}^{\min}(\mathbf{v})=\operatorname{span} \bigl\{ ( \varphi_{1}\otimes \cdots\otimes\varphi_{j-1}\otimes \mathit{id}_{j}\otimes\varphi_{j+1}\otimes \cdots\otimes \varphi_{d}) (\mathbf{v}):\varphi_{k}\in V_{k}^{\prime},\ k\neq j \bigr\} , $$

and, if V _k is a normed space for 1≤k≤d, we can also write it as

$$U_{j}^{\min}(\mathbf{v})=\operatorname{span} \bigl\{ ( \varphi_{1}\otimes \cdots\otimes\varphi_{j-1}\otimes \mathit{id}_{j}\otimes\varphi_{j+1}\otimes \cdots\otimes \varphi_{d}) (\mathbf{v}):\varphi_{k}\in V_{k}^{\ast},\ k\neq j \bigr\} $$

for 1≤j≤d.

2.2.3 Hierarchies of Minimal Subspaces

We have introduced the minimal subspace $U_{j}^{\min}(\mathbf{v})\subset V_{j}$ for a singleton {j}⊂D:={1,2,…,d}. Instead we may consider general disjoint and non-empty subsets of α _i⊂D. For instance, let $\mathbf{v}\in{}_{a}\bigotimes_{j\in D}V_{j} =\mathbf{V}_{\alpha_{1}}\otimes\mathbf{V}_{\alpha_{2}}\otimes\mathbf{V}_{\alpha_{3}}$, where α ₁={1,2}, α ₂={3,4}, and α ₃={5,6,7}. Then we can conclude that there are minimal subspaces $\mathbf{U}_{\alpha_{\nu}}^{\min}(\mathbf{v})$ for ν=1,2,3, such that $\mathbf{v}\in{}_{a}\bigotimes_{\nu=1}^{3}\mathbf{U}_{\alpha_{\nu}}^{\min}(\mathbf{v})$. The relation between $U_{j}^{\min}(\mathbf{v})$ and $\mathbf{U}_{\alpha_{\nu}}^{\min}(\mathbf{v})$ is as follows.

Proposition 2.18

Let v∈V=_a⨂_j∈D V _j and ∅≠α⊂D. Then the minimal subspaces $\mathbf{U}_{\alpha}^{\min}(\mathbf{v})$ and $U_{j}^{\min}(\mathbf{v})$ for j∈α are related by

$$ \mathbf{U}_{\alpha}^{\min}(\mathbf{v})\subset{}_{a} \bigotimes_{j\in\alpha}U_{j}^{\min}( \mathbf{v}) . $$

(2.11)

Proof

We know that $\mathbf{v}\in\mathbf{U}={}_{a}\bigotimes_{j\in D}U_{j}^{\min}(\mathbf{v})$. We may write $\mathbf{U}=\mathbf{U}_{\alpha}\otimes_{a}\mathbf{U}_{\alpha^{c}}$, where α ^c=D∖α and $\mathbf{U}_{\beta}:={}_{a}\bigotimes_{j\in\beta}U_{j}^{\min}(\mathbf{v})$ for any subset β⊂D. Thus, $\mathbf{U}_{\alpha}^{\min}(\mathbf{v})$ must be contained in $\mathbf{U}_{\alpha}={}_{a}\bigotimes_{j\in\alpha}U_{j}^{\min}(\mathbf{v})$. □

An obvious generalisation of the previous results is given below.

Corollary 2.19

Let $\mathbf{v}\in\mathbf{V}={}_{a}\bigotimes_{j\in\mathcal{I}}V_{j}$. Assume that ∅≠α _i⊂D are pairwise disjoint for i=1,2,…,m. The minimal subspace for $\alpha:=\bigcup_{i=1}^{m}\alpha_{i}$ satisfies

$$ \mathbf{U}_{\alpha}^{\min}(\mathbf{v})\subset{}_{a} \bigotimes_{i=1}^{m} \mathbf{U}_{\alpha_{i}}^{\min}(\mathbf{v}) . $$

(2.12)

The algebraic characterisation of $\mathbf{U}_{\alpha}^{\min}(\mathbf{v})$ is analogous to that given in Theorem 2.17. Formulae (2.10a), (2.10b) become

(2.13)

where $( \mathit{id}_{\alpha}\otimes\boldsymbol{\varphi}_{\alpha^{c}} ) (\otimes_{j=1}^{d}v^{(j)})= ( \boldsymbol{\varphi}_{\alpha^{c}}(\otimes_{j\in\alpha^{c}}v^{(j)}) ) \otimes_{k\in\alpha}v^{(k)}$. The analogues of (2.10c), (2.10d) apply as soon as norms are defined on V _j and ${}_{a}\bigotimes_{j\in\alpha^{c}}V_{j}$.

3 Minimal Subspaces in a Banach Tensor Space

In this section we assume the existence of a norm, namely ∥⋅∥, defined on a tensor space V. More precisely, we introduce the following class of Banach spaces.

Definition 3.1

We say that V _∥⋅∥ is a Banach tensor space if there exist an algebraic tensor space V and a norm ∥⋅∥ on V such that V _∥⋅∥ is the completion of V with respect to a given norm ∥⋅∥, i.e.,

$$\mathbf{V}_{\Vert \cdot \Vert }:={}_{\Vert \cdot \Vert }\bigotimes _{j=1}^{d}V_{j} =\overline{ {}_{a}\bigotimes_{j=1}^{d}V_{j} }^{\Vert \cdot \Vert }. $$

If V _∥⋅∥ is a Hilbert space, we will say that V _∥⋅∥ is a Hilbert tensor space.

Next, we give some examples of Banach and Hilbert tensor spaces.

Example 3.2

For I _j⊂ℝ (1≤j≤d) and 1≤p<∞, the Sobolev space H ^N,p(I _j) consists of all univariate functions f from L ^p(I _j) with bounded norm^{Footnote 3}

$$ \Vert f\Vert _{N,p;I_{j}}:= \Biggl(\sum_{n=0}^{N} \int_{\mathrm{I}_{j}}\biggl \vert \frac{\mathrm{d}^{n}}{\mathrm{d}x^{n}}f\biggr \vert ^{p}\, \mathrm{d}x \Biggr)^{1/p}, $$

(3.1a)

whereas the space H ^N,p(I) of d-variate functions on I=I ₁×I ₂×…×I _d⊂ℝ^d is endowed with the norm

$$ \Vert f\Vert _{N,p}:= \biggl(\sum_{0\leq \vert \mathbf{n}\vert \leq N} \int_{\mathbf{I}}\bigl \vert \partial^{\mathbf{n}}f\bigr \vert ^{p}\,\mathrm{d}\mathbf{x} \biggr)^{1/p} $$

(3.1b)

where $\mathbf{n}\in\mathbb{N}_{0}^{d}$ is a multi-index of length $\vert \mathbf{n}\vert :=\sum_{j=1}^{d}n_{j}$. It is well known that H ^N,p(I _j) and H ^N,p(I) are reflexive and separable Banach spaces. Moreover, for p=2, the Sobolev spaces H ^N(I _j):=H ^N,2(I _j) and H ^N(I):=H ^N,2(I) are Hilbert spaces. As a first example,

$$H^{N,p}(\mathbf{I})={}_{\Vert \cdot \Vert _{N,p}} \bigotimes _{j=1}^{d}H^{N,p}(I_{j}) $$

is a Banach tensor space. Examples of Hilbert tensor spaces are

$$L^{2}(\mathbf{I})={}_{\Vert \cdot \Vert _{0,2}}\bigotimes _{j=1}^{d}L^{2}(I_{j}) \quad \text{and}\quad H^{N}(\mathbf{I})={}_{\Vert \cdot \Vert _{N,2}} \bigotimes_{j=1}^{d}H^{N}(I_{j}) \text{ for }N\in\mathbb{N}. $$

We recall that for the set of norms over a given vector space V, we can define a partial ordering ∥⋅∥₁≲∥⋅∥₂, if there exists a constant C such that ∥v∥₁≤C∥v∥₂ for all v∈V.

Given a vector space V, its completion with respect to a norm ∥⋅∥ yields a Banach space which we denote by $V_{\Vert \cdot \Vert }:=\overline{V}^{\Vert \cdot \Vert }$. Note that ∥⋅∥₁≲∥⋅∥₂ implies that $V_{\Vert \cdot \Vert _{2}}\subset V_{\Vert \cdot \Vert _{1}}$.

3.1 Tensor Product of Banach Spaces

Let ∥⋅∥_j, 1≤j≤d, be the norms of the vector spaces V _j appearing in $\mathbf{V}={}_{a}\bigotimes _{j=1}^{d}V_{j}$. By ∥⋅∥ we denote the norm on the tensor space V. Note that ∥⋅∥ is not determined by ∥⋅∥_j, but there are relations which are ‘reasonable’.

Any norm ∥⋅∥ on ${}_{a}\bigotimes_{j=1}^{d}V_{j}$ satisfying

$$ \Biggl\|\bigotimes _{j=1}^{d}v^{(j)}\Biggr\| =\prod_{j=1}^{d}\bigl\Vert v^{(j)}\bigr\Vert_{j} \quad\text{for all }v^{(j)}\in V_{j}\ ( 1\leq j\leq d ) $$

(3.2)

is called a cross norm. As usual, the dual norm to ∥⋅∥ is denoted by ∥⋅∥^∗. If ∥⋅∥ is a cross norm and ∥⋅∥^∗ is also a cross norm on ${}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast}$, i.e.,

$$ \Biggl\|\bigotimes_{j=1}^{d}\varphi^{(j)}\Biggr\| ^{\ast}=\prod_{j=1}^{d}\bigl\Vert\varphi^{(j)} \bigr\Vert_{j}^{\ast}\quad\text{for all}\ \varphi^{(j)}\in V_{j}^{\ast}\ ( 1\leq j\leq d ) , $$

(3.3)

∥⋅∥ is called a reasonable cross norm.

Remark 3.3

Equation (3.2) implies the inequality $\Vert\bigotimes_{j=1}^{d}v^{(j)}\Vert\lesssim\prod_{j=1}^{d}\Vert v^{(j)}\Vert_{j}$, which is equivalent to the continuity of the tensor product mapping

(3.4)

given by $\otimes ( (v_{1},\ldots,v_{d}) ) =\otimes_{j=1}^{d}v_{j}$.

By standard arguments, continuity of the tensor product implies the following result.

Lemma 3.4

Let V _j,0 be dense in (V _j,∥⋅∥_j) for 1≤j≤d. Assume (3.4) to be continuous for some norm ∥⋅∥ defined on $\mathbf{V}={}_{a}\bigotimes_{j=1}^{d}V_{j}$. Then ${}_{a}\bigotimes_{j=1}^{d}V_{j,0}$ is dense in V, so that $\overline{_{a}\bigotimes_{j=1}^{d}V_{j,0}}^{\Vert \cdot \Vert }=\mathbf{V}_{\Vert \cdot \Vert }$.

Example 3.5

It is well known that the norm ∥⋅∥_0,2 is a reasonable cross norm on ${}_{a}\bigotimes_{j=1}^{d}L^{2}(I_{j}) $, whereas ∥⋅∥_N,2 for N≥1 is not a reasonable cross norm on ${}_{a}\bigotimes_{j=1}^{d}H^{N}(I_{j}) $ (cf. Example 3.2).

Note that any functional $\boldsymbol{\varphi}=\otimes_{j=1}^{d}\varphi_{j}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast}$ is also a linear map ${}_{a}\bigotimes_{j=1}^{d}V_{j} \rightarrow\mathbb{R}$, which is defined for elementary tensors by

$$\Biggl(\bigotimes _{j=1}^{d}\varphi_{j}\Biggr) \bigl( \otimes_{j=1}^{d}v_{j} \bigr) =\prod _{j=1}^{d}\varphi_{j}(v_{j}). $$

Thus, ${}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast} \subset({}_{a}\bigotimes_{j=1}^{d}V_{j})^{\prime}$. If ∥⋅∥ is a reasonable cross norm, then by (3.3) the map

$$\bigotimes :\mathop{\mathchoice{\raise-0.22em\hbox{\huge $\times$}} { \raise-0.05em\hbox{\Large $\times$}} {\hbox{\large $\times$}} { \times}}_{j=1}^{d} \bigl( V_{j}^{\ast}, \Vert \cdot \Vert _{j}^{\ast} \bigr) \longrightarrow \Biggl({}_{a} \bigotimes_{j=1}^{d}V_{j}^{\ast} ,\Vert \cdot \Vert ^{\ast}\Biggr) $$

is also continuous. Consequently, ${}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast}\subset({}_{a}\bigotimes_{j=1}^{d}V_{j} )^{\ast}$.

Grothendieck [13] named the following norm ∥⋅∥_∨ the injective norm.

Definition 3.6

Let V _i be a Banach space with norm ∥⋅∥_i for 1≤i≤d. Then for $\mathbf{v}\in\mathbf{V}={}_{a} \bigotimes_{j=1}^{d}V_{j}$ define ∥⋅∥_∨ by

$$ \Vert \mathbf{v}\Vert _{\vee}:=\sup \biggl\{ \frac{\vert ( \varphi_{1}\otimes\varphi_{2}\otimes\cdots\otimes\varphi_{d} ) (\mathbf{v})\vert }{\prod_{j=1}^{d}\Vert\varphi_{j}\Vert_{j}^{\ast}}:0\neq \varphi_{j}\in V_{j}^{\ast},\ 1\leq j\leq d \biggr\} . $$

(3.5)

It is well known that the injective norm is a reasonable cross norm (see Lemma 1.6 in [20]). Further properties are given by the next proposition.

Proposition 3.7

The following statements hold.

(a)
The injective norm is the weakest reasonable cross norm on V; i.e., if ∥⋅∥ is a reasonable cross norm over V, then ∥⋅∥_∨≲∥⋅∥.
(b)
For any norm ∥⋅∥ on V satisfying ∥⋅∥_∨≲∥⋅∥, the inclusion ${}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast} \subset({}_{a}\bigotimes_{j=1}^{d}V_{j})^{\ast}$ holds.

Proof

Statement (a) is a classical result (cf. [20], [14]). To prove (b), we use the fact that ∥⋅∥_∨≲∥⋅∥ implies $\Vert \cdot \Vert _{\vee}^{\ast }\gtrsim \Vert \cdot \Vert ^{\ast}$ (see again [20], [14]). Then

$$\bigl\Vert\otimes_{j=1}^{d}\varphi_{j} \bigr\Vert^{\ast}\leq C \bigl\Vert\otimes_{j=1}^{d} \varphi_{j}\bigr\Vert_{\vee}^{\ast}\quad \bigl( \varphi_{j}\in V_{j}^{\ast },1\leq j\leq d\bigr) $$

for some C>0, and the proof ends using the fact that $\Vert \cdot \Vert _{\vee}^{\ast}$ is also a cross norm. □

3.2 Minimal Subspaces in a Banach Tensor Space

Let V be a tensor product of Banach spaces (V _i,∥⋅∥_i) for 1≤i≤d. Then, considering the injective norm on V _[j] for 1≤j≤d, for each $\mathbf{v}\in\mathbf{V,}$ we conclude from Theorem 2.17 that^{Footnote 4}

$$U_{j}^{\min}(\mathbf{v})=U_{j}^{\mathrm{I}}( \mathbf{v})=U_{j}^{\mathrm{II}}(\mathbf{v})=U_{j}^{\mathrm{III}}(\mathbf{v})=U_{j}^{\mathrm{IV}}( \mathbf{v}) $$

(cf. (2.10a)–(2.10d)). Assume that the norm ∥⋅∥ on V satisfies

$$ \Vert \cdot \Vert \gtrsim \Vert \cdot \Vert _{\vee} $$

(3.6)

(cf. Proposition 3.7a). This assumption ensures that the Banach tensor space V _∥⋅∥ is always a Banach subspace of the Banach tensor space $\mathbf{V}_{\Vert \cdot \Vert _{\vee}}$. This fact allows us to extend the definition of minimal subspaces to a Banach tensor space V _∥⋅∥ with a norm ∥⋅∥ satisfying (3.6). To this end, the following lemma will be useful.

Lemma 3.8

For 1≤i≤d, let (V _i,∥⋅∥_i) be Banach spaces. For fixed j∈{1,…,d} and a given $\boldsymbol{\varphi}_{[j]}=\bigotimes_{k\neq j}\varphi_{k}\in{}_{a}\bigotimes_{k\neq j}V_{k}^{\ast}$, the map id _j⊗φ _[j] belongs to $\mathcal{L}(\mathbf{V}_{\Vert \cdot \Vert _{\vee}},V_{j})$, i.e., id _j⊗φ _[j] is continuous on $( \mathbf{V,}\Vert \cdot \Vert _{\vee} ) $. Hence, there exists a unique extension $\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}}\in\mathcal{L}(\mathbf{V}_{\Vert \cdot \Vert _{\vee}},V_{j})$. Moreover, $\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}}\in \mathcal{L} ( \mathbf{V}_{\Vert \cdot \Vert },V_{j} ) $ holds for any norm ∥⋅∥ on V satisfying (3.6) with the operator norm

$$ \bigl \Vert \overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}} \bigr \Vert _{V_{j}\leftarrow\mathbf{V}}=\sup_{\Vert \mathbf{v}\Vert =1}\bigl \Vert (\overline{\mathit{id}_{j} \otimes\boldsymbol{\varphi}_{[j]}}) (\mathbf{v})\bigr \Vert _{j}\leq C\prod_{k\neq j}\Vert \varphi_{k}\Vert _{k}^{\ast}, $$

(3.7)

where the constant C is determined by the estimate in (3.6).

Proof

Let $\varphi_{j}\in V_{j}^{\ast}$ and use $\varphi_{j}\circ(\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]})=\bigotimes_{k=1}^{d}\varphi_{k}$ (cf. (2.7f)). Hence, continuity follows from

The last inequality holds for any norm on V satisfying ∥⋅∥≥(1/C)∥⋅∥_∨ and proves (3.7). The statement about the extension $\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}}$ is standard. □

An immediate consequence of Lemma 3.8 and Theorem 2.17 is the following.

Corollary 3.9

For 1≤i≤d, let (V _i,∥⋅∥_i) be a Banach space and assume that ∥⋅∥ is a norm on V satisfying (3.6). Then for each algebraic tensor v∈V the representation

$$U_{j}^{\min}(\mathbf{v})=\biggl\{ ( \overline{\mathit{id}_{j}\otimes\boldsymbol{ \varphi}_{[j]}} ) (\mathbf{v}):\boldsymbol{\varphi}_{[j]}\in \bigotimes_{k\neq j}V_{k}^{\ast}\biggr\} $$

holds for 1≤j≤d. Moreover, we can write

$$U_{j}^{\min}(\mathbf{v})=\operatorname{span} \bigl\{ (\overline{ \varphi_{1}\otimes\cdots\otimes\varphi_{j-1} \otimes \mathit{id}_{j}\otimes\varphi_{j+1}\otimes\cdots \otimes\varphi_{d}}) (\mathbf{v}):\varphi_{k}\in V_{k}^{\ast },\ k\neq j \bigr\} . $$

For the hierarchical format from [15] we need to extend the results to a minimal subspace in the tensor space V _α:=⨂_k∈α V _k, where α⊂D:={1,…,d} contains more than one index. Then the splitting V _j⊗V _[j] from above becomes $\mathbf{V}_{\alpha}\otimes \mathbf{V}_{\alpha^{c}}$, where $\mathbf{V}_{\alpha^{c}}:=\bigotimes _{k\in D\backslash\alpha}V_{k}$. The definition of, e.g., $U_{j}^{\mathrm{I}}(\mathbf{v})$ in (2.10a) becomes

$$\mathbf{U}_{\alpha}^{\mathrm{I}}(\mathbf{v}):= \Bigl\{ (\mathit{id}_{\alpha}\otimes \boldsymbol{\varphi}_{\alpha^{c}}) (\mathbf{v}): \text{\ }\boldsymbol{\varphi }_{\alpha^{c}}\in{}_{a}\bigotimes _{k\in\alpha}V_{k}^{\prime } \Bigr\} \subset\mathbf{V}_{\alpha}$$

involving the identity id _α∈L(V _α,V _α).

Remark 3.10

By arguments analogous to those above, we can show that

$$\mathbf{U}_{\alpha}^{\mathrm{I}}(\mathbf{v})=\mathbf{U}_{\alpha}^{\mathrm{III}}( \mathbf{v}):= \Bigl\{ (\mathit{id}_{\alpha}\otimes\boldsymbol{ \varphi}_{\alpha^{c}}) (\mathbf{v}):\text{\ }\boldsymbol{ \varphi}_{\alpha^{c}}\in{}_{a}\bigotimes _{k\in\alpha}V_{k}^{\ast} \Bigr\} . $$

In particular, $\overline{\mathit{id}_{\alpha}\otimes\boldsymbol{\varphi}_{\alpha^{c}}}\in\mathcal{L}(\mathbf{V}_{\Vert \cdot \Vert _{\vee}},V_{\alpha })\subset\mathcal{L} ( \mathbf{V}_{\Vert \cdot \Vert },V_{\alpha} ) $ holds.

3.3 Minimal Closed Subspaces in a Banach Tensor Space

3.3.1 Definitions

So far, $U_{j}^{\min}(\mathbf{v})$ has been defined for algebraic tensors only. From $\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}}\in\mathcal{L} ( \mathbf{V}_{\Vert \cdot \Vert },V_{j} ) $, we can extend the definition of U _jmin(v) in Corollary 3.9 even to topological tensors v∈V _∥⋅∥∖V as follows.

Definition 3.11

For a given Banach tensor space V _∥⋅∥ with a norm ∥⋅∥ satisfying (3.6) we define the set

$$U_{j}^{\min}(\mathbf{v}):=\biggl\{ ( \overline{\mathit{id}_{j}\otimes\boldsymbol{ \varphi}_{[j]}} ) (\mathbf{v}):\boldsymbol{\varphi}_{[j]}\in {}_{a} \bigotimes_{k\neq j}V_{k}^{\ast}\biggr\} $$

for each v∈V _∥⋅∥ and 1≤j≤d.

Observe that $(\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}})(\mathbf{v})$ is well defined, because $\overline{\mathit{id}_{j}\otimes \boldsymbol{\varphi}_{[j]}}$ is continuous and coincides with the standard definition when v∈V. Thus, for each v∈V _∥⋅∥ we can define its ‘minimal subspace’ by

$$ \mathbf{U}(\mathbf{v}):={}_{a}\bigotimes _{j=1}^{d}U_{j}^{\min }(\mathbf{v}) . $$

(3.8a)

If we take into account the topological properties of V _∥⋅∥, we may consider its closure with respect to the norm ∥⋅∥:

$$ \mathbf{U}_{\Vert \cdot \Vert }(\mathbf{v}):={}_{\Vert \cdot \Vert }\bigotimes _{j=1}^{d}\overline{U_{j}^{\min}( \mathbf{v})}^{\Vert \cdot \Vert _{j}} ={}_{\Vert \cdot \Vert }\bigotimes _{j=1}^{d}U_{j}^{\min}( \mathbf{v}) . $$

(3.8b)

The second identity is a consequence of Lemma 3.4. If v∈V, the set $U_{j}^{\min}(\mathbf{v})$ is a finite-dimensional subspace in V _j and therefore closed, i.e., $\overline {U_{j}^{\min}(\mathbf{v})}^{\Vert \cdot \Vert _{j}}=U_{j}^{\min }(\mathbf{v})$. In the general case of v∈V _∥⋅∥, the subspace $U_{j}^{\min}(\mathbf{v})$ may be not closed.

Before we discuss the Banach subspace U _∥⋅∥(v) in Sect. 3.3.3, we first analyse the properties of the subspace $U_{j}^{\min}(\mathbf{v})$. To this end we use the following definition.

Definition 3.12

We say that a sequence (x _n)_n∈ℕ in a Banach space X converges weakly to x∈X, if limφ(x _n)=φ(x) for all φ∈X ^∗. In this case, we write x _n⇀x.

3.3.2 Dependence of $U_{j}^{\min}(\mathbf{v})$ on v

The properties of the maps id _j⊗φ _[j] involved in the definition of $U_{j}^{\min}(\mathbf{v})$ are discussed in Lemma 3.13. As a consequence, we shall establish our main result in Theorem 3.15 about the dimensions of $U_{j}^{\min }(\mathbf{v}_{n})$ and $U_{j}^{\min}(\mathbf{v})$ for a weakly convergent sequence v _n⇀v.

Lemma 3.13

Assume that the norm of the Banach tensor space V _∥⋅∥ satisfies (3.6). Let $\boldsymbol{\varphi}_{[j]}\in{}_{a}\bigotimes_{k\neq j}V_{k}^{\ast}$ and v _n,v∈V _∥⋅∥ with v _n⇀v. Then weak convergence $(\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}})(\mathbf{v}_{n})\rightharpoonup(\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}})(\mathbf{v})$ holds in V _j.

Proof

Let $\boldsymbol{\varphi}_{[j]}=\otimes_{k\neq j}\varphi_{k}\in{}_{a}\bigotimes_{k\neq j}V_{k}^{\ast}$ be an elementary tensor. We have to show that

$$\varphi_{j} \bigl[ (\overline{\mathit{id}_{j}\otimes\boldsymbol{ \varphi}_{[j]}}) (\mathbf{v}_{n}) \bigr] \rightarrow\varphi_{j} \bigl[ (\overline {\mathit{id}_{j}\otimes \boldsymbol{\varphi}_{[j]}}) (\mathbf{v}) \bigr] $$

holds for all $\varphi_{j}\in V_{j}^{\ast}$. By Lemma 3.8, $\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}}:\mathbf{V}_{\Vert \cdot \Vert }\rightarrow V_{j}$ is continuous. Therefore, the composition $\varphi_{j}\circ(\overline {\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}}):\mathbf{V}_{\Vert \cdot \Vert }\rightarrow\mathbb{R}$ is a continuous functional belonging to $\mathbf{V}_{\Vert \cdot \Vert }^{\ast}$, and hence v _n⇀v implies

$$\bigl( \varphi_{j}\circ(\overline{\mathit{id}_{j}\otimes \boldsymbol{\varphi}_{[j]}}) \bigr) (\mathbf{v}_{n}) \rightarrow \bigl( \varphi_{j}\circ(\overline {\mathit{id}_{j} \otimes\boldsymbol{\varphi}_{[j]}}) \bigr) (\mathbf{v}). $$

This proves the lemma for an elementary tensor φ _[j]. The result extends immediately to finite linear combinations $\boldsymbol{\varphi}_{[j]}\in{}_{a}\bigotimes_{k\neq j}V_{k}^{\ast}$. □

Lemma 3.14

Assume N∈ℕ and $x_{n}^{(i)}\rightharpoonup x_{\infty}^{(i)}$ for 1≤i≤N with linearly independent $x_{\infty}^{(i)}\in X$. Then there is an n ₀ such that for all n≥n ₀ the N-tuples $(x_{n}^{(i)}:1\leq i\leq N)$ are linearly independent.

Proof

There are functionals φ ^(j)∈X ^∗ (1≤j≤N) with $\varphi^{(j)}(x_{\infty}^{(i)})=\delta_{ij}$ (cf. Remark 2.1). Set

$$\Delta_{n}:=\det \bigl( \bigl(\varphi^{(j)} \bigl(x_{n}^{(i)}\bigr)\bigr)_{i,j=1}^{N} \bigr) . $$

$x_{n}^{(i)}\rightharpoonup x_{\infty}^{(i)}$ implies $\varphi^{(j)}(x_{n}^{(i)})\rightarrow\varphi^{(j)}(x_{\infty}^{(i)})$. Continuity of the determinant proves $\Delta_{n}\rightarrow\Delta_{\infty}:=\det( ( \delta_{ij} )_{i,j=1}^{N})=1$. Hence, there is an n ₀ such that Δ_n>0 for all n≥n ₀, proving linear independence of $\{x_{n}^{(i)}:1\leq i\leq N\}$. □

Theorem 3.15

Assume that the norm of the Banach tensor space V _∥⋅∥ satisfies (3.6). Let v _n∈V _∥⋅∥ be a sequence with v _n⇀v∈V _∥⋅∥. Then ^{Footnote 5}

$$\dim\overline{U_{j}^{\min}(\mathbf{v})}^{\Vert \cdot \Vert _{j}}=\dim U_{j}^{\min}(\mathbf{v})\leq\liminf_{n\rightarrow\infty}\dim U_{j}^{\min}(\mathbf{v}_{n})\quad\text{for all}\ 1 \leq j\leq d. $$

Proof

Since $U_{j}^{\min}(\mathbf{v})$ is dense in $\overline{U_{j}^{\min }(\mathbf{v})}^{\Vert \cdot \Vert _{j}}$, the dimensions are identical in the sense of footnote 5. We can select a subsequence (again denoted by v _n) such that $\dim U_{j}^{\min}(\mathbf{v}_{n})$ is weakly increasing. If $\dim U_{j}^{\min}(\mathbf{v}_{n})\rightarrow\infty$ holds, nothing is to be proved. Therefore, assume that $\lim\dim U_{j}^{\min}(\mathbf{v}_{n})=N<\infty$. For an indirect proof assume that $\dim U_{j}^{\min}(\mathbf{v})>N$. Then, there are N+1 linearly independent vectors

$$b^{(i)}= \bigl( \overline{\mathit{id}_{j}\otimes\boldsymbol{ \varphi}_{[j]}^{(i)}} \bigr) (\mathbf{v})\text{\quad with\ }\boldsymbol{\varphi}_{[j]}^{(i)}\in {}_{a}\bigotimes_{k\neq j}V_{k}^{\ast} \text{\ for\ }1\leq i\leq N+1. $$

By Lemma 3.13, the sequence $b_{n}^{(i)}:=(\overline {\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}^{(i)}})(\mathbf{v}_{n})\rightharpoonup b^{(i)}$ converges weakly. By Lemma 3.14, for large enough n, also $\{b_{n}^{(i)}:1\leq i\leq N+1\}$ is linearly independent. Because $b_{n}^{(i)}=(\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}^{(i)}})(\mathbf{v}_{n})\in U_{j}^{\min}(\mathbf{v}_{n})$, this contradicts $\dim U_{j}^{\min}(\mathbf{v}_{n})\leq N$. □

For the hierarchical format from [15], id _j⊗φ _[j] must be replaced by $\mathit{id}_{\alpha}\otimes \boldsymbol{\varphi}_{_{\alpha^{c}}}$ (cf. Corollary 2.19 and Remark 3.10). Similar methods as above show the following generalisations:

(3.9a)

(3.9b)

Here, we equip the tensor space V _α=_a⨂_j∈α V _j with the injective norm ∥⋅∥_∨ from (3.5).

3.3.3 $\dim(U_{j}^{\min}(\mathbf{v}))<\infty$

Consider U(v) and U _∥⋅∥(v) from (3.8a,b). For algebraic tensors v we know that v∈U(v). However, the corresponding conjecture v∈U _∥⋅∥(v), in the general case, turns out to be not quite obvious. The statement v∈U _∥⋅∥(v) requires a sequence of $\mathbf{v}_{n}\in{}_{a}\bigotimes_{j=1}^{d}\overline{U_{j}^{\min}(\mathbf{v})}^{\Vert \cdot \Vert _{j}}$ with v=limv _n. We do not have a proof that this holds in general. A positive result holds for the Hilbert case (see Sect. 3.4) and if the subspaces $U_{j}^{\min}(\mathbf{v})$ are finite dimensional (see Theorem 3.16). In the general Banach case, we give a proof for v=limv _n, provided that the convergence is fast enough.

For practical applications, the finite-dimensional case is the most important one, since it follows from Theorem 3.15 with bounded $\liminf_{n\rightarrow\infty}\dim U_{j}^{\min}(\mathbf{v}_{n})$.

Theorem 3.16

Assume that V _∥⋅∥ is a Banach tensor space with ∥⋅∥ satisfying (3.6). For v∈V _∥⋅∥ and all 1≤j≤d assume that $\dim(U_{j}^{\min}(\mathbf{v}))<\infty$. Then v belongs to the (algebraic) tensor space ${}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min }(\mathbf{v}) =\mathbf{U}_{\Vert \cdot \Vert }(\mathbf{v})$.

Proof

Let $\{b_{j}^{(i)}:1\leq i\leq r_{j}\}$ be a basis of $U_{j}^{\min}(\mathbf{v})$. There are functionals $\varphi_{j}^{(i)}\in V_{j}^{\ast}$ with the property $\varphi_{j}^{(i)}(b_{j}^{(k)})=\delta_{ik}$. Define $\mathbf{a}_{\mathbf{i}}:=\bigotimes_{j=1}^{d}\varphi_{j}^{(i_{j})}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast}$ and $\mathbf{b}_{\mathbf{i}}:=\bigotimes_{j=1}^{d}b_{j}^{(i_{j})}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v})$ for i=(i ₁,…,i _d) with 1≤i _j≤r _j. Any $\mathbf{u}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v}) $ is reproduced by

$$\mathbf{u}=\sum_{\mathbf{i}}\mathbf{a}_{\mathbf{i}}( \mathbf{u})\mathbf{b}_{\mathbf{i}}. $$

We set

$$ \mathbf{u}_{\mathbf{v}}:=\sum_{\mathbf{i}} \mathbf{a}_{\mathbf{i}}(\mathbf{v})\mathbf{b}_{\mathbf{i}}\in {}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v}) . $$

(3.10)

Thus, the theorem follows, if we prove that $\mathbf{v}=\mathbf{u}_{\mathbf{v}}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v})$. Observe that the norm ∥v−u _v∥_∨ is described by α(v−u _v) with a normalised $\boldsymbol{\alpha }=\bigotimes_{j=1}^{d}\alpha_{j}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast}$. If we can show α(v−u _v)=0 for all α, the norm ∥v−u _v∥_∨ vanishes and v=u _v follows. Thus we need to show the following.

Claim.:: α(v−u _v)=0 holds for all $\boldsymbol{\alpha}=\bigotimes_{j=1}^{d}\alpha_{j}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast}$.

To prove the claim, split each α _j into $\alpha_{j}^{(0)}+\sum_{i}c_{i}\varphi_{j}^{(i)}$ with $c_{i}:=\alpha_{j}(b_{j}^{(i)})$ and $\alpha_{j}^{(0)}:=\alpha_{j}-\sum_{i}c_{i}\varphi_{j}^{(i)}$. It follows that $\alpha_{j}^{(0)}(b_{j}^{(i)})=0$ for all i, i.e.,

$$ \alpha_{j}^{(0)}(u)=0\quad \mbox{for all}\ u\in U_{j}^{\min}(\mathbf{v}). $$

(3.11)

We expand the product

$$\boldsymbol{\alpha}=\bigotimes_{j=1}^{d} \alpha_{j}=\bigotimes_{j=1}^{d}\biggl(\alpha_{j}^{(0)}+ \sum_{i}c_{i}\varphi_{j}^{(i)}\biggr) =\bigotimes _{j=1}^{d}\biggl(\sum_{i}c_{i} \varphi_{j}^{(i)}\biggr)+A, $$

where all products contained in A have at least one factor $\alpha_{j}^{(0)}$. Consider such a product in A, where, without loss of generality, we assume j=1, i.e., $\alpha_{1}^{(0)}\otimes\boldsymbol{\gamma}_{[1]}$ with γ _[1]∈V _[1]. We conclude that $(\alpha_{1}^{(0)}\otimes\boldsymbol{\gamma}_{[1]})(\mathbf{u}_{\mathbf{v}})=0$, since $(\mathbf{id}_{[1]}\otimes\alpha_{1}^{(0)})(\mathbf{u}_{\mathbf{v}})=\mathbf{0}$ and $\alpha_{1}^{(0)}\otimes\boldsymbol{\gamma}_{[1]}=\boldsymbol{\gamma}_{[1]}\circ(\mathbf{id}_{[1]}\otimes\alpha_{1}^{(0)})$. Furthermore,

$$\bigl( \alpha_{1}^{(0)}\otimes\boldsymbol{ \gamma}_{[1]} \bigr) (\mathbf{v})=\alpha_{1}^{(0)}(w) \quad \mbox{for}\ w:=(\mathit{id}_{1}\otimes \boldsymbol{\gamma}_{[1]}) (\mathbf{v}). $$

By definition of $U_{1}^{\min}(\mathbf{v})$, $w\in U_{1}^{\min}(\mathbf{v})$ holds and $\alpha_{1}^{(0)}(w)=(\alpha_{1}^{(0)}\otimes\boldsymbol{\gamma }_{[1]})(\mathbf{v})=0$ follows from (3.11), and thus $(\alpha_{1}^{(0)}\otimes\boldsymbol{\gamma}_{[1]})(\mathbf{v}-\mathbf{u}_{\mathbf{v}})=0$ holds.

It remains to analyse $( \bigotimes_{j=1}^{d} ( \sum_{i}c_{i}\varphi_{j}^{(i)} ) ) (\mathbf{v}-\mathbf{u}_{\mathbf{v}})= ( \sum_{\mathbf{i}}\mathbf{c}_{\mathbf{i}}\mathbf{a}_{\mathbf{i}} ) (\mathbf{v}-\mathbf{u}_{\mathbf{v}})$ for $\mathbf{c} _{\mathbf{i}}:=\prod_{j=1}^{d}c_{i_{j}}$. Application to u _v yields

$$\biggl( \sum_{\mathbf{i}}\mathbf{c}_{\mathbf{i}} \mathbf{a}_{\mathbf{i}} \biggr) ( \mathbf{u}_{\mathbf{v}} ) =\sum _{\mathbf{i}}\mathbf{c}_{\mathbf{i}}\mathbf{a}_{\mathbf{i}}( \mathbf{v})\in\mathbb{R}$$

(cf. (3.10)). Since this value coincides with (∑_i c _i a _i)(v)=∑_i c _i a _i(v), we have proved

$$\Biggl(\bigotimes _{j=1}^{d} \biggl( \sum_{i}c_{i} \varphi_{j}^{(i)}\biggr) \Biggr) (\mathbf{v}-\mathbf{u}_{\mathbf{v}})=0. $$

Thus the claim follows, and thereby $\mathbf{v}=\mathbf{u}_{\mathbf{v}}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v}) $. □

3.3.4 $\dim(U_{j}^{\min}(\mathbf{v}))=\infty$

Recall that if v∈V then $\dim(U_{j}^{\min}(\mathbf{v}))< \infty$ for 1≤j≤d. Thus, under the assumption $\dim(U_{j}^{\min}(\mathbf{v}))=\infty$ for some j∈{1,2,…,d} we have v∈V _∥⋅∥∖V, and then v is defined as the limit of some Cauchy sequence in V.

For the next theorem we need a further assumption on the norm ∥⋅∥. A sufficient condition is that ∥⋅∥ is a uniform cross norm; i.e., it is a cross norm (cf. (3.2)) and satisfies

$$ \Biggl\| \Biggl(\bigotimes _{j=1}^{d}A_{j}\Biggr) (\mathbf{v}) \Biggr\| \leq \Biggl(\prod_{j=1}^{d} \Vert A_{j}\Vert_{V_{j}\leftarrow V_{j}} \Biggr) \Vert\mathbf{v}\Vert $$

(3.12)

for all $A_{j}\in\mathcal{L}(V_{j},V_{j})$ (1≤j≤d) and all $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}V_{j}$. The uniform cross norm property implies that ∥⋅∥ is a reasonable cross norm (cf. [25]). Hence, condition (3.6) is ensured (cf. Proposition 3.7a). A further consequence will be needed.

Lemma 3.17

Let ∥⋅∥ be a uniform cross norm on $\mathbf{V.}$ Note that V=V _[d]⊗_a V _d, where $\mathbf{V}_{[d]}:={}_{a}\bigotimes_{j=1}^{d-1}V_{j}$.

(a)
The map defined by
$$\Vert\mathbf{x}\Vert_{[d]}:=\Vert\mathbf{x}\otimes v_{d} \Vert ,\quad \text{\textit{where}}\ v_{d}\in V_{d},\ \Vert v_{d}\Vert_{d}=1, $$
does not depend on the choice of v _d. Therefore, it defines a norm on V _[d].
(b)
The norm ∥⋅∥ is a reasonable cross norm on V _[d]⊗V _d, i.e.,
$$ \Vert\mathbf{x}\otimes v_{d}\Vert=\Vert\mathbf{x}\Vert_{\lbrack d]} \Vert v_{d}\Vert_{d}\text{\quad \textit{and}\quad}\Vert\boldsymbol{ \varphi}_{[d]}\otimes\varphi_{d} \Vert^{\ast}=\Vert\boldsymbol{\varphi}_{[d]} \Vert_{[d]}^{\ast}\Vert\varphi_{d} \Vert_{d}^{\ast} $$
(3.13)
for x∈V _[d], v _d∈V _d, $\boldsymbol{\varphi }_{[d]}\in\mathbf{V}_{[d]}^{\ast}$, and $\varphi_{d}\in V_{d}^{\ast}$.
(c)
For $\varphi_{d}\in V_{d}^{\ast}$ and $\boldsymbol{\varphi}_{[d]}\in\mathbf{V}_{[d]}^{\ast}$, the following estimates hold:
$$ \bigl\Vert ( \mathbf{id}_{[d]}\otimes\varphi_{d} ) ( \mathbf{v})\bigr\Vert_{\lbrack d]}\leq\Vert\varphi_{d} \Vert_{d}^{\ast}\Vert\mathbf{v}\Vert\quad \text{\textit{and}}\quad \bigl\Vert(\mathit{id}_{d}\otimes\boldsymbol{\varphi}_{[d]}) (\mathbf{v})\bigr\Vert_{d}\leq\Vert\boldsymbol{\varphi}_{[d]} \Vert_{\lbrack d]}^{\ast}\Vert\mathbf{v}\Vert. $$
(3.14)

Proof

(a) Let $\varphi_{d}\in V_{d}^{\ast}$ be the functional with $\Vert\varphi_{d}\Vert_{d}^{\ast}=1$ and φ _d(v _d)=∥v _d∥_d (cf. (2.2)). Choose any w _d∈V _d with ∥w _d∥_d=1 and set $A_{d}:=w_{d}\varphi_{d}\in\mathcal{L}(V_{d},V_{d})$, i.e., A _d v=φ _d(v)w _d. From $\Vert A_{d}\Vert_{V_{d}\leftarrow V_{d}}=\Vert\varphi_{d}\Vert_{d}^{\ast}\Vert w_{d}\Vert_{d}=1$, the uniform cross norm property (3.12) with A _j=id for 1≤j≤d−1 implies ∥x⊗w _d∥=∥(id _[d]⊗A _d)(x⊗v _d)∥≤∥x⊗v _d∥. Interchanging the roles of w _d and v _d, we obtain ∥x⊗v _d∥=∥x⊗w _d∥.

(b1) ∥x⊗v _d∥=∥x∥_[d]∥v _d∥_d in (3.13) follows from the definition of ∥⋅∥_[d].

(b2) For any elementary tensor v=x _[d]⊗v _d≠0 we have $\frac{| ( \boldsymbol{\varphi}_{[d]}\otimes\varphi_{d} ) (\mathbf{v})|}{\Vert\mathbf{v}\Vert}\leq\Vert\boldsymbol{\varphi}_{[d]}\Vert_{[d]}^{\ast}\Vert\varphi_{d}\Vert_{d}^{\ast}$. Taking the supremum over all v=x _[d]⊗v _d, we obtain

$$\begin{aligned} \Vert\boldsymbol{\varphi}_{[d]}\otimes\varphi_{d} \Vert^{\ast}&=\sup_{\mathbf{v}\neq0}\frac{|(\boldsymbol{\varphi}_{[d]}\otimes\varphi_{d})(\mathbf{v})|}{\Vert\mathbf{v}\Vert}\geq\sup_{\mathbf{v}=\mathbf{x}_{[d]}\otimes v_{d}\neq0} \frac{|(\boldsymbol{\varphi}_{[d]}\otimes\varphi_{d})(\mathbf{v})|}{\Vert\mathbf{v}\Vert} \\ &=\Vert\boldsymbol{\varphi}_{[d]}\Vert_{[ d]}^{\ast}\Vert\varphi_{d} \Vert_{d}^{\ast}. \end{aligned}$$

Define $\mathbf{A}:=\bigotimes_{j=1}^{d}A_{j}\in\mathcal{L}(\mathbf{V},\mathbf{V})$ by A _j=id (1≤j≤d−1) and $A_{d}=\hat{v}_{d}\varphi_{d}$ with $0\neq\hat{v}_{d}\in V_{d}$. Then Av is an elementary vector of the form $\mathbf{x}_{[d]}\otimes\hat{v}_{d}$ and $\Vert A_{d}\Vert_{V_{d}\leftarrow V_{d}}=\Vert\hat{v}_{d}\Vert_{d}\Vert\varphi_{d}\Vert_{d}^{\ast}$ holds. This fact and the cross norm property $\Vert\mathbf{Av}\Vert\leq\Vert\hat{v}_{d}\Vert_{d}\Vert\varphi_{d}\Vert_{d}^{\ast}\Vert\mathbf{v}\Vert$ lead us to

$$\Vert\boldsymbol{\varphi}_{[d]}\Vert_{[d]}^{\ast} \Vert\varphi_{d}\Vert_{d}^{\ast}\geq \frac{|(\boldsymbol{\varphi}_{[d]}\otimes\varphi_{d})(\mathbf{Av})|}{\Vert\mathbf{Av}\Vert}\geq\frac{|(\boldsymbol{\varphi }_{[d]}\otimes\varphi_{d})(\mathbf{Av})|}{\Vert\hat{v}_{d}\Vert_{d}\Vert\varphi_{d}\Vert_{d}^{\ast}\Vert\mathbf{v}\Vert}. $$

Since $(\boldsymbol{\varphi}_{[d]}\otimes\varphi_{d})(\mathbf{Av})= (\boldsymbol{\varphi}_{[d]}\otimes ( \varphi_{d}A_{d} )) (\mathbf{v})=\varphi_{d}(\hat{v}_{d})\cdot(\boldsymbol{\varphi}_{[d]}\otimes\varphi_{d})(\mathbf{v})$, the estimate can be continued by

$$\Vert\boldsymbol{\varphi}_{[d]}\Vert_{[d]}^{\ast} \Vert\varphi_{d}\Vert_{d}^{\ast}\geq \frac{\vert \varphi_{d}(\hat{v}_{d})\vert }{\Vert\hat{v}_{d}\Vert_{d}\Vert\varphi_{d}\Vert_{d}^{\ast}}\frac {|(\boldsymbol{\varphi}_{[d]}\otimes\varphi_{d})(\mathbf{v})|}{\Vert \mathbf{v}\Vert}\quad \mbox{for all}\ 0\neq\hat{v}_{d} \in V_{d}. $$

As $\sup_{\hat{v}_{d}\neq0}\frac{\vert \varphi_{d}(\hat{v}_{d})\vert }{\Vert\hat{v}_{d}\Vert_{d}}=\Vert\varphi_{d}\Vert_{d}^{\ast}$, it follows that $\frac{| ( \boldsymbol{\varphi}_{[d]}\otimes\varphi_{d} ) (\mathbf{v})|}{\Vert\mathbf{v}\Vert}\leq\Vert\boldsymbol{\varphi }_{[d]}\Vert_{[d]}^{\ast}\Vert\varphi_{d}\Vert_{d}^{\ast}$ for all v∈V, so that $\Vert\boldsymbol{\varphi}_{[d]}\otimes\varphi_{d}\Vert^{\ast}\leq\Vert\boldsymbol{\varphi}_{[d]}\Vert_{[ d]}^{\ast}\Vert\varphi_{d}\Vert_{d}^{\ast}$. Together with the opposite inequality from above, we have proved the second equation in (3.13).

(c) Any $\mathbf{\psi}_{[d]}\in\mathbf{V}_{[d]}^{\ast}$ satisfies $\mathbf{\psi}_{[d]}\otimes\varphi_{d}=\mathbf{\psi}_{[d]} ( \mathbf{id}_{[d]}\otimes\varphi_{d} ) $. For v _[d]:=(id _[d]⊗φ _d)(v) there is a $\mathbf{\psi}_{[d]}\in\mathbf{V}_{[d]}^{\ast}$ with $\Vert\mathbf{\psi}_{[d]}\Vert_{[ d]}^{\ast}=1$ and $\vert \mathbf{\psi}_{[d]} ( \mathbf{v}_{[d]} ) \vert =\Vert\mathbf{v}_{[d]}\Vert_{[ d]}$ (cf. (2.2)). Hence,

proves the first inequality in (3.14). The second one can be proved analogously. □

The next result is proved, e.g., in DeVore–Lorentz [8, Chap. 9, §7] or Meise–Vogt [21, Proposition 10.6].

Lemma 3.18

Let Y⊂X be a subspace of a Banach space X with dim(Y)≤n. Then there exists a projection $\varPhi\in\mathcal{L}(X,X)$ onto Y such that

$$\Vert \varPhi \Vert _{X\leftarrow X}\leq\sqrt{n}. $$

The bound is sharp for general Banach spaces, but can be improved to ∥Φ∥_X←X≤n ^1/2−1/p for X=L ^p.

Before we state the next theorem we recall the following definition. In Sect. 1, we introduced the set $\mathcal{R}_{r}$ for the tensor space V. Since V≅V _[d]⊗_a V _d we now introduce the notation

$$\mathcal{R}_{r} ( \mathbf{V}_{[d]}\otimes_{a}V_{d} ) := \Biggl\{ \sum_{i=1}^{r} \mathbf{v}_{[d]}^{(i)}\otimes v_{d}^{(i)}: \mathbf{v}_{[d]}^{(i)} \in\mathbf{V}_{[d]} \ \text{and}\ v_{d}^{(i)} \in V_{d} \Biggr\} . $$

Theorem 3.19

Assume that V _∥⋅∥ is a Banach tensor space with a uniform cross norm ∥⋅∥. If v∈V _∥⋅∥∖V is the limit of a sequence {v _n}_n∈ℕ⊂V, where $\mathbf{v}_{n} \in\mathcal{R}_{r} ( \mathbf{V}_{[d]}\otimes_{a}V_{d} ) $ with r≤n, and a convergence rate given by

$$\Vert \mathbf{v}_{n}-\mathbf{v}\Vert \leq o\bigl(n^{-3/2} \bigr), $$

then v∈U _∥⋅∥(v).

Proof

Use the setting V≅V _[d]⊗_a V _d from Lemma 3.17. Since $\mathbf{v}_{n} \in\mathcal{R}_{r} ( \mathbf{V}_{[d]}\otimes_{a}V_{d} ) $, with r≤n, each v _n∈V has a representation in $\mathbf{U}_{[d]}^{\min}(\mathbf{v}_{n})\otimes U_{d}^{\min}(\mathbf{v}_{n})$ with $r:=\dim \mathbf{U}_{[d]}^{\min}(\mathbf{v}_{n})=\dim U_{d}^{\min}(\mathbf{v}_{n})\leq n$. Renaming r as n, we obtain the representation $\mathbf{v}_{n}=\sum_{i=1}^{n}\mathbf{v}_{[d]}^{(i)}\otimes v_{d}^{(i)}$. According to Corollary 2.15b, we can fix any basis $\{v_{d}^{(i)}\}$ of $U_{d}^{\min}(\mathbf{v}_{n})$ and recover $\mathbf{v}_{n}= \sum_{i=1}^{n}(\mathbf{id}_{[d]}\otimes\psi_{d}^{(i)})(\mathbf{v}_{n})\otimes v_{d}^{(i)}$ from the dual functionals $\{\psi_{d}^{(i)}\}$. We choose $v_{d}^{(i)}$ and $\psi_{d}^{(i)}$ according to Lemma 2.2 with $\Vert v_{d}^{(i)}\Vert_{d}=\Vert\psi_{d}^{(i)}\Vert_{d}^{\ast}=1$. Define

$$\mathbf{u}_{n}^{\mathrm{I}}:=\sum_{i=1}^{n} \bigl( \overline{\mathit{id}_{[d]}\otimes \psi_{d}^{(i)}} \bigr) (\mathbf{v})\otimes v_{d}^{(i)}\in \mathbf{U}_{[d]}^{\min }(\mathbf{v})\otimes_{a}V_{d}. $$

The triangle inequality yields

(3.15)

Note that

$$\mathbf{u}_{n}^{\mathrm{I}}\in\mathbf{U}_{[d],n}\otimes V_{d}\quad \text{with }\mathbf{U}_{[d],n}:=\operatorname{span} \bigl\{ \bigl( \overline{\mathbf{id}_{[d]}\otimes \psi_{d}^{(i)}} \bigr) (\mathbf{v}):1\leq i\leq n \bigr\} \subset\mathbf{U}_{[d]}^{\min}(\mathbf{v}), $$

where dimU _[d],n≤n.

On the other hand, according to Lemma 2.2, we can choose a basis $\{\mathbf{v}_{[d]}^{(i)}\}_{i=1}^{n}$ of $\mathbf{U}_{[d]}^{\min }(\mathbf{v}_{n})$ and its corresponding dual basis $\{\boldsymbol{\chi}_{[d]}^{(i)}\}_{i=1}^{n}$. An analogous proof shows that

$$\mathbf{u}_{n}^{\mathrm{II}}:=\sum_{i=1}^{n} \mathbf{v}_{[d]}^{(i)}\otimes \bigl( \overline{\mathit{id}_{d} \otimes\boldsymbol{\chi}_{[d]}^{(i)}} \bigr) (\mathbf{v}) $$

satisfies the properties

$$ \bigl\Vert\mathbf{u}_{n}^{\mathrm{II}}-\mathbf{v}_{n}\bigr\Vert\leq n\Vert\mathbf{v}-\mathbf{v}_{n}\Vert $$

(3.16)

and $\mathbf{u}_{n}^{\mathrm{II}}\in\mathbf{V}_{[d]}\otimes_{a}U_{d,n}$, where $U_{d,n}:=\operatorname{span}\{ (\overline{\mathit{id}_{d}\otimes\boldsymbol{\chi}_{[d]}^{(i)}})(\mathbf{v}):1\leq i\leq n\} $ has dimU _d,n≤n and is a subspace of $U_{d}^{\min}(\mathbf{v})$.

From Lemma 3.18 we choose the projection Φ _d onto the subspace U _d,n and define

$$\mathbf{u}_{n}:= ( \mathbf{id}_{[d]}\otimes \varPhi_{d} ) \mathbf{u}_{n}^{\mathrm{I}}\in U_{[d],n} \otimes_{a}U_{d,n}\subset\mathbf{U}_{[d]}^{\min}( \mathbf{v})\otimes_{a}U_{d}^{\min}(\mathbf{v}) \underset {\text{(2.11)}}{\subset}{}_{a} \bigotimes_{j=1}^{d}U_{j}^{\min}( \mathbf{v}) . $$

The uniform cross norm property (3.12) with A _j=id (1≤j≤d−1) and A _d=Φ _d implies the estimate $\Vert\mathbf{id}_{[d]}\otimes\varPhi_{d}\Vert_{\mathbf{V\leftarrow V}}=\Vert\varPhi_{d} \Vert_{\mathbf{V}_{d}\mathbf{\leftarrow V}_{d}}\leq\sqrt{n}$, where the latter bound is given by Lemma 3.18. Therefore,

Altogether, we get the estimate

The assumption ∥v−v _n∥≤o(n ^−3/2) implies ∥u _n−v∥→0. □

3.4 Minimal Closed Subspaces in a Hilbert Tensor Space

Let 〈⋅,⋅〉_j be a scalar product defined on V _j (1≤j≤d), i.e., V _j is a pre-Hilbert space. Then $\mathbf{V}={}_{a}\bigotimes_{j=1}^{d}V_{j}$ is again a pre-Hilbert space with a scalar product which is defined for elementary tensors $\mathbf{v}=\bigotimes_{j=1}^{d}v^{(j)}$ and $\mathbf{w}=\bigotimes_{j=1}^{d}w^{(j)}$ by

$$ \langle \mathbf{v,w} \rangle = \Biggl\langle \bigotimes _{j=1}^{d}v^{(j)},\bigotimes _{j=1}^{d}w^{(j)} \Biggr\rangle := \prod_{j=1}^{d} \bigl\langle v^{(j)},w^{(j)} \bigr\rangle_{j}\quad\text{for all}\ v^{(j)},w^{(j)}\in V_{j}. $$

(3.17)

This bilinear form has a unique extension 〈⋅,⋅〉:V×V→ℝ. One verifies that 〈⋅,⋅〉 is a scalar product, called the induced scalar product. Let V be equipped with the norm ∥⋅∥ corresponding to the induced scalar product 〈⋅,⋅〉. As usual, the Hilbert tensor space $\mathbf{V}_{\Vert \cdot \Vert }={}_{\Vert \cdot \Vert }\bigotimes_{j=1}^{d}V_{j} $ is the completion of V with respect to ∥⋅∥. Since the norm ∥⋅∥ is derived via (3.17), it is easy to see that ∥⋅∥ is a reasonable and even uniform cross norm.

We recall that orthogonal projections $P\in\mathcal{L}(V,V)$ (V Hilbert space) are self-adjoint projections. P is an orthogonal projection onto the closed subspace $U:=\operatorname{range}(P)\subset V$, which leads to the direct sum V=U⊕U ^⊥, where $U^{\bot}=\operatorname{range}(\mathit{id}-P)$. Vice versa, each closed subspace U⊂V defines an orthogonal projection P with $U=\operatorname{range}(P)$.

Lemma 3.20

Let V _j be Hilbert spaces with subspaces U _j⊂V _j such that $V_{j} = U_{j} \oplus U_{j}^{\bot}$. The norm ∥⋅∥ of the Hilbert tensor space ${}_{\Vert \cdot \Vert }\bigotimes_{j=1}^{d}V_{j}$ is defined via the scalar product (3.17). Then

$$\bigcap_{1\leq j\leq d} ( U_{j} \otimes_{\Vert \cdot \Vert }\mathbf{V}_{[j]} ) ={}_{\Vert \cdot \Vert }\bigotimes_{j=1}^{d}U_{j} ,\quad \text{\textit{where}}\ \mathbf{V}_{[j]}:= {}_{a}\bigotimes_{k\neq j}V_{k} . $$

Proof

We consider the case d=2 only (d≥3 can be obtained by induction). Then the assertion to be proved is

$$( U_{1}\otimes_{\Vert \cdot \Vert }V_{2} ) \cap ( V_{1}\otimes_{\Vert \cdot \Vert }U_{2} ) =U_{1} \otimes_{\Vert \cdot \Vert }U_{2}. $$

The analogous statement for the algebraic tensor spaces holds by Lemma 2.11. The general rule $\overline{X\cap Y}\subset \overline{X}\cap\overline{Y}$ ($\bar{\mathbf{\cdot}}$ is the closure with respect to ∥⋅∥) implies that

The lemma is proved, if the opposite inclusion holds:

$$ ( U_{1}\otimes_{\Vert \cdot \Vert }V_{2} ) \cap ( V_{1}\otimes_{\Vert \cdot \Vert }U_{2} ) \subset U_{1}\otimes_{\Vert \cdot \Vert }U_{2}. $$

(3.18)

Let v∈U ₁⊗_∥⋅∥ V ₂. By definition, there is a sequence v _n∈U ₁⊗_a V ₂ with v _n→v. Let P ₁ be the orthogonal projection onto U ₁. Then (P ₁⊗id ₂)v _n=v _n proves $\mathcal{P}_{1}\mathbf{v}=\mathbf{v}$ for the extension $\mathcal{P}_{1}:=\overline{P_{1}\otimes \mathit{id}_{2}}$. Similarly, $\mathcal{P}_{2}\mathbf{v}=\mathbf{v}$ follows with $\mathcal{P}_{2}:=\overline{\mathit{id}_{1} \otimes P_{2}}$, where P ₂ is the orthogonal projection onto U ₂. Since P ₁⊗id ₂ and id ₁⊗P ₂ commute, the product P ₁⊗P ₂ is also an orthogonal projection. Its range is U ₁⊗_a U ₂, while U ₁⊗_∥⋅∥ U ₂ is the range of its extension $\mathcal{P}:=\overline{P_{1}\otimes P_{2}}=\mathcal{P}_{1}\mathcal{P}_{2}=\mathcal{P}_{2}\mathcal{P}_{1}$. Hence, $\mathcal{P}_{1}\mathbf{v}= \mathcal{P}_{2}\mathbf{v}=\mathbf{v}$ implies $\mathcal{P}\mathbf{v}=\mathbf{v,}$ i.e., v∈U ₁⊗_∥⋅∥ U ₂. This ends the proof of (3.18). □

Lemma 3.21

Let V _i (i=1,2) be Hilbert spaces, and U ₁⊂V ₁ a closed subspace. Then the direct sum $V_{1}=U_{1}\oplus U_{1}^{\bot}$ implies

$$V_{1}\otimes_{\Vert \cdot \Vert }V_{2}= ( U_{1} \otimes_{\Vert \cdot \Vert }V_{2} ) \oplus \bigl( U_{1}^{\bot}\otimes_{\Vert \cdot \Vert }V_{2} \bigr) . $$

Proof

Consider the ranges of $\mathcal{P}_{1}=\overline{P_{1}\otimes \mathit{id}_{2}}$ and $\mathit{id}-\mathcal{P}_{1}$, where P ₁ is the orthogonal projection onto U ₁. □

Unlike Theorem 3.19 for the Banach tensor space setting, we need no assumption on the speed of the convergence v _n→v to obtain the result v∈U _∥⋅∥(v).

Theorem 3.22

Assume that V _j are Hilbert spaces and that V is equipped with the norm ∥⋅∥ corresponding to the induced scalar product. Then for all v∈V _∥⋅∥ it follows that v∈U _∥⋅∥(v).

Proof

(1) In order to simplify the notation, we set $U_{j}:=\overline{U_{j}^{\min }(\mathbf{v})}^{\Vert \cdot \Vert _{j}}$ for 1≤j≤d. For all 1≤j≤d we may write V _∥⋅∥ as V _j⊗_∥⋅∥ V _[j]. If we succeed in proving v∈U _j⊗_∥⋅∥ V _[j], Lemma 3.20 implies $\mathbf{v}\in{}_{\Vert \cdot \Vert }\bigotimes_{j=1}^{d}U_{j} =\mathbf{U}_{\Vert \cdot \Vert }(\mathbf{v})$.

(2) According to $\mathbf{V}_{\Vert \cdot \Vert }= ( U_{j}\otimes_{\Vert \cdot \Vert }\mathbf{V}_{[j]} ) \oplus(U_{j}^{\bot}\otimes_{\Vert \cdot \Vert }\mathbf{V}_{[j]})$ from Lemma 3.21, we split v into

$$\mathbf{v}=\mathbf{v}_{||}+\mathbf{v}_{\bot}\quad \mbox{with}\ \mathbf{v} _{||}\in U_{j}\otimes_{\Vert \cdot \Vert } \mathbf{V}_{[j]}\text{ and }\mathbf{v}_{\bot}\in U_{j}^{\bot}\otimes_{\Vert \cdot \Vert }\mathbf{V}_{[j]}. $$

For an indirect proof we assume v _⊥≠0. Then there are $v_{j}\in U_{j}^{\bot}$ and v _[j]∈V _[j] with 〈v _j⊗v _[j],v _⊥〉=〈v _j⊗v _[j],v〉≠0 (otherwise there are no algebraic tensors converging to v _⊥). For $\boldsymbol{\varphi}_{[j]}:= \langle \mathbf{v}_{[j]},\cdot \rangle_{[j]}\in\mathbf{V}_{[j]}^{\ast}$ one verifies

$$\langle v_{j}\otimes\mathbf{v}_{[j]},\mathbf{v} \rangle = \bigl\langle v_{j},(\mathit{id}_{j}\otimes\boldsymbol{ \varphi}_{[j]}) (\mathbf{v}) \bigr\rangle . $$

The definition of $U_{j}^{\min}(\mathbf{v})$ yields (id _j⊗φ _[j])(v)∈U _j. Since $v_{j}\in U_{j}^{\bot}$, we obtain the contradiction 〈v _j⊗v _[j],v _⊥〉=〈v _j⊗v _[j],v〉=0. Hence v _⊥=0 proves the statement v∈U _j⊗_∥⋅∥ V _[j] needed in part (1). □

So far, we have assumed that the norm ∥⋅∥ of the Hilbert space V corresponds to the induced scalar product. In principle, we may also define another scalar product 〈⋅,⋅〉_V on V together with another norm $\Vert \cdot \Vert _{\mathbf{V}}\mathbf{.}$ In this case, we have to assume that ∥⋅∥_V is a uniform cross norm (at least, $\Vert (\bigotimes_{j=1}^{d}A_{j})(\mathbf{v})\Vert \leq C(\prod_{j=1}^{d}\Vert A_{j}\Vert_{V_{j}\leftarrow V_{j}})\Vert\mathbf{v}\Vert$ must hold for some constant C). This ensures that the projections $\mathcal{P}_{j}$ (as defined in the proof of Lemma 3.20) belong to $\mathcal{L}(\mathbf{V},\mathbf{V})$. Furthermore, (3.6) holds. Scalar products like 〈v _j⊗v _[j],v〉 in the proof above are to be replaced with (φ _j⊗φ _[j])(v), where, as usual, $\varphi_{j}\in V_{j}^{\ast}$ is defined via φ _j(⋅)=〈v _j,⋅〉_j. Then we can state again that v∈U _∥⋅∥(v).

4 On the Best $\mathcal{T}_{\mathbf{r}}$ Approximation in a Banach Tensor Space

4.1 Main Statement

Theorem 4.1

Let V _∥⋅∥ be a reflexive Banach tensor space with a norm satisfying (3.6). Then for each v∈V _∥⋅∥ there exists $\mathbf{w}\in\mathcal{T}_{\mathbf{r}}$ such that

$$ \Vert\mathbf{v}-\mathbf{w}\Vert=\min_{\mathbf{u}\in\mathcal{T}_{\mathbf{r}}}\Vert\mathbf{v}-\mathbf{u} \Vert. $$

(4.1)

Proof

Combine Theorem 4.2 and Proposition 4.3 given below. □

A subset M⊂X is called weakly closed, if x _n∈M and x _n⇀x implies x∈M. Note that ‘weakly closed’ is stronger than ‘closed’, i.e., M weakly closed ⇒ M closed.

Theorem 4.2

[4] Let (X,∥⋅∥) be a reflexive Banach space with a weakly closed subset ∅≠M⊂X. Then the following minimisation problem has a solution: For any x∈X there exists v∈M with

$$\Vert x-v\Vert =\min\bigl\{\Vert x-w\Vert :w\in M\bigr\}. $$

Proposition 4.3

Let V _∥⋅∥ be a Banach tensor space with a norm satisfying (3.6). Then the set $\mathcal{T}_{\mathbf{r}}$ is weakly closed.

Proof

Let $\{\mathbf{v}_{n}\}\subset\mathcal{T}_{\mathbf{r}}$ be such that v _n⇀v. Then there are subspaces U _j,n⊂V _j such that v _n∈U _j,n with dimU _j,n=r _j. Since $U_{j}^{\min}(\mathbf{v}_{n})\subset U_{j,n}$, $\dim U_{j}^{\min}(\mathbf{v}_{n})\leq r_{j}$ holds for all n∈ℕ. Consequently, by Theorem 3.15, $\dim U_{j}^{\min}(\mathbf{v})\leq r_{j}$. Thus, $U_{j}^{\min}(\mathbf{v})$ is finite dimensional. From Theorem 3.16 we conclude that $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d}U_{j}^{\min}(\mathbf{v}) $ and, thereby, $\mathbf{v}\in\mathcal{T}_{\mathbf{r}}$. □

Corollary 4.4

A statement analogous to Theorem 4.1 also holds for the set $\mathcal{H}_{\mathfrak{r}}$ appearing for the hierarchical format from [15] and the format from [24]. The proof uses the fact that $\mathcal{H}_{\mathfrak{r}}$ is weakly closed.

Since the assumption of reflexivity excludes important spaces, we add some remarks on this subject. The existence of a minimiser or ‘nearest point’ w in a certain set A⊂X to some v∈V∖A is a well-studied subject. A set A is called ‘proximinal’ if ∥v−w∥=min_u∈A∥v−u∥ has at least one solution w∈A. Without the assumption of reflexivity, there are statements ensuring under certain conditions that the set of points in V∖A possessing nearest points in A is dense (e.g., Edelstein [10]). A smaller class than the weakly closed subsets A are the closed and convex subsets. However, even for closed and convex subsets one cannot avoid reflexivity, in general, because of the following result (cf. [4, Proposition 4]). Note that the sets $\mathcal{T}_{\mathbf{r}}$ and $\mathcal{H}_{\mathfrak{r}}$ are not convex, but weakly closed, as we will show with the help of minimal subspaces.

Theorem 4.5

All closed and convex subsets are proximinal, if and only if the underlying Banach space is reflexive.

4.2 Generalisation to the Intersection of Finitely Many Banach Tensor Spaces

We recall that the assumption (3.6) implies ${}_{a}\bigotimes_{j=1}^{d}V_{j}^{\ast}\subset({}_{a} \bigotimes_{j=1}^{d}V_{j})^{\ast}$ (cf. Proposition 3.7b). For certain Banach tensor spaces this property does not hold. Therefore, we have to check whether some of the results given in the previous section can be extended to this case. Thus, in this section we introduce the intersection tensor spaces. We also study sequences of minimal subspaces in this framework in order to prove the existence of a best $\mathcal{T}_{\mathbf{r}}$ approximation. To illustrate this situation we give the following example.

Recall that $\Vert f\Vert _{C^{1}(I)}=\max_{x\in I} \{ \vert f(x)\vert ,\vert f^{\prime}(x)\vert \} $ is the norm of continuously differentiable functions in one variable x∈I⊂ℝ. The naming ∥⋅∥_1,mix of the following norm is derived from the mixed derivative involved.

Example 4.6

Let I and J be compact intervals in ℝ and consider $V=(C^{1}(I), \Vert \cdot \Vert _{C^{1}(I)})$ and $W=(C^{1}(J),\Vert \cdot \Vert _{C^{1}(J)})$. For the tensor space V⊗_a W we introduce the norm

$$ \begin{aligned}[b] \Vert \varphi \Vert _{C_{\mathrm{mix}}^{1}(I\times J)}&:=\Vert \varphi \Vert _{1,\mathrm{mix}} \\ &:= \max_{ ( x,y ) \in I\times J} \biggl\{ \bigl \vert \varphi(x,y)\bigr \vert ,\biggl \vert \frac{\partial }{\partial x}\varphi(x,y)\biggr \vert ,\biggl \vert \frac{\partial}{\partial y} \varphi(x,y)\biggr \vert ,\biggl \vert \frac{\partial^{2}}{\partial x\partial y}\varphi(x,y)\biggr \vert \biggr\} . \end{aligned} $$

(4.2)

It can be shown that ∥⋅∥_1,mix is a reasonable cross norm. However, the standard norm of C ¹(I×J) given by

$$ \Vert \varphi \Vert _{C^{1}(I\times J)}:=\max_{ ( x,y ) \in I\times J} \biggl\{ \bigl \vert \varphi(x,y)\bigr \vert ,\biggl \vert \frac{\partial}{\partial x}\varphi(x,y) \biggr \vert ,\biggl \vert \frac{\partial }{\partial y}\varphi(x,y)\biggr \vert \biggr\} $$

(4.3)

is not a reasonable cross norm.

We have seen that the space C ¹(I×J) is not the straightforward result of the tensor product C ¹(I)⊗C ¹(J). The norm ∥⋅∥_1,mix from (4.2) turns out to be a reasonable cross norm, but then the resulting space $C_{\mathrm{mix}}^{1}(I\times J)$ is a smaller space than C ¹(I×J). Vice versa, the dual norm $\Vert \cdot \Vert _{C^{1}(I\times J)}^{\ast}$ of C ¹(I×J) is not bounded for v ^∗⊗w ^∗∈V ^∗⊗W ^∗. Therefore, it is not a reasonable cross norm.

The family of Sobolev spaces H ^m,p(I _j) for m=0,1,…,N is an example of a scale of Banach spaces which we introduce below. From now on, we fix integers N _j and denote the j-th scale by

$$ V_{j}=V_{j}^{(0)}\supset V_{j}^{(1)} \supset\cdots\supset V_{j}^{(N_{j})}\quad \text{with dense embedding,} $$

(4.4)

which means that $V_{j}^{(n)}$ is a dense subspace of $(V_{j}^{(n-1)},\Vert \cdot \Vert _{j,n-1})$ for n=1,…,N _j. This fact implies that the corresponding norms satisfy

$$ \Vert \cdot \Vert _{j,n}\gtrsim \Vert \cdot \Vert _{j,m} \quad \text{for}\ N_{j}\geq n\geq m\geq0\text{ on }V_{j}^{(n)}. $$

(4.5)

It is an easy exercise to see that all $V_{j}^{(n)}$ (1≤n≤N _j) are dense in $(V_{j}^{(0)},\Vert \cdot \Vert _{j,0})$.

Definition 4.7

Under the given assumptions for (4.4) we say that a subset $\mathcal{N}\subset\mathbb{N}_{0}^{d}$ is an admissible index set, if it satisfies

(4.6a)

(4.6b)

(4.6c)

For each n in an admissible index set $\mathcal{N}$, we define the tensor space

$$ \mathbf{V}^{(\mathbf{n})}:={}_{a}\bigotimes _{j=1}^{d}V_{j}^{(n_{j})} . $$

(4.7)

All spaces V ⁽ⁿ⁾ are subspaces of $\mathbf{V}^{(\mathbf{0})}={}_{a}\bigotimes_{j=1}^{d}V_{j}$ (recall that $V_{j}=V_{j}^{(0)}$). Assume that the following conditions hold:

(a)
For each admissible $\mathbf{n}\in\mathbb{N}_{0}^{d}$, a norm ∥⋅∥_n on V ⁽ⁿ⁾ exists satisfying ∥⋅∥_n≤∥⋅∥_m for $\mathbf{n}\leq\mathbf{m}\in\mathcal{N}$, and
(b)
the norm ∥⋅∥₀ on $\mathbf{V}^{(\mathbf{0})}={}_{a}\bigotimes_{j=1}^{d}V_{j}$ satisfies (3.6).

Now, we introduce the Banach tensor space

$$ \mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})}:={}_{\Vert \cdot \Vert _{\mathbf{n}}}\bigotimes _{j=1}^{d}V_{j}^{(n_{j})} . $$

(4.8)

Note that for each $\mathbf{n}\in\mathcal{N}$, if $\mathbf{n}\leq\mathbf{m}\in\mathcal{N}$, then $\mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{m})}\subset\mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})}$. From Lemma 2.16 one derives the following result.

Lemma 4.8

Let $\mathcal{N}\subset\mathbb{N}_{0}^{d}$ be an admissible index set. Then the following statements hold:

(a)
$\mathbf{V}^{(\mathbf{n})}=\bigcap_{j=1}^{d}\mathbf{V}^{(\mathbf{n}_{j})}$ for all $\mathbf{n}\in\mathcal{N}$, where $\mathbf{n}_{j}:=(\underset{j-1}{\underbrace{0,\ldots,0}},n_{j},\underset{d-j}{\underbrace{0,\ldots,0}})\in\mathbb{N}_{0}^{d}$, for 1≤j≤d.
(b)
$\bigcap_{\mathbf{n}\in\mathcal{N}}\mathbf{V}^{(\mathbf{n})}=\mathbf{V}^{(N_{1},\ldots,N_{d})}={}_{a}\bigotimes_{j=1}^{d}V_{j}^{(N_{j})}$.

Proof

From Lemma 2.16 we have

$$\bigcap_{j=1}^{d}\mathbf{V}^{(\mathbf{n}_{j})}= {}_{a}\bigotimes_{j=1}^{d} \bigcap_{i=1}^{d}V_{j}^{(n_{j}\delta_{i,j})} ={}_{a}\bigotimes_{j=1}^{d}V_{j}^{(n_{j})} , $$

and statement (a) follows. Also, by Lemma 2.16,

$$\bigcap_{\mathbf{n}\in\mathcal{N}}\mathbf{V}^{(\mathbf{n})}= {}_{a}\bigotimes_{j=1}^{d} \bigcap_{\mathbf{n}\in\mathcal{N}}V_{j}^{(n_{j})} $$

and, by (4.4), statement (b) is proved. □

Let $\mathcal{N}\subset\mathbb{N}_{0}^{d}$ be an admissible index set. From Lemma 4.8b it follows that the intersection of the set of tensor spaces $\{\mathbf{V}^{(\mathbf{n})}:\mathbf{n}\in\mathcal{N}\}$ is the tensor space $\mathbf{V}^{(N_{1},\ldots,N_{d})}\subset\mathbf{V}_{\Vert \cdot \Vert }^{(N_{1},\ldots,N_{d})}$. Observe that the index (N ₁,…,N _d) does not necessarily belong to the index set $\mathcal{N}$. Also, by Lemma 4.8a, we obtain the following minimal representation:

$$ \mathbf{V}^{(N_{1},\ldots,N_{d})}=\bigcap_{j=1}^{d} \mathbf{V}^{(\mathbf{N}_{j})}. $$

(4.9)

Next, we introduce the Banach space induced by intersection of the set of Banach tensor spaces $\{\mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})}:\mathbf{n}\in\mathcal{N}\}$.

Definition 4.9

Let $\mathcal{N}\subset\mathbb{N}_{0}^{d}$ be an admissible index set. The Banach space $\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ induced by the intersection of the set of Banach tensor spaces $\{\mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})}:\mathbf{n}\in\mathcal{N}\}$ is defined by

$$ \mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}:=\bigcap_{\mathbf{n}\in\mathcal{N}} \mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})}\quad \text{with the intersection norm }\Vert \mathbf{v}\Vert _{\mathcal{N}}:=\max_{\mathbf{n}\in\mathcal{N}}\Vert \mathbf{v}\Vert _{\mathbf{n}} $$

(4.10)

or an equivalent one.

Next, we consider elementary tensors from the tensor space V ⁽⁰⁾.

Proposition 4.10

Let $\mathcal{N}\subset\mathbb{N}_{0}^{d}$ be an admissible index set. Then

$$\mathbf{V}^{(\mathbf{0})}\cap\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}=\mathbf{V}^{(N_{1},\ldots,N_{d})}$$

holds. In particular, each $\mathbf{v}\in\mathbf{V}^{(\mathbf{0})}\cap\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ has a representation $\mathbf{v}=\sum_{i=1}^{r}\bigotimes_{j=1}^{d}v_{j}^{(i)}$ with $v_{j}^{(i)}\in V_{j}^{(N_{j})}$ and a minimal number r of terms.

Proof

By definition (4.10), $\mathbf{V}^{(\mathbf{0})}\cap\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}=\bigcap_{\mathbf{n}\in\mathcal{N}} ( \mathbf{V}^{(\mathbf{0})}\cap \mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})} ) $ holds. Since $\mathbf{v}\in\mathbf{V}^{(\mathbf{0})}\cap\mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})}$ is an elementary tensor, it belongs to V ⁽⁰⁾∩V ⁽ⁿ⁾=V ⁽ⁿ⁾. From Lemma 2.16 it follows that $\mathbf{v}\in\bigcap_{\mathbf{n}\in\mathcal{N}}\mathbf{V}^{(\mathbf{n})}={}_{a}\bigotimes_{j=1}^{d} ( \bigcap_{\mathbf{n}\in\mathcal{N}}V_{j}^{(n_{j})} )$. By condition (4.6c), one of the n _j equals N _j, which implies $\mathbf{v}\in{}_{a}\bigotimes_{j=1}^{d} ( \bigcap_{\mathbf{n}\in\mathcal{N}}V_{j}^{(n_{j})} ) ={}_{a}\bigotimes_{j=1}^{d}V_{j}^{(N_{j})}$. □

Corollary 4.11

The set $\mathbf{V}^{(N_{1},\ldots,N_{d})}$ is dense in $\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ with respect to the $\Vert \cdot \Vert _{\mathcal{N}}$-topology.

Proof

The inclusions $\mathbf{V}^{(N_{1},\ldots,N_{d})}\subset\mathbf{V}^{(\mathbf{n})}\subset\mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})}$ are dense for all $\mathbf{n}\in\mathcal{N}$ (cf. (4.4)). Definition (4.10) yields the assertion. □

Example 4.12

Fix N>0 and consider the Sobolev spaces $V_{j}^{n}=H^{n,p}(I_{j})$ for 0≤n≤N and 1≤j≤d. The standard choice of $\mathcal{N}$ is given by

$$ \mathcal{N}:= \bigl\{ \mathbf{n}\in\mathbb{N}_{0}^{d}\text{ with }\vert \mathbf{n}\vert \leq N \bigr\} \quad(\text{here }N_{j}=N\text{\ for all }1\leq j\leq d). $$

(4.11)

In this situation we have $\mathbf{V}^{\mathbf{n}}=H^{n_{1},p}(I_{1})\otimes_{a}\cdots\otimes_{a}H^{n_{d},p}(I_{d})$ for each $\mathbf{n}\in\mathcal{N}$, and

$$\mathbf{V}^{(N,\ldots,N)}={}_{a}\bigotimes _{j=1}^{d}H^{N,p}(I_{j}) . $$

The choice of the norm in V ⁿ is

$$ \Vert f\Vert _{\mathbf{n}}:= \biggl(\sum_{\mathbf{0}\leq\mathbf{k}\leq\mathbf{n}} \int_{\mathbf{I}} \bigl \vert \partial^{\mathbf{k}}f \bigr \vert ^{p}\,\mathrm{d}x \biggr)^{1/p}, $$

(4.12)

while in $\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ we take

$$\Vert f\Vert _{\mathcal{N}}:= \biggl( \sum_{\mathbf{n}\in\mathcal{N}} \Vert f\Vert _{\mathbf{n}}^{p} \biggr)^{1/p}, $$

which is equivalent to the usual norm ∥⋅∥_N,p. Then, by Corollary 4.11,

$$ {}_{\Vert \cdot \Vert _{N,p}}\bigotimes_{j=1}^{d}H^{N,p}(I_{j}) =\bigcap_{\mathbf{n}\in\mathcal{N}} \overline{H^{n_{1},p}(I_{1})\otimes_{a} \cdots\otimes_{a}H^{n_{d},p}(I_{d})}^{\Vert \cdot \Vert _{\mathbf{n}}}. $$

(4.13)

Observe that ${}_{\Vert \cdot \Vert _{N,p}}\bigotimes_{j=1}^{d}H^{N,p}(I_{j})$ is a Banach subspace of the Banach space H ^N,p(I). Moreover, for each $\mathbf{N}_{j}=(0,\ldots ,0,N,0,\ldots,0)\in\mathcal{N}$, we have

$$\Vert f\Vert _{\mathbf{N}_{j}}= \Biggl(\sum_{k=0}^{N} \int_{\mathbf{I}}\bigl \vert \partial_{x_{j}}^{k}f \bigr \vert ^{p}\,\mathrm{d}x \Biggr)^{1/p}, $$

which is clearly a cross norm in

$$\mathbf{V}^{(\mathbf{N}_{j})}=L^{p}(I_{1}) \otimes_{a}\cdots\otimes_{a}L^{p}(I_{j-1})\otimes_{a}H^{N,p}(I_{j}) \otimes_{a}L^{p}(I_{j+1})\otimes \cdots \otimes_{a}L^{p}(I_{d}), $$

for 1≤j≤d. In particular for p=2 we obtain

$$ H^{N}(\mathbf{I})={}_{\Vert \cdot \Vert _{N,2}}\bigotimes _{j=1}^{d}H^{N}(I_{j}) =\bigcap_{\mathbf{n}\in\mathcal{N}}\overline{H^{n_{1}}(I_{1}) \otimes_{a}\cdots\otimes_{a}H^{n_{d}}(I_{d})}^{\Vert \cdot \Vert _{\mathbf{n}}}, $$

(4.14)

and in this case the norm $\Vert \cdot \Vert _{\mathbf{N}_{j}}$ in

$$\mathbf{V}^{(\mathbf{N}_{j})}=L^{2}(I_{1}) \otimes_{a}\cdots\otimes_{a}L^{2}(I_{j-1})\otimes_{a}H^{N}(I_{j}) \otimes_{a}L^{2}(I_{j+1})\otimes \cdots \otimes_{a}L^{2}(I_{d}) $$

is generated by the induced scalar product (3.17) for 1≤j≤d. Consequently, it is a reasonable cross norm.

Note that Proposition 4.10 states that all functions from the algebraic tensor space ${}_{a}\bigotimes_{j=1}^{d}C^{0}(I_{j})\cap C^{1}(\mathbf{I})$ are already in $\mathbf{V}_{\mathrm{mix}}=C_{\mathrm{mix}}^{1}(\mathbf{I})$ (see Eq. (4.2)), which is a proper subspace of C ¹(I).

Example 4.13

Fix N>0 and consider $V_{j}^{n}=H^{n,p}(I_{j})$ for 0≤n≤N and 1≤j≤d. Now, we consider the set

$$ \mathcal{N}:= \bigl\{ \mathbf{n}\in\mathbb{N}_{0}^{d}\text{ with }\vert \mathbf{n}\vert \leq N \bigr\} \cup\bigl\{(N,N,\ldots,N)\bigr\}. $$

(4.15)

In this situation

$$\mathbf{V}^{(N,\ldots,N)}=H^{N,p}(I_{1}) \otimes_{a}\cdots\otimes_{a}H^{N,p}(I_{d}). $$

The norm in V ⁿ is also given by (4.12). In particular, the norm

$$\Vert f\Vert _{\mathbf{mix}}:=\Vert f\Vert _{(N,\ldots,N)}= \biggl(\sum _{\mathbf{k}\leq(N,\ldots,N)}\int_{\mathbf{I}}\bigl \vert \partial^{\mathbf{k}}f\bigr \vert ^{p}\,\mathrm{d}x \biggr)^{1/p}$$

in V ^(N,…,N) is a cross norm. Since in $\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ we take

$$\Vert f\Vert _{\mathcal{N}}:=\biggl(\sum_{\mathbf{n}\in\mathcal{N}} \Vert f\Vert _{\mathbf{n}}^{p}\biggr)^{1/p}, $$

which is equivalent to the ∥⋅∥_mix-norm, by Corollary 4.11, we obtain

Thus,

$${}_{\Vert \cdot \Vert _{\mathbf{mix}}}\bigotimes_{j=1}^{d}H^{N,p}(I_{j}) \varsubsetneqq{}_{\Vert \cdot \Vert _{N,p}}\bigotimes _{j=1}^{d}H^{N,p}(I_{j}) . $$

In particular, for p=2, we have

$$ {}_{\Vert \cdot \Vert _{\mathbf{mix}}}\bigotimes_{j=1}^{d}H^{N}(I_{j}) \varsubsetneqq{}_{\Vert \cdot \Vert _{N,2}}\bigotimes _{j=1}^{d}H^{N}(I_{j}) =H^{N}(\mathbf{I}). $$

(4.16)

Moreover, it is easy to see that the ∥⋅∥_mix-norm is generated by the induced scalar product (3.17) of H ^N(I _j) for 1≤j≤d and satisfies condition (3.6). This fact implies that Theorem 4.1 holds for the Hilbert tensor space ${}_{\Vert \cdot \Vert _{\mathbf{mix}}}\bigotimes_{j=1}^{d}H^{N}(I_{j})$.

Thus, a natural question arising in this example is whether Theorem 4.1 holds for the Hilbert tensor space H ^N(I) characterised by (4.14).

From Proposition 4.10, there are different equivalent versions of how to define the minimal subspace $U_{j}^{\min}(\mathbf{v})=\operatorname*{span}\{v_{j}^{(i)}:1\leq i\leq r\}$ for $\mathbf{v}=\sum_{i=1}^{r}\bigotimes_{j=1}^{d}v_{j}^{(i)}$. Here, we can state the following.

Corollary 4.14

Let $\mathcal{N}\subset\mathbb{N}_{0}^{d}$ be an admissible index set. For each $\mathbf{v}\in\mathbf{V}^{(N_{1},\ldots,N_{d})}$,

$$U_{j}^{\min}(\mathbf{v})=\biggl\{(\mathit{id}_{j}\otimes\varphi_{[ j]}) (\mathbf{v}):\varphi_{[ j]}\in \biggl({}_{a}\bigotimes _{k\neq j}V_{k}^{(N_{k})} \biggr)^{\prime}\biggr\}\subset V_{j}^{(N_{j})}$$

holds for 1≤j≤d.

Corollary 4.14 cannot be extended as Corollary 3.9 was for the Banach space case. A simple counterexample is f∈C ¹(I×J) with f(x,y)=F(x+y) and F∉C ². Choose φ∈C ¹(J)^∗ as $\varphi=\delta_{\eta}^{\prime }$. Then φ(f)(x)=−F′(x+η)∈C ⁰(I), but φ(f) is not in C ¹(I) in contrast to Corollary 4.14. While, in Corollary 4.14, we could take functionals from $( {}_{a}\bigotimes_{k\neq j}V_{k}^{(n_{k})}) ^{\prime}$ for any n bounded by n _k≤N _k, we now have to restrict the functionals to n=0. Because of the notation $V_{k}^{(0)}=V_{k}$, the definition coincides with the usual one:

$$\overline{U_{j}^{\min}(\mathbf{v})}^{\Vert \cdot \Vert _{j,0}}:= \overline{ \biggl\{ ( \overline{\mathit{id}_{j}\otimes\varphi_{[ j]}} ) (\mathbf{v}):\varphi_{[ j]}\in \biggl({}_{a} \bigotimes _{k\neq j}V_{k} \biggr)^{\ast} \biggr \}}^{\Vert \cdot \Vert _{j,0}}, $$

where the completion is performed with respect to the norm ∥⋅∥_j,0 of $V_{j}^{(0)}$.

In the following we show that the same results can be derived as in the standard case. Condition (3.6) used before must be adapted to the situation of the intersection space. Consider the tuples $\mathbf{N}_{j}=(0,\ldots,0,N_{j},0,\ldots,0)\in\mathcal{N}$ from (4.6c) and the corresponding tensor space

$$\mathbf{V}^{(\mathbf{N}_{j})}=V_{1}\otimes_{a}\cdots \otimes_{a}V_{j-1}\otimes_{a}V_{j}^{(N_{j})} \otimes_{a}V_{j+1}\otimes\cdots\otimes_{a}V_{d}$$

endowed with the norm $\Vert \cdot \Vert _{\mathbf{\mathbf{N}}_{j}}$. From now on, we denote by $\Vert \cdot \Vert _{\vee (\mathbf{N}_{j})}$ the injective norm defined from the Banach spaces $V_{1},\ldots,V_{j-1},V_{j}^{(N_{j})},V_{j+1},\ldots,V_{d}$.

Theorem 4.15

Assume that $\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ is a reflexive Banach space induced by the intersection of the set of Banach tensor spaces $\{\mathbf{V}_{\Vert \cdot \Vert _{\mathbf{n}}}^{(\mathbf{n})}:\mathbf{n}\in\mathcal{N}\}$ and

$$ \Vert \cdot \Vert _{\vee(\mathbf{N}_{j})}\lesssim \Vert \cdot \Vert _{\mathbf{\mathbf{N}}_{j}} \quad\text{for all }1\leq j\leq d. $$

(4.17)

Then, for each $\mathbf{v}\in\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$, there exists $\mathbf{w}\in\mathcal{T}_{\mathbf{r}}$ such that

$$\Vert\mathbf{v}-\mathbf{w}\Vert=\min_{\mathbf{u}\in\mathcal{T}_{\mathbf{r}}}\Vert\mathbf{v}-\mathbf{u} \Vert. $$

Lemma 4.16

Assume that $\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ is a Banach space induced by the intersection of the set of Banach tensor spaces $\{\mathbf{V}_{\Vert \cdot \Vert _{\mathbf{n}}}^{(\mathbf{n})}:\mathbf{n}\in\mathcal{N}\}$ satisfying assumption (4.17). Let $\boldsymbol{\varphi}_{[j]}\in{}_{a}\bigotimes_{k\neq j}V_{k}^{\ast}$ and $\mathbf{v}_{n},\mathbf{v}\in\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ with v _n⇀v. Then weak convergence $(\overline{\mathit{id}_{j}\otimes \boldsymbol{\varphi}_{[j]}})(\mathbf{v}_{n})\rightharpoonup(\overline {\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}})(\mathbf{v})$ holds in $V_{j}^{(N_{j})}$.

Proof

Repeat the proof of Lemma 3.13 and note that $\varphi_{j}\in(V_{j}^{(N_{j})})^{\ast}$ composed with an elementary tensor φ _[j]=⨂_k≠j φ _k ($\varphi_{k}\in V_{k}^{\ast}$) yields $\boldsymbol{\varphi}=\bigotimes_{k=1}^{d}\varphi_{k}\in{}_{a}\bigotimes_{k=1}^{d}(V_{k}^{(n_{k})})^{\ast}$ with n _k=0 for k≠j and n _j=N _j. By (4.17) and Proposition 3.7b, φ belongs to $( \mathbf{V}^{(\mathbf{N}_{j})} )^{\ast}$. □

Corollary 4.17

Under the assumptions of Lemma 4.16, $\overline {U_{j}^{\min}(\mathbf{v})}^{\Vert \cdot \Vert _{j,0}}\subset V_{j}^{(N_{j})}$ holds for all $\mathbf{v}\in\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ and 1≤j≤d.

Proof

Let $\mathbf{v}_{m}\in\mathbf{V}^{(N_{1},\ldots,N_{d})}$ be a sequence with $\mathbf{v}_{m}\rightarrow\mathbf{v}\in\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ By definition (4.10) of the intersection norm, $\Vert \mathbf{v}_{m}-\mathbf{v}\Vert _{\mathbf{\mathbf{N}}_{j}}\rightarrow0$ holds for all j. Then (3.7) shows that $\Vert(\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}})(\mathbf{v}-\mathbf{v}_{m})\Vert_{j,N_{j}}\rightarrow0$. Since $(\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}})(\mathbf{v}_{m})\in V_{j}^{(N_{j})}$ by Proposition 4.10, the limit of $(\overline{\mathit{id}_{j}\otimes\boldsymbol{\varphi}_{[j]}})(\mathbf{v})$ also belongs to $V_{j}^{(N_{j})}$. □

Lemma 4.18

Assume that $\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ is a Banach space induced by the intersection of the set of Banach tensor spaces $\{\mathbf{V}_{\Vert \cdot \Vert _{\mathbf{n}}}^{(\mathbf{n})}:\mathbf{n}\in\mathcal{N}\}$ satisfying assumption (4.17). For $\mathbf{v}_{m}\in\mathbf{V}^{(N_{1},\ldots,N_{d})}$ assume $\mathbf{v}_{m}\rightharpoonup\mathbf{v}\in\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$. Then

$$\dim U_{j}^{\min}(\mathbf{v})=\dim\overline{U_{j}^{\min}( \mathbf{v})}^{\Vert \cdot \Vert _{j,0}}\leq\liminf_{m\rightarrow\infty}\dim U_{j}^{\min}(\mathbf{v}_{m})\quad\text{\textit{for all} }1 \leq j\leq d. $$

Proof

We can repeat the proof from Theorem 3.15. □

Finally, in a similar way as in Proposition 4.3, we can also obtain the following statement.

Proposition 4.19

Assume that $\mathbf{V}_{\Vert \cdot \Vert _{\mathcal{N}}}$ is a Banach space induced by the intersection of the set of Banach tensor spaces $\{\mathbf{V}_{\Vert \cdot \Vert }^{(\mathbf{n})}:\mathbf{n}\in\mathcal{N}\}$ satisfying the assumption (4.17). Then the set $\mathcal{T}_{\mathbf{r}}$ is weakly closed.

Proof of Theorem 4.15

The proof is a consequence of Theorem 4.2 and Proposition 4.19. □

Example 4.20

Now, we return to H ^N(I) characterised as an intersection of Hilbert tensor spaces by (4.14). Recall that $\Vert \cdot \Vert _{\mathbf{N}_{j}}$ is a reasonable cross norm in

$$\mathbf{V}^{(\mathbf{N}_{j})}=L^{2}(I_{1}) \otimes_{a}\cdots\otimes_{a}L^{2}(I_{j-1})\otimes_{a}H^{N,2}(I_{j}) \otimes_{a}L^{2}(I_{j+1})\otimes \cdots \otimes_{a}L^{2}(I_{d}) $$

for 1≤j≤d. Then condition (4.17) holds, and we obtain the existence of a best $\mathcal{T}_{\mathbf{r}}$ approximation in this space.

For Hilbert spaces, Uschmajew [27] has proved the existence of minimisers of (1.3) using particular properties of Hilbert spaces.

4.3 Some Consequences of the Best $\mathcal{T}_{\mathbf{r}}$ Approximation in a Hilbert Tensor Space

In this section we assume that the Hilbert space $\mathbf{V}_{\Vert \cdot \Vert }:={}_{\Vert \cdot \Vert }\bigotimes_{j=1}^{d}V_{j}$, which is the completion of $\mathbf{V}:={}_{a}\bigotimes_{j=1}^{d}V_{j}$ with respect to ∥⋅∥, has the property that the set $\mathcal{T}_{\mathbf{r}}$ is weakly closed. Then for each $\mathbf{r}\in\mathbb{N}_{0}^{d}$ with r≥1, we can define a map from V _∥⋅∥ to [0,∞), using

$$ \Vert\mathbf{u}\Vert_{\mathbf{r}}:=\max\bigl\{\bigl|\langle\mathbf{v}, \mathbf{u}\rangle\bigr|:\mathbf{v}\in\mathcal{T}_{\mathbf{r}}, \Vert \mathbf{v}\Vert=1\bigr\}. $$

(4.18)

Observe that if ∥⋅∥ is induced by the scalar products of V _j, then ∥⋅∥₁=∥⋅∥_∨. For general r≥1 we obtain the following result.

Theorem 4.21

Assume that in the Hilbert space V _∥⋅∥ the set $\mathcal{T}_{\mathbf{r}}$ is weakly closed. Then for each $\mathbf{r}\in\mathbb{N}_{0}^{d}$ with r≥1, the following statements hold.

(a)
For each u∈V _∥⋅∥ the equality
$$\Vert\mathbf{u}\Vert_{\mathbf{r}}^{2}=\Vert\mathbf{u} \Vert^{2}-\min_{\mathbf{v}\in\mathcal{T}_{\mathbf{r}}}\Vert\mathbf{u}-\mathbf{v} \Vert^{2}$$
holds.
(b)
If $\mathbf{s}\in\mathbb{N}_{0}^{d}$ satisfies r≤s, then ∥u∥_r≤∥u∥_s for all u∈V _∥⋅∥.
(c)
∥⋅∥_r is a norm on V _∥⋅∥.

Proof

To prove (a) let $\mathcal{D}= \{ \mathbf{w}:\mathbf{w}\in\mathcal{T}_{\mathbf{r}}\text{ and }\Vert\mathbf{w}\Vert=1 \} $. Then

$$\min_{\mathbf{v}\in\mathcal{T}_{\mathbf{r}}}\Vert\mathbf{u}-\mathbf{v}\Vert= \min_{\mathbf{w}\in\mathcal{D},\lambda\in\mathbb{R}}\Vert\mathbf{u}-\lambda\mathbf{w}\Vert. $$

Note that ∥u−λ w∥²=∥u∥²−2λ〈u,w〉+λ ². The minimum of $\Vert\mathbf{u}-\lambda\mathbf{w}\Vert_{2}^{2}$ for $\mathbf{w}\in \mathcal{D}$ is obtained for λ=〈u,w〉>0 and equals $\Vert\mathbf{u}-\lambda\mathbf{w}\Vert_{2}^{2}=\Vert \mathbf{u}\Vert_{2}^{2}-|\langle\mathbf{u},\mathbf{w}\rangle|^{2}$. Thus,

$$\min_{\mathbf{v}\in\mathcal{T}_{\mathbf{r}}}\Vert\mathbf{u}-\mathbf{v}\Vert^{2}= \min_{\lambda\in\mathbb{R}, \mathbf{w}\in\mathcal{D}}\Vert\mathbf{u}-\lambda\mathbf{w} \Vert^{2}=\min_{\mathbf{w}\in\mathcal{D}}\bigl\Vert\mathbf{u}-\langle\mathbf{u}, \mathbf{w}\rangle\mathbf{w}\bigr\Vert^{2}=\Vert\mathbf{u} \Vert^{2}-\max_{\mathbf{w}\in\mathcal{D}}\bigl|\langle \mathbf{u},\mathbf{w} \rangle\bigr|^{2}, $$

and from (4.18) statement (a) follows.

Since r≤s implies $\mathcal{T}_{\mathbf{r}}\subset\mathcal{T}_{\mathbf{s}}$, statement (b) follows from (4.18).

To prove (c) note that the norm axiom ∥λ u∥_r=|λ|∥u∥_r and the triangle inequality are standard. To prove that u≠0 implies ∥u∥_r>0, note that if ∥u∥_r=0 we have 〈u,v〉=0 for all $\mathbf{v}\in\mathcal{T}_{\mathbf{r}}$. Since $\operatorname{span}\mathcal{T}_{\mathbf{r}}$ is dense in V _∥⋅∥, we obtain that u=0. □

Let $V_{1}=\mathbb{R}^{n_{1}}$ and $V_{2}=\mathbb{R}^{n_{2}}$ be equipped with the usual Euclidean norm. Then V ₁⊗_a V ₂ is isomorphic to matrices from $\mathbb{R}^{n_{1}\times n_{2}}$ with the Frobenius norm ∥⋅∥. It is not difficult to see that ∥u∥_(1,1) coincides with σ ₁, the first singular value of the singular value decomposition of u.

Notes

Note that the meaning of id _[j] and id _[k] may differ: in the second line of (2.7b), (id _[k]⊗A _k)∈L(V,V _[k]⊗_a W _k) and (id _[j]⊗A _j)∈L(V _[k]⊗_a W _k,V _[j,k]⊗_a W _j⊗_a W _k) (cf. (2.5b)), whereas in the third one (id _[j]⊗A _j)∈L(V,V _[j]⊗_a W _j) and (id _[k]⊗A _k)∈L(V _[j]⊗_a W _j,V _[j,k]⊗_a W _k⊗_a W _j).
Recall that an elementary tensor is a tensor of the form v ₁⊗⋯⊗v _d.
In (3.1a) it suffices to have the terms for n=0 and n=N. The derivatives are to be understood as weak derivatives.
We recall that the definition of $U_{j}^{\mathrm{IV}}(\mathbf{v})$ requires the definition of a norm on V _[j]. The following arguments will be based on $U_{j}^{\mathrm{III}}(\mathbf{v})$.
Here, infinite dimensions are identified and not considered as possibly different infinite cardinalities.

References

A. Ammar, B. Mokdad, F. Chinesta, R. Keunings, A new family of solvers for some classes of multidimensional partial differential equations encountered in kinetic theory modelling of complex fluids, J. Non-Newton. Fluid Mech. 139(3), 153–176 (2006).
Article MATH Google Scholar
C.J. Appellof, E.R. Davidson, Strategies for analyzing data from video fluorometric monitoring of liquid-chromatographic effluents, Anal. Chem. 53(13), 2053–2056 (1981).
Article Google Scholar
G. Berkooz, P. Holmes, J.L. Lumley, The proper orthogonal decomposition in the analysis of turbulent flows, Annu. Rev. Fluid Mech. 25, 539–575 (1993).
Article MathSciNet Google Scholar
J.M. Borwein, Proximality and Chebyshev sets, Optim. Lett. 1, 21–32 (2007).
Article MathSciNet MATH Google Scholar
E. Cancès, V. Ehrlacher, T. Lelievre, Convergence of a greedy algorithm for high-dimensional convex nonlinear problems. Math. Models Methods Appl. Sci. 21, 2433–2467 (2011).
Article MathSciNet MATH Google Scholar
J.D. Carroll, J.J. Chang, Analysis of individual differences in multidimensional scaling via an n-way generalization of Eckart-Young decomposition, Psychometrika 35, 283–319 (1970).
Article MATH Google Scholar
V. de Silva, L.-H. Lim, Tensor rank and ill-posedness of the best low-rank approximation problem, SIAM J. Matrix Anal. Appl. 30, 1084–1127 (2008).
Article MathSciNet Google Scholar
R.A. DeVore, G.G. Lorentz, Constructive Approximation (Springer, Berlin, 1993).
MATH Google Scholar
A. Doostan, G. Iaccarino, A least-squares approximation of partial differential equations with high-dimensional random inputs, J. Comput. Phys. 228(12), 4332–4345 (2009).
Article MathSciNet MATH Google Scholar
M. Edelstein, Weakly proximinal sets, J. Approx. Theory 18, 1–8 (1976).
Article MathSciNet MATH Google Scholar
A. Falcó, Algorithms and numerical methods for high dimensional financial market models, Rev. Econ. Financ. 20, 51–68 (2010).
Google Scholar
W.H. Greub, Linear Algebra, 4th edn. Graduate Text in Mathematics (Springer, Berlin, 1981)
MATH Google Scholar
A. Grothendieck, Résumé de la théorie métrique des produit tensoriels topologiques, Bol. Soc. Mat. São Paulo 8, 1–79 (1953/56).
MathSciNet Google Scholar
W. Hackbusch, Tensor Spaces and Numerical Tensor Calculus (Springer, Berlin, 2012).
Book MATH Google Scholar
W. Hackbusch, S. Kühn, A new scheme for the tensor representation, J. Fourier Anal. Appl. 15, 706–722 (2009).
Article MathSciNet MATH Google Scholar
F.L. Hitchcock, The expression of a tensor or a polyadic as a sum of products, J. Math. Phys. 6, 164–189 (1927).
MATH Google Scholar
R. Hübener, V. Nebendahl, W. Dür, Concatenated tensor network states, New J. Phys. 12, 025004 (2010).
Article Google Scholar
T.G. Kolda, B.W. Bader, Tensor decompositions and applications, SIAM Rev. 51, 455–500 (2009).
Article MathSciNet MATH Google Scholar
L. De Lathauwer, J. Vandewalle, Dimensionality reduction in higher-order signal processing and rank—(r ₁,r ₂,…,r _n) reduction in multilinear algebra, Linear Algebra Appl. 391, 31–55 (2004).
Article MathSciNet MATH Google Scholar
W.A. Light, E.W. Cheney, Approximation Theory in Tensor Product Spaces. Lect. Notes Math., vol. 1169 (Springer, Berlin, 1985).
MATH Google Scholar
R. Meise, D. Vogt, Introduction to Functional Analysis (Clarendon, Oxford, 1997).
MATH Google Scholar
A. Nouy, A generalized spectral decomposition technique to solve a class of linear stochastic partial differential equations, Comput. Methods Appl. Mech. Eng. 96(45–48), 4521–4537 (2007).
Article MathSciNet Google Scholar
A. Nouy, Proper generalized decompositions and separated representations for the numerical solution of high dimensional stochastic problems, Arch. Comput. Methods Eng. 17(4), 403–434 (2010).
Article MathSciNet Google Scholar
I.V. Oseledets, E.E. Tyrtyshnikov, TT-cross approximation for multidimensional arrays, Linear Algebra Appl. 432, 70–88 (2010).
Article MathSciNet MATH Google Scholar
B. Simon, Uniform crossnorms, Pac. J. Math. 46, 555–560 (1973).
MATH Google Scholar
L.R. Tucker, Some mathematical notes on three-mode factor analysis, Psychometrika 31, 279–311 (1966)
Article MathSciNet Google Scholar
A. Uschmajew, Convex maximization problems on non-compact stiefel manifolds with application to orthogonal tensor approximations, Numer. Math. 115, 309–331 (2010).
Article MathSciNet MATH Google Scholar
M.A.O. Vasilescu, D. Terzopoulos, Multilinear analysis of image ensembles: tensorfaces, in ECCV 2002: Proceedings of the 7th European Conference on Computer Vision. Lecture Notes in Comput. Sci., vol. 2350 (Springer, Berlin, 2002), pp. 447–460.
Chapter Google Scholar
G. Vidal, Efficient classical simulation of slightly entangled quantum computations, Phys. Rev. Lett. 91, 147902 (2003).
Article Google Scholar
H. Wang, N. Ahuja, Compact representation of multidimensional data using tensor rank-one decomposition, in ICPR 2004: Proceedings of the 17th International Conference on Pattern Recognition, vol. 1 (2004), pp. 44–47.
Chapter Google Scholar

Download references

Acknowledgements

This work is partially supported by the PRCEU-UCH30/10 grant of the Universidad CEU Cardenal Herrera.

Author information

Authors and Affiliations

Departamento de Ciencias Físicas, Matemáticas y de la Computación, Universidad CEU Cardenal Herrera, San Bartolome 55, 46115, Alfara del Patriarca (Valencia), Spain
Antonio Falcó
Max-Planck-Institut Mathematik in den Naturwissenschaften, Inselstr. 22, 04103, Leipzig, Germany
Wolfgang Hackbusch

Authors

Antonio Falcó
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Hackbusch
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wolfgang Hackbusch.

Additional information

Communicated by Wolfgang Dahmen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Falcó, A., Hackbusch, W. On Minimal Subspaces in Tensor Representations. Found Comput Math 12, 765–803 (2012). https://doi.org/10.1007/s10208-012-9136-6

Download citation

Received: 19 November 2010
Revised: 06 May 2011
Accepted: 22 June 2011
Published: 09 October 2012
Issue Date: December 2012
DOI: https://doi.org/10.1007/s10208-012-9136-6

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On Minimal Subspaces in Tensor Representations

Abstract

Similar content being viewed by others

Tree-based tensor formats

The Set of Orthogonal Tensor Trains

The minimal and maximal operator ideals associated to $$(n+1)$$ -tensor norms of Michor’s type

1 Introduction

2 Minimal Subspaces in an Algebraic Tensor Space

Remark 2.1

Lemma 2.2

2.1 Algebraic Tensor Spaces

2.1.1 Definitions and Elementary Facts

Lemma 2.3

Proof

Lemma 2.4

Proof

Definition 2.5

Remark 2.6

2.1.2 Matricisation

Definition 2.7

Example 2.8

Definition 2.9

Lemma 2.10

Proof

2.2 Minimal Subspaces

2.2.1 Case d=2

Lemma 2.11

Proof

Definition 2.12

Lemma 2.13

Proof

Proposition 2.14

Proof

Corollary 2.15

Proof

2.2.2 Definition in the General Case

Lemma 2.16

Proof

Theorem 2.17

Proof

2.2.3 Hierarchies of Minimal Subspaces

Proposition 2.18

Proof

Corollary 2.19

3 Minimal Subspaces in a Banach Tensor Space

Definition 3.1

Example 3.2

3.1 Tensor Product of Banach Spaces

Remark 3.3

Lemma 3.4

Example 3.5

Definition 3.6

Proposition 3.7

Proof

3.2 Minimal Subspaces in a Banach Tensor Space

Lemma 3.8

Proof

Corollary 3.9

Remark 3.10

3.3 Minimal Closed Subspaces in a Banach Tensor Space

3.3.1 Definitions

Definition 3.11

Definition 3.12

3.3.2 Dependence of \(U_{j}^{\min}(\mathbf{v})\) on v

Lemma 3.13

Proof

Lemma 3.14

Proof

Theorem 3.15

Proof

3.3.3 \(\dim(U_{j}^{\min}(\mathbf{v}))<\infty\)

Theorem 3.16

Proof

3.3.4 \(\dim(U_{j}^{\min}(\mathbf{v}))=\infty\)

Lemma 3.17

Proof

Lemma 3.18

Theorem 3.19

Proof

3.4 Minimal Closed Subspaces in a Hilbert Tensor Space