On the complexity of multivariate polynomial division

Sparse interpolation [1, 5, 2, 13] provides an interesting paradigm for efficient computations with multivariate polynomials. In particular, under suitable hypothesis, multiplication of sparse polynomials can be carried out in quasi-linear time, in terms of the expected output size. More recently, other multiplication algorithms have also been investigated, which outperform naive and sparse interpolation under special circumstances [14, 12]. An interesting question is how to exploit such asymptotically faster multiplication algorithms for the purpose of polynomial elimination. In this paper, we will focus on the reduction of a multivariate polynomial with respect to an autoreduced set of other polynomials and show that fast multiplication algorithms can indeed be exploited in this context in an asymptotically quasi-optimal way.

Consider the polynomial ring

over an effective field

with an effective zero test. Given a polynomial

, we call

the support of

. The naive multiplication of two sparse polynomials

requires a priori

operations in

. This upper bound is sharp if

and

are very sparse, but pessimistic if

and

are dense.

Assuming that

has characteristic zero, a better algorithm was proposed in [2] (see also [1, 5] for some background). The complexity of this algorithm can be expressed in the expected size

of the output (when no cancellations occur). It is shown that

and

can be multiplied using only

operations in

, where

stands for the complexity of multiplying two univariate polynomials in

of degrees

. Unfortunately, the algorithm in [2] has two drawbacks:

In practice, the second drawback is of less importance. Indeed, especially when the coefficients in

can become large, then the computation of

is often cheap with respect to the multiplication

itself, even if we compute

in a naive way.

Recently, several algorithms were proposed for removing the drawbacks of [2]. First of all, in [13] we proposed a practical algorithm with essentially the same advantages as the original algorithm from [2], but with a good bit complexity and a variant which also works in positive characterisic. However, it still requires a bound for

and it only works for special kinds of fields

(which nevertheless cover the most important cases such as

and finite fields). Even faster algorithms were proposed in [9, 14], but these algorithms only work for special supports. Yet another algorithm was proposed in [7, 12]. This algorithm has none of the drawbacks of [2], but its complexity is suboptimal (although better than the complexity of naive multiplication).

At any rate, these recent developments make it possible to rely on fast sparse polynomial multiplication as a building block, both in theory and in practice. This makes it natural to study other operations on multivariate polynomials with this building block at our disposal. One of the most important such operations is division.

The multivariate analogue of polynomial division is the reduction of a polynomial

with respect to an autoreduced tuple

of other polynomials. This leads to a relation

such that none of the terms occurring in

can be further reduced with respect to

. In this paper, we are interested in the computation of

as well as

. We will call this the problem of extended reduction, in analogy with the notion of an “extended g.c.d.”.

Now in the univariate context, “relaxed power series” provide a convenient technique for the resolution of implicit equations [6, 7, 8, 10]. One major advantage of this technique is that it tends to respect most sparsity patterns which are present in the input data and in the equations. The main technical tool in this paper (see section 3) is to generalize this technique to the setting of multivariate polynomials, whose terms are ordered according to a specific admissible ordering on the monomials. This will make it possible to rewrite (1) as a so called recursive equation (see section 4.2), which can be solved in a relaxed manner. Roughly speaking, the cost of the extended reduction then reduces to the cost of the relaxed multiplications

. Up to a logarithmic overhead, we will show (theorem 7) that this cost is the same as the cost of checking the relation (1).

In order to simplify the exposition, we will adopt a simplified sparse complexity model throughout this paper. In particular, our complexity analysis will not take into account the computation of support bounds for products or results of the extended reduction. Bit complexity issues will also be left aside in this paper. We finally stress that our results are mainly of theoretical interest since none of the proposed algorithms have currently been implemented. Nevertheless, practical gains are not to be excluded, especially in the case of small

, high degrees and dense supports.

2Notations

Let

be an effective field with an effective zero test and let

be indeterminates. We will denote

for any

and

. In particular,

. For any subset

we will denote by

the final segment generated by

for the partial ordering

Let

be a total ordering on

which is compatible with addition. Two particular such orderings are the lexicographical ordering

and the reverse lexicographical ordering

For instance, the graded reverse lexicographical ordering

is obtained by taking

3Relaxed multiplication

3.1Relaxed power series

Let us briefly recall the technique of relaxed power series computations, which is explained in more detail in [7]. In this computational model, a univariate power series

is regarded as a stream of coefficients

. When performing an operation

on power series it is required that the coefficient

of the result is output as soon as sufficiently many coefficients of the inputs are known, so that the computation of

does not depend on the further coefficients. For instance, in the case of a multiplication

, we require that

is output as soon as

and

are known. In particular, we may use the naive formula

for the computation of

The additional constraint on the time when coefficients should be output admits the important advantage that the inputs may depend on the output, provided that we add a small delay. For instance, the exponential

of a power series

may be computed in a relaxed way using the formula

and the right-hand side only depends on the previously computed coefficients

. More generally, equations of the form

which have this property are called recursive equations and we refer to [11] for a mechanism to transform fairly general implicit equations into recursive equations.

The main drawback of the relaxed approach is that we cannot directly use fast algorithms on polynomials for computations with power series. For instance, assuming that

has sufficiently many

-th roots of unity and that field operations in

can be done in time

, two polynomials of degrees

can be multiplied in time

, using FFT multiplication [3]. Given the truncations

and

at order

of power series

, we may thus compute the truncated product

in time

as well. This is much faster than the naive

relaxed multiplication algorithm for the computation of

. However, the formula for

when using FFT multiplication depends on all input coefficients

and

, so the fast algorithm is not relaxed (we will say that FFT multiplication is a zealous algorithm). Fortunately, efficient relaxed multiplication algorithms do exist:

Theorem 1. [6, 7, 4] Let be the time complexity for the multiplication of polynomials of degrees in . Then there exists a relaxed multiplication algorithm for series in at order of time complexity .

Remark 2. In fact, the algorithm from theorem 1 generalizes to the case when the multiplication on

is replaced by an arbitrary bilinear “multiplication”

, where

and

are effective modules over an effective ring

. If

denotes the time complexity for multiplying two polynomials

and

of degrees

, then we again obtain a relaxed multiplication for series

and

at order

of time complexity

Theorem 3. [10] If admits a primitive -th root of unity for all , then there exists a relaxed multiplication algorithm of time complexity . In practice, the existence of a -th root of unity with suffices for multiplication up to order .

3.2Relaxed multivariate Laurent series

Let

be an effective ring. A power series

is said to be computable if there is an algorithm which takes

on input and produces the coefficient

on output. We will denote by

the set of such series. Then

is an effective ring for relaxed addition, subtraction and multiplication.

A computable Laurent series is a formal product

with

and

. The set

of such series forms an effective ring for the addition, subtraction and multiplication defined by

is an effective field with an effective zero test, then we may also define an effective division on

, but this operation will not be needed in what follows.

Assume now that

is replaced by a finite number of variables

. Then an element of

will also be called a “computable lexicographical Laurent series”. Any non zero

has a natural valuation

, by setting

, etc. The concept of recursive equations naturally generalizes to the multivariate context. For instance, for an infinitesimal Laurent series

(that is,

, where

), the formula

Conversely, given

, we define

by substituting

. We will call the transformations

and

tagging resp. untagging; they provide us with a relaxed mechanism to compute with multivariate polynomials in

, such that the admissible ordering

is respected. For instance, we may compute the relaxed product of two polynomials

by computing the relaxed product

and substituting

in the result. We notice that tagging is an injective operation which preserves the size of the support.

3.3Complexity analysis

Assume now that we are given

and a set

such that

. We assume that

is a function such that the (zealous) product

can be computed in time

. We will also assume that

is an increasing function of

. In [2, 15], it is shown that we may take

Let us now study the complexity of sparse relaxed multiplication of

and

. We will use the classical algorithm for fast univariate relaxed multiplication from [6, 7], of time complexity

. We will also consider semi-relaxed multiplication as in [8], where one of the arguments

is completely known in advance and only the other one is computed in a relaxed manner.

Theorem 4. With the above notations, the relaxed product of and can be computed in time

Proof. In order to simplify our exposition, we will rather prove the theorem for a semi-relaxed product of

(relaxed) and

(known in advance). As shown in [8], the general case reduces to this special case. We will prove by induction over

that the semi-relaxed product can be computed using at most

operations in

is sufficiently large. For

, we have nothing to do, so assume that

Let us first consider the semi-relaxed product of

and

with respect to

. Setting

, the computation of this product corresponds (see the right-hand side of figure 1) to the computation of

zealous

products (i.e.

products of polynomials of degrees

zealous

products, and so on until

zealous

products. We finally need to perform

semi-relaxed

products of series in

only.

More precisely, assume that

and

have valuations

resp.

and let

stand for the coefficient of

. We also define

and notice that the

are pairwise disjoint. In the semi-relaxed multiplication, we have to compute the zealous

products

for all

. Since

The total time spent in performing all zealous

block multiplications with

is therefore bounded by

Let us next consider the remaining

semi-relaxed products. If

, then these are really scalar products, whence the remaining work can clearly be performed in time

is sufficiently large. If

, then for each

, we have

By the induction hypothesis, we may therefore perform this semi-relaxed product in time

. A similar argument as above now yields the bound

for performing all

semi-relaxed block products. The total execution time (which also takes into account the final additions) is therefore bounded by

. This completes the induction.

Remark 5. In practice, the computation of zealous products of the form

is best done in the untagged model, i.e. using the formula

Proceeding this way allows us to use any of our preferred algorithms for sparse polynomial multiplication. In particular, we may use [14] or [12].

4Polynomial reduction

4.1Naive extended reduction

Consider a tuple

. We say that

is autoreduced if

for all

and

for all

. Given such a tuple

and an arbitrary polynomial

, we say that

is reduced with respect to

for all

and

. An extended reduction of

with respect to

is a tuple

with

such that

is reduced with respect to

. The naive algorithm extended-reduce below computes an extended reduction of

Algorithm extended-reduce

Input: and an autoreduced tuple

Output: an extended reduction of with respect to

While is not reduced with respect to do

Let be minimal and such that for some

Remark 6. Although an extended reduction is usually not unique, the one computed by extended-reduce is uniquely determined by the fact that, in our main loop, we take

minimal with

for some

. This particular extended reduction is also characterized by the fact that

need to be known beforehand. These upper bounds are easily computed as a function of

by the variant supp-extended-reduce of extended-reduce below. We recall from the end of the introduction that we do not take into account the cost of this computation in our complexity analysis. In reality, the execution time of supp-extended-reduce is similar to the one of extended-reduce, except that potentially expensive operations in

are replaced by boolean operations of unit cost. We also recall that support bounds can often be obtained by other means for specific problems.

Algorithm supp-extended-reduce

Input: subsets and of as above

Output: subsets and of as above

Let be minimal with for some

4.2Relaxed extended reduction

Using the relaxed multiplication from section 3, we are now in a position to replace the algorithm extended-reduce by a new algorithm, which directly computes

using the equation (3). In order to do this, we still have to put it in a recursive form which is suitable for relaxed resolution.

In particular,

yields the “leading term” of the extended reduction

. We also denote by

the corresponding operator from

which sends

is recursive, whence the extended reduction can be computed using

multivariate relaxed multiplications

. With

and

as in the previous section, theorem 4 therefore implies:

Theorem 7. We may compute the extended reduction of with respect to in time

Remark 8. Following remark 2, we also notice that

, the

and

may be replaced by vectors of polynomials in

(regarded as polynomials with coefficients in

), in the case that several polynomials need to be reduced simultaneously.

Bibliography

[1]: M. Ben-Or and P. Tiwari. A deterministic algorithm for sparse multivariate polynomial interpolation. In STOC '88: Proceedings of the twentieth annual ACM symposium on Theory of computing, pages 301–309, New York, NY, USA, 1988. ACM Press.
[2]: J. Canny, E. Kaltofen, and Y. Lakshman. Solving systems of non-linear polynomial equations faster. In Proc. ISSAC '89, pages 121–128, Portland, Oregon, A.C.M., New York, 1989. ACM Press.
[3]: J.W. Cooley and J.W. Tukey. An algorithm for the machine calculation of complex Fourier series. Math. Computat., 19:297–301, 1965.
[4]: M. J. Fischer and L. J. Stockmeyer. Fast on-line integer multiplication. Proc. 5th ACM Symposium on Theory of Computing, 9:67–72, 1974.
[5]: D. Y. Grigoriev and M. Karpinski. The matching problem for bipartite graphs with polynomially bounded permanents is in NC. In Proceedings of the 28th IEEE Symposium on the Foundations of Computer Science, pages 166–172, 1987.
[6]: J. van der Hoeven. Lazy multiplication of formal power series. In W. W. Küchlin, editor, Proc. ISSAC '97, pages 17–20, Maui, Hawaii, July 1997.
[7]: J. van der Hoeven. Relax, but don't be too lazy. JSC, 34:479–542, 2002.
[8]: J. van der Hoeven. Relaxed multiplication using the middle product. In Manuel Bronstein, editor, Proc. ISSAC '03, pages 143–147, Philadelphia, USA, August 2003.
[9]: J. van der Hoeven. The truncated Fourier transform and applications. In J. Gutierrez, editor, Proc. ISSAC 2004, pages 290–296, Univ. of Cantabria, Santander, Spain, July 4–7 2004.
[10]: J. van der Hoeven. New algorithms for relaxed multiplication. JSC, 42(8):792–802, 2007.
[11]: J. van der Hoeven. From implicit to recursive equations. Technical report, HAL, 2011. http://hal.archives-ouvertes.fr/hal-00583125.
[12]: J. van der Hoeven and G. Lecerf. On the complexity of blockwise polynomial multiplication. In Proc. ISSAC '12, pages 211–218, Grenoble, France, July 2012.
[13]: J. van der Hoeven and G. Lecerf. On the bit-complexity of sparse polynomial multiplication. JSC, 50:227–254, 2013.
[14]: J. van der Hoeven and É. Schost. Multi-point evaluation in higher dimensions. AAECC, 24(1):37–52, 2013.
[15]: E. Kaltofen and Y. N. Lakshman. Improved sparse multivariate polynomial interpolation algorithms. In ISSAC '88: Proceedings of the international symposium on Symbolic and algebraic computation, pages 467–474. Springer Verlag, 1988.
[16]: L. Robbiano. Term orderings on the polynominal ring. In European Conference on Computer Algebra (2), pages 513–517, 1985.

1Introduction

2Notations

3Relaxed multiplication

3.1Relaxed power series

3.2Relaxed multivariate Laurent series

3.3Complexity analysis

4Polynomial reduction

4.1Naive extended reduction

4.2Relaxed extended reduction

Bibliography