<\body> and Applications>>>||<\author-address> de Mathématiques ( 425) Université Paris-Sud 91405 Orsay Cedex France Email: >||-multiplication, multivariate polynomials, multivariate power series.>|> <\abstract> In this paper, we present a truncated version of the classical Fast Fourier Transform. When applied to polynomial multiplication, this algorithm has the nice property of eliminating the ``jumps'' in the complexity at powers of two. When applied to the multiplication of multivariate polynomials or truncated multivariate power series, we gain a logarithmic factor with respect to the best previously known algorithms. >>>>>>>>>>>> Let \1/2> be an effective ring of constants ( the usual arithmetic operations >, > and >> can be carried out by algorithm). If > has a primitive -th root of unity with >, then the product of two polynomials \[X]> with n> can be computed in time using the Fast Fourier Transform or . If > does not admit a primitive -th root of unity, then one needs an additional overhead of in order to carry out the multiplication, by artificially adding new root of unity . Besides the fact that the asymptotic complexity of the involves a large constant factor, another classical drawback is that the complexity function admits important jumps at each power of two. These jumps can be reduced by using )>-th roots of unity for small . They can also be smoothened by decomposing )\(n+\)>-multiplications as n>-, \>- and )\\>-multiplications. However, these tricks are not very elegant, cumbersome to implement, and they do not allow to completely eliminate the jump problem. In section , we present a new kind of ``Truncated Fourier Transform'' or , which allows for the fast evaluation of a polynomial \[X]> in any number of well-chosen roots of unity. This algorithm coincides with the usual if is a power of two, but it behaves smoothly for intermediate values. In section , we also show that the inverse operation of interpolation can be carried out with the same complexity (modulo a few additional shifts). The permits to speed up the multiplication of univariate polynomials with a constant factor between and . In the case of multivariate polynomials, the repeated gain of such a constant factor leads to the gain of a non-trivial asymptotic factor. More precisely, assuming that > admits sufficiently >-th roots of unity, we will show in section

that the product of two multivariate polynomials \[z,\,z]> can be computed in time , where > and . The best previously known algorithm , based on sparse polynomial multiplication, has time complexity s)>. In section we finally give an algorithm for the multiplication of truncated multivariate power series. This algorithm, which has time complexity s)>, again improves the best previously known algorithm by a factor of . Moreover, both in the cases of multivariate polynomials and power series, we expect the corresponding constant factor to be better. Let > be an effective ring of constants, > with \> and \\> a primitive -th root of unity ( =\1>). The discrete Fast Fourier Transform () of an -tuple ,\,a)\\> (with respect to >) is the -tuple ,\,

)=FFT>(a)\\> with <\equation*> =

a*\. In other words, =A(\)>, where \[X]> denotes the polynomial +a*X+\+a*X>. The can be computed efficiently using binary splitting: writing <\equation*> (a,\,a)=(b,c,\,b,c), we recursively compute the Fourier transforms of ,\,b)> and ,\,c)> <\eqnarray*> >(b,\,b)>||,\,);>>|>(c,\,c)>||,\,).>>>> Then we have <\eqnarray*> >(a,\,a)>||+,\,+*\>>|||-,\,-*\).>>>> This algorithm requires n> multiplications with powers of > and additions (or subtractions). In practice, it is most efficient to implement an in-place variant of the above algorithm. We will denote by > the bitwise mirror of at length (for instance, =24> and =26>). At step , we start with the vector <\equation*> x=(x,\,x)=(a,\,a). At step {1,\,p}>, we set <\equation> +j>>>|+j>>>>>>=|*m>>>||\*m>>>>>>+j>>>|+j>>>>>>. for all {0,2,\,n/m-2}> and {0,\,m-1}>, where =2>. Using induction over , it can easily be seen that <\equation*> x+j>=(FFT>>(a,a+j>,\,a+j>))>, for all {0,\,n/m-1}> and {0,\,m-1}>. In particular, <\eqnarray*> >||>>>|>||>>>>> for all {0,\,n-1}>. This algorithm of ``repeated crossings'' is illustrated in figure . <\big-figure|> Schematic representation of a Fast Fourier Transform for . The black dots correspond to the >, the upper row being ,\,x)=(a,\,a)> and the lower row ,\,x)=(,,,,\,)>. A classical application of the is the multiplication of polynomials +\+a*X> and +\+b*X>. Assuming that n>, we first evaluate and in ,\,\

> using the : <\eqnarray*> ,A(\

))>||>(a,\,a)>>|,B(\

))>||>(b,\,b)>>>> We next compute the evaluations ,A(\

)*B(\

)))> of at ,\

>. We finally have to recover from these values using the inverse . But the inverse with respect to > is nothing else as times the direct with respect to

>. Indeed, for all ,\,a)\\> and all {0,\,n-1}>, we have <\equation> FFT1>>(FFT>(a))=

a*\

=n*a, since <\equation*>

=0 whenever k>. This yields a multiplication algorithm of time complexity in [X]>, when assuming that > admits enough primitive >-th roots of unity. In the case that > does not, then new roots of unity can be added artificially so as to yield an algorithm of time complexity .

The algorithm from the previous section has the disadvantage that needs to be a power of two. If we want to multiply two polynomials \[X]> such that >, then we need to carry out the at precision , thereby losing a factor of . This factor can be reduced using several tricks. For instance, one may decompose the )\(n+\)>-product into an n> product, an \>-product and an )\\>-product. This is efficient for small