<\body> ||<\author-address> de Mathématiques ( 425) Université Paris-Sud 91405 Orsay Cedex France Email: >|||> <\abstract> A real number is said to be effective if there exists an algorithm which, given a required tolerance \\*2>>, returns a binary approximation \\*2>> for with

-x\|\

\>. Effective real numbers are interesting in areas of numerical analysis where numerical instability is a major problem. One key problem with effective real numbers is to perform intermediate computations at the smallest precision which is sufficient to guarantee an exact end-result. In this paper we first review two classical techniques to achieve this: error estimates and interval analysis. We next present two new techniques: ``relaxed evaluations'' reduce the amount of re-evaluations at larger precisions and ``balanced error estimates'' automatically provide good tolerances for intermediate computations. |

|>>>>>>>>>>>A is a number of the form > with \>. We denote by =\*2>> the set of dyadic numbers and >={x\\:x\0}>. Given \> and \\>>, an >-approximation for is a number \\> with -x\|\\>. An for \> is an algorithm which takes a tolerance \\> on input and which returns an >-approximation for . A real number \> is said to be , if it admits an approximation algorithm. The aim of this paper is to review several recent techniques for computations with effective real numbers and to present a few new ones. Effective real numbers can be useful in areas of numerical analysis where numerical instability is a major problem, like singularity theory. In such areas, multiple precision arithmetic is often necessary and the required precisions for auxiliary computations may heavily vary. We hope that the development of an adequate computational theory for effective real numbers will both allow us to automatically perform the error analysis and compute the required precisions at which computations have to take place. We recall that there exists no general zero-test for effective real numbers. Nevertheless, exact or heuristically reliable zero-tests do exist for interesting subfields, which contain transcendental constants. However, this topic will not be studied in this paper and we refer to for some recent work on this matter. In an object-oriented language like , a natural way to represent an effective real number is by an abstract object with a method which corresponds to the approximation algorithm. When using this representation, which will be discussed in more detail in section

, it is natural to perform error estimations for common operations on real numbers. In other words, if we want to compute ,\,x)> with precision >, then we determine tolerances ,\,\> such that the multiple precision evaluation of at any >-approximations for the > always yields an >-approximation for . In some cases, error estimates are quite pessimistic. An alternative technique for computing with effective real numbers is interval arithmetic. In this case, we approximate an effective real number by an interval ,]> which contains . Then the evaluation of ,\,x)> comes down to the determination of an interval ,]> with <\equation*> f([>,>],\,[>,>])\[,]. For continuous functions , when starting with a very precise approximation of ( -> is very small), the obtained error estimate for will also become as small as desired. In section , this technique of error estimates will be reviewed in more detail, as well as an efficient representation for high-precision intervals. Unfortunately, in some cases, both and error estimates are quite pessimistic. In sections and , we will therefore present two new techniques for the computation of ``adaptive error estimates''. These techniques combine the advantages of priori> and error estimates, while eliminating their major disadvantages. Moreover, this new technique remains reasonably easy to implement in a general purpose system for computations with effective real numbers. Under certain restrictions, the new techniques are also close to being optimal. This point will be discussed in section . The approaches presented in sections and have been proposed, often independently, by several authors and we notice the existence of several similarities with the theory of effective power series . In the past, we experimentally implemented the approaches of sections

and

in the case of power series (not officially distributed) real numbers . We are currently working on a more robust C++ library based on the ideas in this paper . Currently, this library contains the basic arithmetic operations. Some other implementations with similar objectives are currently known to the author . All these implementations are free software and they are mainly based on the and libraries . error estimates> A natural way to represent an effective real number is by its approximation algorithm. Conceptually speaking, this means that we view as a black box which can be asked for approximations up to any desired precision: <\equation*> \\

|||||>>>>>\\

x In an object oriented language like C++, this can be implemented using an abstract base class with a virtual method for the approximation algorithm. Effective real numbers will then be pointers to . For instance, <\code> class real_rep { public: \ \ virtual dyadic approx (const dyadic& tol) = 0; }; typedef real_rep* real; Here stands for the class of dyadic numbers. In practice, these may be taken to be arbitrary precision floating point numbers. For simplicity, we also do not care about memory management. In a real implementation, one would need a mechanism for reference counting or conservative garbage collection. Now assume that we want to evaluate ,\,x)> for a given tolerance \\>>, where ,\,x> are effective real numbers and is an operation like >, >> or . In order to make , we need to construct an approximation algorithm for as a function of ,\,x>. This both involves the dyadic approximation of the evaluation of in dyadic approximations of the > and the determination of tolerances ,\,\> for the dyadic approximations of the >. More precisely, ,\,\> should be such that for any ,\,\\> with -x\|\\(i=1,\,r)>, we have <\equation*> \|>(,\,)-f(x,\,x)\|\\, where >> stands for a dyadic approximation algorithm for which depends on the tolerance >. For instance, in the case when is the addition, we may use exact arithmetic on

> (so that >=f> for all >) and take =\=\/2>. This yields the following class for representing sums of real numbers: <\code> class add_real_rep: public real_rep { \ \ real x, y; public: \ \ add_real_rep (const real& x2, const real& y2): \ \ \ \ x (x2), y (y2) {} \ \ dyadic approx (const dyadic& tol) { \ \ \ \ return x-\approx (tol \\ 1) + y-\approx (tol \\ 1); } }; The addition can now be implemented as follows: <\code> inline real operator + (const real& x, const real& y) { \ \ return new add_real_rep (x, y); } Notice that, in a sense, we have really represented the sum of and by the expression

y> (more generally, such expressions are dags (directed acyclic graphs)). Nevertheless, the representation using an abstract class provides additional flexibility. For instance, we may attach additional information to the class , like the best currently known approximation for the number (thereby avoiding unnecessary recomputations). In practice, it is also good to provide an additional abstract method for computing a rough upper bound for the number. This gives a fine-grained control over potential cancellations. The above approach heavily relies on the computation of error estimates ( the computation of the >). If no additional techniques are used, then this leads to the following disadvantages: <\description> We do not take advantage of the fact that the numeric evaluation of >(,\,)> may lead to an approximation of which is far better than the required tolerance >. Indeed, multiple precision computations are usually done with a precision which is a multiple of the number of bits in a machine word. ``On average'', we therefore gain something like bits of precision. In section , we will show that it may actually be profitable to systematically compute more than necessary. The error estimates may be pessimistic, due to badly balanced expressions. For instance consider the >-approximation of a sum +(x+(x+\+(x)))>, which corresponds to a tree <\equation*> |||x>>> Then the above technique would lead to the computation of an /2)>-approximation of > for r> and an /2)>-approximation of >. If is large, then /2> is unnecessarily small, since a mere /r)>-approximation for each > would do. In section we will consider a general technique for computing ``balanced error estimates''. Badly balanced expressions naturally occur when evaluating polynomials using Horner's rule. error estimates> An alternative technique for computing with effective real numbers is interval arithmetic. The idea is to systematically compute intervals approximations instead of floating point approximations. These intervals must be such that the real numbers we are interested in are certified to lie in their respective interval approximations. More precisely, given \> and \\>>, an >-interval> for is a closed interval ,]> with ,\\>> and -\2*\>. Concretely speaking, we may represent such intervals by their endpoints > and >. Alternatively, if the precisions of > and > are large, then it may be more efficient to represent the interval by its center +)/2> and its radius -)/2>. Indeed, the exact endpoints of the interval are not that important. Hence, modulo a slight increase of the radius, we may always assume that the radius can be stored in a ``single precision dyadic number'' {0,\,2-1}*2>>, where is the number of bits in a machine word. This trick allows to reduce the number of multiple precision computations by a factor of two. Now assume that we want to compute an >-approximation for ,\,x)>, where ,\,x> are effective real numbers and where is a continuous function. Assume also that we have reasonable initial >-intervals for the >. Then, starting with a low precision , we first compute /2)>-intervals >,>]> for the >. We next evaluate using interval arithmetic. This yields an interval ,]> with <\equation*> f([>,>],\,[>,>])\[,]. If -\2*\>, then +)/2> is an >-approximation for . Otherwise, we increase the precisions and repeat the computations. Under relatively mild assumptions on the way we evaluate , this procedure will eventually stop, since is continuous. Although this technique of error estimates does solve the problems and raised in the previous section, it also induces some new problems. Most importantly, we have lost the fine-grained control over the precisions in intermediate computations during the evaluation of ,\,x)>. Indeed, we have only control over the overall starting precision . This disadvantage is reflected in two ways: <\description> It is not clear how to increase . Ideally speaking, should be increased in such a way that the computation time of ,]> is doubled at each iteration. In that case (see section ), the overall computation time is bounded by a constant time the computation time the least precision