s'authentifier
version française rss feed
Fiche détaillée  Récupérer au format
Versions disponibles :
ensl-00335792, version 2
Informatique/Arithmétique des ordinateurs
Computing floating-point square roots via bivariate polynomial evaluation
Claude-Pierre Jeannerod1, 2, Hervé Knochel3, Christophe Monat3, Guillaume Revy4
1 :  LIP - Laboratoire de l'Informatique du Parallélisme
2 :  Inria Grenoble Rhône-Alpes / LIP Laboratoire de l'Informatique du Parallélisme - ARENAIRE
3 :  ST-GRENOBLE - STMicroelectronics (Grenoble)
4 :  ParLab - Parallel Computing Laboratory
[ARENAIRE - Arithmétique des ordinateurs]
In this paper we show how to reduce the computation of correctly-rounded square roots of binary floating-point data to the fixed-point evaluation of some particular integer polynomials in two variables. By designing parallel and accurate evaluation schemes for such bivariate polynomials, we show further that this approach allows for high instruction-level parallelism (ILP) exposure, and thus potentially low latency implementations. Then, as an illustration, we detail a C implementation of our method in the case of IEEE 754-2008 binary32 floating-point data (formerly called single precision in the 1985 version of the IEEE 754 standard). This software implementation, which assumes 32-bit integer arithmetic only, is almost complete in the sense that it supports special operands, subnormal numbers, and all rounding modes, but not exception handling (that is, status flags are not set). Finally we have carried out experiments with this implementation using the ST200 VLIW compiler from STMicroelectronics. The results obtained demonstrate the practical interest of our approach in that context: for all rounding modes, the generated assembly code is optimally scheduled and has indeed low latency (23 cycles).
Anglais
Binary floating-point arithmetic – square root – correct rounding – polynomial evaluation – instruction-level parallelism – rounding error analysis – C software implementation – VLIW integer processor.
B2.4, G.1.9, G.4
LIP research report RR2008-38
Liste des fichiers attachés à ce document : 
PDF
JeKnMoRe09-ieeeTC-revision.pdf(1 MB)