Computing floating-point square roots via bivariate polynomial evaluation

Abstract : In this paper we show how to reduce the computation of correctly-rounded square roots of binary floating-point data to the fixed-point evaluation of some particular integer polynomials in two variables. By designing parallel and accurate evaluation schemes for such bivariate polynomials, we show further that this approach allows for high instruction-level parallelism (ILP) exposure, and thus potentially low latency implementations. Then, as an illustration, we detail a C implementation of our method in the case of IEEE 754-2008 binary32 floating-point data (formerly called single precision in the 1985 version of the IEEE 754 standard). This software implementation, which assumes 32-bit integer arithmetic only, is almost complete in the sense that it supports special operands, subnormal numbers, and all rounding modes, but not exception handling (that is, status flags are not set). Finally we have carried out experiments with this implementation using the ST200 VLIW compiler from STMicroelectronics. The results obtained demonstrate the practical interest of our approach in that context: for all rounding modes, the generated assembly code is optimally scheduled and has indeed low latency (23 cycles).
Type de document :
Pré-publication, Document de travail
LIP research report RR2008-38. 2010
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger
Contributeur : Claude-Pierre Jeannerod <>
Soumis le : vendredi 26 mars 2010 - 10:22:31
Dernière modification le : jeudi 17 janvier 2019 - 15:16:03
Document(s) archivé(s) le : mercredi 30 novembre 2016 - 16:28:48


Fichiers produits par l'(les) auteur(s)


  • HAL Id : ensl-00335792, version 2



Claude-Pierre Jeannerod, Hervé Knochel, Christophe Monat, Guillaume Revy. Computing floating-point square roots via bivariate polynomial evaluation. LIP research report RR2008-38. 2010. 〈ensl-00335792v2〉



Consultations de la notice


Téléchargements de fichiers