Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Computing floating-point square roots via bivariate polynomial evaluation

Abstract : In this paper we show how to reduce the computation of correctly-rounded square roots of binary floating-point data to the fixed-point evaluation of some particular integer polynomials in two variables. By designing parallel and accurate evaluation schemes for such bivariate polynomials, we show further that this approach allows for high instruction-level parallelism (ILP) exposure, and thus potentially low latency implementations. Then, as an illustration, we detail a C implementation of our method in the case of IEEE 754-2008 binary32 floating-point data (formerly called single precision in the 1985 version of the IEEE 754 standard). This software implementation, which assumes 32-bit integer arithmetic only, is almost complete in the sense that it supports special operands, subnormal numbers, and all rounding modes, but not exception handling (that is, status flags are not set). Finally we have carried out experiments with this implementation using the ST200 VLIW compiler from STMicroelectronics. The results obtained demonstrate the practical interest of our approach in that context: for all rounding modes, the generated assembly code is optimally scheduled and has indeed low latency (23 cycles).
Document type :
Preprints, Working Papers, ...
Complete list of metadata

Cited literature [18 references]  Display  Hide  Download
Contributor : Claude-Pierre Jeannerod Connect in order to contact the contributor
Submitted on : Friday, March 26, 2010 - 10:22:31 AM
Last modification on : Friday, February 4, 2022 - 3:19:14 AM
Long-term archiving on: : Wednesday, November 30, 2016 - 4:28:48 PM


Files produced by the author(s)


  • HAL Id : ensl-00335792, version 2



Claude-Pierre Jeannerod, Hervé Knochel, Christophe Monat, Guillaume Revy. Computing floating-point square roots via bivariate polynomial evaluation. 2010. ⟨ensl-00335792v2⟩



Record views


Files downloads