A new binary floating-point division algorithm and its software implementation on the ST231 processor

Claude-Pierre Jeannerod 1, 2 Hervé Knochel 3 Christophe Monat 3 Guillaume Revy 1, 2, * Gilles Villard 1, 2
* Auteur correspondant
2 ARENAIRE - Computer arithmetic
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : This paper deals with the design and implementation of low latency software for binary floating-point division with correct rounding to nearest. The approach we present here targets a VLIW integer processor of the ST200 family, and is based on fast and accurate programs for evaluating some particular bivariate polynomials. We start by giving approximation and evaluation error conditions that are sufficient to ensure correct rounding. Then we describe the heuristics used to generate such evaluation programs, as well as those used to automatically validate their accuracy. Finally, we propose, for the binary32 format, a complete C implementation of the resulting division algorithm. With the ST200 compiler and compared to previous implementations, the speed-up observed with our approach is by a factor of almost 1.8.
Type de document :
Pré-publication, Document de travail
LIP research report RR2008-39. 2009
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger

https://hal-ens-lyon.archives-ouvertes.fr/ensl-00335892
Contributeur : Claude-Pierre Jeannerod <>
Soumis le : mercredi 18 mars 2009 - 13:53:29
Dernière modification le : vendredi 20 juillet 2018 - 11:36:03
Document(s) archivé(s) le : samedi 26 novembre 2016 - 06:46:09

Fichier

fpdiv-hal-V2.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : ensl-00335892, version 2

Collections

Citation

Claude-Pierre Jeannerod, Hervé Knochel, Christophe Monat, Guillaume Revy, Gilles Villard. A new binary floating-point division algorithm and its software implementation on the ST231 processor. LIP research report RR2008-39. 2009. 〈ensl-00335892v2〉

Partager

Métriques

Consultations de la notice

236

Téléchargements de fichiers

102