Optimizing correctly-rounded reciprocal square roots for embedded VLIW cores

Claude-Pierre Jeannerod; Guillaume Revy

Communication Dans Un Congrès Année : 2009

Optimizing correctly-rounded reciprocal square roots for embedded VLIW cores

(1, 2) , (1, 2)

1
2

Claude-Pierre Jeannerod

Fonction : Auteur
PersonId : 855152

Computer arithmetic

Laboratoire de l'Informatique du Parallélisme

Guillaume Revy

Fonction : Auteur
PersonId : 20753
IdHAL : guillaume-revy
IdRef : 14704040X

Computer arithmetic

Laboratoire de l'Informatique du Parallélisme

Résumé

This paper presents an optimized software implementation of the reciprocal square root function, for IEEE binary32 floating-point data and with correct rounding to nearest. The main feature of this implementation is high instruction level parallelism (ILP) exposure, which results here from an extension of the polynomial evaluation-based method of~\cite{JeKnMoRe08} as well as from the design of a specific rounding procedure. This implementation proves to be very efficient for some VLIW processor cores like STMicroelectronics' ST231 (used mainly for embedded media processing), where a low latency of 29 cycles has been measured.

Mots clés

Reciprocal square root Binary floating-point arithmetic Correct rounding Polynomial evaluation Software implementation VLIW processor core

Domaines

Arithmétique des ordinateurs

Fichier principal

JeannerodRevyAsilomar09-finalversion.pdf (226.1 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Claude-Pierre Jeannerod : Connectez-vous pour contacter le contributeur

https://ens-lyon.hal.science/ensl-00391185

Soumis le : mercredi 25 novembre 2009-11:25:32

Dernière modification le : jeudi 11 mai 2023-11:56:10

Archivage à long terme le : samedi 26 novembre 2016-16:15:25

Dates et versions

ensl-00391185 , version 1 (03-06-2009)

ensl-00391185 , version 2 (25-11-2009)

Identifiants

HAL Id : ensl-00391185 , version 2

Citer

Claude-Pierre Jeannerod, Guillaume Revy. Optimizing correctly-rounded reciprocal square roots for embedded VLIW cores. Asilomar Conference on Signals, Systems, and Computers, Nov 2009, United States. ⟨ensl-00391185v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-LYON CNRS INRIA UNIV-LYON1 INRIA2 UDL

151 Consultations

663 Téléchargements

Optimizing correctly-rounded reciprocal square roots for embedded VLIW cores

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager