High-level algorithms for correctly-rounded reciprocal square roots
We analyze two fast and accurate algorithms recently presented by Borges for computing x^{-1/2} in binary floating-point arithmetic (assuming that efficient and correctly-rounded FMA and square root are available). The first algorithm is based on the Newton-Raphson iteration, and the second one uses...
Saved in:
| Published in: | Proceedings - Symposium on Computer Arithmetic pp. 18 - 25 |
|---|---|
| Main Authors: | , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
01.09.2022
|
| Subjects: | |
| ISSN: | 2576-2265 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | We analyze two fast and accurate algorithms recently presented by Borges for computing x^{-1/2} in binary floating-point arithmetic (assuming that efficient and correctly-rounded FMA and square root are available). The first algorithm is based on the Newton-Raphson iteration, and the second one uses an order-3 iteration. We give attainable relative-error bounds for these two algorithms, build counterexamples showing that in very rare cases they do not provide a correctly-rounded result, and characterize precisely when such failures happen in IEEE 754 binary32 and binary64 arithmetics. We then give a generic (i.e., precision-independent) algorithm that always returns a correctly-rounded result, and show how it can be simplified and made more efficient in the important cases of binary32 and binary64. |
|---|---|
| ISSN: | 2576-2265 |
| DOI: | 10.1109/ARITH54963.2022.00013 |