Skip to main content
Log in

Decimal Square Root: Algorithm and Hardware Implementation

  • Published:
Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Abstract

We propose a new digit recurrence decimal square root (DSR) design and provide its ASIC implementation. The interim square root digits are in \([ {-5,5} ]\). The proposed architecture generally follows that of a previous radix-10 divider. However, it provides novel solutions with regard to few DSR-specific challenges. For example, complex error analysis shows that only four (out of sixteen) digits of partial square root is sufficient to estimate partial remainders that are required for the more complicated square root digit selection. This design performs about 10 % faster and consumes 28 % less area than the previously reported ASIC digit recurrence decimal square rooter.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. H. Baliga, N. Cooray , E. Gamsaragan, P. Smith, K. Yoon, J. Abel, A. Valles, Improvements in the Intel Core2 Penryn processor family architecture and microarchitecture. Intel Technol. J., 179–192 (2008). http://www.intel.com/technology/itj/2008/v12i3/3-paper/1-abstract.htm

  2. N. Burgess, C.N. Hinds, Design of the ARM VFP11 divide and square root synthesisable macrocell, in Proceedings of the 18th IEEE Symposium on Computer Arithmetic, pp. 87–96 (2007)

  3. F.Y. Busaba, C.A. Krygowski, W.H. Li, E.M. Schwarz, S.R. Carlough, The IBM z900 decimal arithmetic unit, in Asilomar Conference on Signals, Systems, and Computers, vol. 2, pp. 1335–1339 (2001)

  4. M.D. Ercegovac, T. Lang, On-the-fly rounding. IEEE Trans. Comput. 41(12), 1497–1503 (1992)

    Article  Google Scholar 

  5. M.D. Ercegovac, T. Lang, Digital Arithmetic (Morgan Kaufmann Publishers, Los Altos, 2004)

    Google Scholar 

  6. M.D. Ercegovac, R. McIlhenny, Design and FPGA implementation of radix-10 algorithm for division with limited precision primitives, in Proceedings of the 42nd Asilomar Conference on Signals, Systems and Computers (2008)

  7. M.D. Ercegovac, R. McIlhenny, Design and FPGA implementation of radix-10 algorithm for square root with limited precision primitives, in Proceedings of the 43rd Asilomar Conference on Signals, Systems and Computers (2009)

  8. M.D. Ercegovac, R. McIlhenny, Design and FPGA implementation of radix-10 combined division/square root algorithm with limited precision primitives, in Proceedings of the 44rd Asilomar Conference on Signals, Systems and Computers (2010)

  9. G. Gerwig, H. Wetter, E.M. Schwarz, J. Haess, High performance floating-point unit with 116 bit wide divider, in Proceedings of the 16th Symposium on Computer Arithmetic, pp. 87–94 (2003)

  10. L. Han, S. Ko, High speed parallel decimal multiplication with redundant internal encoding. IEEE Trans. Comput. 62(5), 956–968 (2013)

    Article  MathSciNet  Google Scholar 

  11. IEEE Inc., IEEE 754-2008 Standard for Floating-Point Arithmetic (2008)

  12. G. Jaberipur, B. Parhami, M. Ghodsi, Weighted two-valued digit-set encodings: unifying efficient hardware representation schemes for redundant number systems. IEEE Trans. Circuits Syst. I 52(7), 1348–1357 (2005)

    Article  MathSciNet  Google Scholar 

  13. A. Kaivani, A. Hosseiny, G. Jaberipur, Improving the speed of decimal divider. IET Comput. Digit. Tech. 5(5), 393–404 (2011)

    Article  Google Scholar 

  14. A. Kaivani, S.-B. Ko, Decimal SRT square root: algorithm and architecture. Circuits Syst. Signal Process. 32(5), 2137–2150 (2013)

    Article  Google Scholar 

  15. T. Lang, A. Nannarelli, A radix-10 digit-recurrence division unit: algorithm and architecture. IEEE Trans. Comput. 56(6), 727–739 (2007)

    Article  MathSciNet  Google Scholar 

  16. H. Nikmehr, B. Philips, Fast decimal floating-point division. IEEE Trans. VLSI 14(9), 951–961 (2006)

    Article  Google Scholar 

  17. NVIDIA. Fermi. NVIDIA’s Next Generation CUDA Computer Architecture. http://www.nvidia.com/content/PDF/fermiwhitepapers/NVIDIAFermiComputeArchitectureWhitepaper.pdf

  18. S.F. Oberman, Floating-point division and square root algorithms and implementation in the AMD-K7 microprocessor, in Proceedings of the 14th Symposium on Computer Arithmetic, pp. 106–115 (1999)

  19. R. Raafat, A. Mohamed, H.A.H. Fahmy, Y. Farouk, M. Elkhouly, T. Eldeeb, R. Samy, Decimal Floating-Point Square-Root Unit Using Newton–Raphson Iterations. United States Patent Application Publication, Pub. No: US 2012/0011182 A1 (2012)

  20. R.K. Richards, Arithmetic Operations in Digital Computers (Van Nostrand, New York, 1955)

    Google Scholar 

  21. M. Schmookler, A. Weinberger, High speed decimal addition. IEEE Trans. Comput. C-20(8), 862–866 (1971)

  22. E.M. Schwarz, Power6 decimal divider, in Proceedings of the IEEE International Conference on Applied-Specific Systems, Architecture and Processors (ASAP), pp. 128–133 (2007)

  23. A. Vazquez, E. Antelo, E.P. Montuschi, (2007) A radix-10 SRT divider based on alternative BCD codings, in XXV IEEE International Conference on Computer Design (ICCD 2007), Lake Tahoe, CA, USA, pp. 280–287

  24. L.K. Wang, M.J. Schulte, (2004) Decimal floating-point division using Newton-Raphson iteration, in Proceedings of the IEEE International Conference on Applied-Specific Systems, Architecture, and Processors (ASAP), pp. 84–95

  25. L.K. Wang, M.J. Schulte, (2005) Decimal floating-point square root using Newton–Raphson iteration, in Proceedings of the IEEE International Conference on Applied-Specific Systems, Architecture, Processors (ASAP), pp. 309–315

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ghassem Jaberipur.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hosseiny, A., Jaberipur, G. Decimal Square Root: Algorithm and Hardware Implementation. Circuits Syst Signal Process 35, 4195–4219 (2016). https://doi.org/10.1007/s00034-015-0215-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00034-015-0215-1

Keywords

Navigation