ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
Computer Physics Communications
Volume 178, Issue 11, 1 June 2008, Pages 817-834
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (484 K)

  E-mail Article   
  Add to my Quick Links   
Bookmark and share in 2collab (opens in new window)
Request permission to reuse this article
  Cited By in Scopus (0)
 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.cpc.2008.01.045    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2008 Elsevier B.V. All rights reserved.

On ERI sorting for SIMD execution of large-scale Hartree–Fock SCF

Tirath Ramdasa, Corresponding Author Contact Information, E-mail The Corresponding Author, Gregory K. Egana, E-mail The Corresponding Author, David Abramsonb, E-mail The Corresponding Author and Kim K. Baldridgec, E-mail The Corresponding Author

aCentre for Telecommunications and Information Engineering, Monash University, Melbourne, Australia bCentre for Distributed Systems and Software Engineering, Monash University, Melbourne, Australia cOrganic Chemistry Institute, University of Zurich, Switzerland

Received 23 August 2007; 
revised 18 January 2008; 
accepted 22 January 2008. 
Available online 1 February 2008.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Given the resurgent attractiveness of single-instruction-multiple-data (SIMD) processing, it is important for high-performance computing applications to be SIMD-capable. The Hartree–Fock SCF (HF-SCF) application, in it's canonical form, cannot fully exploit SIMD processing. Prior attempts to implement Electron Repulsion Integral (ERI) sorting functionality to essentially “SIMD-ify” the HF-SCF application have met frustration because of the low throughput of the sorting functionality. With greater awareness of computer architecture, we discuss how the sorting functionality may be practically implemented to provide high-performance. Overall system performance analysis, including memory locality analysis, is also conducted, and further emphasises that a system with ERI sorting is capable of very high throughput. We discuss two alternative implementation options, with one immediately accessible software-based option discussed in detail. The impact of workload characteristics on expected performance is also discussed, and it is found that in general as basis set size increases the potential performance of the system also increases. Consideration is given to conventional CPUs, GPUs, FPGAs, and the Cell Broadband Engine architecture.

Keywords: Hartree–Fock self consistent field; Electron repulsion integrals; Single-instruction-multiple-data processing

PACS classification codes: 02.60.Pn; 07.05.Bx; 31.15.Ar; 89.20.Ff

Article Outline

1. Introduction
2. Application characteristics
3. Computer architecture overview
3.1. Previous vectorisation efforts
3.2. ERIC special-purpose processor
3.3. Functional requirements of ERI sorting
3.4. Proposed hardware architecture
4. System-level details
4.1. Tuple sorting
4.2. Searchword LUT
5. Performance
5.1. SIMD unit utilisation
5.2. Basis set access locality
6. Impact of workload characteristics
6.1. Unrolling contractions
6.2. Thread-matching with contractions
6.3. Workload characteristics
7. Integral blocks
8. Implementation practicalities
8.1. Software solution
8.2. Fully custom hardware solution
8.3. SIMD ERI processing
9. Conclusions and further work
References

















 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.