Skip to main content
Log in

Tracing application program execution on the CRAY X-MP and CRAY-2

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Important insights into program operation can be gained by observing dynamic execution behavior. Unfortunately, many high-performance machines provide execution profile summaries as the only tool for performance investigation. We have developed a tracing library for the CRAY X-MP and CRAY-2 supercomputers that supports the low-overhead capture of execution events for sequential and multitasked programs. This library has been extended to use the automatic instrumentation facilities on these machines, allowing trace data from routine entry and exit, and other program segments, to be captured. To assess the utility of the trace-based tools, three of the Perfect Benchmark codes have been tested in scalar and vector modes with the tracing instrumentation. In addition to computing summary execution statistics from the traces, interesting execution dynamics appear when studying the trace histories. It is also possible to model application performance based on properties identified from traces. Our conclusion is that adding tracing support in Cray supercomputers can have significant returns in improved performance characterization and evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Berry, M. 1989. The Perfect Club Benchmarks: Effective performance evaluation of supercomputers. The Internat. J. Supercomputer Applications, 3, 3 (fall), 5–40.

    Google Scholar 

  • Carrington, D. 1983. Profiling under ELSI UNIX. Software Practice and Experience, 16, 9 (Sept.), 865–873.

    Google Scholar 

  • Cray Research, Inc. 1989. UNICOS Performance Utilities Reference Manual. May.

  • Fromm, H., Hercksen, U., Herzog, U., John, K., Klar, R., and Kleinoder, W. 1983. Experiences with performance measurement and modeling of a processor array. IEEE Trans. Comps., 32, 1 (Jan.).

    Google Scholar 

  • Gehringer, E., Siewiorek, D., and Segall, Z. 1987. Parallel Processing: The CM * Experience. Digital Press.

  • Graham, S., Kessler, P., and McKusick, M. 1982. gprof: A call graph execution profiler. In Proc., SIGPLAN '82 Symp. on Compiler Construction (Boston, June), ACM Press, pp. 120–126.

  • Graham, S., Kessler, P., and McKusick, M. 1983. An execution profiler for modular programs. Software Practice and Experience, 13: 671–685.

    Google Scholar 

  • Jameson, A. 1983. Solution of the Euler equations for a two-dimensional transonic flow by a multigrid method. Applied Math. and Comp., 13:327.

    Google Scholar 

  • Larson, J. 1985. CRAY X-MP hardware performance monitor. Cray Channels.

  • Larson, J., and Lutz, R. 1985. Perftrace user guide. Tech. rept., Cray Research, Inc. (Aug.).

  • Malony, A. 1990. Performance observability. Ph.D. thesis, Dept. of Comp. Sci., Univ. of Ill. at Urbana-Champaign, Urbana, Ill.

    Google Scholar 

  • Malony, A. 1991. Event based performance perturbation: A case study. In Third ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming (to appear).

  • Malony, A., Reed, D., and Wijshoff, H. 1989. Performance measurement intrusion and perturbation analysis. Tech. rept. CSRD-923, Univ. of Ill., Center for Supercomputing Research and Development, Urbana, Ill.

    Google Scholar 

  • Mellor-Crummey, J. 1989. Debugging and analysis of large-scale parallel programs. Ph.D. thesis, Dept. of Comp. Sci., Univ. of Rochester, Rochester, N.Y.

    Google Scholar 

  • Model, M. 1978. Monitoring system behavior in a complex computational environment. Ph.D. thesis, Stanford Univ., Stanford, Calif.

    Google Scholar 

  • Noor, A., and Peters, J. 1985. Model-size reduction techniques for the analysis of symmetric anisotropic structures. Eng. Computations, 2, 4 (Apr.), 285.

    Google Scholar 

  • Orszag, S. 1984. Order and disorder in two and three-dimensional Benard convection. J. Fluid Mechanics, 174:1.

    Google Scholar 

  • Pointer, L. 1990. Perfect: Performance evaluation for cost-effective transformations—Report 2. Tech. rept. CSRD No. 964, Univ. of Ill. at Urbana-Champaign, Center for Supercomputing Research and Development, Urbana, Ill.

    Google Scholar 

  • Segall, Z., and Rudolph, L. 1985. PIE: A programming and instrumentation environment for parallel processing. IEEE Software, 2, 6 (Nov.), 22–37.

    Google Scholar 

  • Simmons, M., Koskela, R., and Bucher, L, eds. 1989. Instrumentation for Future Parallel Computing Systems. ACM Press.

  • Simmons, M., Koskela, R., and Bucher, I., eds. 1990. Parallel Computer Systems: Performance Instrumentation and Visualization. ACM Press.

Download references

Author information

Authors and Affiliations

Authors

Additional information

Supported in part by the National Science Foundation under Grants No. NSF MIP-88-07775 and No. NSF ASC-84-04556, and the NASA Ames Research Center Grant No. NCC-2-559.

Supported in part by the National Science Foundation under grant NSF ASC-84-04556.

Supported in part by the National Science Foundation under grants NSF CCR-86-57696, NSF CCR-87-06653 and NSF CDA-87-22836 and by the National Aeronautics and Space Administration under NASA Contract Number NAG-1-613.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Malony, A.D., Larson, J.L. & Reed, D.A. Tracing application program execution on the CRAY X-MP and CRAY-2. J Supercomput 5, 137–162 (1991). https://doi.org/10.1007/BF00127841

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00127841

Keywords

Navigation