skip to main content
article
Free access

A model and compilation strategy for out-of-core data parallel programs

Published: 01 August 1995 Publication History

Abstract

It is widely acknowledged in high-performance computing circles that parallel input/output needs substantial improvement in order to make scalable computers truly usable. We present a data storage model that allows processors independent access to their own data and a corresponding compilation strategy that integrates data-parallel computation with data distribution for out-of-core problems. Our results compare several communication methods and I/O optimizations using two out-of-core problems, Jacobi iteration and LU factorization.

References

[1]
J.R. Allen and K. Kennedy. Automatic translation of Fortran programs to vector form. ACM Transactions on Programming Languages and Systems, 9(4):491-542, October 1987.
[2]
W. Abu-Sufah. Improving the Performance of Virtual Memory Computers. PhD thesis, Dept. of Computer Science, University of illinois at Urbana-Champaign, 1979.
[3]
A. Choudhary, R. Bordawekar, M. Harry, R. Krishnaiyer, R. Ponnusamy, T. Singh, and R. Thakur. PASSION: Parallel and Scalable Software for Input-Output. Technical Report SCCS- 636, NPAC, Syracuse University, Sep 1994.
[4]
S. Cart and K. Kennedy. Compiler blockabilty of numerical algorithms. Prec. of Supercomputing'92, November 1992.
[5]
J. del Rosario, R. Bordawekar, and A. Choudhary. Improved parallel I/O via a two-phase runtime access strategy. In Proceedings of the Workshop on I/O in Parallel Computer Systems at IPPS '93, April 1993.
[6]
G. Fox, S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, and C. Tseng. Fortran D language specifications. Technical Report COMP TR90- 141, Rice University, 1990.
[7]
G. Fox. The architecture of problems and portable parallel software systems. Technical Report SCCS-78b, Northeast Parallel Architectures Center, Syracuse University, Syracuse, NY 13244, 1991.
[8]
High Performance Fortran Forum. High Performance Fortran language specification. Scientific Programming, 2(1-2):1-170, 1993.
[9]
D. Kotz. Disk-Directed I/O for MIMD multiprocessors. Technical Report PCS-TR94-226, Dept. of Computer Science, Dartmouth College, July 1994.
[10]
T. Mowry. Tolerating Latency Through Software Controlled Data Prefetching. PhD thesis, Department of Computer Science, Stanford University, March 1994.
[11]
K.S. Trivedi. Prepaging and applications to the STAR-100 computer. In Proceedings of the Symposium on High Speed Computer and Algorithm Organization, pages 435-446, April 1977.
[12]
K.S. Trivedi. On the paging performance of array algorithms. IEEE Transactions on Computers, C-26(10):938-947, October 1977.
[13]
H. Zima, P. Brezany, B. Chapman, P. Mehrotra, and A. Schwald. Vienna Fortran - a Language Specification. Technical Report ICASE Interim Report 21, MS 132c, ICASE, NASA, Hampton VA 23681, 1992.

Cited By

View all
  • (2015)Scalable Out-of-core OpenSHMEM Library for HPCRevised Selected Papers of the Second Workshop on OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies - Volume 939710.1007/978-3-319-26428-8_9(138-153)Online publication date: 4-Aug-2015
  • (2012)ISPIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2012.3218:6(838-851)Online publication date: 1-Jun-2012
  • (2011)Optimal multi-image processing streaming framework on parallel heterogeneous systemsProceedings of the 11th Eurographics conference on Parallel Graphics and Visualization10.5555/2386230.2386232(1-10)Online publication date: 10-Apr-2011
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGPLAN Notices
ACM SIGPLAN Notices  Volume 30, Issue 8
Aug. 1995
226 pages
ISSN:0362-1340
EISSN:1558-1160
DOI:10.1145/209937
Issue’s Table of Contents
  • cover image ACM Conferences
    PPOPP '95: Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
    August 1995
    234 pages
    ISBN:0897917006
    DOI:10.1145/209936
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 August 1995
Published in SIGPLAN Volume 30, Issue 8

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)63
  • Downloads (Last 6 weeks)21
Reflects downloads up to 23 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Scalable Out-of-core OpenSHMEM Library for HPCRevised Selected Papers of the Second Workshop on OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies - Volume 939710.1007/978-3-319-26428-8_9(138-153)Online publication date: 4-Aug-2015
  • (2012)ISPIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2012.3218:6(838-851)Online publication date: 1-Jun-2012
  • (2011)Optimal multi-image processing streaming framework on parallel heterogeneous systemsProceedings of the 11th Eurographics conference on Parallel Graphics and Visualization10.5555/2386230.2386232(1-10)Online publication date: 10-Apr-2011
  • (2011)Practical Loop Transformations for Tensor Contraction Expressions on Multi-level Memory HierarchiesCompiler Construction10.1007/978-3-642-19861-8_15(266-285)Online publication date: 2011
  • (2003)A compilation framework for distributed memory parallelization of data mining algorithmsProceedings International Parallel and Distributed Processing Symposium10.1109/IPDPS.2003.1213080(8)Online publication date: 2003
  • (2002)Data parallel language and compiler support for data intensive applicationsParallel Computing10.1016/S0167-8191(02)00093-528:5(725-748)Online publication date: 1-May-2002
  • (2001)Compiling Data Intensive Applications with Spatial CoordinatesLanguages and Compilers for Parallel Computing10.1007/3-540-45574-4_22(339-354)Online publication date: 4-Dec-2001
  • (1999)Reference implementation of scalable I/O low-level API on Intel ParagonJournal of Computer Science and Technology10.1007/BF0294850914:3(206-223)Online publication date: May-1999
  • (1997)Auto-blocking matrix-multiplication or tracking BLAS3 performance from source codeACM SIGPLAN Notices10.1145/263767.26378932:7(206-216)Online publication date: 21-Jun-1997
  • (1997)Auto-blocking matrix-multiplication or tracking BLAS3 performance from source codeProceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming10.1145/263764.263789(206-216)Online publication date: 21-Jun-1997
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media