article

Free access

A model and compilation strategy for out-of-core data parallel programs

Authors:

Michael PalecznyAuthors Info & Claims

ACM SIGPLAN Notices, Volume 30, Issue 8

Pages 1 - 10

https://doi.org/10.1145/209937.209938

Published: 01 August 1995 Publication History

PDF eReader

Abstract

It is widely acknowledged in high-performance computing circles that parallel input/output needs substantial improvement in order to make scalable computers truly usable. We present a data storage model that allows processors independent access to their own data and a corresponding compilation strategy that integrates data-parallel computation with data distribution for out-of-core problems. Our results compare several communication methods and I/O optimizations using two out-of-core problems, Jacobi iteration and LU factorization.

References

[1]

J.R. Allen and K. Kennedy. Automatic translation of Fortran programs to vector form. ACM Transactions on Programming Languages and Systems, 9(4):491-542, October 1987.

Digital Library

Google Scholar

[2]

W. Abu-Sufah. Improving the Performance of Virtual Memory Computers. PhD thesis, Dept. of Computer Science, University of illinois at Urbana-Champaign, 1979.

Digital Library

Google Scholar

[3]

A. Choudhary, R. Bordawekar, M. Harry, R. Krishnaiyer, R. Ponnusamy, T. Singh, and R. Thakur. PASSION: Parallel and Scalable Software for Input-Output. Technical Report SCCS- 636, NPAC, Syracuse University, Sep 1994.

Google Scholar

[4]

S. Cart and K. Kennedy. Compiler blockabilty of numerical algorithms. Prec. of Supercomputing'92, November 1992.

Digital Library

Google Scholar

[5]

J. del Rosario, R. Bordawekar, and A. Choudhary. Improved parallel I/O via a two-phase runtime access strategy. In Proceedings of the Workshop on I/O in Parallel Computer Systems at IPPS '93, April 1993.

Google Scholar

[6]

G. Fox, S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, and C. Tseng. Fortran D language specifications. Technical Report COMP TR90- 141, Rice University, 1990.

Google Scholar

[7]

G. Fox. The architecture of problems and portable parallel software systems. Technical Report SCCS-78b, Northeast Parallel Architectures Center, Syracuse University, Syracuse, NY 13244, 1991.

Google Scholar

[8]

High Performance Fortran Forum. High Performance Fortran language specification. Scientific Programming, 2(1-2):1-170, 1993.

Crossref

Google Scholar

[9]

D. Kotz. Disk-Directed I/O for MIMD multiprocessors. Technical Report PCS-TR94-226, Dept. of Computer Science, Dartmouth College, July 1994.

Digital Library

Google Scholar

[10]

T. Mowry. Tolerating Latency Through Software Controlled Data Prefetching. PhD thesis, Department of Computer Science, Stanford University, March 1994.

Digital Library

Google Scholar

[11]

K.S. Trivedi. Prepaging and applications to the STAR-100 computer. In Proceedings of the Symposium on High Speed Computer and Algorithm Organization, pages 435-446, April 1977.

Google Scholar

[12]

K.S. Trivedi. On the paging performance of array algorithms. IEEE Transactions on Computers, C-26(10):938-947, October 1977.

Digital Library

Google Scholar

[13]

H. Zima, P. Brezany, B. Chapman, P. Mehrotra, and A. Schwald. Vienna Fortran - a Language Specification. Technical Report ICASE Interim Report 21, MS 132c, ICASE, NASA, Hampton VA 23681, 1992.

Google Scholar

Cited By

View all

Gómez-Iglesias AVienne JHamidouche KSimmons CBarth WPanda D(2015)Scalable Out-of-core OpenSHMEM Library for HPCRevised Selected Papers of the Second Workshop on OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies - Volume 939710.1007/978-3-319-26428-8_9(138-153)Online publication date: 4-Aug-2015
https://dl.acm.org/doi/10.1007/978-3-319-26428-8_9
Ha LKruger JComba JSilva CJoshi S(2012)ISPIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2012.3218:6(838-851)Online publication date: 1-Jun-2012
https://dl.acm.org/doi/10.1109/TVCG.2012.32
Ha LKrüger JComba JJoshi SSilva C(2011)Optimal multi-image processing streaming framework on parallel heterogeneous systemsProceedings of the 11th Eurographics conference on Parallel Graphics and Visualization10.5555/2386230.2386232(1-10)Online publication date: 10-Apr-2011
https://dl.acm.org/doi/10.5555/2386230.2386232
Show More Cited By

Index Terms

A model and compilation strategy for out-of-core data parallel programs
1. Computing methodologies
  1. Concurrent computing methodologies
    1. Concurrent programming languages
2. Software and its engineering
  1. Software notations and tools
    1. Compilers
    2. General programming languages
      1. Language types
        Concurrent programming languages

Recommendations

Compilation of out-of-core data parallel programs for distributed memory machines
Special issue on input/output in parallel computer systems

We are developing a compiler and runtime support system called PASSION (Parallel And Scalable Software for Input-Output), to translate out-of-core programs written in a data parallel language like HPF to message passing node programs with explicit ...
Advanced data-parallel compilation
A model and compilation strategy for out-of-core data parallel programs
PPOPP '95: Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming

It is widely acknowledged in high-performance computing circles that parallel input/output needs substantial improvement in order to make scalable computers truly usable. We present a data storage model that allows processors independent access to their ...

Comments

Information & Contributors

Information

Published In

ACM SIGPLAN Notices Volume 30, Issue 8

Aug. 1995

226 pages

ISSN:0362-1340

EISSN:1558-1160

DOI:10.1145/209937

Editors:
Richard L. Wexelblat
IRS IS:AO, Washington, DC
,
Jeanne Ferrante
Univ. of California, San Diego
,
David Padua
Univ. of Illinois at Urbana-Champaign, Urbana

Issue’s Table of Contents

PPOPP '95: Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
August 1995
234 pages
ISBN:0897917006
DOI:10.1145/209936
Chairmen:
Jeanne Ferrante
Univ. of California, San Diego
,
David Padua
Univ. of Illinois at Urbana-Champaign, Urbana
,
Editor:
Richard L. Wexelblat
IRS IS:AO, Washington, DC

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 August 1995

Published in SIGPLAN Volume 30, Issue 8

Check for updates

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

53
Total Citations
View Citations
375
Total Downloads

Downloads (Last 12 months)63
Downloads (Last 6 weeks)21

Reflects downloads up to 23 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Gómez-Iglesias AVienne JHamidouche KSimmons CBarth WPanda D(2015)Scalable Out-of-core OpenSHMEM Library for HPCRevised Selected Papers of the Second Workshop on OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies - Volume 939710.1007/978-3-319-26428-8_9(138-153)Online publication date: 4-Aug-2015
https://dl.acm.org/doi/10.1007/978-3-319-26428-8_9
Ha LKruger JComba JSilva CJoshi S(2012)ISPIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2012.3218:6(838-851)Online publication date: 1-Jun-2012
https://dl.acm.org/doi/10.1109/TVCG.2012.32
Ha LKrüger JComba JJoshi SSilva C(2011)Optimal multi-image processing streaming framework on parallel heterogeneous systemsProceedings of the 11th Eurographics conference on Parallel Graphics and Visualization10.5555/2386230.2386232(1-10)Online publication date: 10-Apr-2011
https://dl.acm.org/doi/10.5555/2386230.2386232
Ma WKrishnamoorthy SAgrawal G(2011)Practical Loop Transformations for Tensor Contraction Expressions on Multi-level Memory HierarchiesCompiler Construction10.1007/978-3-642-19861-8_15(266-285)Online publication date: 2011
https://doi.org/10.1007/978-3-642-19861-8_15
Xiaogang Li Ruoming Jin Agrawal G(2003)A compilation framework for distributed memory parallelization of data mining algorithmsProceedings International Parallel and Distributed Processing Symposium10.1109/IPDPS.2003.1213080(8)Online publication date: 2003
https://doi.org/10.1109/IPDPS.2003.1213080
Ferreira RAgrawal GSaltz J(2002)Data parallel language and compiler support for data intensive applicationsParallel Computing10.1016/S0167-8191(02)00093-528:5(725-748)Online publication date: 1-May-2002
https://dl.acm.org/doi/10.1016/S0167-8191%2802%2900093-5
Ferreira RAgrawal GJin RSaltz J(2001)Compiling Data Intensive Applications with Spatial CoordinatesLanguages and Compilers for Parallel Computing10.1007/3-540-45574-4_22(339-354)Online publication date: 4-Dec-2001
https://doi.org/10.1007/3-540-45574-4_22
Sun N(1999)Reference implementation of scalable I/O low-level API on Intel ParagonJournal of Computer Science and Technology10.1007/BF0294850914:3(206-223)Online publication date: May-1999
https://doi.org/10.1007/BF02948509
Frens JWise D(1997)Auto-blocking matrix-multiplication or tracking BLAS3 performance from source codeACM SIGPLAN Notices10.1145/263767.26378932:7(206-216)Online publication date: 21-Jun-1997
https://dl.acm.org/doi/10.1145/263767.263789
Frens JWise DSchreiber RPingali K(1997)Auto-blocking matrix-multiplication or tracking BLAS3 performance from source codeProceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming10.1145/263764.263789(206-216)Online publication date: 21-Jun-1997
https://dl.acm.org/doi/10.1145/263764.263789
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

References

Cited By

Index Terms

Recommendations

Compilation of out-of-core data parallel programs for distributed memory machines

Advanced data-parallel compilation