Abstract
Weather forecasting, especially snowfall prediction, was critical in the 2018 Winter Olympics, where the accuracy of the predictions was of key importance for the planning of the different Olympic events. It was a significant challenge for the authors to meet the requirements in time and forecast resolution, while doing their best to be as competitive as possible. All the forecasts were obtained using the Weather Research and Forecasting (WRF) model, executed on the GALGO supercomputer. In order to obtain the best performance and meet the required execution times, different combinations of compilers, Message Passing Interface (MPI) libraries and computing platforms were tested to seek the best combinations. This work proposes an empirical score of special interest to supercomputer maintainers, developers and scientists, which can be useful to obtain the best WRF configuration for their systems. Additionally, we found substantial performance differences when using different combinations of compilers, MPI libraries and hybrid shared memory paradigms, although these differences varied depending on the underlying platform. As conclusion, after all the tests we performed, we chose the combination with Intel compilers, Intel MPI library and OpenMP for the production system tasked to perform the weather forecasts for the Winter Olympic Games.
Similar content being viewed by others
References
Balachandran L, Almomany A, Alquraan A (2014) GCC vs. ICC comparison using parsec benchmarks. Int J Innov Technol Explor Eng 4(7):76–82
Han Y (2013) Application performance evaluation on different compiler optimizations. Adv Comput Sci Appl (ACSA) 2(3):410–415
Machado RS, Almeida RB, Jardim AD, Pernas AM, Yamin AC, Cavalheiro GGH (2017) Comparing performance of C compilers optimizations on different multicore architectures. In 2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW), pp 25–30
Madani B, Al-Shaikh R (2010) Performance modeling and MPI evaluation using Westmere-based Infiniband HPC cluster. In: 2010 Fourth UKSim European Symposium on Computer Modeling and Simulation, pp 363–368
Ong H, Farrell PA (2000) Performance comparison of LAM/MPI, MPICH, and MVICH on a Linux cluster connected by a Gigabit Ethernet network. In: Proceedings of the 4th Annual Linux Showcase & Conference—Volume 4, ALS’00, USENIX Association, Berkeley, pp 31–31
Vinter B, Bjørndalen JM , Anshus OJ, Larsen T (2004) A comparison of three MPI implementations. In: Proceedings of Communicating Process Architectures (CPA 2004). Morgan Kaufman
Drosinos N, Koziris N (2004) Performance comparison of pure MPI vs hybrid MPI-OpenMP parallelization models on SMP clusters. In: Proceedings of 18th International Parallel and Distributed Processing Symposium, 2004. p 15
Jin H, Jespersen D, Mehrotra P, Biswas R, Huang L, Chapman B (2011) High performance computing using MPI and OpenMP on multi-core parallel systems. Parallel Comput 37(9):562–575 Emerging Programming Paradigms for Large-Scale Scientific Computing
Rabenseifner R, Hager G, Jost G (2009) Hybrid MPI/OpenMP parallel programming on clusters of multi-core SMP nodes. In: 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing, pp 427–436
Chorley MJ, Walker DW (2010) Performance analysis of a hybrid MPI/OpenMP application on multi-core clusters. J Comput Sci 1(3):168–174
Kotobi A, Abdul Hamid NAW, Othman M, Hussin M (2014) Performance analysis of hybrid OpenMP/MPI based on multi-core cluster architecture. In: 2014 International Conference on Computational Science and Technology (ICCST), pp 1–6
Evans JP, Ekström M, Ji F (2012) Evaluating the performance of a WRF physics ensemble over south-east Australia. Clim Dyn 39(6):1241–1258
Andraju P, Kanth LA, Kumari VK, Rao SVB (2019) Performance optimization of operational WRF model configured for Indian Monsoon region. Earth Syst Environ 3:231–239
Bayá R, Porrini C, Pedemonte M, Ezzatti P (2018) Task parallelism in the WRF model through computation offloading to many-core devices. In: 2018 26th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP), pp 596–600
Malakar P, Saxena V, George T, Mittal R, Kumar S, Naim AG, Husain SA, bin Hj H (2012) Performance evaluation and optimization of nested high resolution weather simulations. In: Kaklamanis C, Papatheodorou T, Spirakis PG (eds) Euro-par 2012 parallel processing. Springer, Berlin, pp 805–817
Shainer G, Liu T, Michalakes J, Liberman J, Layton J, Celebioglu O, Schultz S, Mora J, Cownie D (2009) Weather Research and Forecast (WRF) model performance and profiling analysis on advanced multi-core HPC clusters. In: 10th LCI International Conference on High-Performance Clustered Computing
Langkamp T, Böhner J (2011) Influence of the compiler on multi-CPU performance of WRFv3. Geosci Model Dev 4(3):611–623
Christidis Z (2015) Performance and scaling of WRF on three different parallel supercomputers. In: Kunkel JM, Ludwig T (eds) High performance computing. Springer, Cham, pp 514–528
Osthoff C, Souto RP, Dias PLS, Panetta J, Lopes P (2012) Atmospheric model cluster performance evaluation on hybrid MPI/OpenMP/Cuda programming model platform. In: 2012 31st International Conference of the Chilean Computer Science Society, pp 216–222
Schepke C, Maillard N, Osthoff C, Dias PS, Pannetta J (2010) Performance evaluation of an atmospheric simulation model on multi-core environments. In: Proceedings of Conferencia Latino Americana de Computación de Alto Rendimiento, pp 330–332
Wang Y, Jiang J, Zhang J, He J, Zhang H, Chi X, Yue T (2018) An efficient parallel algorithm for the coupling of global climate models and regional climate models on a large-scale multi-core cluster. J Supercomput 74:3999–4018
Morrison H, Milbrandt JA (2015) Parameterization of cloud microphysics based on the prediction of bulk ice particle properties part. i: scheme description and idealized tests. J Atmos Sci 72(1):287–311
Hong SY, Noh Y, Dudhia J (2006) A new vertical diffusion package with an explicit treatment of entrainment processes. Mon Weather Rev 134(9):2318–2341
Mlawer Eli J, Taubman Steven J, Brown Patrick D, Iacono Michael J, Clough Shepard A (1997) Radiative transfer for inhomogeneous atmospheres: RRTM, a validated correlated-k model for the longwave. J Geophys Res Atmos 102(D14):16663–16682
Kain JS, Fritsch JM (1993) Convective parameterization for mesoscale models: The Kain-Fritsch scheme. In: Emanuel KA, Raymond DJ (eds) The representation of cumulus convection in numerical models. Meteorological monographs. American Meteorological Society, Boston, MA
Niu GY, Yang ZL, Mitchell KE, Chen F, Ek MB, Barlage M, Kumar A, Manning K, Niyogi D, Rosero E et al (2011) The community Noah land surface model with multiparameterization options (Noah-MP): 1. Model description and evaluation with local-scale measurements. J Geophys Res Atmos 116(D12109):1���19
Hong S-Y, Dudhia J (2004) Testing of a new nonlocal boundary layer vertical diffusion scheme in numerical weather prediction applications. In: 20th Conference on Weather Analysis and Forecasting/16th Conference on Numerical Weather Prediction
Acknowledgements
Funding from projects CGL2013-48367-P, CGL2016-80609-R (AEI/FEDER, UE), and 1365002970/KMA2018-00721 (Korea Meteorological Administration) is gratefully acknowledged. RM acknowledges support from Grant FPI BES-2014-069430 for conducting his PhD.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Moreno, R., Arias, E., Cazorla, D. et al. Seeking the best Weather Research and Forecasting model performance: an empirical score approach. J Supercomput 76, 9629–9653 (2020). https://doi.org/10.1007/s11227-020-03219-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-020-03219-9