Abstract
Data streaming applications are an important class of data-intensive systems. Performance is an essential quality of such systems. It is, for example, expressed by the delay of analysis results or the utilization of system resources. Architecture-level decisions such as the configuration of sources, sinks and operations, their deployment or the choice of technology impact the performance. Current component-based performance prediction approaches cannot accurately predict the performance of those systems, because they do not support the metrics that are specific to data streaming applications and only approximate the behavior of data stream operations instead of expressing it explicitly. In particular, operations that group multiple data events and thus introduce timing dependencies between different calls to the system are not represented sufficiently. In this paper, we present an approach for modeling networks of data stream operations including their parameters with the goal of predicting the performance of the resulting composed data streaming application. The approach is based on a component-based performance model with queueing semantics for processing resources. Our evaluation shows that our model can more accurately express the behavior of the system, resulting in a more expressive performance model compared to a well-encapsulated component-based model without data stream operations.
Partially funded by the German Research Foundation (DFG) as part of the Research Training Group GRK 2153: “Energy Status Data – Informatics Methods for its Collection, Analysis and Exploitation”.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
The data is available publicly via the website of the challenge [9].
References
Aliabadi, S.K., et al.: Analytical composite performance models for big data applications. J. Netw. Comput. Appl. 142, 63–75 (2019)
Arasu, A., Babu, S., Widom, J.: The CQL continuous query language: semantic foundations and query execution. VLDB J. 15(2), 121–142 (2006)
Casale, G., Li, C.: Enhancing big data application design with the DICE framework. In: Mann, Z.Á., Stolz, V. (eds.) ESOCC 2017. CCIS, vol. 824, pp. 164–168. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-79090-9_13
Castiglione, A., et al.: Modeling performances of concurrent big data applications. Softw. Pract. Exper. 45(8), 1127–1144 (2015)
DICE consortium: Deliverable 2.4 DICE Deployment Abstractions, European Union’s Horizon 2020 programme (2017). http://www.dice-h2020.eu/deliverables/
DICE consortium: Deliverable 3.4 DICE simulation tools, European Union’s Horizon 2020 programme (2017). http://www.dice-h2020.eu/deliverables/
Happe, L., Buhnova, B., Reussner, R.: Stateful component-based performance models. Softw. Syst. Model. 13(4), 1319–1343 (2013). https://doi.org/10.1007/s10270-013-0336-6
Hummel, O., et al.: A collection of software engineering challenges for big data system development. In: Euromicro SEAA, pp. 362–369. IEEE (2018)
Jerzak, Z., Ziekow, H.: DEBS 2014 grand challenge: smart homes - DEBS.org. https://debs.org/grand-challenges/2014/
Jerzak, Z., Ziekow, H.: The DEBS 2014 grand challenge. In: DEBS 2014, pp. 266–269. ACM (2014)
Kroß, J., Krcmar, H.: Model-based performance evaluation of batch and stream applications for big data. In: MASCOTS, pp. 80–86. IEEE (2017)
Kroß, J., Krcmar, H.: PerTract: model extraction and specification of big data systems for performance prediction by the example of apache spark and hadoop. Big Data Cogn. Comput. 3(3), 47 (2019)
Maddodi, G., Jansen, S., Overeem, M.: Aggregate architecture simulation in event-sourcing applications using layered queuing networks. In: ICPE 2020, pp. 238–245. ACM (2020)
Mazkatli, M., et al.: Incremental calibration of architectural performance models with parametric dependencies. In: ICSA 2020. IEEE (2020)
Meijer, E.: Your mouse is a database. ACM Queue 10(3), 20 (2012)
Rathfelder, C.: Modelling event-based interactions in component-based architectures for quantitative system evaluation. In: The Karlsruhe Series on Software Design and Quality, KIT Scientific Publishing (2013)
Reussner, R.H., et al.: Modeling and Simulating Software Architectures - The Palladio Approach. MIT Press, Cambridge (2016)
Sachs, K.: Performance modeling and benchmarking of event-based systems. Ph.D. thesis, Darmstadt University of Technology (2011)
Werle, D.: GitHub repository of palladio indirections. https://github.com/PalladioSimulator/Palladio-Addons-Indirections
Werle, D.: Data Stream Operations as First-Class Entities in Component-Based Performance Models - Auxiliary Material (2020). https://doi.org/10.5281/zenodo.3937718
Werle, D., Seifermann, S., Koziolek, A.: Data stream operations as first-class entities in palladio. In: SSP 2019. Softwaretechnik Trends (2019)
Wu, E., Diao, Y., Rizvi, S.: High-performance complex event processing over streams. In: SIGMOD, pp. 407–418. ACM (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Werle, D., Seifermann, S., Koziolek, A. (2020). Data Stream Operations as First-Class Entities in Component-Based Performance Models. In: Jansen, A., Malavolta, I., Muccini, H., Ozkaya, I., Zimmermann, O. (eds) Software Architecture. ECSA 2020. Lecture Notes in Computer Science(), vol 12292. Springer, Cham. https://doi.org/10.1007/978-3-030-58923-3_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-58923-3_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58922-6
Online ISBN: 978-3-030-58923-3
eBook Packages: Computer ScienceComputer Science (R0)