Skip to main content

Showing 1–14 of 14 results for author: Siegmund, N

  1. arXiv:2410.08801  [pdf, other

    cs.SE cs.IR

    A Methodology for Evaluating RAG Systems: A Case Study On Configuration Dependency Validation

    Authors: Sebastian Simon, Alina Mailach, Johannes Dorn, Norbert Siegmund

    Abstract: Retrieval-augmented generation (RAG) is an umbrella of different components, design decisions, and domain-specific adaptations to enhance the capabilities of large language models and counter their limitations regarding hallucination and outdated and missing knowledge. Since it is unclear which design decisions lead to a satisfactory performance, developing RAG systems is often experimental and ne… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  2. arXiv:2203.10356  [pdf, other

    cs.SE

    On Debugging the Performance of Configurable Software Systems: Developer Needs and Tailored Tool Support

    Authors: Miguel Velez, Pooyan Jamshidi, Norbert Siegmund, Sven Apel, Christian Kästner

    Abstract: Determining whether a configurable software system has a performance bug or it was misconfigured is often challenging. While there are numerous debugging techniques that can support developers in this task, there is limited empirical evidence of how useful the techniques are to address the actual needs that developers have when debugging the performance of configurable software systems; most techn… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  3. arXiv:2106.02716  [pdf, other

    cs.SE

    VEER: Enhancing the Interpretability of Model-based Optimizations

    Authors: Kewen Peng, Christian Kaltenecker, Norbert Siegmund, Sven Apel, Tim Menzies

    Abstract: Many software systems can be tuned for multiple objectives (e.g., faster runtime, less required memory, less network traffic or energy consumption, etc.). Optimizers built for different objectives suffer from "model disagreement"; i.e., they have different (or even opposite) insights and tactics on how to optimize a system. Model disagreement is rampant (at least for configuration problems). Yet p… ▽ More

    Submitted 12 February, 2023; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: 27 pages, 7 figures, 4 tables, accepted by EMSE

    ACM Class: D.2; K.6.3

  4. arXiv:2102.06395  [pdf, other

    cs.SE

    White-Box Performance-Influence Models: A Profiling and Learning Approach

    Authors: Max Weber, Sven Apel, Norbert Siegmund

    Abstract: Many modern software systems are highly configurable, allowing the user to tune them for performance and more. Current performance modeling approaches aim at finding performance-optimal configurations by building performance models in a black-box manner. While these models provide accurate estimates, they cannot pinpoint causes of observed performance behavior to specific code regions. This does n… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  5. arXiv:2101.05362  [pdf, other

    cs.SE

    White-Box Analysis over Machine Learning: Modeling Performance of Configurable Systems

    Authors: Miguel Velez, Pooyan Jamshidi, Norbert Siegmund, Sven Apel, Christian Kästner

    Abstract: Performance-influence models can help stakeholders understand how and where configuration options and their interactions influence the performance of a system. With this understanding, stakeholders can debug performance behavior and make deliberate configuration decisions. Current black-box techniques to build such models combine various sampling and learning strategies, resulting in tradeoffs bet… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Accepted for publication at ICSE'21

  6. arXiv:1911.12643  [pdf, other

    cs.SE cs.LG

    Predicting Performance of Software Configurations: There is no Silver Bullet

    Authors: Alexander Grebhahn, Norbert Siegmund, Sven Apel

    Abstract: Many software systems offer configuration options to tailor their functionality and non-functional properties (e.g., performance). Often, users are interested in the (performance-)optimal configuration, but struggle to find it, due to missing information on influences of individual configuration options and their interactions. In the past, various supervised machine-learning techniques have been u… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

  7. arXiv:1905.02066  [pdf, other

    cs.SE

    ConfigCrusher: Towards White-Box Performance Analysis for Configurable Systems

    Authors: Miguel Velez, Pooyan Jamshidi, Florian Sattler, Norbert Siegmund, Sven Apel, Christian Kastner

    Abstract: Stakeholders of configurable systems are often interested in knowing how configuration options influence the performance of a system to facilitate, for example, the debugging and optimization processes of these systems. Several black-box approaches can be used to obtain this information, but they either sample a large number of configurations to make accurate predictions or miss important performa… ▽ More

    Submitted 14 July, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

  8. arXiv:1801.02175  [pdf, other

    cs.SE

    Finding Faster Configurations using FLASH

    Authors: Vivek Nair, Zhe Yu, Tim Menzies, Norbert Siegmund, Sven Apel

    Abstract: Finding good configurations for a software system is often challenging since the number of configuration options can be large. Software engineers often make poor choices about configuration or, even worse, they usually use a sub-optimal configuration in production, which leads to inadequate performance. To assist engineers in finding the (near) optimal configuration, this paper introduces FLASH, a… ▽ More

    Submitted 1 September, 2018; v1 submitted 7 January, 2018; originally announced January 2018.

  9. arXiv:1712.07440  [pdf, other

    cs.SE

    On the Relation of External and Internal Feature Interactions: A Case Study

    Authors: Sergiy Kolesnikov, Norbert Siegmund, Christian Kästner, Sven Apel

    Abstract: Detecting feature interactions is imperative for accurately predicting performance of highly-configurable systems. State-of-the-art performance prediction techniques rely on supervised machine learning for detecting feature interactions, which, in turn, relies on time consuming performance measurements to obtain training data. By providing information about potentially interacting features, we can… ▽ More

    Submitted 22 January, 2018; v1 submitted 20 December, 2017; originally announced December 2017.

  10. arXiv:1709.02280  [pdf, other

    stat.ML cs.PF cs.SE

    Transfer Learning for Performance Modeling of Configurable Systems: An Exploratory Analysis

    Authors: Pooyan Jamshidi, Norbert Siegmund, Miguel Velez, Christian Kästner, Akshay Patel, Yuvraj Agarwal

    Abstract: Modern software systems provide many configuration options which significantly influence their non-functional properties. To understand and predict the effect of configuration options, several sampling and learning strategies have been proposed, albeit often with significant cost to cover the highly dimensional configuration space. Recently, transfer learning has been applied to reduce the effort… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

    Comments: To appear in 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE 2017), 12 pages

  11. arXiv:1704.00234  [pdf, other

    cs.SE

    Transfer Learning for Improving Model Predictions in Highly Configurable Software

    Authors: Pooyan Jamshidi, Miguel Velez, Christian Kästner, Norbert Siegmund, Prasad Kawthekar

    Abstract: Modern software systems are built to be used in dynamic environments using configuration capabilities to adapt to changes and external uncertainties. In a self-adaptation context, we are often interested in reasoning about the performance of the systems under different configurations. Usually, we learn a black-box model based on real measurements to predict the performance of the system given a sp… ▽ More

    Submitted 20 April, 2017; v1 submitted 1 April, 2017; originally announced April 2017.

    Comments: To be published in the proceedings of the 12th International Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS'17)

  12. Using Bad Learners to find Good Configurations

    Authors: Vivek Nair, Tim Menzies, Norbert Siegmund, Sven Apel

    Abstract: Finding the optimally performing configuration of a software system for a given setting is often challenging. Recent approaches address this challenge by learning performance models based on a sample set of configurations. However, building an accurate performance model can be very expensive (and is often infeasible in practice). The central insight of this paper is that exact performance values (… ▽ More

    Submitted 28 June, 2017; v1 submitted 19 February, 2017; originally announced February 2017.

    Comments: 11 pages, 11 figures

  13. arXiv:1701.08106  [pdf, other

    cs.SE cs.LG

    Faster Discovery of Faster System Configurations with Spectral Learning

    Authors: Vivek Nair, Tim Menzies, Norbert Siegmund, Sven Apel

    Abstract: Despite the huge spread and economical importance of configurable software systems, there is unsatisfactory support in utilizing the full potential of these systems with respect to finding performance-optimal configurations. Prior work on predicting the performance of software configurations suffered from either (a) requiring far too many sample configurations or (b) large variances in their predi… ▽ More

    Submitted 3 August, 2017; v1 submitted 27 January, 2017; originally announced January 2017.

    Comments: 26 pages, 6 figures

  14. arXiv:1603.05038  [pdf, other

    stat.CO physics.data-an stat.AP

    CoinCalc -- A new R package for quantifying simultaneities of event series

    Authors: Jonathan F. Siegmund, Nicole Siegmund, Reik V. Donner

    Abstract: We present the new R package CoinCalc for performing event coincidence analysis (ECA), a novel statistical method to quantify the simultaneity of events contained in two series of observations, either as simultaneous or lagged coincidences within a user-specific temporal tolerance window. The package also provides different analytical as well as surrogate-based significance tests (valid under diff… ▽ More

    Submitted 16 March, 2016; originally announced March 2016.