Skip to main content

Showing 1–32 of 32 results for author: Sarro, F

  1. arXiv:2410.02482  [pdf, other

    cs.SE

    It is Giving Major Satisfaction: Why Fairness Matters for Developers

    Authors: Emeralda Sesari, Federica Sarro, Ayushi Rastogi

    Abstract: Software practitioners often face unfairness in their work, such as unequal recognition of contributions, gender bias, and unclear criteria for performance reviews. While the link between fairness and job satisfaction has been established in other fields, its relevance to software professionals remains underexplored. This study aims to examine how fairness perceptions relate to job satisfaction am… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  2. arXiv:2409.19028  [pdf, other

    cs.CL cs.SE quant-ph

    Exploring LLM-Driven Explanations for Quantum Algorithms

    Authors: Giordano d'Aloisio, Sophie Fortz, Carol Hanna, Daniel Fortunato, Avner Bensoussan, Eñaut Mendiluze Usandizaga, Federica Sarro

    Abstract: Background: Quantum computing is a rapidly growing new programming paradigm that brings significant changes to the design and implementation of algorithms. Understanding quantum algorithms requires knowledge of physics and mathematics, which can be challenging for software developers. Aims: In this work, we provide a first analysis of how LLMs can support developers' understanding of quantum cod… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  3. arXiv:2407.14982  [pdf, other

    cs.CV cs.AI

    GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation

    Authors: Jingzhi Gong, Sisi Li, Giordano d'Aloisio, Zishuo Ding, Yulong Ye, William B. Langdon, Federica Sarro

    Abstract: Tuning the parameters and prompts for improving AI-based text-to-image generation has remained a substantial yet unaddressed challenge. Hence we introduce GreenStableYolo, which improves the parameters and prompts for Stable Diffusion to both reduce GPU inference time and increase image generation quality using NSGA-II and Yolo. Our experiments show that despite a relatively slight trade-off (18… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: This paper is published in the SSBSE Challenge Track 2024

  4. arXiv:2405.03489  [pdf, other

    cs.SE

    On the Influence of Data Resampling for Deep Learning-Based Log Anomaly Detection: Insights and Recommendations

    Authors: Xiaoxue Ma, Huiqi Zou, Pinjia He, Jacky Keung, Yishu Li, Xiao Yu, Federica Sarro

    Abstract: Numerous Deep Learning (DL)-based approaches have gained attention in software Log Anomaly Detection (LAD), yet class imbalance in training data remains a challenge, with anomalies often comprising less than 1\% of datasets like Thunderbird. Existing DLLAD methods may underperform in severely imbalanced datasets. Although data resampling has proven effective in other software engineering tasks, it… ▽ More

    Submitted 22 October, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 18 pages, 3 figures

  5. Understanding Fairness in Software Engineering: Insights from Stack Exchange

    Authors: Emeralda Sesari, Federica Sarro, Ayushi Rastogi

    Abstract: Software practitioners discuss problems at work with peers, in-person and online. These discussions can be technical (e.g., how to fix a bug?) and social (e.g., how to assign work fairly?). While there is a growing body of knowledge exploring fairness problems and solutions in the human and social factors of software engineering, most focus has been on specific problems. This study provides fairne… ▽ More

    Submitted 2 August, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: 18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) 2024

  6. arXiv:2402.03186  [pdf, other

    cs.SE

    Charting The Evolution of Solidity Error Handling

    Authors: Charalambos Mitropoulos, Maria Kechagia, Chrysostomos Maschas, Sotiris Ioannidis, Federica Sarro, Dimitris Mitropoulos

    Abstract: The usage of error handling in Solidity smart contracts is vital because smart contracts perform transactions that should be verified. Transactions that are not carefully handled, may lead to program crashes and vulnerabilities, implying financial loss and legal consequences. While Solidity designers attempt to constantly update the language with new features, including error-handling (EH) feature… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  7. arXiv:2401.09275  [pdf, other

    cs.SE

    Hot Fixing Software: A Comprehensive Review of Terminology, Techniques, and Applications

    Authors: Carol Hanna, David Clark, Federica Sarro, Justyna Petke

    Abstract: A hot fix is an unplanned improvement to a specific time-critical issue deployed to a software system in production. While hot fixing is an essential and common activity in software maintenance, it has never been surveyed as a research activity. Thus, such a review is long overdue. In this paper, we conduct a comprehensive literature review of work on hot fixing. We highlight the fields where this… ▽ More

    Submitted 15 May, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  8. arXiv:2311.10516  [pdf, other

    cs.SE

    User-Centric Deployment of Automated Program Repair at Bloomberg

    Authors: David Williams, James Callan, Serkan Kirbas, Sergey Mechtaev, Justyna Petke, Thomas Prideaux-Ghee, Federica Sarro

    Abstract: Automated program repair (APR) tools have unlocked the potential for the rapid rectification of codebase issues. However, to encourage wider adoption of program repair in practice, it is necessary to address the usability concerns related to generating irrelevant or out-of-context patches. When software engineers are presented with patches they deem uninteresting or unhelpful, they are burdened wi… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  9. arXiv:2311.04710  [pdf, other

    cs.SE cs.AI

    The Quest for Content: A Survey of Search-Based Procedural Content Generation for Video Games

    Authors: Mar Zamorano, Carlos Cetina, Federica Sarro

    Abstract: Video games demand is constantly increasing, which requires the costly production of large amounts of content. Towards this challenge, researchers have developed Search-Based Procedural Content Generation (SBPCG), that is, the (semi-)automated creation of content through search algorithms. We survey the current state of SBPCG, reporting work appeared in the field between 2011-2022 and identifying… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  10. arXiv:2310.19813  [pdf, ps, other

    cs.SE cs.AI cs.LG cs.NE

    Enhancing Genetic Improvement Mutations Using Large Language Models

    Authors: Alexander E. I. Brownlee, James Callan, Karine Even-Mendoza, Alina Geiger, Carol Hanna, Justyna Petke, Federica Sarro, Dominik Sobania

    Abstract: Large language models (LLMs) have been successfully applied to software engineering tasks, including program repair. However, their application in search-based techniques such as Genetic Improvement (GI) is still largely unexplored. In this paper, we evaluate the use of LLMs as mutation operators for GI to improve the search process. We expand the Gin Java GI toolkit to call OpenAI's API to genera… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted for publication at the Symposium on Search-Based Software Engineering (SSBSE) 2023

    Journal ref: Arcaini, P., Yue, T., Fredericks, E.M. (eds) Search-Based Software Engineering. SSBSE 2023. Lecture Notes in Computer Science, vol 14415. Springer, Cham

  11. arXiv:2308.12264  [pdf, other

    cs.LG cs.AI cs.PF cs.SE

    Enhancing Energy-Awareness in Deep Learning through Fine-Grained Energy Measurement

    Authors: Saurabhsingh Rajput, Tim Widmayer, Ziyuan Shang, Maria Kechagia, Federica Sarro, Tushar Sharma

    Abstract: With the increasing usage, scale, and complexity of Deep Learning (DL) models, their rapidly growing energy consumption has become a critical concern. Promoting green development and energy awareness at different granularities is the need of the hour to limit carbon emissions of DL systems. However, the lack of standard and repeatable tools to accurately measure and optimize energy consumption at… ▽ More

    Submitted 1 February, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

  12. arXiv:2308.02935  [pdf, other

    cs.CY cs.AI cs.CV cs.SE

    Bias Behind the Wheel: Fairness Testing of Autonomous Driving Systems

    Authors: Xinyue Li, Zhenpeng Chen, Jie M. Zhang, Federica Sarro, Ying Zhang, Xuanzhe Liu

    Abstract: This paper conducts fairness testing of automated pedestrian detection, a crucial but under-explored issue in autonomous driving systems. We evaluate eight state-of-the-art deep learning-based pedestrian detectors across demographic groups on large-scale real-world datasets. To enable thorough fairness testing, we provide extensive annotations for the datasets, resulting in 8,311 images with 16,07… ▽ More

    Submitted 17 October, 2024; v1 submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM Transactions on Software Engineering and Methodology (TOSEM)

  13. arXiv:2308.01923  [pdf, other

    cs.LG cs.AI cs.CY cs.SE

    Fairness Improvement with Multiple Protected Attributes: How Far Are We?

    Authors: Zhenpeng Chen, Jie M. Zhang, Federica Sarro, Mark Harman

    Abstract: Existing research mostly improves the fairness of Machine Learning (ML) software regarding a single protected attribute at a time, but this is unrealistic given that many users have multiple protected attributes. This paper conducts an extensive study of fairness improvement regarding multiple protected attributes, covering 11 state-of-the-art fairness improvement methods. We analyze the effective… ▽ More

    Submitted 4 April, 2024; v1 submitted 25 July, 2023; originally announced August 2023.

    Comments: Accepted by the 46th International Conference on Software Engineering (ICSE 2024). Please include ICSE in any citations

  14. arXiv:2306.01620  [pdf, other

    cs.SE

    SuperFlow: Performance Testing for Serverless Computing

    Authors: Jinfeng Wen, Zhenpeng Chen, Federica Sarro, Xuanzhe Liu

    Abstract: Serverless computing is an emerging cloud computing paradigm that allows software engineers to develop cloud applications as a set of functions (called serverless functions). However, accurately obtaining the performance (i.e., response latency) of serverless functions is challenging due to the highly dynamic nature of the environment in which they run. To tackle this problem, a possible solution… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  15. arXiv:2305.18084  [pdf, other

    cs.SE

    Assess and Summarize: Improve Outage Understanding with Large Language Models

    Authors: Pengxiang Jin, Shenglin Zhang, Minghua Ma, Haozhe Li, Yu Kang, Liqun Li, Yudong Liu, Bo Qiao, Chaoyun Zhang, Pu Zhao, Shilin He, Federica Sarro, Yingnong Dang, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: Cloud systems have become increasingly popular in recent years due to their flexibility and scalability. Each time cloud computing applications and services hosted on the cloud are affected by a cloud outage, users can experience slow response times, connection issues or total service disruption, resulting in a significant negative business impact. Outages are usually comprised of several concurri… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  16. arXiv:2305.04309  [pdf, other

    cs.SE

    Revisiting the Performance of Serverless Computing: An Analysis of Variance

    Authors: Jinfeng Wen, Zhenpeng Chen, Federica Sarro, Xuanzhe Liu

    Abstract: Serverless computing is an emerging cloud computing paradigm, which allows software engineers to develop applications at the granularity of function (called serverless functions). However, multiple identical runs of the same serverless functions can show different performance (i.e., response latencies) due to the highly dynamic underlying environment where these functions are executed. We conduct… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  17. arXiv:2209.08321  [pdf, other

    cs.SE cs.CY cs.LG

    Enhanced Fairness Testing via Generating Effective Initial Individual Discriminatory Instances

    Authors: Minghua Ma, Zhao Tian, Max Hort, Federica Sarro, Hongyu Zhang, Qingwei Lin, Dongmei Zhang

    Abstract: Fairness testing aims at mitigating unintended discrimination in the decision-making process of data-driven AI systems. Individual discrimination may occur when an AI model makes different decisions for two distinct individuals who are distinguishable solely according to protected attributes, such as age and race. Such instances reveal biased AI behaviour, and are called Individual Discriminatory… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: 19 pages, 7 figures

  18. arXiv:2209.00437  [pdf, ps, other

    cs.SE

    Agile Effort Estimation: Have We Solved the Problem Yet? Insights From A Second Replication Study (GPT2SP Replication Report)

    Authors: Vali Tawosi, Rebecca Moussa, Federica Sarro

    Abstract: Fu and Tantithamthavorn have recently proposed GPT2SP, a Transformer-based deep learning model for SP estimation of user stories. They empirically evaluated the performance of GPT2SP on a dataset shared by Choetkiertikul et al including 16 projects with a total of 23,313 issues. They benchmarked GPT2SP against two baselines (namely the naive Mean and Median estimators) and the method previously pr… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: Report

  19. Test-based Patch Clustering for Automatically-Generated Patches Assessment

    Authors: Matias Martinez, Maria Kechagia, Anjana Perera, Justyna Petke, Federica Sarro, Aldeida Aleti

    Abstract: Previous studies have shown that Automated Program Repair (APR) techniques suffer from the overfitting problem. Overfitting happens when a patch is run and the test suite does not reveal any error, but the patch actually does not fix the underlying bug or it introduces a new defect that is not covered by the test suite. Therefore, the patches generated by apr tools need to be validated by human pr… ▽ More

    Submitted 27 August, 2024; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Published in Springer Empirical Software Engineering, Volume 29, article number 116 (2024)

  20. arXiv:2207.10223  [pdf, other

    cs.SE

    Fairness Testing: A Comprehensive Survey and Analysis of Trends

    Authors: Zhenpeng Chen, Jie M. Zhang, Max Hort, Mark Harman, Federica Sarro

    Abstract: Unfair behaviors of Machine Learning (ML) software have garnered increasing attention and concern among software engineers. To tackle this issue, extensive research has been dedicated to conducting fairness testing of ML software, and this paper offers a comprehensive survey of existing studies in this field. We collect 100 papers and organize them based on the testing workflow (i.e., how to test)… ▽ More

    Submitted 6 March, 2024; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted by ACM Transactions on Software Engineering and Methodology (TOSEM 2024). Please include TOSEM in any citations

  21. arXiv:2207.07068  [pdf, other

    cs.LG

    Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey

    Authors: Max Hort, Zhenpeng Chen, Jie M. Zhang, Mark Harman, Federica Sarro

    Abstract: This paper provides a comprehensive survey of bias mitigation methods for achieving fairness in Machine Learning (ML) models. We collect a total of 341 publications concerning bias mitigation for ML classifiers. These methods can be distinguished based on their intervention procedure (i.e., pre-processing, in-processing, post-processing) and the technique they apply. We investigate how existing bi… ▽ More

    Submitted 11 October, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: 52 pages, 7 figures

  22. arXiv:2207.03277  [pdf, other

    cs.SE cs.AI

    A Comprehensive Empirical Study of Bias Mitigation Methods for Machine Learning Classifiers

    Authors: Zhenpeng Chen, Jie M. Zhang, Federica Sarro, Mark Harman

    Abstract: Software bias is an increasingly important operational concern for software engineers. We present a large-scale, comprehensive empirical study of 17 representative bias mitigation methods for Machine Learning (ML) classifiers, evaluated with 11 ML performance metrics (e.g., accuracy), 4 fairness metrics, and 20 types of fairness-performance trade-off assessment, applied to 8 widely-adopted softwar… ▽ More

    Submitted 10 February, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted by ACM Transactions on Software Engineering and Methodology (TOSEM 2023). Please include TOSEM in any citations

  23. arXiv:2207.01705  [pdf, other

    cs.SE cs.LG

    Do Not Take It for Granted: Comparing Open-Source Libraries for Software Development Effort Estimation

    Authors: Rebecca Moussa, Federica Sarro

    Abstract: In the past two decades, several Machine Learning (ML) libraries have become freely available. Many studies have used such libraries to carry out empirical investigations on predictive Software Engineering (SE) tasks. However, the differences stemming from using one library over another have been overlooked, implicitly assuming that using any of these libraries would provide the user with the same… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  24. arXiv:2202.12074  [pdf, ps, other

    cs.SE cs.AI cs.LG

    On The Effectiveness of One-Class Support Vector Machine in Different Defect Prediction Scenarios

    Authors: Rebecca Moussa, Danielle Azar, Federica Sarro

    Abstract: Defect prediction aims at identifying software components that are likely to cause faults before a software is made available to the end-user. To date, this task has been modeled as a two-class classification problem, however its nature also allows it to be formulated as a one-class classification task. Previous studies show that One-Class Support Vector Machine (OCSVM) can outperform two-class cl… ▽ More

    Submitted 23 March, 2024; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Published at SANER'24 (Winner of the Best RENE paper award) see https://conf.researchr.org/details/saner-2024/saner-2024-reproducibility-studies-and-negative-results--rene--track-/78/On-The-Effectiveness-of-One-Class-Support-Vector-Machine-in-Different-Defect-Predicti

  25. arXiv:2202.00979  [pdf, other

    cs.SE

    A Versatile Dataset of Agile Open Source Software Projects

    Authors: Vali Tawosi, Afnan Al-Subaihin, Rebecca Moussa, Federica Sarro

    Abstract: Agile software development is nowadays a widely adopted practise in both open-source and industrial software projects. Agile teams typically heavily rely on issue management tools to document new issues and keep track of outstanding ones, in addition to storing their technical details, effort estimates, assignment to developers, and more. Previous work utilised the historical information stored in… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Comments: 5 pages, 1 figure

  26. arXiv:2201.05401  [pdf, other

    cs.SE cs.LG stat.ML

    Agile Effort Estimation: Have We Solved the Problem Yet? Insights From A Replication Study

    Authors: Vali Tawosi, Rebecca Moussa, Federica Sarro

    Abstract: In the last decade, several studies have explored automated techniques to estimate the effort of agile software development. We perform a close replication and extension of a seminal work proposing the use of Deep Learning for Agile Effort Estimation (namely Deep-SE), which has set the state-of-the-art since. Specifically, we replicate three of the original research questions aiming at investigati… ▽ More

    Submitted 17 December, 2022; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: Accepted for publication in IEEE Transactions on Software Engineering (TSE, 2022)

  27. arXiv:2110.09610  [pdf, other

    cs.SE cs.LG

    A Survey on Machine Learning Techniques for Source Code Analysis

    Authors: Tushar Sharma, Maria Kechagia, Stefanos Georgiou, Rohit Tiwari, Indira Vats, Hadi Moazen, Federica Sarro

    Abstract: The advancements in machine learning techniques have encouraged researchers to apply these techniques to a myriad of software engineering tasks that use source code analysis, such as testing and vulnerability detection. Such a large number of studies hinders the community from understanding the current research landscape. This paper aims to summarize the current knowledge in applied machine learni… ▽ More

    Submitted 13 September, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

  28. Sentinel: A Hyper-Heuristic for the Generation of Mutant Reduction Strategies

    Authors: Giovani Guizzo, Federica Sarro, Jens Krinke, Silvia Regina Vergilio

    Abstract: Mutation testing is an effective approach to evaluate and strengthen software test suites, but its adoption is currently limited by the mutants' execution computational cost. Several strategies have been proposed to reduce this cost (a.k.a. mutation cost reduction strategies), however none of them has proven to be effective for all scenarios since they often need an ad-hoc manual selection and con… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: in IEEE Transactions on Software Engineering

  29. arXiv:2010.16063  [pdf, other

    cs.SE

    Do Users Care about Ad's Performance Costs? Exploring the Effects of the Performance Costs of In-App Ads on User Experience

    Authors: Cuiyun Gao, Jichuan Zeng, Federica Sarro, David Lo, Irwin King, Michael R. Lyu

    Abstract: Context: In-app advertising is the primary source of revenue for many mobile apps. The cost of advertising (ad cost) is non-negligible for app developers to ensure a good user experience and continuous profits. Previous studies mainly focus on addressing the hidden performance costs generated by ads, including consumption of memory, CPU, data traffic, and battery. However, there is no research ona… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

    Comments: 14 pages, accpeted by Information and Software Technology (IST)

  30. arXiv:2010.03525  [pdf

    cs.SE cs.GL

    Empirical Standards for Software Engineering Research

    Authors: Paul Ralph, Nauman bin Ali, Sebastian Baltes, Domenico Bianculli, Jessica Diaz, Yvonne Dittrich, Neil Ernst, Michael Felderer, Robert Feldt, Antonio Filieri, Breno Bernard Nicolau de França, Carlo Alberto Furia, Greg Gay, Nicolas Gold, Daniel Graziotin, Pinjia He, Rashina Hoda, Natalia Juristo, Barbara Kitchenham, Valentina Lenarduzzi, Jorge Martínez, Jorge Melegati, Daniel Mendez, Tim Menzies, Jefferson Molleri , et al. (18 additional authors not shown)

    Abstract: Empirical Standards are natural-language models of a scientific community's expectations for a specific kind of study (e.g. a questionnaire survey). The ACM SIGSOFT Paper and Peer Review Quality Initiative generated empirical standards for research methods commonly used in software engineering. These living documents, which should be continuously revised to reflect evolving consensus around resear… ▽ More

    Submitted 4 March, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: For the complete standards, supplements and other resources, see https://github.com/acmsigsoft/EmpiricalStandards

  31. FrUITeR: A Framework for Evaluating UI Test Reuse

    Authors: Yixue Zhao, Justin Chen, Adriana Sejfia, Marcelo Schmitt Laser, Jie Zhang, Federica Sarro, Mark Harman, Nenad Medvidovic

    Abstract: UI testing is tedious and time-consuming due to the manual effort required. Recent research has explored opportunities for reusing existing UI tests from an app to automatically generate new tests for other apps. However, the evaluation of such techniques currently remains manual, unscalable, and unreproducible, which can waste effort and impede progress in this emerging area. We introduce FrUITeR… ▽ More

    Submitted 3 November, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: ESEC/FSE 2020

  32. arXiv:1312.0086  [pdf, ps, other

    cs.NE cs.DC

    A Framework for Genetic Algorithms Based on Hadoop

    Authors: Filomena Ferrucci, M-Tahar Kechadi, Pasquale Salza, Federica Sarro

    Abstract: Genetic Algorithms (GAs) are powerful metaheuristic techniques mostly used in many real-world applications. The sequential execution of GAs requires considerable computational power both in time and resources. Nevertheless, GAs are naturally parallel and accessing a parallel platform such as Cloud is easy and cheap. Apache Hadoop is one of the common services that can be used for parallel applicat… ▽ More

    Submitted 15 December, 2013; v1 submitted 30 November, 2013; originally announced December 2013.