Skip to main content

Showing 1–25 of 25 results for author: Aleti, A

  1. arXiv:2409.07829  [pdf, other

    cs.SE

    Enabling Cost-Effective UI Automation Testing with Retrieval-Based LLMs: A Case Study in WeChat

    Authors: Sidong Feng, Haochuan Lu, Jianqin Jiang, Ting Xiong, Likun Huang, Yinglin Liang, Xiaoqin Li, Yuetang Deng, Aldeida Aleti

    Abstract: UI automation tests play a crucial role in ensuring the quality of mobile applications. Despite the growing popularity of machine learning techniques to generate these tests, they still face several challenges, such as the mismatch of UI elements. The recent advances in Large Language Models (LLMs) have addressed these issues by leveraging their semantic understanding capabilities. However, a sign… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  2. Experimental evaluation of architectural software performance design patterns in microservices

    Authors: Willem Meijer, Catia Trubiani, Aldeida Aleti

    Abstract: Microservice architectures and design patterns enhance the development of large-scale applications by promoting flexibility. Industrial practitioners perceive the importance of applying architectural patterns but they struggle to quantify their impact on system quality requirements. Our research aims to quantify the effect of design patterns on system performance metrics, e.g., service latency and… ▽ More

    Submitted 20 August, 2024; originally announced September 2024.

    Comments: The Journal of Systems & Software (2024)

  3. arXiv:2406.11753  [pdf, other

    cs.CL cs.LG

    A Semantic-based Layer Freezing Approach to Efficient Fine-Tuning of Language Models

    Authors: Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang

    Abstract: Finetuning language models (LMs) is crucial for adapting the models to downstream data and tasks. However, full finetuning is usually costly. Existing work, such as parameter-efficient finetuning (PEFT), often focuses on \textit{how to finetune} but neglects the issue of \textit{where to finetune}. As a pioneering work on answering where to finetune (at the layer level), we conduct a semantic anal… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 13 pages, 5 figures, under peer-review

  4. arXiv:2405.03326  [pdf, other

    cs.SE

    PAFOT: A Position-Based Approach for Finding Optimal Tests of Autonomous Vehicles

    Authors: Victor Crespo-Rodriguez, Neelofar, Aldeida Aleti

    Abstract: Autonomous Vehicles (AVs) are prone to revolutionise the transportation industry. However, they must be thoroughly tested to avoid safety violations. Simulation testing plays a crucial role in finding safety violations of Automated Driving Systems (ADSs). This paper proposes PAFOT, a position-based approach testing framework, which generates adversarial driving scenarios to expose safety violation… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Pre-print from AST 2024 conference

  5. arXiv:2402.11910  [pdf, other

    cs.SE

    Enhancing Large Language Models for Text-to-Testcase Generation

    Authors: Saranya Alagarsamy, Chakkrit Tantithamthavorn, Chetan Arora, Aldeida Aleti

    Abstract: Context: Test-driven development (TDD) is a widely employed software development practice that involves developing test cases based on requirements prior to writing the code. Although various methods for automated test case generation have been proposed, they are not specifically tailored for TDD, where requirements instead of code serve as input. Objective: In this paper, we introduce a text-to-t… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  6. arXiv:2401.16184  [pdf, other

    cs.CL cs.LG

    Vocabulary-Defined Semantics: Latent Space Clustering for Improving In-Context Learning

    Authors: Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang

    Abstract: In-context learning enables language models (LM) to adapt to downstream data or tasks by incorporating few samples as demonstrations within the prompts. It offers strong performance without the expense of fine-tuning. However, the performance of in-context learning can be unstable depending on the quality, format, or order of demonstrations, which in turn exacerbates the difficulty of optimization… ▽ More

    Submitted 14 October, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: under peer-review

  7. arXiv:2312.05356  [pdf, other

    cs.SE cs.CL cs.LG

    Neuron Patching: Semantic-based Neuron-level Language Model Repair for Code Generation

    Authors: Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang

    Abstract: Large Language Models (LLMs) have already gained widespread adoption in software engineering, particularly in code generation tasks. However, updating these models with new knowledge can be prohibitively expensive, yet it is essential to maximize their utility, such as implementing a hotfix technique to address urgent or critical LLM errors. In this paper, we propose \textsc{MENT}, a novel and eff… ▽ More

    Submitted 5 August, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 12 pages, 7 figures, 7 tables, under peer-review

  8. Instance Space Analysis of Search-Based Software Testing

    Authors: Neelofar Neelofar, Kate Smith-Miles, Mario Andres Munoz, Aldeida Aleti

    Abstract: Search-based software testing (SBST) is now a mature area, with numerous techniques developed to tackle the challenging task of software testing. SBST techniques have shown promising results and have been successfully applied in the industry to automatically generate test cases for large and complex software systems. Their effectiveness, however, is problem-dependent. In this paper, we revisit the… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Journal ref: IEEE Transactions on Software Engineering, 49(4), 2642-2660 (2022)

  9. arXiv:2311.08049  [pdf, other

    cs.SE

    Towards Reliable AI: Adequacy Metrics for Ensuring the Quality of System-level Testing of Autonomous Vehicles

    Authors: Neelofar Neelofar, Aldeida Aleti

    Abstract: AI-powered systems have gained widespread popularity in various domains, including Autonomous Vehicles (AVs). However, ensuring their reliability and safety is challenging due to their complex nature. Conventional test adequacy metrics, designed to evaluate the effectiveness of traditional software testing, are often insufficient or impractical for these systems. White-box metrics, which are speci… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 12 pages, 7 figures

  10. arXiv:2309.03554  [pdf, other

    cs.SE

    Software Testing of Generative AI Systems: Challenges and Opportunities

    Authors: Aldeida Aleti

    Abstract: Software Testing is a well-established area in software engineering, encompassing various techniques and methodologies to ensure the quality and reliability of software systems. However, with the advent of generative artificial intelligence (GenAI) systems, new challenges arise in the testing domain. These systems, capable of generating novel and creative outputs, introduce unique complexities tha… ▽ More

    Submitted 11 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

  11. arXiv:2303.06283  [pdf, other

    cs.SE

    Closing the Loop for Software Remodularisation -- REARRANGE: An Effort Estimation Approach for Software Clustering-based Remodularisation

    Authors: Alvin Jian Jia Tan, Chun Yong Chong, Aldeida Aleti

    Abstract: Software remodularization through clustering is a common practice to improve internal software quality. However, the true benefit of software clustering is only realized if developers follow through with the recommended refactoring suggestions, which can be complex and time-consuming. Simply producing clustering results is not enough to realize the benefits of remodularization. For the recommended… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at ICSE23 Poster Track

  12. arXiv:2302.10352  [pdf, other

    cs.SE

    A3Test: Assertion-Augmented Automated Test Case Generation

    Authors: Saranya Alagarsamy, Chakkrit Tantithamthavorn, Aldeida Aleti

    Abstract: Test case generation is an important activity, yet a time-consuming and laborious task. Recently, AthenaTest -- a deep learning approach for generating unit test cases -- is proposed. However, AthenaTest can generate less than one-fifth of the test cases correctly, due to a lack of assertion knowledge and test signature verification. In this paper, we propose A3Test, a DL-based test case generatio… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: Under Review at ACM Transactions on Software Engineering and Methodology

  13. arXiv:2212.07566  [pdf, other

    cs.SE

    Identifying and Explaining Safety-critical Scenarios for Autonomous Vehicles via Key Features

    Authors: Neelofar, Aldeida Aleti

    Abstract: Ensuring the safety of autonomous vehicles (AVs) is of utmost importance and testing them in simulated environments is a safer option than conducting in-field operational tests. However, generating an exhaustive test suite to identify critical test scenarios is computationally expensive as the representation of each test is complex and contains various dynamic and static features, such as the AV u… ▽ More

    Submitted 28 November, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: 28 pages, 6 figures

    ACM Class: D.2.5

  14. Test-based Patch Clustering for Automatically-Generated Patches Assessment

    Authors: Matias Martinez, Maria Kechagia, Anjana Perera, Justyna Petke, Federica Sarro, Aldeida Aleti

    Abstract: Previous studies have shown that Automated Program Repair (APR) techniques suffer from the overfitting problem. Overfitting happens when a patch is run and the test suite does not reveal any error, but the patch actually does not fix the underlying bug or it introduces a new defect that is not covered by the test suite. Therefore, the patches generated by apr tools need to be validated by human pr… ▽ More

    Submitted 27 August, 2024; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Published in Springer Empirical Software Engineering, Volume 29, article number 116 (2024)

  15. arXiv:2110.02682  [pdf, other

    cs.SE

    How good does a Defect Predictor need to be to guide Search-Based Software Testing?

    Authors: Anjana Perera, Burak Turhan, Aldeida Aleti, Marcel Böhme

    Abstract: Defect predictors, static bug detectors and humans inspecting the code can locate the parts of the program that are buggy before they are discovered through testing. Automated test generators such as search-based software testing (SBST) techniques can use this information to direct their search for test cases to likely buggy code, thus speeding up the process of detecting existing bugs. However, o… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: 12 pages, 4 figures

    ACM Class: D.2.5

  16. Defect Prediction Guided Search-Based Software Testing

    Authors: Anjana Perera, Aldeida Aleti, Marcel Böhme, Burak Turhan

    Abstract: Today, most automated test generators, such as search-based software testing (SBST) techniques focus on achieving high code coverage. However, high code coverage is not sufficient to maximise the number of bugs found, especially when given a limited testing budget. In this paper, we propose an automated test generation technique that is also guided by the estimated degree of defectiveness of the s… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 13 pages, 8 figures

    ACM Class: D.2.5

    Journal ref: In Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering (ASE '20), 2020

  17. arXiv:2107.01766  [pdf, other

    cs.SE

    E-SC4R: Explaining Software Clustering for Remodularisation

    Authors: Alvin Jian Jia Tan, Chun Yong Chong, Aldeida Aleti

    Abstract: Maintenance of existing software requires a large amount of time for comprehending the source code. The architecture of a software, however, may not be clear to maintainers if up to date documentations are not available. Software clustering is often used as a remodularisation and architecture recovery technique to help recover a semantic representation of the software design. Due to the diverse do… ▽ More

    Submitted 2 October, 2021; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: 31 pages

  18. arXiv:2012.01708  [pdf, other

    cs.SE

    Feature-Based Software Design Pattern Detection

    Authors: Najam Nazar, Aldeida Aleti, Yaokun Zheng

    Abstract: Software design patterns are standard solutions to common problems in software design and architecture. Knowing that a particular module implements a design pattern is a shortcut to design comprehension. Manually detecting design patterns is a time consuming and challenging task, therefore, researchers have proposed automatic design pattern detection techniques. However, these techniques show low… ▽ More

    Submitted 2 December, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted in Journal of Systems and Software (JSS)

  19. E-APR: Mapping the Effectiveness of Automated Program Repair

    Authors: Aldeida Aleti, Matias Martinez

    Abstract: Automated Program Repair (APR) is a fast growing area with numerous new techniques being developed to tackle one of the most challenging software engineering problems. APR techniques have shown promising results, giving us hope that one day it will be possible for software to repair itself. In this paper, we focus on the problem of objective performance evaluation of APR techniques. We introduce a… ▽ More

    Submitted 8 June, 2021; v1 submitted 10 February, 2020; originally announced February 2020.

    Journal ref: Empirical Software Engineering 2021

  20. arXiv:2001.02872  [pdf, other

    cs.AI cs.DM

    The Neighbours' Similar Fitness Property for Local Search

    Authors: Mark Wallace, Aldeida Aleti

    Abstract: For most practical optimisation problems local search outperforms random sampling - despite the "No Free Lunch Theorem". This paper introduces a property of search landscapes termed Neighbours' Similar Fitness (NSF) that underlies the good performance of neighbourhood search in terms of local improvement. Though necessary, NSF is not sufficient to ensure that searching for improvement among the ne… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

  21. arXiv:1912.02535  [pdf, other

    cs.NE cs.AI

    Is perturbation an effective restart strategy?

    Authors: Aldeida Aleti, Mark Wallace, Markus Wagner

    Abstract: Premature convergence can be detrimental to the performance of search methods, which is why many search algorithms include restart strategies to deal with it. While it is common to perturb the incumbent solution with diversification steps of various sizes with the hope that the search method will find a new basin of attraction leading to a better local optimum, it is usually not clear how big the… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  22. Robotic Hierarchical Graph Neurons. A novel implementation of HGN for swarm robotic behaviour control

    Authors: Phillip Smith, Aldeida Aleti, Vincent C. S. Lee, Robert Hunjet, Asad Khan

    Abstract: This paper explores the use of a novel form of Hierarchical Graph Neurons (HGN) for in-operation behaviour selection in a swarm of robotic agents. This new HGN is called Robotic-HGN (R-HGN), as it matches robot environment observations to environment labels via fusion of match probabilities from both temporal and intra-swarm collections. This approach is novel for HGN as it addresses robotic obser… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

    Journal ref: Expert Systems with Applications 2021

  23. arXiv:1910.12412  [pdf, other

    cs.NE

    Swarm Behaviour Evolution via Rule Sharing and Novelty Search

    Authors: Phillip Smith, Robert Hunjet, Aldeida Aleti, Asad Khan

    Abstract: We present in this paper an exertion of our previous work by increasing the robustness and coverage of the evolution search via hybridisation with a state-of-the-art novelty search and accelerate the individual agent behaviour searches via a novel behaviour-component sharing technique. Via these improvements, we present Swarm Learning Classifier System 2.0 (SLCS2), a behaviour evolving algorithm w… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

  24. arXiv:1910.09811  [pdf, other

    astro-ph.SR astro-ph.IM

    A data-driven model of nucleosynthesis with chemical tagging in a lower-dimensional latent space

    Authors: Andrew R. Casey, John C. Lattanzio, Aldeida Aleti, David L. Dowe, Joss Bland-Hawthorn, Sven Buder, Geraint F. Lewis, Sarah L. Martell, Thomas Nordlander, Jeffrey D. Simpson, Sanjib Sharma, Daniel B. Zucker

    Abstract: Chemical tagging seeks to identify unique star formation sites from present-day stellar abundances. Previous techniques have treated each abundance dimension as being statistically independent, despite theoretical expectations that many elements can be produced by more than one nucleosynthetic process. In this work we introduce a data-driven model of nucleosynthesis where a set of latent factors (… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: Accepted to ApJ

  25. An Efficient Method for Uncertainty Propagation in Robust Software Performance Estimation

    Authors: Aldeida Aleti, Catia Trubiani, André van Hoorn, Pooyan Jamshidi

    Abstract: Software engineers often have to estimate the performance of a software system before having full knowledge of the system parameters, such as workload and operational profile. These uncertain parameters inevitably affect the accuracy of quality evaluations, and the ability to judge if the system can continue to fulfil performance requirements if parameter results are different from expected. Previ… ▽ More

    Submitted 14 January, 2018; originally announced January 2018.