-
Final Cooling With Thick Wedges for a Muon Collider
Authors:
D. Fu,
A. Badea. K. Folan Di Petrillo,
D. Neuffer,
D. Stratakis
Abstract:
In the final cooling stages for a muon collider, the transverse emittances are reduced while the longitudinal emittance is allowed to increase. In previous studies, Final cooling used absorbers within very high field solenoids to cool low-momentum muons. Simulations of the systems did not reach the desired cooling design goals. In this study, we develop and optimize a different conceptual design f…
▽ More
In the final cooling stages for a muon collider, the transverse emittances are reduced while the longitudinal emittance is allowed to increase. In previous studies, Final cooling used absorbers within very high field solenoids to cool low-momentum muons. Simulations of the systems did not reach the desired cooling design goals. In this study, we develop and optimize a different conceptual design for the final 4D cooling channel, which is based on using dense wedge absorbers. We used G4Beamline to simulate the channel and Python to generate and analyze particle distributions. We optimized the design parameters of the cooling channel and produced conceptual designs (corresponding to possible starting points for the input beam) which achieve transverse cooling in both x and y by a factor of $\approx$ 3.5. These channels achieve a lower transverse and longitudinal emittance than the best previously published design.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Interim report for the International Muon Collider Collaboration (IMCC)
Authors:
C. Accettura,
S. Adrian,
R. Agarwal,
C. Ahdida,
C. Aimé,
A. Aksoy,
G. L. Alberghi,
S. Alden,
N. Amapane,
D. Amorim,
P. Andreetto,
F. Anulli,
R. Appleby,
A. Apresyan,
P. Asadi,
M. Attia Mahmoud,
B. Auchmann,
J. Back,
A. Badea,
K. J. Bae,
E. J. Bahng,
L. Balconi,
F. Balli,
L. Bandiera,
C. Barbagallo
, et al. (362 additional authors not shown)
Abstract:
The International Muon Collider Collaboration (IMCC) [1] was established in 2020 following the recommendations of the European Strategy for Particle Physics (ESPP) and the implementation of the European Strategy for Particle Physics-Accelerator R&D Roadmap by the Laboratory Directors Group [2], hereinafter referred to as the the European LDG roadmap. The Muon Collider Study (MuC) covers the accele…
▽ More
The International Muon Collider Collaboration (IMCC) [1] was established in 2020 following the recommendations of the European Strategy for Particle Physics (ESPP) and the implementation of the European Strategy for Particle Physics-Accelerator R&D Roadmap by the Laboratory Directors Group [2], hereinafter referred to as the the European LDG roadmap. The Muon Collider Study (MuC) covers the accelerator complex, detectors and physics for a future muon collider. In 2023, European Commission support was obtained for a design study of a muon collider (MuCol) [3]. This project started on 1st March 2023, with work-packages aligned with the overall muon collider studies. In preparation of and during the 2021-22 U.S. Snowmass process, the muon collider project parameters, technical studies and physics performance studies were performed and presented in great detail. Recently, the P5 panel [4] in the U.S. recommended a muon collider R&D, proposed to join the IMCC and envisages that the U.S. should prepare to host a muon collider, calling this their "muon shot". In the past, the U.S. Muon Accelerator Programme (MAP) [5] has been instrumental in studies of concepts and technologies for a muon collider.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
SyDRA: An Approach to Understand Game Engine Architecture
Authors:
Gabriel C. Ullmann,
Yann-Gaël Guéhéneuc,
Fabio Petrillo,
Nicolas Anquetil,
Cristiano Politowski
Abstract:
Game engines are tools to facilitate video game development. They provide graphics, sound, and physics simulation features, which would have to be otherwise implemented by developers. Even though essential for modern commercial video game development, game engines are complex and developers often struggle to understand their architecture, leading to maintainability and evolution issues that negati…
▽ More
Game engines are tools to facilitate video game development. They provide graphics, sound, and physics simulation features, which would have to be otherwise implemented by developers. Even though essential for modern commercial video game development, game engines are complex and developers often struggle to understand their architecture, leading to maintainability and evolution issues that negatively affect video game productions. In this paper, we present the Subsystem-Dependency Recovery Approach (SyDRA), which helps game engine developers understand game engine architecture and therefore make informed game engine development choices. By applying this approach to 10 open-source game engines, we obtain architectural models that can be used to compare game engine architectures and identify and solve issues of excessive coupling and folder nesting. Through a controlled experiment, we show that the inspection of the architectural models derived from SyDRA enables developers to complete tasks related to architectural understanding and impact analysis in less time and with higher correctness than without these models.
△ Less
Submitted 14 July, 2024; v1 submitted 8 June, 2024;
originally announced June 2024.
-
A Distributed Approach for Persistent Homology Computation on a Large Scale
Authors:
Riccardo Ceccaroni,
Lorenzo Di Rocco,
Umberto Ferraro Petrillo,
Pierpaolo Brutti
Abstract:
Persistent homology (PH) is a powerful mathematical method to automatically extract relevant insights from images, such as those obtained by high-resolution imaging devices like electron microscopes or new-generation telescopes. However, the application of this method comes at a very high computational cost, that is bound to explode more because new imaging devices generate an ever-growing amount…
▽ More
Persistent homology (PH) is a powerful mathematical method to automatically extract relevant insights from images, such as those obtained by high-resolution imaging devices like electron microscopes or new-generation telescopes. However, the application of this method comes at a very high computational cost, that is bound to explode more because new imaging devices generate an ever-growing amount of data. In this paper we present PixHomology, a novel algorithm for efficiently computing $0$-dimensional PH on 2D images, optimizing memory and processing time. By leveraging the Apache Spark framework, we also present a distributed version of our algorithm with several optimized variants, able to concurrently process large batches of astronomical images. Finally, we present the results of an experimental analysis showing that our algorithm and its distributed version are efficient in terms of required memory, execution time, and scalability, consistently outperforming existing state-of-the-art PH computation tools when used to process large datasets.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Emerging Results on Automated Support for Searching and Selecting Evidence for Systematic Literature Review Updates
Authors:
Bianca Minetto Napoleão,
Ritika Sarkar,
Sylvain Hallé,
Fabio Petrillo,
Marcos Kalinowski
Abstract:
Context: The constant growth of primary evidence and Systematic Literature Reviews (SLRs) publications in the Software Engineering (SE) field leads to the need for SLR Updates. However, searching and selecting evidence for SLR updates demands significant effort from SE researchers. Objective: We present emerging results on an automated approach to support searching and selecting studies for SLR up…
▽ More
Context: The constant growth of primary evidence and Systematic Literature Reviews (SLRs) publications in the Software Engineering (SE) field leads to the need for SLR Updates. However, searching and selecting evidence for SLR updates demands significant effort from SE researchers. Objective: We present emerging results on an automated approach to support searching and selecting studies for SLR updates in SE. Method: We developed an automated tool prototype to perform the snowballing search technique and support selecting relevant studies for SLR updates using Machine Learning (ML) algorithms. We evaluated our automation proposition through a small-scale evaluation with a reliable dataset from an SLR replication and its update. Results: Effectively automating snowballing-based search strategies showed feasibility with minor losses, specifically related to papers without Digital Object Identifier (DOI). The ML algorithm giving the highest performance to select studies for SLR updates was Linear Support Vector Machine, with approximately 74% recall and 15% precision. Using such algorithms with conservative thresholds to minimize the risk of missing papers can significantly reduce evidence selection efforts. Conclusion: The preliminary results of our evaluation point in promising directions, indicating the potential of automating snowballing search efforts and of reducing the number of papers to be manually analyzed by about 2.5 times when selecting evidence for updating SLRs in SE.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
BDD-Based Framework with RL Integration: An approach for videogames automated testing
Authors:
Vincent Mastain,
Fabio Petrillo
Abstract:
Testing plays a vital role in software development, but in the realm of video games, the process differs from traditional software development practices. Game developers typically rely on human testers who are provided with checklists to evaluate various elements. While major game developers already employ automated testing using script-based bots, the increasing complexity of video games is pushi…
▽ More
Testing plays a vital role in software development, but in the realm of video games, the process differs from traditional software development practices. Game developers typically rely on human testers who are provided with checklists to evaluate various elements. While major game developers already employ automated testing using script-based bots, the increasing complexity of video games is pushing the limits of scripted solutions, necessitating the adoption of more advanced testing strategies. To assist game studios in enhancing the quality of their games through automated testing, we propose the integration of Behavior Driven Development (BDD) with Reinforcement Learning (RL). This positional paper summarizes our proposal and framework under development.
△ Less
Submitted 8 October, 2023;
originally announced November 2023.
-
SPIRT: A Fault-Tolerant and Reliable Peer-to-Peer Serverless ML Training Architecture
Authors:
Amine Barrak,
Mayssa Jaziri,
Ranim Trabelsi,
Fehmi Jaafar,
Fabio Petrillo
Abstract:
The advent of serverless computing has ushered in notable advancements in distributed machine learning, particularly within parameter server-based architectures. Yet, the integration of serverless features within peer-to-peer (P2P) distributed networks remains largely uncharted. In this paper, we introduce SPIRT, a fault-tolerant, reliable, and secure serverless P2P ML training architecture. desig…
▽ More
The advent of serverless computing has ushered in notable advancements in distributed machine learning, particularly within parameter server-based architectures. Yet, the integration of serverless features within peer-to-peer (P2P) distributed networks remains largely uncharted. In this paper, we introduce SPIRT, a fault-tolerant, reliable, and secure serverless P2P ML training architecture. designed to bridge this existing gap.
Capitalizing on the inherent robustness and reliability innate to P2P systems, SPIRT employs RedisAI for in-database operations, leading to an 82\% reduction in the time required for model updates and gradient averaging across a variety of models and batch sizes. This architecture showcases resilience against peer failures and adeptly manages the integration of new peers, thereby highlighting its fault-tolerant characteristics and scalability. Furthermore, SPIRT ensures secure communication between peers, enhancing the reliability of distributed machine learning tasks. Even in the face of Byzantine attacks, the system's robust aggregation algorithms maintain high levels of accuracy. These findings illuminate the promising potential of serverless architectures in P2P distributed machine learning, offering a significant stride towards the development of more efficient, scalable, and resilient applications.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Exploring the Impact of Serverless Computing on Peer To Peer Training Machine Learning
Authors:
Amine Barrak,
Ranim Trabelsi,
Fehmi Jaafar,
Fabio Petrillo
Abstract:
The increasing demand for computational power in big data and machine learning has driven the development of distributed training methodologies. Among these, peer-to-peer (P2P) networks provide advantages such as enhanced scalability and fault tolerance. However, they also encounter challenges related to resource consumption, costs, and communication overhead as the number of participating peers g…
▽ More
The increasing demand for computational power in big data and machine learning has driven the development of distributed training methodologies. Among these, peer-to-peer (P2P) networks provide advantages such as enhanced scalability and fault tolerance. However, they also encounter challenges related to resource consumption, costs, and communication overhead as the number of participating peers grows. In this paper, we introduce a novel architecture that combines serverless computing with P2P networks for distributed training and present a method for efficient parallel gradient computation under resource constraints.
Our findings show a significant enhancement in gradient computation time, with up to a 97.34\% improvement compared to conventional P2P distributed training methods. As for costs, our examination confirmed that the serverless architecture could incur higher expenses, reaching up to 5.4 times more than instance-based architectures. It is essential to consider that these higher costs are associated with marked improvements in computation time, particularly under resource-constrained scenarios. Despite the cost-time trade-off, the serverless approach still holds promise due to its pay-as-you-go model. Utilizing dynamic resource allocation, it enables faster training times and optimized resource utilization, making it a promising candidate for a wide range of machine learning applications.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Visualising Game Engine Subsystem Coupling
Authors:
Gabriel C. Ullmann,
Yann-Gaël Guéhéneuc,
Fabio Petrillo,
Nicolas Anquetil,
Cristiano Politowski
Abstract:
Game engines support video game development by providing functionalities such as graphics rendering or input/output device management. However, their architectures are often overlooked, which hinders their integration and extension. In this paper, we use an approach for architecture recovery to create architectural models for 10 open-source game engines. We use these models to answer the following…
▽ More
Game engines support video game development by providing functionalities such as graphics rendering or input/output device management. However, their architectures are often overlooked, which hinders their integration and extension. In this paper, we use an approach for architecture recovery to create architectural models for 10 open-source game engines. We use these models to answer the following questions: Which subsystems more often couple with one another? Do game engines share subsystem coupling patterns? We observe that the Low-Level Renderer, Platform Independence Layer and Resource Manager are frequently coupled to the game engine Core. By identifying the most frequent coupling patterns, we describe an emergent game engine architecture and discuss how it can be used by practitioners to improve system understanding and maintainability.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Assessing Video Game Balance using Autonomous Agents
Authors:
Cristiano Politowski,
Fabio Petrillo,
Ghizlane ElBoussaidi,
Gabriel C. Ullmann,
Yann-Gaël Guéhéneuc
Abstract:
As the complexity and scope of games increase, game testing, also called playtesting, becomes an essential activity to ensure the quality of video games. Yet, the manual, ad-hoc nature of game testing leaves space for automation. In this paper, we research, design, and implement an approach to supplement game testing to balance video games with autonomous agents. We evaluate our approach with two…
▽ More
As the complexity and scope of games increase, game testing, also called playtesting, becomes an essential activity to ensure the quality of video games. Yet, the manual, ad-hoc nature of game testing leaves space for automation. In this paper, we research, design, and implement an approach to supplement game testing to balance video games with autonomous agents. We evaluate our approach with two platform games. We bring a systematic way to assess if a game is balanced by (1) comparing the difficulty levels between game versions and issues with the game design, and (2) the game demands for skill or luck.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
An Exploratory Approach for Game Engine Architecture Recovery
Authors:
Gabriel C. Ullmann,
Yann-Gaël Guéhéneuc,
Fabio Petrillo,
Nicolas Anquetil,
Cristiano Politowski
Abstract:
Game engines provide video game developers with a wide range of fundamental subsystems for creating games, such as 2D/3D graphics rendering, input device management, and audio playback. Developers often integrate these subsystems with other applications or extend them via plugins. To integrate or extend correctly, developers need a broad system architectural understanding. However, architectural i…
▽ More
Game engines provide video game developers with a wide range of fundamental subsystems for creating games, such as 2D/3D graphics rendering, input device management, and audio playback. Developers often integrate these subsystems with other applications or extend them via plugins. To integrate or extend correctly, developers need a broad system architectural understanding. However, architectural information is not always readily available and is often overlooked in this kind of system. In this work, we propose an approach for game engine architecture recovery and explore the architecture of three popular open-source game engines (Cocos2d-x, Godot, and Urho3D). We perform manual subsystem detection and use Moose, a platform for software analysis, to generate architectural models. With these models, we answer the following questions: Which subsystems are present in game engines? Which subsystems are more often coupled with one another? Why are these subsystems coupled with each other? Results show that the platform independence, resource management, world editor, and core subsystems are frequently included by others and therefore act as foundations for the game engines. Furthermore, we show that, by applying our approach, game engine developers can understand whether subsystems are related and divide responsibilities. They can also assess whether relationships among subsystems are appropriate for the game engine.
△ Less
Submitted 20 March, 2023; v1 submitted 4 March, 2023;
originally announced March 2023.
-
Architecting Peer-to-Peer Serverless Distributed Machine Learning Training for Improved Fault Tolerance
Authors:
Amine Barrak,
Fabio Petrillo,
Fehmi Jaafar
Abstract:
Distributed Machine Learning refers to the practice of training a model on multiple computers or devices that can be called nodes. Additionally, serverless computing is a new paradigm for cloud computing that uses functions as a computational unit. Serverless computing can be effective for distributed learning systems by enabling automated resource scaling, less manual intervention, and cost reduc…
▽ More
Distributed Machine Learning refers to the practice of training a model on multiple computers or devices that can be called nodes. Additionally, serverless computing is a new paradigm for cloud computing that uses functions as a computational unit. Serverless computing can be effective for distributed learning systems by enabling automated resource scaling, less manual intervention, and cost reduction. By distributing the workload, distributed machine learning can speed up the training process and allow more complex models to be trained. Several topologies of distributed machine learning have been established (centralized, parameter server, peer-to-peer). However, the parameter server architecture may have limitations in terms of fault tolerance, including a single point of failure and complex recovery processes. Moreover, training machine learning in a peer-to-peer (P2P) architecture can offer benefits in terms of fault tolerance by eliminating the single point of failure. In a P2P architecture, each node or worker can act as both a server and a client, which allows for more decentralized decision making and eliminates the need for a central coordinator. In this position paper, we propose exploring the use of serverless computing in distributed machine learning training and comparing the performance of P2P architecture with the parameter server architecture, focusing on cost reduction and fault tolerance.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Optimizing Trigger-Level Track Reconstruction for Sensitivity to Exotic Signatures
Authors:
K. F. Di Petrillo,
J. N. Farr,
C. Guo,
T. R. Holmes,
J. Nelson,
K. Pachal
Abstract:
Many compelling beyond the Standard Model scenarios predict signals that result in unconventional charged particle trajectories. Signatures for which unusual tracks are the most conspicuous feature of the event pose significant challenges for experiments at the Large Hadron Collider (LHC), particularly for the trigger. This article presents a study of track-based triggers for a representative set…
▽ More
Many compelling beyond the Standard Model scenarios predict signals that result in unconventional charged particle trajectories. Signatures for which unusual tracks are the most conspicuous feature of the event pose significant challenges for experiments at the Large Hadron Collider (LHC), particularly for the trigger. This article presents a study of track-based triggers for a representative set of long-lived and unconventional signatures at the upcoming High Luminosity LHC, as well as resulting recommendations for the target parameters of a hardware-based tracking system. Scenarios studied include large multiplicities of low momentum tracks produced in a soft-unclustered-energy-pattern model, displaced leptons and anomalous prompt tracks predicted in a Supersymmetry model with long-lived staus, and displaced hadrons predicted in a Higgs portal scenario with long-lived scalars.
△ Less
Submitted 12 January, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
An Approach to Build Consistent Software Architecture Diagrams Using Devops System Descriptors
Authors:
Jalves Nicacio,
Fabio Petrillo
Abstract:
System architecture diagrams play an essential role in understanding system architecture. They encourage more active discussion among participants and make it easier to recall system details. However, system architecture diagrams often diverge from the software. As a result, they can interfere with the understanding and maintenance of the software. We propose an approach to build system architectu…
▽ More
System architecture diagrams play an essential role in understanding system architecture. They encourage more active discussion among participants and make it easier to recall system details. However, system architecture diagrams often diverge from the software. As a result, they can interfere with the understanding and maintenance of the software. We propose an approach to build system architecture diagrams using DevOps system descriptors to improve the consistency of architecture diagrams. To produce our approach, we survey problems with architecture diagrams in the software industry, developing guidelines for creating architecture diagrams. Next, we produce a taxonomy for system descriptor concepts and a process to convert system descriptors into architecture diagrams. We evaluate our approach through a case study. In this case study, we defined a Docker Compose descriptor for a newsfeed system and transformed it into a system architectural diagram using the proposed approach. Our results indicate that, currently, system descriptors generally lead to consistent diagrams only to a limited extent. However, the case study's observations indicate that the proposed approach is promising and demonstrates that system descriptors have the potential to create more consistent architectural diagrams. Further evaluation in controlled and empirical experiments is necessary to test our hypothesis in more detail.
△ Less
Submitted 21 October, 2022;
originally announced October 2022.
-
Report of the Topical Group on Physics Beyond the Standard Model at Energy Frontier for Snowmass 2021
Authors:
Tulika Bose,
Antonio Boveia,
Caterina Doglioni,
Simone Pagan Griso,
James Hirschauer,
Elliot Lipeles,
Zhen Liu,
Nausheen R. Shah,
Lian-Tao Wang,
Kaustubh Agashe,
Juliette Alimena,
Sebastian Baum,
Mohamed Berkat,
Kevin Black,
Gwen Gardner,
Tony Gherghetta,
Josh Greaves,
Maxx Haehn,
Phil C. Harris,
Robert Harris,
Julie Hogan,
Suneth Jayawardana,
Abraham Kahn,
Jan Kalinowski,
Simon Knapen
, et al. (297 additional authors not shown)
Abstract:
This is the Snowmass2021 Energy Frontier (EF) Beyond the Standard Model (BSM) report. It combines the EF topical group reports of EF08 (Model-specific explorations), EF09 (More general explorations), and EF10 (Dark Matter at Colliders). The report includes a general introduction to BSM motivations and the comparative prospects for proposed future experiments for a broad range of potential BSM mode…
▽ More
This is the Snowmass2021 Energy Frontier (EF) Beyond the Standard Model (BSM) report. It combines the EF topical group reports of EF08 (Model-specific explorations), EF09 (More general explorations), and EF10 (Dark Matter at Colliders). The report includes a general introduction to BSM motivations and the comparative prospects for proposed future experiments for a broad range of potential BSM models and signatures, including compositeness, SUSY, leptoquarks, more general new bosons and fermions, long-lived particles, dark matter, charged-lepton flavor violation, and anomaly detection.
△ Less
Submitted 18 October, 2022; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Muon Collider Forum Report
Authors:
K. M. Black,
S. Jindariani,
D. Li,
F. Maltoni,
P. Meade,
D. Stratakis,
D. Acosta,
R. Agarwal,
K. Agashe,
C. Aime,
D. Ally,
A. Apresyan,
A. Apyan,
P. Asadi,
D. Athanasakos,
Y. Bao,
E. Barzi,
N. Bartosik,
L. A. T. Bauerdick,
J. Beacham,
S. Belomestnykh,
J. S. Berg,
J. Berryhill,
A. Bertolin,
P. C. Bhat
, et al. (160 additional authors not shown)
Abstract:
A multi-TeV muon collider offers a spectacular opportunity in the direct exploration of the energy frontier. Offering a combination of unprecedented energy collisions in a comparatively clean leptonic environment, a high energy muon collider has the unique potential to provide both precision measurements and the highest energy reach in one machine that cannot be paralleled by any currently availab…
▽ More
A multi-TeV muon collider offers a spectacular opportunity in the direct exploration of the energy frontier. Offering a combination of unprecedented energy collisions in a comparatively clean leptonic environment, a high energy muon collider has the unique potential to provide both precision measurements and the highest energy reach in one machine that cannot be paralleled by any currently available technology. The topic generated a lot of excitement in Snowmass meetings and continues to attract a large number of supporters, including many from the early career community. In light of this very strong interest within the US particle physics community, Snowmass Energy, Theory and Accelerator Frontiers created a cross-frontier Muon Collider Forum in November of 2020. The Forum has been meeting on a monthly basis and organized several topical workshops dedicated to physics, accelerator technology, and detector R&D. Findings of the Forum are summarized in this report.
△ Less
Submitted 8 August, 2023; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Game Engine Comparative Anatomy
Authors:
Gabriel C. Ullmann,
Cristiano Politowski,
Yann-Gaël Guéhéneuc,
Fabio Petrillo
Abstract:
Video game developers use game engines as a tool to manage complex aspects of game development. While engines play a big role in the success of games, to the best of our knowledge, they are often developed in isolation, in a closed-source manner, without architectural discussions, comparison, and collaboration among projects. In this work in progress, we compare the call graphs of two open-source…
▽ More
Video game developers use game engines as a tool to manage complex aspects of game development. While engines play a big role in the success of games, to the best of our knowledge, they are often developed in isolation, in a closed-source manner, without architectural discussions, comparison, and collaboration among projects. In this work in progress, we compare the call graphs of two open-source engines: Godot 3.4.4 and Urho3D 1.8. While static analysis tools could provide us with a general picture without precise call graph paths, the use of a profiler such as Callgrind allows us to also view the call order and frequency. These graphs give us insight into the engines' designs. We showed that, by using Callgrind, we can obtain a high-level view of an engine's architecture, which can be used to understand it. In future work, we intend to apply both dynamic and static analysis to other open-source engines to understand architectural patterns and their impact on aspects such as performance and maintenance.
△ Less
Submitted 13 July, 2022;
originally announced July 2022.
-
Towards Continuous Systematic Literature Review in Software Engineering
Authors:
Bianca Minetto Napoleão,
Fabio Petrillo,
Sylvain Hallé,
Marcos Kalinowski
Abstract:
Context: New scientific evidence continuously arises with advances in Software Engineering (SE) research. Conventionally, Systematic Literature Reviews (SLRs) are not updated or updated intermittently, leaving gaps between updates, during which time the SLR may be missing crucial new evidence. Goal: We propose and evaluate a concept and process called Continuous Systematic Literature Review (CSLR)…
▽ More
Context: New scientific evidence continuously arises with advances in Software Engineering (SE) research. Conventionally, Systematic Literature Reviews (SLRs) are not updated or updated intermittently, leaving gaps between updates, during which time the SLR may be missing crucial new evidence. Goal: We propose and evaluate a concept and process called Continuous Systematic Literature Review (CSLR) in SE. Method: To elaborate on the CSLR concept and process, we performed a synthesis of evidence by conducting a meta-ethnography, addressing knowledge from varied research areas. Furthermore, we conducted a case study to evaluate the CSLR process. Results: We describe the resulting CSLR process in BPMN format. The case study results provide indications on the importance and feasibility of applying CSLR in practice to continuously update SLR evidence in SE. Conclusion: The CSLR concept and process provide a feasible and systematic way to continuously incorporate new evidence into SLRs, supporting trustworthy and up-to-date evidence for SLRs in SE.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.
-
Track-Based Triggers for Exotic Signatures
Authors:
K. F. Di Petrillo,
J. N. Farr,
C. Guo,
T. R. Holmes,
J. Nelson,
K. Pachal
Abstract:
Several compelling beyond the Standard Model scenarios predict signals that result in unconventional charged particle trajectories. Signatures for which unusual tracks are the most conspicuous feature pose significant challenges for experiments at the Large Hadron Collider (LHC), particularly for the trigger. This article presents a study of track-based triggers for a representative set of long-li…
▽ More
Several compelling beyond the Standard Model scenarios predict signals that result in unconventional charged particle trajectories. Signatures for which unusual tracks are the most conspicuous feature pose significant challenges for experiments at the Large Hadron Collider (LHC), particularly for the trigger. This article presents a study of track-based triggers for a representative set of long-lived and unconventional signatures at the upcoming High Luminosity LHC. Scenarios studied include large multiplicities of low momentum tracks produced in a soft-unclustered-energy-pattern model, displaced leptons and anomalous prompt tracks predicted in a Supersymmetry model with long-lived staus, and displaced hadrons predicted in a Higgs portal scenario with long-lived scalars. Trigger efficiency is measured as a function of the baseline parameters of a track trigger, including transverse momentum and impact parameter. Recommendations for future hardware-based track triggers are presented.
△ Less
Submitted 28 June, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Towards Automated Video Game Testing: Still a Long Way to Go
Authors:
Cristiano Politowski,
Yann-Gaël Guéhéneuc,
Fabio Petrillo
Abstract:
As the complexity and scope of game development increase, playtesting remains an essential activity to ensure the quality of video games. Yet, the manual, ad-hoc nature of playtesting gives space to improvements in the process. In this study, we investigate gaps between academic solutions in the literature for automated video game testing and the needs of video game developers in the industry. We…
▽ More
As the complexity and scope of game development increase, playtesting remains an essential activity to ensure the quality of video games. Yet, the manual, ad-hoc nature of playtesting gives space to improvements in the process. In this study, we investigate gaps between academic solutions in the literature for automated video game testing and the needs of video game developers in the industry. We performed a literature review on video game automated testing and applied an online survey with video game developers. The literature results show a rise in research topics related to automated video game testing. The survey results show that game developers are skeptical about using automated agents to test games. We conclude that there is a need for new testing approaches that did not disrupt the developer workflow. As for the researchers, the focus should be on the testing goal and testing oracle.
△ Less
Submitted 10 March, 2022; v1 submitted 25 February, 2022;
originally announced February 2022.
-
Video Game Project Management Anti-patterns
Authors:
Gabriel C. Ullmann,
Cristiano Politowski,
Yann-Gaël Guéhéneuc,
Fabio Petrillo,
João Eduardo Montandon
Abstract:
Project Management anti-patterns are well-documented in the software-engineering literature, and studying them allows understanding their impacts on teams and projects. The video game development industry is known for its mismanagement practices, and therefore applying this knowledge would help improving game developers' productivity and well-being. In this paper, we map project management anti-pa…
▽ More
Project Management anti-patterns are well-documented in the software-engineering literature, and studying them allows understanding their impacts on teams and projects. The video game development industry is known for its mismanagement practices, and therefore applying this knowledge would help improving game developers' productivity and well-being. In this paper, we map project management anti-patterns to anti-patterns reported by game developers in the gray literature. We read 440 postmortems problems, identified anti-pattern candidates, and related them with definitions from the software-engineering literature. We discovered that most anti-pattern candidates could be mapped to anti-patterns in the software-engineering literature, except for Feature Creep, Feature Cuts, Working on Multiple Projects, and Absent or Inadequate Tools. We discussed the impact of the unmapped candidates on the development process while also drawing a parallel between video games and traditional software development. Future works include validating the definitions of the candidates via survey with practitioners and also considering development anti-patterns.
△ Less
Submitted 11 March, 2022; v1 submitted 12 February, 2022;
originally announced February 2022.
-
Log severity level classification: an approach for systems in production
Authors:
Eduardo Mendes,
Fabio Petrillo
Abstract:
Context: Logs are often the primary source of information for system developers and operations engineers to understand and diagnose the behavior of a software system in production. In many cases, logs are the only evidence available for fault investigation. Problem: However, the inappropriate choice of log severity level can impact the amount of log data generated and, consequently, quality. This…
▽ More
Context: Logs are often the primary source of information for system developers and operations engineers to understand and diagnose the behavior of a software system in production. In many cases, logs are the only evidence available for fault investigation. Problem: However, the inappropriate choice of log severity level can impact the amount of log data generated and, consequently, quality. This storage overhead can impact the performance of log-based monitoring systems, as excess log data comes with increased aggregate noise, making it challenging to utilize what is actually important when trying to do diagnostics. Goal: This research aims to decrease the overheads of monitoring systems by processing the severity level of log data from systems in production. Approach: To achieve this goal, we intend to deepen the knowledge about the log severity levels and develop an automated approach to log severity level classification, demonstrating that reducing log severity level "noise" improves the monitoring of systems in production. Conclusion: We hope that the set of contributions from this work can improve the monitoring activities of software systems and contribute to the creation of knowledge that improves logging practices
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
Review of opportunities for new long-lived particle triggers in Run 3 of the Large Hadron Collider
Authors:
Juliette Alimena,
James Beacham,
Freya Blekman,
Adrián Casais Vidal,
Xabier Cid Vidal,
Matthew Citron,
David Curtin,
Albert De Roeck,
Nishita Desai,
Karri Folan Di Petrillo,
Yuri Gershtein,
Louis Henry,
Tova Holmes,
Brij Jashal,
Philip James Ilten,
Sascha Mehlhase,
Javier Montejo Berlingen,
Arantza Oyanguren,
Giovanni Punzi,
Murilo Santana Rangel,
Federico Leo Redi,
Lorenzo Sestini,
Emma Torro,
Carlos Vázquez Sierra,
Maarten van Veghel
, et al. (53 additional authors not shown)
Abstract:
Long-lived particles (LLPs) are highly motivated signals of physics Beyond the Standard Model (BSM) with great discovery potential and unique experimental challenges. The LLP search programme made great advances during Run 2 of the Large Hadron Collider (LHC), but many important regions of signal space remain unexplored. Dedicated triggers are crucial to improve the potential of LLP searches, and…
▽ More
Long-lived particles (LLPs) are highly motivated signals of physics Beyond the Standard Model (BSM) with great discovery potential and unique experimental challenges. The LLP search programme made great advances during Run 2 of the Large Hadron Collider (LHC), but many important regions of signal space remain unexplored. Dedicated triggers are crucial to improve the potential of LLP searches, and their development and expansion is necessary for the full exploitation of the new data. The public discussion of triggers has therefore been a relevant theme in the recent LLP literature, in the meetings of the LLP@LHC Community workshop and in the respective experiments. This paper documents the ideas collected during talks and discussions at these Workshops, benefiting as well from the ideas under development by the trigger community within the experimental collaborations. We summarise the theoretical motivations of various LLP scenarios leading to highly elusive signals, reviewing concrete ideas for triggers that could greatly extend the reach of the LHC experiments. We thus expect this document to encourage further thinking for both the phenomenological and experimental communities, as a stepping stone to further develop the LLP@LHC physics programme.
△ Less
Submitted 27 October, 2021;
originally announced October 2021.
-
Log severity levels matter: A multivocal mapping
Authors:
Eduardo Mendes,
Fabio Petrillo
Abstract:
The choice of log severity level can be challenging and cause problems in producing reliable logging data. However, there is a lack of specifications and practical guidelines to support this challenge. In this study, we present a multivocal systematic mapping of log severity levels from peer-reviewed literature, logging libraries, and practitioners' views. We analyzed 19 severity levels, 27 studie…
▽ More
The choice of log severity level can be challenging and cause problems in producing reliable logging data. However, there is a lack of specifications and practical guidelines to support this challenge. In this study, we present a multivocal systematic mapping of log severity levels from peer-reviewed literature, logging libraries, and practitioners' views. We analyzed 19 severity levels, 27 studies, and 40 logging libraries. Our results show redundancy and semantic similarity between the levels and a tendency to converge the levels for a total of six levels. Our contributions help leverage the reliability of log entries: (i) mapping the literature about log severity levels, (ii) mapping the severity levels in logging libraries, (iii) a set of synthesized six definitions and four general purposes for severity levels. We recommend that developers use a standard nomenclature, and for logging library creators, we suggest providing accurate and unambiguous definitions of log severity levels.
△ Less
Submitted 6 December, 2021; v1 submitted 2 September, 2021;
originally announced September 2021.
-
Mapping breakpoint types: an exploratory study
Authors:
Eduardo Andreetta Fontana,
Fabio Petrillo
Abstract:
Debugging is a relevant task for finding bugs during software development, maintenance, and evolution. During debugging, developers use modern IDE debuggers to analyze variables, step execution, and set breakpoints. Observing IDE debuggers, we find several breakpoint types. However, what are the breakpoint types? The goal of our study is to map the breakpoint types among IDEs and academic literatu…
▽ More
Debugging is a relevant task for finding bugs during software development, maintenance, and evolution. During debugging, developers use modern IDE debuggers to analyze variables, step execution, and set breakpoints. Observing IDE debuggers, we find several breakpoint types. However, what are the breakpoint types? The goal of our study is to map the breakpoint types among IDEs and academic literature. Thus, we mapped the gray literature on the documentation of the nine main IDEs used by developers according to the three public rankings. In addition, we performed a systematic mapping of academic literature over 68 articles describing breakpoint types. Finally, we analyzed the developers understanding of the main breakpoint types through a questionnaire. We present three main contributions: (1) the mapping of breakpoint types (IDEs and literature), (2) compiled definitions of breakpoint types, (3) a breakpoint type taxonomy. Our contributions provide the first step to organize breakpoint IDE taxonomy and lexicon, and support further debugging research.
△ Less
Submitted 11 November, 2021; v1 submitted 31 August, 2021;
originally announced September 2021.
-
Continuous Systematic Literature Review: An Approach for Open Science
Authors:
Bianca Minetto Napoleão,
Fabio Petrillo,
Sylvain Hallé
Abstract:
Systematic Literature Reviews (SLRs) play an important role in the Evidence-Based Software Engineering scenario. With the advance of the computer science field and the growth of research publications, new evidence continuously arises. This fact impacts directly on the purpose of keeping SLRs up-to-date which could lead researchers to obsolete conclusions or decisions about a research problem or in…
▽ More
Systematic Literature Reviews (SLRs) play an important role in the Evidence-Based Software Engineering scenario. With the advance of the computer science field and the growth of research publications, new evidence continuously arises. This fact impacts directly on the purpose of keeping SLRs up-to-date which could lead researchers to obsolete conclusions or decisions about a research problem or investigation. Creating and maintaining SLRs up-to-date demand a significant effort due to several reasons such as the rapid increase in the amount of evidence, limitation of available databases and lack of detailed protocol documentation and data availability. Conventionally, in software engineering SLRs are not updated or updated intermittently leaving gaps between updates during which time the SLR may be missing important new research. In order to address these issues, we propose the concept, process and tooling support of Continuous Systematic Literature Review (CSLR) in SE aiming to keep SLRs constantly updated with the promotion of open science practices. This positional paper summarizes our proposal and approach under development.
△ Less
Submitted 29 August, 2021;
originally announced August 2021.
-
The Power of Word-Frequency Based Alignment-Free Functions: a Comprehensive Large-scale Experimental Analysis -- Version 3
Authors:
Giuseppe Cattaneo,
Umberto Ferraro Petrillo,
Raffaele Giancarlo,
Francesco Palini,
Chiara Romualdi
Abstract:
Motivation: Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. Experimental studies on real datasets abound and, to some extent, there are also studies regarding their control of false positive rate (Type I error). However, assessment of their power, i.e., their ability to identify true similarity, has been limited to some members of the D2 family by experiment…
▽ More
Motivation: Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. Experimental studies on real datasets abound and, to some extent, there are also studies regarding their control of false positive rate (Type I error). However, assessment of their power, i.e., their ability to identify true similarity, has been limited to some members of the D2 family by experimental studies on short sequences, not adequate for current applications, where sequence lengths may vary considerably. Such a State of the Art is methodologically problematic, since information regarding a key feature such as power is either missing or limited. Results: By concentrating on a representative set of word-frequency based AF functions, we perform the first coherent and uniform evaluation of the power, involving also Type I error for completeness. Two Alternative models of important genomic features (CIS Regulatory Modules and Horizontal Gene Transfer), a wide range of sequence lengths from a few thousand to millions, and different values of k have been used. As a result, we provide a characterization of those AF functions that is novel and informative. Indeed, we identify weak and strong points of each function considered, which may be used as a guide to choose one for analysis tasks. Remarkably, of the fifteen functions that we have considered, only four stand out, with small differences between small and short sequence length scenarios. Finally, in order to encourage the use of our methodology for validation of future AF functions, the Big Data platform supporting it is public.
△ Less
Submitted 19 October, 2021; v1 submitted 27 June, 2021;
originally announced June 2021.
-
Towards auto-completion on software requirements statements
Authors:
Carlos Alberto dos Santos,
Fabio Petrillo
Abstract:
As software systems become more complex, modern software development requires more attention to human perspectives, and active participation of development teams in requirements elicitation tasks. In this context, incomplete or ambiguous requirements descriptions do not guide the development of good software products. We hypothesize that the text auto-completion feature improves the quality of the…
▽ More
As software systems become more complex, modern software development requires more attention to human perspectives, and active participation of development teams in requirements elicitation tasks. In this context, incomplete or ambiguous requirements descriptions do not guide the development of good software products. We hypothesize that the text auto-completion feature improves the quality of the software requirements artifacts. We present the motivation for this study, related works, our approach and future research efforts.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
Towards Logging Noisiness Theory: quality aspects to characterize unwanted log entries
Authors:
Eduardo Mendes,
Fabio Petrillo
Abstract:
Context: Logging tasks track the system's functioning by keeping records of evidence that have been analyzed by monitoring and observability activities. For these activities to be effective, it is necessary to consider the quality of the consumed information. Problem: However, the presence of noise - unwanted information - compromises the log files' quality. The noisiness of a log file can be affe…
▽ More
Context: Logging tasks track the system's functioning by keeping records of evidence that have been analyzed by monitoring and observability activities. For these activities to be effective, it is necessary to consider the quality of the consumed information. Problem: However, the presence of noise - unwanted information - compromises the log files' quality. The noisiness of a log file can be affected among other things by: (i) the wrong severity log choices, (ii) the production of duplicate entries, (iii) the incompleteness of the information, (iv) the inappropriate format of the entries, (v) the amount of information generated. Objective: This work aims to broadly define the concept of noise in the context of logging, proposing the initial steps of Logging Noisiness, a theory on quality aspects to characterize unwanted log entries.
△ Less
Submitted 5 June, 2021;
originally announced June 2021.
-
What Makes a Game High-rated? Towards Factors of Video Game Success
Authors:
Gabriel Ullmann,
Cristiano Politowski,
Yann-Gäel Guéhéneuc,
Fabio Petrillo
Abstract:
As the video game market grows larger, it becomes harder to stand out from the crowd. Launching a successful game involves different aspects. But what are they? In this paper, we investigate some aspects of the high-rated games from a dataset of 200 projects. The results show that the none of the aspects of this study have a strong relationship with the game's success. A further analysis on the hi…
▽ More
As the video game market grows larger, it becomes harder to stand out from the crowd. Launching a successful game involves different aspects. But what are they? In this paper, we investigate some aspects of the high-rated games from a dataset of 200 projects. The results show that the none of the aspects of this study have a strong relationship with the game's success. A further analysis on the high-rated games shows that team, technical, and game-design aspects should be the main focus of the game developers.
△ Less
Submitted 13 May, 2022; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Towards Automated Acceptance testing for industrial robots
Authors:
Marcela G. dos Santos,
Fabio Petrillo
Abstract:
Industrial robots are important machines applied in numerous modern industries that execute repetitive tasks with high accuracy, replacing or supporting dangerous jobs. In this kind of system, with increased complexity in which cost is related to the time the system keeps working, the system must operate with a minimum number of failures. In other words, a quality aspect important in industry is r…
▽ More
Industrial robots are important machines applied in numerous modern industries that execute repetitive tasks with high accuracy, replacing or supporting dangerous jobs. In this kind of system, with increased complexity in which cost is related to the time the system keeps working, the system must operate with a minimum number of failures. In other words, a quality aspect important in industry is reliability. We hypothesize that Automated Acceptance Testing improves reliability for industrial robot program. We present the research question, the motivation for this study, our hypothesis and future research efforts.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Test beam characterization of sensor prototypes for the CMS Barrel MIP Timing Detector
Authors:
R. Abbott,
A. Abreu,
F. Addesa,
M. Alhusseini,
T. Anderson,
Y. Andreev,
A. Apresyan,
R. Arcidiacono,
M. Arenton,
E. Auffray,
D. Bastos,
L. A. T. Bauerdick,
R. Bellan,
M. Bellato,
A. Benaglia,
M. Benettoni,
R. Bertoni,
M. Besancon,
S. Bharthuar,
A. Bornheim,
E. Brücken,
J. N. Butler,
C. Campagnari,
M. Campana,
R. Carlin
, et al. (174 additional authors not shown)
Abstract:
The MIP Timing Detector will provide additional timing capabilities for detection of minimum ionizing particles (MIPs) at CMS during the High Luminosity LHC era, improving event reconstruction and pileup rejection. The central portion of the detector, the Barrel Timing Layer (BTL), will be instrumented with LYSO:Ce crystals and Silicon Photomultipliers (SiPMs) providing a time resolution of about…
▽ More
The MIP Timing Detector will provide additional timing capabilities for detection of minimum ionizing particles (MIPs) at CMS during the High Luminosity LHC era, improving event reconstruction and pileup rejection. The central portion of the detector, the Barrel Timing Layer (BTL), will be instrumented with LYSO:Ce crystals and Silicon Photomultipliers (SiPMs) providing a time resolution of about 30 ps at the beginning of operation, and degrading to 50-60 ps at the end of the detector lifetime as a result of radiation damage. In this work, we present the results obtained using a 120 GeV proton beam at the Fermilab Test Beam Facility to measure the time resolution of unirradiated sensors. A proof-of-concept of the sensor layout proposed for the barrel region of the MTD, consisting of elongated crystal bars with dimensions of about 3 x 3 x 57 mm$^3$ and with double-ended SiPM readout, is demonstrated. This design provides a robust time measurement independent of the impact point of the MIP along the crystal bar. We tested LYSO:Ce bars of different thickness (2, 3, 4 mm) with a geometry close to the reference design and coupled to SiPMs manufactured by Hamamatsu and Fondazione Bruno Kessler. The various aspects influencing the timing performance such as the crystal thickness, properties of the SiPMs (e.g. photon detection efficiency), and impact angle of the MIP are studied. A time resolution of about 28 ps is measured for MIPs crossing a 3 mm thick crystal bar, corresponding to an MPV energy deposition of 2.6 MeV, and of 22 ps for the 4.2 MeV MPV energy deposition expected in the BTL, matching the detector performance target for unirradiated devices.
△ Less
Submitted 16 July, 2021; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Towards improving architectural diagram consistency using system descriptors
Authors:
Jalves Nicacio,
Fabio Petrillo
Abstract:
Communication between practitioners is essential for the system's quality in the DevOps context. To improve this communication, practitioners often use informal diagrams to represent the components of a system. However, as systems evolve, it is a challenge to synchronize diagrams with production environments consistently. Hence, the inconsistency of architectural diagrams can affect communication…
▽ More
Communication between practitioners is essential for the system's quality in the DevOps context. To improve this communication, practitioners often use informal diagrams to represent the components of a system. However, as systems evolve, it is a challenge to synchronize diagrams with production environments consistently. Hence, the inconsistency of architectural diagrams can affect communication between practitioner and their understanding of systems. In this paper, we propose the use of system descriptors to improve deployment diagram consistency. We state two main hypotheses: (1) if an architectural diagram is generated from a valid system descriptor, then the diagram is consistent; (2) if a valid system descriptor is generated from an architectural diagram, then the diagram is consistent. We report a case study to explore our hypotheses. Furthermore, we constructed a system descriptor from the Netflix deployment diagram, and we applied our tool to generate a new architectural diagram. Finally, we compare the original and generated diagrams to evaluate our proposal. Our case study shows all Docker compose description elements can be graphically represented in the generated architectural diagram, and the generated diagram does not present inconsistent aspects of the original diagram. Thus, our preliminary results lead to further evaluation in controlled and empirical experiments to test our hypotheses.
△ Less
Submitted 25 March, 2021;
originally announced March 2021.
-
A Survey of Video Game Testing
Authors:
Cristiano Politowski,
Fabio Petrillo,
Yann-Gäel Guéhéneuc
Abstract:
Video-game projects are notorious for having day-one bugs, no matter how big their budget or team size. The quality of a game is essential for its success. This quality could be assessed and ensured through testing. However, to the best of our knowledge, little is known about video-game testing. In this paper, we want to understand how game developers perform game testing. We investigate, through…
▽ More
Video-game projects are notorious for having day-one bugs, no matter how big their budget or team size. The quality of a game is essential for its success. This quality could be assessed and ensured through testing. However, to the best of our knowledge, little is known about video-game testing. In this paper, we want to understand how game developers perform game testing. We investigate, through a survey, the academic and gray literature to identify and report on existing testing processes and how they could automate them. We found that game developers rely, almost exclusively, upon manual play-testing and the testers' intrinsic knowledge. We conclude that current testing processes fall short because of their lack of automation, which seems to be the natural next step to improve the quality of games while maintaining costs. However, the current game-testing techniques may not generalize to different types of games.
△ Less
Submitted 10 March, 2021;
originally announced March 2021.
-
Software Engineering for Robotic Systems:a systematic mapping study
Authors:
Marcela G. dos Santos,
Fabio Petrillo
Abstract:
Robots are being applied in a vast range of fields, leading researchers and practitioners to write tasks more complex than in the past. The robot software complexity increases the difficulty of engineering the robot's software components with quality requirements. Researchers and practitioners have applied software engineering (SE) approaches and robotic domains to address this issue in the last t…
▽ More
Robots are being applied in a vast range of fields, leading researchers and practitioners to write tasks more complex than in the past. The robot software complexity increases the difficulty of engineering the robot's software components with quality requirements. Researchers and practitioners have applied software engineering (SE) approaches and robotic domains to address this issue in the last two decades. This study aims to identify, classify and evaluate the current state-of-the-art Software Engineering for Robotic Systems (SERS). We systematically selected and analyzed 50 primary studies extracted from an automated search on Scopus digital library and manual search on the two editions of the RoSE workshop. We present three main contributions. Firstly, we provide an analysis from three following perspectives: demographics of publication, SE areas applied in robotics domains, and RSE findings. Secondly, we show a catalogue of research studies that apply software engineering techniques in the robotic domain, classified with the SWEBOK guide. We have identified 5 of 15 software engineering areas from the SWEBOK guide applied explicitly in robotic domains. The majority of the studies focused on the development phase (design, models and methods and construction). Testing and quality software areas have little coverage in SERS. Finally, we identify research opportunities and gaps in software engineering for robotic systems for future studies.
△ Less
Submitted 24 February, 2021;
originally announced February 2021.
-
What Skills do IT Companies look for in New Developers? A Study with Stack Overflow Jobs
Authors:
João Eduardo Montandon,
Cristiano Politowski,
Luciana Lourdes Silva,
Marco Tulio Valente,
Fabio Petrillo,
Yann-Gaël Guéhéneuc
Abstract:
Context: There is a growing demand for information on how IT companies look for candidates to their open positions. Objective: This paper investigates which hard and soft skills are more required in IT companies by analyzing the description of 20,000 job opportunities. Method: We applied open card sorting to perform a high-level analysis on which types of hard skills are more requested. Further, w…
▽ More
Context: There is a growing demand for information on how IT companies look for candidates to their open positions. Objective: This paper investigates which hard and soft skills are more required in IT companies by analyzing the description of 20,000 job opportunities. Method: We applied open card sorting to perform a high-level analysis on which types of hard skills are more requested. Further, we manually analyzed the most mentioned soft skills. Results: Programming languages are the most demanded hard skills. Communication, collaboration, and problem-solving are the most demanded soft skills. Conclusion: We recommend developers to organize their resumé according to the positions they are applying. We also highlight the importance of soft skills, as they appear in many job opportunities.
△ Less
Submitted 4 November, 2020;
originally announced November 2020.
-
Are the Old Days Gone? A Survey on Actual Software Engineering Processes in Video Game Industry
Authors:
Cristiano Politowski,
Lisandra Fontoura,
Fabio Petrillo,
Yann-Gaël Guéhéneuc
Abstract:
In the past 10 years, several researches studied video game development process who proposed approaches to improve the way how games are developed. These approaches usually adopt agile methodologies because of claims that traditional practices and the waterfall process are gone. However, are the "old days" really gone in the game industry? In this paper, we present a survey of software engineering…
▽ More
In the past 10 years, several researches studied video game development process who proposed approaches to improve the way how games are developed. These approaches usually adopt agile methodologies because of claims that traditional practices and the waterfall process are gone. However, are the "old days" really gone in the game industry? In this paper, we present a survey of software engineering processes in video game industry from postmortem project analyses. We analyzed 20 postmortems from Gamasutra Portal. We extracted their processes and modelled them through using the Business Process Model and Notation (BPMN). This work presents three main contributions. First, a postmortem analysis methodology to identify and extract project processes. Second, the study main result: \textbf{the "old days" are gone, but not completely}. \textbf{Iterative practices} are increasing and are applied to at least \textbf{65\% of projects} in which \textbf{45\% of this projects} explicitly adopted Agile practices. However, \textbf{waterfall} process is still applied at least \textbf{30\% of projects}. Finally, we discuss some implications, directions and opportunities for video game development community.
△ Less
Submitted 4 September, 2020;
originally announced September 2020.
-
Learning from the past: A process recommendation system for video game projects using postmortems experiences
Authors:
Cristiano Politowski,
Lisandra M. Fontoura,
Fabio Petrillo,
Yann-Gaël Guéhéneuc
Abstract:
Context: The video game industry is a billion dollar industry that faces problems in the way games are developed. One method to address these problems is using developer aid tools, such as Recommendation Systems. These tools assist developers by generating recommendations to help them perform their tasks. Objective: This article describes a systematic approach to recommend development processes fo…
▽ More
Context: The video game industry is a billion dollar industry that faces problems in the way games are developed. One method to address these problems is using developer aid tools, such as Recommendation Systems. These tools assist developers by generating recommendations to help them perform their tasks. Objective: This article describes a systematic approach to recommend development processes for video game projects, using postmortem knowledge extraction and a model of the context of the new project, in which "postmortems" are articles written by video game developers at the end of projects, summarizing the experience of their game development team. This approach aims to provide reflections about development processes used in the game industry as well as guidance to developers to choose the most adequate process according to the contexts they're in. Method: Our approach is divided in three separate phases: in the the first phase, we manually extracted the processes from the postmortems analysis; in the second one, we created a video game context and algorithm rules for recommendation; and finally in the third phase, we evaluated the recommended processes by using quantitative and qualitative metrics, game developers feedback, and a case study by interviewing a video game development team. Contributions: This article brings three main contributions. The first describes a database of developers' experiences extracted from postmortems in the form of development processes. The second defines the main attributes that a video game project contain, which it uses to define the contexts of the project. The third describes and evaluates a recommendation system for video game projects, which uses the contexts of the projects to identify similar projects and suggest a set of activities in the form of a process.
△ Less
Submitted 4 September, 2020;
originally announced September 2020.
-
Game Industry Problems: an Extensive Analysis of the Gray Literature
Authors:
Cristiano Politowski,
Fabio Petrillo,
Gabriel C. Ullmann,
Yann-Gaël Guéhéneuc
Abstract:
Context: Given its competitiveness, the video-game industry has a closed-source culture. Hence, little is known of the problems faced by game developers. However, game developers do share information about their games projects through postmortems, which describe informally what happened during the projects. Objective: The software-engineering research community and game developers would benefit fr…
▽ More
Context: Given its competitiveness, the video-game industry has a closed-source culture. Hence, little is known of the problems faced by game developers. However, game developers do share information about their games projects through postmortems, which describe informally what happened during the projects. Objective: The software-engineering research community and game developers would benefit from a state of the problems of the video game industry, in particular the problems faced by game developers, their evolution in time, and their root causes. This state of the practice would allow researchers and practitioners to work towards solving these problems. Method: We analyzed 200 postmortems from 1997 to 2019, resulting in 927 problems divided into 20 types. Through our analysis, we described the overall landscape of game industry problems in the past 23 years and how these problems evolved over the years. We also give details on the most common problems, their root causes, and possible solutions. We finally discuss suggestions for future projects. Results: We observe that (1) the game industry suffers from management and production problems in the same proportion; (2) management problems decreased over the years giving space to business problems, while production problems remained constant; (3a) technical and game design problems are decreasing over the years, the latter only after the last decade; (3b) problems related to the team increase over the last decade;(3c) marketing problems are the ones that had the biggest increase over the 23 years compared to other problem types; (4) finally, the majority of the main root causes are related to people, not technologies. Conclusions: In this paper we provide a state of the practice for researchers to understand and study video-game development problems. We also offer suggestions to help practitioners to avoid the most common problems.
△ Less
Submitted 22 January, 2021; v1 submitted 4 September, 2020;
originally announced September 2020.
-
A Large Scale Empirical Study of the Impact of Spaghetti Code and Blob Anti-patterns on Program Comprehension
Authors:
Cristiano Politowski,
Foutse Khomh,
Simone Romano,
Giuseppe Scanniello,
Fabio Petrillo,
Yann-Gaël Guéhéneuc,
Abdou Maiga
Abstract:
Context: Several studies investigated the impact of anti-patterns (i.e., "poor" solutions to recurring design problems) during maintenance activities and reported that anti-patterns significantly affect the developers' effort required to edit files. However, before developers edit files, they must understand the source code of the systems. This source code must be easy to understand by developers.…
▽ More
Context: Several studies investigated the impact of anti-patterns (i.e., "poor" solutions to recurring design problems) during maintenance activities and reported that anti-patterns significantly affect the developers' effort required to edit files. However, before developers edit files, they must understand the source code of the systems. This source code must be easy to understand by developers. Objective: In this work, we provide a complete assessment of the impact of two instances of two anti-patterns, Blob or Spaghetti Code, on program comprehension. Method: We analyze the impact of these two anti-patterns through three empirical studies conducted at Polytechnique Montréal (Canada) with 24 participants; at Carlton University (Canada) with 30 participants; and at University Basilicata (Italy) with 79 participants. Results: We collect data from 372 tasks obtained thanks to 133 different participants from the three universities. We use three metrics to assess the developers' comprehension of the source code: (1) the duration to complete each task; (2) their percentage of correct answers; and, (3) the NASA task load index for their effort. Conclusions: We report that, although single occurrences of Blob or Spaghetti code anti-patterns have little effect on code comprehension, two occurrences of either Blob or Spaghetti Code significantly increases the developers' time spent in their tasks, reduce their percentage of correct answers, and increase their effort. Hence, we recommend that developers act on both anti-patterns, which should be refactored out of the source code whenever possible. We also recommend further studies on combinations of anti-patterns rather than on single anti-patterns one at a time.
△ Less
Submitted 4 September, 2020;
originally announced September 2020.
-
Applying system descriptors to address ambiguity on deployment diagrams
Authors:
Jalves Nicacio,
Fabio Petrillo
Abstract:
Communication between practitioners is essential for product quality in the DevOps context. This communication often takes place through deployment diagrams of a system under development. However, it is common diagrams to become ambiguous or inconsistent as the system progresses and goes to a continuous delivery pipeline or production. Moreover, diagrams could not follow the evolution of systems,…
▽ More
Communication between practitioners is essential for product quality in the DevOps context. This communication often takes place through deployment diagrams of a system under development. However, it is common diagrams to become ambiguous or inconsistent as the system progresses and goes to a continuous delivery pipeline or production. Moreover, diagrams could not follow the evolution of systems, and it is challenging to associate diagrams to production. In this paper, we propose the use of system descriptors to address the ambiguity of deployment diagrams. We state three main hypotheses (1) if a deployment diagram is generated from a valid system descriptor then the diagram is unambiguous; (2) if a valid system descriptor is generated from a deployment diagram then the descriptor is unambiguous; (3) if a diagram $μ$ generated from a descriptor $A$ is unambiguous and if a descriptor $B$ is generated from the diagram $μ$ equally unambiguous then descriptors $A$ and $B$ are equivalent. We report a case study to test our hypotheses. We constructed a system descriptor from Netflix deployment diagram, and we applied our tool to generate a new deployment diagram. Finally, we compare the original and generated diagrams to evaluate our proposal. Our case study shows the generated deployment diagrams are graphically equivalent to system descriptors and eliminated ambiguous aspects of the original diagram. Thus, our preliminary results lead to further evaluation in controlled and empirical experiments to test our hypotheses conclusively.
△ Less
Submitted 2 October, 2020; v1 submitted 25 August, 2020;
originally announced August 2020.
-
Open Source Software Development Process: A Systematic Review
Authors:
Bianca Minetto Napoleão,
Fabio Petrillo,
Sylvain Hallé
Abstract:
Open Source Software (OSS) has been recognized by the software development community as an effective way to deliver software. Unlike traditional software development, OSS development is driven by collaboration among developers spread geographically and motivated by common goals and interests. Besides this fact, it is recognized by OSS community the need of understand OSS development process and it…
▽ More
Open Source Software (OSS) has been recognized by the software development community as an effective way to deliver software. Unlike traditional software development, OSS development is driven by collaboration among developers spread geographically and motivated by common goals and interests. Besides this fact, it is recognized by OSS community the need of understand OSS development process and its activities. Our goal is to investigate the state-of-art about OSS process through conducting a systematic literature review providing an overview of how the OSS community has been investigating OSS process over past years identifying and summarizing OSS process activities and their characteristics as well as translating OSS process in a macro process through BPMN notation. As a result, we systematically analysed 33 studies presenting an overview of the state-of-art of researches regarding OSS process, a generalized OSS development macro process represented by BPMN notation with a detailed description of each OSS process activity and roles in OSS environment. We conclude that OSS process can be in practice further investigated by researchers. In addition, the presented OSS process can be used as a guide for OSS projects and being adapted according to each OSS project reality. It provides insights to managers and developers who want to improve their development process even in OSS and traditional environments. Finally, recommendations for OSS community regarding OSS process activities are provided.
△ Less
Submitted 11 August, 2020;
originally announced August 2020.
-
DR-Tools: a suite of lightweight open-source tools to measure and visualize Java source code
Authors:
Guilherme Lacerda,
Fabio Petrillo,
Marcelo Pimenta
Abstract:
In Software Engineering, some of the most critical activities are maintenance and evolution. However, to perform both with quality, minimizing impacts and risks, developers need to analyze and identify where the main problems come from previously. In this paper, we introduce DR-Tools Suite, a set of lightweight open-source tools that analyze and calculate source code metrics, allowing developers t…
▽ More
In Software Engineering, some of the most critical activities are maintenance and evolution. However, to perform both with quality, minimizing impacts and risks, developers need to analyze and identify where the main problems come from previously. In this paper, we introduce DR-Tools Suite, a set of lightweight open-source tools that analyze and calculate source code metrics, allowing developers to visualize the results in different formats and graphs. Also, we define a set of heuristics to help the code analysis. We conducted two case studies (one academic and one industrial) to collect feedback on the tools suite, on how we will evolve the tools, as well as insights to develop new tools that support developers in their daily work.
△ Less
Submitted 8 August, 2020;
originally announced August 2020.
-
FASTA/Q Data Compressors for MapReduce-Hadoop Genomics:Space and Time Savings Made Easy -- Version 1
Authors:
Umberto Ferraro Petrillo,
Francesco Palini,
Giuseppe Cattaneo,
Raffaele Giancarlo
Abstract:
Motivation: Storage of genomic data is a major cost for the Life Sciences, effectively addressed mostly via specialized data compression methods. For the same reasons of abundance in data production, the use of Big Data technologies is seen as the future for genomic data storage and processing, with MapReduce-Hadoop as leaders. Somewhat surprisingly, none of the specialized FASTA/Q compressors is…
▽ More
Motivation: Storage of genomic data is a major cost for the Life Sciences, effectively addressed mostly via specialized data compression methods. For the same reasons of abundance in data production, the use of Big Data technologies is seen as the future for genomic data storage and processing, with MapReduce-Hadoop as leaders. Somewhat surprisingly, none of the specialized FASTA/Q compressors is available within Hadoop. Indeed, their deployment there is not exactly immediate. Such a State of the Art is problematic. Results: We provide major advances in two different directions. Methodologically, we propose two general methods, with the corresponding software, that make very easy to deploy a specialized FASTA/Q compressor within MapReduce-Hadoop for processing files stored on the distributed Hadoop File System, with very little knowledge of Hadoop. Practically, we provide evidence that the deployment of those specialized compressors within Hadoop, not available so far, results in major cost savings, i.e., on large plant genomes, 30% less HDFS data blocks (one block=128MB), speed-up of at least x1.5 in I/O time and comparable or reduced network communication time with respect to the use of generic compressors available in Hadoop. Finally, we observe that these results hold also for the Apache Spark framework, when used to process FASTA/Q files stored on the Hadoop File System.
△ Less
Submitted 27 July, 2020;
originally announced July 2020.
-
Measurements of an AC-LGAD strip sensor with a 120 GeV proton beam
Authors:
Artur Apresyan,
Wei Chen,
Gabriele D'Amen,
Karri Folan Di Petrillo,
Gabriele Giacomini,
Ryan Heller,
Hakseong Lee,
Sergey Los,
Chang-Seong Moon,
Alessandro Tricoli
Abstract:
The development of detectors that provide high resolution in four dimensions has attracted wide-spread interest in the scientific community for several applications in high-energy physics, nuclear physics, medical imaging, mass spectroscopy as well as quantum information. In addition to high time resolution and thanks to the AC-coupling of the electrodes, LGAD silicon sensors can provide high reso…
▽ More
The development of detectors that provide high resolution in four dimensions has attracted wide-spread interest in the scientific community for several applications in high-energy physics, nuclear physics, medical imaging, mass spectroscopy as well as quantum information. In addition to high time resolution and thanks to the AC-coupling of the electrodes, LGAD silicon sensors can provide high resolution in the measurement of spatial coordinates of an incident minimum ionizing particle. Such AC-coupled LGADs, also known as AC-LGADs, are therefore considered as candidates for future detectors to provide 4-dimensional measurements in a single sensing device with 100$\%$ fill factor. This article presents the first characterization of an AC-LGAD sensor with a proton beam of 120 GeV momentum at Fermilab. The sensor consists of strips with 80 $μ$m width, fabricated at Brookhaven National Laboratory. The signal properties, efficiency, spatial, and time resolution are presented. The experimental results show that the time resolution of such an AC-LGAD is compatible to standard LGADs with similar gain, and that AC-LGADs can be segmented with fine pitches as standard strip or pixel detectors.
△ Less
Submitted 6 November, 2020; v1 submitted 2 June, 2020;
originally announced June 2020.
-
Alignment-free Genomic Analysis via a Big Data Spark Platform
Authors:
Umberto Ferraro Petrillo,
Francesco Palini,
Giuseppe Cattaneo,
Raffaele Giancarlo
Abstract:
Motivation: Alignment-free distance and similarity functions (AF functions, for short) are a well established alternative to two and multiple sequence alignments for many genomic, metagenomic and epigenomic tasks. Due to data-intensive applications, the computation of AF functions is a Big Data problem, with the recent Literature indicating that the development of fast and scalable algorithms comp…
▽ More
Motivation: Alignment-free distance and similarity functions (AF functions, for short) are a well established alternative to two and multiple sequence alignments for many genomic, metagenomic and epigenomic tasks. Due to data-intensive applications, the computation of AF functions is a Big Data problem, with the recent Literature indicating that the development of fast and scalable algorithms computing AF functions is a high-priority task. Somewhat surprisingly, despite the increasing popularity of Big Data technologies in Computational Biology, the development of a Big Data platform for those tasks has not been pursued, possibly due to its complexity. Results: We fill this important gap by introducing FADE, the first extensible, efficient and scalable Spark platform for Alignment-free genomic analysis. It supports natively eighteen of the best performing AF functions coming out of a recent hallmark benchmarking study. FADE development and potential impact comprises novel aspects of interest. Namely, (a) a considerable effort of distributed algorithms, the most tangible result being a much faster execution time of reference methods like MASH and FSWM; (b) a software design that makes FADE user-friendly and easily extendable by Spark non-specialists; (c) its ability to support data- and compute-intensive tasks. About this, we provide a novel and much needed analysis of how informative and robust AF functions are, in terms of the statistical significance of their output. Our findings naturally extend the ones of the highly regarded benchmarking study, since the functions that can really be used are reduced to a handful of the eighteen included in FADE.
△ Less
Submitted 23 October, 2021; v1 submitted 2 May, 2020;
originally announced May 2020.
-
Internet of Things Architectures: A Comparative Study
Authors:
Marcela G. dos Santos,
Darine Ameyed,
Fabio Petrillo,
Fehmi Jaafar,
Mohamed Cheriet
Abstract:
Over the past two decades, the Internet of Things (IoT) has become an underlying concept to a variety of solutions and technologies that it is now hardly possible to enumerate and describe all of them. The concept behind the Internet of Things is as powerful as it is complex, and for the components in the IoT solution tomesh together perfectly, they all have to be part of a well-thought-out struct…
▽ More
Over the past two decades, the Internet of Things (IoT) has become an underlying concept to a variety of solutions and technologies that it is now hardly possible to enumerate and describe all of them. The concept behind the Internet of Things is as powerful as it is complex, and for the components in the IoT solution tomesh together perfectly, they all have to be part of a well-thought-out structure. That is where understanding the IoT architecture becomes paramount. Because of the vast domain of IoT, there is no single consensus on IoT architecture. Different researchers and organizations proposed different architectures under a variety of classifications, mainly: conceptual, standard and, industrial or commercial adoption. It is indispensable to make a systematic analysis of IoT architecture to be able to compare the industrial proposals and identify their similarities and their differences. In this work, we summarize information about seven IoT industrial architectures in order to propose an approach that makes possible a comparative analysis between different IoT architectures. This work presents two main contributions: (i) an approach for analyzing and comparing IoTarchitectures using Layer-Model; (ii) a comparative study of seven industrial IoT architectures.
△ Less
Submitted 27 April, 2020;
originally announced April 2020.
-
Code Smells and Refactoring: A Tertiary Systematic Review of Challenges and Observations
Authors:
Guilherme Lacerda,
Fabio Petrillo,
Marcelo Pimenta,
Yann Gael Gueheneuc
Abstract:
In this paper, we present a tertiary systematic literature review of previous surveys, secondary systematic literature reviews, and systematic mappings. We identify the main observations (what we know) and challenges (what we do not know) on code smells and refactoring. We show that code smells and refactoring have a strong relationship with quality attributes, i.e., with understandability, mainta…
▽ More
In this paper, we present a tertiary systematic literature review of previous surveys, secondary systematic literature reviews, and systematic mappings. We identify the main observations (what we know) and challenges (what we do not know) on code smells and refactoring. We show that code smells and refactoring have a strong relationship with quality attributes, i.e., with understandability, maintainability, testability, complexity, functionality, and reusability. We argue that code smells and refactoring could be considered as the two faces of a same coin. Besides, we identify how refactoring affects quality attributes, more than code smells. We also discuss the implications of this work for practitioners, researchers, and instructors. We identify 13 open issues that could guide future research work. Thus, we want to highlight the gap between code smells and refactoring in the current state of software-engineering research. We wish that this work could help the software-engineering research community in collaborating on future work on code smells and refactoring.
△ Less
Submitted 22 April, 2020;
originally announced April 2020.
-
Are Game Engines Software Frameworks? A Three-perspective Study
Authors:
Cristiano Politowski,
Fabio Petrillo,
João Eduardo Montandon,
Marco Tulio Valente,
Yann-Gaël Guéhéneuc
Abstract:
Game engines help developers create video games and avoid duplication of code and effort, like frameworks for traditional software systems. In this paper, we explore open-source game engines along three perspectives: literature, code, and human. First, we explore and summarise the academic literature on game engines. Second, we compare the characteristics of the 282 most popular engines and the 28…
▽ More
Game engines help developers create video games and avoid duplication of code and effort, like frameworks for traditional software systems. In this paper, we explore open-source game engines along three perspectives: literature, code, and human. First, we explore and summarise the academic literature on game engines. Second, we compare the characteristics of the 282 most popular engines and the 282 most popular frameworks in GitHub. Finally, we survey 124 engine developers about their experience with the development of their engines. We report that: (1) Game engines are not well-studied in software-engineering research with few studies having engines as object of research. (2) Open-source game engines are slightly larger in terms of size and complexity and less popular and engaging than traditional frameworks. Their programming languages differ greatly from frameworks. Engine projects have shorter histories with less releases. (3) Developers perceive game engines as different from traditional frameworks. Generally, they build game engines to (a) better control the environment and source code, (b) learn about game engines, and (c) develop specific games. We conclude that open-source game engines have differences compared to traditional open-source frameworks although this differences do not demand special treatments.
△ Less
Submitted 19 September, 2020; v1 submitted 12 April, 2020;
originally announced April 2020.
-
A Tertiary and Secondary Study Canvas
Authors:
Bianca Minetto Napoleão,
Fabio Petrillo,
Sylvain Hallé
Abstract:
Over the past years, more secondary (Systematic Literature Reviews and Systematic Mappings) and tertiary studies have been conducted. Their conduction is considered a quite large task and labor-intensive since it involves a detailed process including a protocol development, which is one of the most challenging phase reported by the software engineering research community. In this scenario, we prop…
▽ More
Over the past years, more secondary (Systematic Literature Reviews and Systematic Mappings) and tertiary studies have been conducted. Their conduction is considered a quite large task and labor-intensive since it involves a detailed process including a protocol development, which is one of the most challenging phase reported by the software engineering research community. In this scenario, we propose a Secondary and Tertiary Study Canvas aiming to simplify and clarify the understanding of the steps that need to be performed during the secondary and tertiary process conduction, including the protocol development. For this, we synthesized and organized the existing secondary studies' protocols in a Canvas format as well as suggesting a step-based approach to assist the secondary and tertiary studies' conduction.
△ Less
Submitted 2 April, 2020;
originally announced April 2020.