Skip to main content

Showing 1–21 of 21 results for author: Cossu, A

  1. arXiv:2404.14909  [pdf, other

    cs.LG hep-th

    MultiSTOP: Solving Functional Equations with Reinforcement Learning

    Authors: Alessandro Trenta, Davide Bacciu, Andrea Cossu, Pietro Ferrero

    Abstract: We develop MultiSTOP, a Reinforcement Learning framework for solving functional equations in physics. This new methodology produces actual numerical solutions instead of bounds on them. We extend the original BootSTOP algorithm by adding multiple constraints derived from domain-specific knowledge, even in integral form, to improve the accuracy of the solution. We investigate a particular equation… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: ICLR 2024 Workshop on AI4DifferentialEquations In Science

  2. arXiv:2404.07817  [pdf, other

    cs.LG cs.AI

    Calibration of Continual Learning Models

    Authors: Lanpei Li, Elia Piccoli, Andrea Cossu, Davide Bacciu, Vincenzo Lomonaco

    Abstract: Continual Learning (CL) focuses on maximizing the predictive performance of a model across a non-stationary stream of data. Unfortunately, CL models tend to forget previous knowledge, thus often underperforming when compared with an offline model trained jointly on the entire data stream. Given that any CL model will eventually make mistakes, it is of crucial importance to build calibrated CL mode… ▽ More

    Submitted 12 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted at CLVISION workshop, CVPR 2024

  3. arXiv:2311.11908  [pdf, other

    cs.LG cs.AI cs.CV

    Continual Learning: Applications and the Road Forward

    Authors: Eli Verwimp, Rahaf Aljundi, Shai Ben-David, Matthias Bethge, Andrea Cossu, Alexander Gepperth, Tyler L. Hayes, Eyke Hüllermeier, Christopher Kanan, Dhireesha Kudithipudi, Christoph H. Lampert, Martin Mundt, Razvan Pascanu, Adrian Popescu, Andreas S. Tolias, Joost van de Weijer, Bing Liu, Vincenzo Lomonaco, Tinne Tuytelaars, Gido M. van de Ven

    Abstract: Continual learning is a subfield of machine learning, which aims to allow machine learning models to continuously learn on new data, by accumulating knowledge without forgetting what was learned in the past. In this work, we take a step back, and ask: "Why should one care about continual learning in the first place?". We set the stage by examining recent continual learning papers published at four… ▽ More

    Submitted 28 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Journal ref: Transactions on Machine Learning Research (TMLR), 2024

  4. arXiv:2308.10328  [pdf, other

    cs.LG

    A Comprehensive Empirical Evaluation on Online Continual Learning

    Authors: Albin Soutif--Cormerais, Antonio Carta, Andrea Cossu, Julio Hurtado, Hamed Hemati, Vincenzo Lomonaco, Joost Van de Weijer

    Abstract: Online continual learning aims to get closer to a live learning experience by learning directly on a stream of data with temporally shifting distribution and by storing a minimum amount of data from that stream. In this empirical evaluation, we evaluate various methods from the literature that tackle online continual learning. More specifically, we focus on the class-incremental setting in the con… ▽ More

    Submitted 23 September, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: ICCV Visual Continual Learning Workshop 2023 accepted paper

  5. arXiv:2306.07218  [pdf, other

    cs.LG cs.AI

    A Protocol for Continual Explanation of SHAP

    Authors: Andrea Cossu, Francesco Spinnato, Riccardo Guidotti, Davide Bacciu

    Abstract: Continual Learning trains models on a stream of data, with the aim of learning new information without forgetting previous knowledge. Given the dynamic nature of such environments, explaining the predictions of these models can be challenging. We study the behavior of SHAP values explanations in Continual Learning and propose an evaluation protocol to robustly assess the change of explanations in… ▽ More

    Submitted 20 June, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: ESANN 2023, 6 pages, added link to code

  6. arXiv:2303.15888  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Projected Latent Distillation for Data-Agnostic Consolidation in Distributed Continual Learning

    Authors: Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide Bacciu, Joost van de Weijer

    Abstract: Distributed learning on the edge often comprises self-centered devices (SCD) which learn local tasks independently and are unwilling to contribute to the performance of other SDCs. How do we achieve forward transfer at zero cost for the single SCDs? We formalize this problem as a Distributed Continual Learning scenario, where SCD adapt to local tasks and a CL model consolidates the knowledge from… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  7. arXiv:2302.01766  [pdf, other

    cs.LG

    Avalanche: A PyTorch Library for Deep Continual Learning

    Authors: Antonio Carta, Lorenzo Pellegrini, Andrea Cossu, Hamed Hemati, Vincenzo Lomonaco

    Abstract: Continual learning is the problem of learning from a nonstationary stream of data, a fundamental issue for sustainable and efficient training of deep neural networks over time. Unfortunately, deep learning libraries only provide primitives for offline training, assuming that model's architecture and data are fixed. Avalanche is an open source library maintained by the ContinualAI non-profit organi… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  8. arXiv:2301.11396  [pdf, other

    cs.LG

    Class-Incremental Learning with Repetition

    Authors: Hamed Hemati, Andrea Cossu, Antonio Carta, Julio Hurtado, Lorenzo Pellegrini, Davide Bacciu, Vincenzo Lomonaco, Damian Borth

    Abstract: Real-world data streams naturally include the repetition of previous concepts. From a Continual Learning (CL) perspective, repetition is a property of the environment and, unlike replay, cannot be controlled by the agent. Nowadays, the Class-Incremental (CI) scenario represents the leading test-bed for assessing and comparing CL strategies. This scenario type is very easy to use, but it never allo… ▽ More

    Submitted 19 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Accepted to the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023 19 pages

  9. arXiv:2207.00010  [pdf, other

    cs.LG cs.AI cs.HC

    Continual Learning for Human State Monitoring

    Authors: Federico Matteoni, Andrea Cossu, Claudio Gallicchio, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Continual Learning (CL) on time series data represents a promising but under-studied avenue for real-world applications. We propose two new CL benchmarks for Human State Monitoring. We carefully designed the benchmarks to mirror real-world environments in which new subjects are continuously added. We conducted an empirical evaluation to assess the ability of popular CL strategies to mitigate forge… ▽ More

    Submitted 11 July, 2022; v1 submitted 29 June, 2022; originally announced July 2022.

    Comments: 6 pages, 4 figures, 2 tables, Accepted as oral at ESANN 2022

  10. arXiv:2206.11849  [pdf, other

    cs.LG cs.AI cs.CV

    Sample Condensation in Online Continual Learning

    Authors: Mattia Sangermano, Antonio Carta, Andrea Cossu, Davide Bacciu

    Abstract: Online Continual learning is a challenging learning scenario where the model must learn from a non-stationary stream of data where each sample is seen only once. The main challenge is to incrementally learn while avoiding catastrophic forgetting, namely the problem of forgetting previously acquired knowledge while learning from new data. A popular solution in these scenario is to use a small memor… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted as a conference paper at 2022 International Joint Conference on Neural Networks (IJCNN 2022). Part of 2022 IEEE World Congress on Computational Intelligence (IEEE WCCI 2022)

  11. arXiv:2205.09357  [pdf, other

    cs.LG cs.AI

    Continual Pre-Training Mitigates Forgetting in Language and Vision

    Authors: Andrea Cossu, Tinne Tuytelaars, Antonio Carta, Lucia Passaro, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Pre-trained models are nowadays a fundamental component of machine learning research. In continual learning, they are commonly used to initialize the model before training on the stream of non-stationary data. However, pre-training is rarely applied during continual learning. We formalize and investigate the characteristics of the continual pre-training scenario in both language and vision environ… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: under review

  12. Practical Recommendations for Replay-based Continual Learning Methods

    Authors: Gabriele Merlin, Vincenzo Lomonaco, Andrea Cossu, Antonio Carta, Davide Bacciu

    Abstract: Continual Learning requires the model to learn from a stream of dynamic, non-stationary data without forgetting previous knowledge. Several approaches have been developed in the literature to tackle the Continual Learning challenge. Among them, Replay approaches have empirically proved to be the most effective ones. Replay operates by saving some samples in memory which are then used to rehearse k… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Journal ref: ICIAP 2022 Workshops

  13. arXiv:2112.06511  [pdf, other

    cs.LG cs.AI cs.CV

    Ex-Model: Continual Learning from a Stream of Trained Models

    Authors: Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Learning continually from non-stationary data streams is a challenging research topic of growing popularity in the last few years. Being able to learn, adapt, and generalize continually in an efficient, effective, and scalable way is fundamental for a sustainable development of Artificial Intelligent systems. However, an agent-centric view of continual learning requires learning directly from raw… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

  14. arXiv:2112.02925  [pdf, other

    cs.LG cs.AI

    Is Class-Incremental Enough for Continual Learning?

    Authors: Andrea Cossu, Gabriele Graffieti, Lorenzo Pellegrini, Davide Maltoni, Davide Bacciu, Antonio Carta, Vincenzo Lomonaco

    Abstract: The ability of a model to learn continually can be empirically assessed in different continual learning scenarios. Each scenario defines the constraints and the opportunities of the learning environment. Here, we challenge the current trend in the continual learning literature to experiment mainly on class-incremental scenarios, where classes present in one experience are never revisited. We posit… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Under review

  15. arXiv:2111.09437  [pdf, other

    cs.AI cs.LG

    Sustainable Artificial Intelligence through Continual Learning

    Authors: Andrea Cossu, Marta Ziosi, Vincenzo Lomonaco

    Abstract: The increasing attention on Artificial Intelligence (AI) regulation has led to the definition of a set of ethical principles grouped into the Sustainable AI framework. In this article, we identify Continual Learning, an active area of AI research, as a promising approach towards the design of systems compliant with the Sustainable AI principles. While Sustainable AI outlines general desiderata for… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted at the 2021 International Conference on AI for People (CAIP)

  16. arXiv:2105.07674  [pdf, ps, other

    cs.LG cs.AI

    Continual Learning with Echo State Networks

    Authors: Andrea Cossu, Davide Bacciu, Antonio Carta, Claudio Gallicchio, Vincenzo Lomonaco

    Abstract: Continual Learning (CL) refers to a learning setup where data is non stationary and the model has to learn without forgetting existing knowledge. The study of CL for sequential patterns revolves around trained recurrent networks. In this work, instead, we introduce CL in the context of Echo State Networks (ESNs), where the recurrent component is kept fixed. We provide the first evaluation of catas… ▽ More

    Submitted 17 August, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Accepted as oral at ESANN 2021

  17. arXiv:2104.00405  [pdf, other

    cs.LG cs.AI cs.CV

    Avalanche: an End-to-End Library for Continual Learning

    Authors: Vincenzo Lomonaco, Lorenzo Pellegrini, Andrea Cossu, Antonio Carta, Gabriele Graffieti, Tyler L. Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I. Parisi, Fabio Cuzzolin, Andreas Tolias, Simone Scardapane, Luca Antiga, Subutai Amhad, Adrian Popescu, Christopher Kanan, Joost van de Weijer , et al. (3 additional authors not shown)

    Abstract: Learning continually from non-stationary data streams is a long-standing goal and a challenging problem in machine learning. Recently, we have witnessed a renewed and fast-growing interest in continual learning, especially within the deep learning community. However, algorithmic solutions are often difficult to re-implement, evaluate and port across different settings, where even results on standa… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: Official Website: https://avalanche.continualai.org

  18. arXiv:2103.15851  [pdf, other

    cs.LG cs.AI

    Distilled Replay: Overcoming Forgetting through Synthetic Samples

    Authors: Andrea Rosasco, Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Replay strategies are Continual Learning techniques which mitigate catastrophic forgetting by keeping a buffer of patterns from previous experiences, which are interleaved with new data during training. The amount of patterns stored in the buffer is a critical parameter which largely influences the final performance and the memory footprint of the approach. This work introduces Distilled Replay, a… ▽ More

    Submitted 22 June, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

  19. arXiv:2103.11750  [pdf, other

    cs.LG cs.AI

    Catastrophic Forgetting in Deep Graph Networks: an Introductory Benchmark for Graph Classification

    Authors: Antonio Carta, Andrea Cossu, Federico Errica, Davide Bacciu

    Abstract: In this work, we study the phenomenon of catastrophic forgetting in the graph representation learning scenario. The primary objective of the analysis is to understand whether classical continual learning techniques for flat and sequential data have a tangible impact on performances when applied to graph data. To do so, we experiment with a structure-agnostic model and a deep graph network in a rob… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: Accepted at the 2021 Web Conference Workshop on Graph Learning Benchmarks (GLB 2021). Code available at https://github.com/diningphil/continual_learning_for_graphs

    ACM Class: I.2.6

  20. Continual Learning for Recurrent Neural Networks: an Empirical Evaluation

    Authors: Andrea Cossu, Antonio Carta, Vincenzo Lomonaco, Davide Bacciu

    Abstract: Learning continuously during all model lifetime is fundamental to deploy machine learning solutions robust to drifts in the data distribution. Advances in Continual Learning (CL) with recurrent neural networks could pave the way to a large number of applications where incoming data is non stationary, like natural language processing and robotics. However, the existing body of work on the topic is… ▽ More

    Submitted 2 August, 2021; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: Published in Neural Networks

    Journal ref: Neural Networks, Volume 143, 2021, pages 607-627

  21. Continual Learning with Gated Incremental Memories for sequential data processing

    Authors: Andrea Cossu, Antonio Carta, Davide Bacciu

    Abstract: The ability to learn in dynamic, nonstationary environments without forgetting previous knowledge, also known as Continual Learning (CL), is a key enabler for scalable and trustworthy deployments of adaptive solutions. While the importance of continual learning is largely acknowledged in machine vision and reinforcement learning problems, this is mostly under-documented for sequence processing tas… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: Accepted as a conference paper at 2020 International Joint Conference on Neural Networks (IJCNN 2020). Part of 2020 IEEE World Congress on Computational Intelligence (IEEE WCCI 2020)