Skip to main content

Showing 1–28 of 28 results for author: Strisciuglio, N

  1. arXiv:2410.03030  [pdf, other

    cs.CV cs.AI

    Dynamic Sparse Training versus Dense Training: The Unexpected Winner in Image Corruption Robustness

    Authors: Boqian Wu, Qiao Xiao, Shunxin Wang, Nicola Strisciuglio, Mykola Pechenizkiy, Maurice van Keulen, Decebal Constantin Mocanu, Elena Mocanu

    Abstract: It is generally perceived that Dynamic Sparse Training opens the door to a new era of scalability and efficiency for artificial neural networks at, perhaps, some costs in accuracy performance for the classification task. At the same time, Dense Training is widely accepted as being the "de facto" approach to train artificial neural networks if one would like to maximize their robustness against ima… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  2. arXiv:2408.13029  [pdf, other

    cs.CV

    Indoor scene recognition from images under visual corruptions

    Authors: Willams de Lima Costa, Raul Ismayilov, Nicola Strisciuglio, Estefania Talavera Martinez

    Abstract: The classification of indoor scenes is a critical component in various applications, such as intelligent robotics for assistive living. While deep learning has significantly advanced this field, models often suffer from reduced performance due to image corruption. This paper presents an innovative approach to indoor scene recognition that leverages multimodal data fusion, integrating caption-based… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  3. arXiv:2408.04077  [pdf, ps, other

    cs.CV

    PushPull-Net: Inhibition-driven ResNet robust to image corruptions

    Authors: Guru Swaroop Bennabhaktula, Enrique Alegre, Nicola Strisciuglio, George Azzopardi

    Abstract: We introduce a novel computational unit, termed PushPull-Conv, in the first layer of a ResNet architecture, inspired by the anti-phase inhibition phenomenon observed in the primary visual cortex. This unit redefines the traditional convolutional layer by implementing a pair of complementary filters: a trainable push kernel and its counterpart, the pull kernel. The push kernel (analogous to traditi… ▽ More

    Submitted 13 September, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: Accepted at ICPR 2024, code available at https://github.com/bgswaroop/pushpull-conv

  4. arXiv:2403.01944  [pdf, other

    cs.CV cs.LG

    Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification

    Authors: Puru Vaish, Shunxin Wang, Nicola Strisciuglio

    Abstract: Computer vision models normally witness degraded performance when deployed in real-world scenarios, due to unexpected changes in inputs that were not accounted for during training. Data augmentation is commonly used to address this issue, as it aims to increase data variety and reduce the distribution gap between training and test data. However, common visual augmentations might not guarantee exte… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024

  5. arXiv:2402.04239  [pdf, other

    cs.LG

    CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers

    Authors: Adjorn van Engelenhoven, Nicola Strisciuglio, Estefanía Talavera

    Abstract: The Transformer architecture has shown to be a powerful tool for a wide range of tasks. It is based on the self-attention mechanism, which is an inherently computationally expensive operation with quadratic computational complexity: memory usage and compute time increase quadratically with the length of the input sequences, thus limiting the application of Transformers. In this work, we propose a… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  6. arXiv:2401.16304  [pdf, other

    cs.CV cs.LG

    Regressing Transformers for Data-efficient Visual Place Recognition

    Authors: María Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov

    Abstract: Visual place recognition is a critical task in computer vision, especially for localization and navigation systems. Existing methods often rely on contrastive learning: image descriptors are trained to have small distance for similar images and larger distance for dissimilar ones in a latent space. However, this approach struggles to ensure accurate distance-based image similarity representation,… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in ICRA 2024

  7. arXiv:2312.05119  [pdf, other

    eess.IV cs.CV

    Quantifying white matter hyperintensity and brain volumes in heterogeneous clinical and low-field portable MRI

    Authors: Pablo Laso, Stefano Cerri, Annabel Sorby-Adams, Jennifer Guo, Farrah Mateen, Philipp Goebl, Jiaming Wu, Peirong Liu, Hongwei Li, Sean I. Young, Benjamin Billot, Oula Puonti, Gordon Sze, Sam Payabavash, Adam DeHavenon, Kevin N. Sheth, Matthew S. Rosen, John Kirsch, Nicola Strisciuglio, Jelmer M. Wolterink, Arman Eshaghi, Frederik Barkhof, W. Taylor Kimberly, Juan Eugenio Iglesias

    Abstract: Brain atrophy and white matter hyperintensity (WMH) are critical neuroimaging features for ascertaining brain injury in cerebrovascular disease and multiple sclerosis. Automated segmentation and quantification is desirable but existing methods require high-resolution MRI with good signal-to-noise ratio (SNR). This precludes application to clinical and low-field portable MRI (pMRI) scans, thus hamp… ▽ More

    Submitted 15 February, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  8. arXiv:2310.12677  [pdf, other

    cs.CV

    Case-level Breast Cancer Prediction for Real Hospital Settings

    Authors: Shreyasi Pathak, Jörg Schlötterer, Jeroen Geerdink, Jeroen Veltman, Maurice van Keulen, Nicola Strisciuglio, Christin Seifert

    Abstract: Breast cancer prediction models for mammography assume that annotations are available for individual images or regions of interest (ROIs), and that there is a fixed number of images per patient. These assumptions do not hold in real hospital settings, where clinicians provide only a final diagnosis for the entire mammography exam (case). Since data in real hospital settings scales with continuous… ▽ More

    Submitted 19 October, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 31 pages, 15 figures, 12 tables

  9. arXiv:2308.06622  [pdf, other

    cs.CV

    DFM-X: Augmentation by Leveraging Prior Knowledge of Shortcut Learning

    Authors: Shunxin Wang, Christoph Brune, Raymond Veldhuis, Nicola Strisciuglio

    Abstract: Neural networks are prone to learn easy solutions from superficial statistics in the data, namely shortcut learning, which impairs generalization and robustness of models. We propose a data augmentation strategy, named DFM-X, that leverages knowledge about frequency shortcuts, encoded in Dominant Frequencies Maps computed for image classification models. We randomly select X% training images of ce… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCVW2023

  10. arXiv:2307.15461  [pdf, other

    eess.IV cs.CV

    Defocus Blur Synthesis and Deblurring via Interpolation and Extrapolation in Latent Space

    Authors: Ioana Mazilu, Shunxin Wang, Sven Dummer, Raymond Veldhuis, Christoph Brune, Nicola Strisciuglio

    Abstract: Though modern microscopes have an autofocusing system to ensure optimal focus, out-of-focus images can still occur when cells within the medium are not all in the same focal plane, affecting the image quality for medical diagnosis and analysis of diseases. We propose a method that can deblur images as well as synthesize defocus blur. We train autoencoders with implicit and explicit regularization… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted at CAIP2023

  11. arXiv:2307.09829  [pdf, other

    cs.LG cs.CV

    What do neural networks learn in image classification? A frequency shortcut perspective

    Authors: Shunxin Wang, Raymond Veldhuis, Christoph Brune, Nicola Strisciuglio

    Abstract: Frequency analysis is useful for understanding the mechanisms of representation learning in neural networks (NNs). Most research in this area focuses on the learning dynamics of NNs for regression tasks, while little for classification. This study empirically investigates the latter and expands the understanding of frequency shortcuts. First, we perform experiments on synthetic datasets, designed… ▽ More

    Submitted 30 August, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted at ICCV2023

  12. arXiv:2305.12854  [pdf, other

    eess.IV cs.CV cs.GR

    RDA-INR: Riemannian Diffeomorphic Autoencoding via Implicit Neural Representations

    Authors: Sven Dummer, Nicola Strisciuglio, Christoph Brune

    Abstract: Diffeomorphic registration frameworks such as Large Deformation Diffeomorphic Metric Mapping (LDDMM) are used in computer graphics and the medical domain for atlas building, statistical latent modeling, and pairwise and groupwise registration. In recent years, researchers have developed neural network-based approaches regarding diffeomorphic registration to improve the accuracy and computational e… ▽ More

    Submitted 30 July, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 41 pages, 27 figures (including subfigures), revised version, to be published in SIAM Journal on Imaging Sciences

    MSC Class: 58D05; 53A05; 58B20; 68T07; 62-07; 68U10

  13. arXiv:2305.06024  [pdf, other

    cs.CV

    A Survey on the Robustness of Computer Vision Models against Common Corruptions

    Authors: Shunxin Wang, Raymond Veldhuis, Christoph Brune, Nicola Strisciuglio

    Abstract: The performance of computer vision models are susceptible to unexpected changes in input images caused by sensor errors or extreme imaging environments, known as common corruptions (e.g. noise, blur, illumination changes). These corruptions can significantly hinder the reliability of these models when deployed in real-world scenarios, yet they are often overlooked when testing model generalization… ▽ More

    Submitted 14 September, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

  14. arXiv:2303.11739  [pdf, other

    cs.CV eess.IV

    Data-efficient Large Scale Place Recognition with Graded Similarity Supervision

    Authors: Maria Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov

    Abstract: Visual place recognition (VPR) is a fundamental task of computer vision for visual localization. Existing methods are trained using image pairs that either depict the same place or not. Such a binary indication does not consider continuous relations of similarity between images of the same place taken from different positions, determined by the continuous nature of camera pose. The binary similari… ▽ More

    Submitted 25 March, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  15. arXiv:2204.03503  [pdf, other

    cs.CL cs.AI cs.LG

    Survey on Automated Short Answer Grading with Deep Learning: from Word Embeddings to Transformers

    Authors: Stefan Haller, Adina Aldea, Christin Seifert, Nicola Strisciuglio

    Abstract: Automated short answer grading (ASAG) has gained attention in education as a means to scale educational tasks to the growing number of students. Recent progress in Natural Language Processing and Machine Learning has largely influenced the field of ASAG, of which we survey the recent research advancements. We complement previous surveys by providing a comprehensive analysis of recently published m… ▽ More

    Submitted 11 March, 2022; originally announced April 2022.

    Comments: Under review

  16. arXiv:2103.06638  [pdf, other

    cs.CV

    Generalized Contrastive Optimization of Siamese Networks for Place Recognition

    Authors: María Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov

    Abstract: Visual place recognition is a challenging task in computer vision and a key component of camera-based localization and navigation systems. Recently, Convolutional Neural Networks (CNNs) achieved high results and good generalization capabilities. They are usually trained using pairs or triplets of images labeled as either similar or dissimilar, in a binary fashion. In practice, the similarity betwe… ▽ More

    Submitted 20 April, 2023; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Published at CVPR2023 as arXiv:2303.11739

  17. arXiv:2103.01634  [pdf, other

    cs.CV eess.IV

    Brain-inspired algorithms for processing of visual data

    Authors: Nicola Strisciuglio

    Abstract: The study of the visual system of the brain has attracted the attention and interest of many neuro-scientists, that derived computational models of some types of neuron that compose it. These findings inspired researchers in image processing and computer vision to deploy such models to solve problems of visual data processing. In this paper, we review approaches for image processing and computer v… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

  18. arXiv:2006.15373  [pdf, other

    cs.CV

    MTStereo 2.0: improved accuracy of stereo depth estimation withMax-trees

    Authors: Rafael Brandt, Nicola Strisciuglio, Nicolai Petkov

    Abstract: Efficient yet accurate extraction of depth from stereo image pairs is required by systems with low power resources, such as robotics and embedded systems. State-of-the-art stereo matching methods based on convolutional neural networks require intensive computations on GPUs and are difficult to deploy on embedded systems. In this paper, we propose a stereo matching method, called MTStereo 2.0, for… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

  19. arXiv:1906.12151  [pdf, other

    cs.CV

    Place recognition in gardens by learning visual representations: data set and benchmark analysis

    Authors: Maria Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov

    Abstract: Visual place recognition is an important component of systems for camera localization and loop closure detection. It concerns the recognition of a previously visited place based on visual cues only. Although it is a widely studied problem for indoor and urban environments, the recent use of robots for automation of agricultural and gardening tasks has created new problems, due to the challenging a… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

    Comments: Accepted for the 18th International Conference on Computer Analysis of Images and Patterns

  20. arXiv:1901.10208  [pdf, ps, other

    cs.CV cs.LG

    A Push-Pull Layer Improves Robustness of Convolutional Neural Networks

    Authors: Nicola Strisciuglio, Manuel Lopez-Antequera, Nicolai Petkov

    Abstract: We propose a new layer in Convolutional Neural Networks (CNNs) to increase their robustness to several types of noise perturbations of the input images. We call this a push-pull layer and compute its response as the combination of two half-wave rectified convolutions, with kernels of opposite polarity. It is based on a biologically-motivated non-linear model of certain neurons in the visual system… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

  21. arXiv:1901.06904  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Learning sound representations using trainable COPE feature extractors

    Authors: Nicola Strisciuglio, Mario Vento, Nicolai Petkov

    Abstract: Sound analysis research has mainly been focused on speech and music processing. The deployed methodologies are not suitable for analysis of sounds with varying background noise, in many cases with very low signal-to-noise ratio (SNR). In this paper, we present a method for the detection of patterns of interest in audio signals. We propose novel trainable feature extractors, which we call COPE (Com… ▽ More

    Submitted 22 March, 2019; v1 submitted 21 January, 2019; originally announced January 2019.

    Comments: Accepted for publication in Pattern Recognition

    Journal ref: Pattern Recognition (2019)

  22. arXiv:1811.10240  [pdf, other

    cs.CV

    Brain-inspired robust delineation operator

    Authors: Nicola Strisciuglio, George Azzopardi, Nicolai Petkov

    Abstract: In this paper we present a novel filter, based on the existing COSFIRE filter, for the delineation of patterns of interest. It includes a mechanism of push-pull inhibition that improves robustness to noise in terms of spurious texture. Push-pull inhibition is a phenomenon that is observed in neurons in area V1 of the visual cortex, which suppresses the response of certain simple cells for stimuli… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted at Brain-driven Computer Vision workshop at ECCV 2018

  23. arXiv:1804.01792  [pdf, other

    cs.RO

    TrimBot2020: an outdoor robot for automatic gardening

    Authors: Nicola Strisciuglio, Radim Tylecek, Michael Blaich, Nicolai Petkov, Peter Bieber, Jochen Hemming, Eldert van Henten, Torsten Sattler, Marc Pollefeys, Theo Gevers, Thomas Brox, Robert B. Fisher

    Abstract: Robots are increasingly present in modern industry and also in everyday life. Their applications range from health-related situations, for assistance to elderly people or in surgical operations, to automatic and driver-less vehicles (on wheels or flying) or for driving assistance. Recently, an interest towards robotics applied in agriculture and gardening has arisen, with applications to automatic… ▽ More

    Submitted 15 May, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: Accepted for publication at International Sympsium on Robotics 2018

  24. arXiv:1801.00688  [pdf, ps, other

    cs.CV cs.AI cs.SD eess.AS eess.IV

    Learning audio and image representations with bio-inspired trainable feature extractors

    Authors: Nicola Strisciuglio

    Abstract: Recent advancements in pattern recognition and signal processing concern the automatic learning of data representations from labeled training samples. Typical approaches are based on deep learning and convolutional neural networks, which require large amount of labeled training samples. In this work, we propose novel feature extractors that can be used to learn the representation of single prototy… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: Accepted for publication in the journal "Eleectronic Letters on Computer Vision and Image Understanding"

  25. arXiv:1708.00672  [pdf, other

    cs.CV

    Action recognition by learning pose representations

    Authors: Alessia Saggese, Nicola Strisciuglio, Mario Vento, Nicolai Petkov

    Abstract: Pose detection is one of the fundamental steps for the recognition of human actions. In this paper we propose a novel trainable detector for recognizing human poses based on the analysis of the skeleton. The main idea is that a skeleton pose can be described by the spatial arrangements of its joints. Starting from this consideration, we propose a trainable pose detector, that can be configured on… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

    Comments: Accepted at REACTS workshop (CAIP conference 2017)

  26. arXiv:1707.07747  [pdf, other

    cs.CV

    Detection of curved lines with B-COSFIRE filters: A case study on crack delineation

    Authors: Nicola Strisciuglio, George Azzopardi, Nicolai Petkov

    Abstract: The detection of curvilinear structures is an important step for various computer vision applications, ranging from medical image analysis for segmentation of blood vessels, to remote sensing for the identification of roads and rivers, and to biometrics and robotics, among others. %The visual system of the brain has remarkable abilities to detect curvilinear structures in noisy images. This is a n… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

    Comments: Accepted at Computer Analysis of Images and Patterns (CAIP) 2017

  27. arXiv:1707.07438  [pdf, ps, other

    cs.CV

    Delineation of line patterns in images using B-COSFIRE filters

    Authors: Nicola Strisciuglio, Nicolai Petkov

    Abstract: Delineation of line patterns in images is a basic step required in various applications such as blood vessel detection in medical images, segmentation of rivers or roads in aerial images, detection of cracks in walls or pavements, etc. In this paper we present trainable B-COSFIRE filters, which are a model of some neurons in area V1 of the primary visual cortex, and apply it to the delineation of… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

    Comments: International Work Conference on Bioinspired Intelligence, July 10-13, 2017

  28. arXiv:1703.09933  [pdf, other

    cs.CV

    Sentiment Recognition in Egocentric Photostreams

    Authors: Estefania Talavera, Nicola Strisciuglio, Nicolai Petkov, Petia Radeva

    Abstract: Lifelogging is a process of collecting rich source of information about daily life of people. In this paper, we introduce the problem of sentiment analysis in egocentric events focusing on the moments that compose the images recalling positive, neutral or negative feelings to the observer. We propose a method for the classification of the sentiments in egocentric pictures based on global and seman… ▽ More

    Submitted 29 March, 2017; originally announced March 2017.