Skip to main content

Showing 1–28 of 28 results for author: Lebbah, M

  1. arXiv:2410.12264  [pdf, other

    cs.LG cs.GT

    Game Theory Meets Statistical Mechanics in Deep Learning Design

    Authors: Djamel Bouchaffra, Fayçal Ykhlef, Bilal Faye, Hanane Azzag, Mustapha Lebbah

    Abstract: We present a novel deep graphical representation that seamlessly merges principles of game theory with laws of statistical mechanics. It performs feature extraction, dimensionality reduction, and pattern classification within a single learning framework. Our approach draws an analogy between neurons in a network and players in a game theory model. Furthermore, each neuron viewed as a classical par… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  2. arXiv:2410.06173  [pdf, other

    cs.CL cs.AI cs.LG

    Manual Verbalizer Enrichment for Few-Shot Text Classification

    Authors: Quang Anh Nguyen, Nadi Tomeh, Mustapha Lebbah, Thierry Charnois, Hanene Azzag, Santiago Cordoba Muñoz

    Abstract: With the continuous development of pre-trained language models, prompt-based training becomes a well-adopted paradigm that drastically improves the exploitation of models for many natural language processing tasks. Prompting also shows great performance compared to traditional fine-tuning when adapted to zero-shot or few-shot scenarios where the number of annotated data is limited. In this framewo… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  3. arXiv:2409.11059  [pdf, other

    cs.CV cs.LG

    OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities

    Authors: Bilal Faye, Hanane Azzag, Mustapha Lebbah

    Abstract: Cross-modal alignment Learning integrates information from different modalities like text, image, audio and video to create unified models. This approach develops shared representations and learns correlations between modalities, enabling applications such as visual question answering and audiovisual content analysis. Current techniques rely on large modality-specific encoders, necessitating fine-… ▽ More

    Submitted 18 September, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

  4. arXiv:2409.10111  [pdf, other

    cs.LG cs.CE cs.NE

    Evaluating the Efficacy of Instance Incremental vs. Batch Learning in Delayed Label Environments: An Empirical Study on Tabular Data Streaming for Fraud Detection

    Authors: Kodjo Mawuena Amekoe, Mustapha Lebbah, Gregoire Jaffre, Hanene Azzag, Zaineb Chelly Dagdia

    Abstract: Real-world tabular learning production scenarios typically involve evolving data streams, where data arrives continuously and its distribution may change over time. In such a setting, most studies in the literature regarding supervised learning favor the use of instance incremental algorithms due to their ability to adapt to changes in the data distribution. Another significant reason for choosing… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 20 pages

  5. arXiv:2409.04759  [pdf, other

    cs.CV cs.LG

    Adaptative Context Normalization: A Boost for Deep Learning in Image Processing

    Authors: Bilal Faye, Hanane Azzag, Mustapha Lebbah, Djamel Bouchaffra

    Abstract: Deep Neural network learning for image processing faces major challenges related to changes in distribution across layers, which disrupt model convergence and performance. Activation normalization methods, such as Batch Normalization (BN), have revolutionized this field, but they rely on the simplified assumption that data distribution can be modelled by a single Gaussian distribution. To overcome… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.16798

    Journal ref: ICIP 2024

  6. arXiv:2409.04757  [pdf, other

    cs.LG

    Unsupervised Adaptive Normalization

    Authors: Bilal Faye, Hanane Azzag, Mustapha Lebbah, Fangchen Fang

    Abstract: Deep neural networks have become a staple in solving intricate problems, proving their mettle in a wide array of applications. However, their training process is often hampered by shifting activation distributions during backpropagation, resulting in unstable gradients. Batch Normalization (BN) addresses this issue by normalizing activations, which allows for the use of higher learning rates. Desp… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.16798

    Journal ref: IJCNN 2024

  7. arXiv:2408.10787  [pdf, other

    cs.CV cs.LG

    A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection Training

    Authors: Bilal Faye, Binta Sow, Hanane Azzag, Mustapha Lebbah

    Abstract: Object detection is a fundamental challenge in computer vision, centered on recognizing objects within images, with diverse applications in areas like image analysis, robotics, and autonomous vehicles. Although existing methods have achieved great success, they are often constrained by a fixed vocabulary of objects. To overcome this limitation, approaches like MDETR have redefined object detection… ▽ More

    Submitted 22 October, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

  8. arXiv:2405.17027  [pdf, other

    cs.LG

    Supervised Batch Normalization

    Authors: Bilal Faye, Mustapha Lebbah, Hanane Azzag

    Abstract: Batch Normalization (BN), a widely-used technique in neural networks, enhances generalization and expedites training by normalizing each mini-batch to the same mean and variance. However, its effectiveness diminishes when confronted with diverse data distributions. To address this challenge, we propose Supervised Batch Normalization (SBN), a pioneering approach. We expand normalization beyond trad… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  9. arXiv:2403.16798  [pdf, other

    cs.LG cs.AI cs.NE

    Cluster-Based Normalization Layer for Neural Networks

    Authors: Bilal Faye, Hanane Azzag, Mustapha Lebbah

    Abstract: Deep learning grapples with challenges in training neural networks, notably internal covariate shift and label shift. Conventional normalization techniques like Batch Normalization (BN) partially mitigate these issues but are hindered by constraints such as dependency on batch size and distribution assumptions. Similarly, mixture normalization (MN) encounters computational barriers in handling div… ▽ More

    Submitted 18 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  10. arXiv:2403.04650  [pdf, other

    cs.LG cs.AI

    Lightweight Cross-Modal Representation Learning

    Authors: Bilal Faye, Hanane Azzag, Mustapha Lebbah, Djamel Bouchaffra

    Abstract: Low-cost cross-modal representation learning is crucial for deriving semantic representations across diverse modalities such as text, audio, images, and video. Traditional approaches typically depend on large specialized models trained from scratch, requiring extensive datasets and resulting in high resource and time costs. To overcome these challenges, we introduce a novel approach named Lightwei… ▽ More

    Submitted 7 September, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Journal ref: ESANN 2024

  11. arXiv:2402.01050  [pdf, other

    stat.ML cs.LG stat.CO

    Distributed MCMC inference for Bayesian Non-Parametric Latent Block Model

    Authors: Reda Khoufache, Anisse Belhadj, Hanene Azzag, Mustapha Lebbah

    Abstract: In this paper, we introduce a novel Distributed Markov Chain Monte Carlo (MCMC) inference method for the Bayesian Non-Parametric Latent Block Model (DisNPLBM), employing the Master/Worker architecture. Our non-parametric co-clustering algorithm divides observations and features into partitions using latent multivariate Gaussian block distributions. The workload on rows is evenly distributed among… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted to PaKDD 2024

  12. arXiv:2312.11169  [pdf, other

    stat.ML cs.LG

    Distributed Collapsed Gibbs Sampler for Dirichlet Process Mixture Models in Federated Learning

    Authors: Reda Khoufache, Mustapha Lebbah, Hanene Azzag, Etienne Goffinet, Djamel Bouchaffra

    Abstract: Dirichlet Process Mixture Models (DPMMs) are widely used to address clustering problems. Their main advantage lies in their ability to automatically estimate the number of clusters during the inference process through the Bayesian non-parametric framework. However, the inference becomes considerably slow as the dataset size increases. This paper proposes a new distributed Markov Chain Monte Carlo… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted to SDM 2024

  13. arXiv:2309.17383  [pdf, other

    cs.DC cs.LG

    Parallel Computation of Multi-Slice Clustering of Third-Order Tensors

    Authors: Dina Faneva Andriantsiory, Camille Coti, Joseph Ben Geloun, Mustapha Lebbah

    Abstract: Machine Learning approaches like clustering methods deal with massive datasets that present an increasing challenge. We devise parallel algorithms to compute the Multi-Slice Clustering (MSC) for 3rd-order tensors. The MSC method is based on spectral analysis of the tensor slices and works independently on each tensor mode. Such features fit well in the parallel paradigm via a distributed memory sy… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  14. arXiv:2305.11684  [pdf, other

    cs.LG

    Self-Reinforcement Attention Mechanism For Tabular Learning

    Authors: Kodjo Mawuena Amekoe, Mohamed Djallel Dilmi, Hanene Azzag, Mustapha Lebbah, Zaineb Chelly Dagdia, Gregoire Jaffre

    Abstract: Apart from the high accuracy of machine learning models, what interests many researchers in real-life problems (e.g., fraud detection, credit scoring) is to find hidden patterns in data; particularly when dealing with their challenging imbalanced characteristics. Interpretability is also a key requirement that needs to accompany the used machine learning model. In this concern, often, intrinsicall… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  15. arXiv:2303.10154  [pdf, other

    cs.NE cs.LG math.OC

    Epigenetics Algorithms: Self-Reinforcement-Attention mechanism to regulate chromosomes expression

    Authors: Mohamed Djallel Dilmi, Hanene Azzag, Mustapha Lebbah

    Abstract: Genetic algorithms are a well-known example of bio-inspired heuristic methods. They mimic natural selection by modeling several operators such as mutation, crossover, and selection. Recent discoveries about Epigenetics regulation processes that occur "on top of" or "in addition to" the genetic basis for inheritance involve changes that affect and improve gene expression. They raise the question of… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: submitted for GECCO conference

  16. arXiv:2303.07768  [pdf, other

    cs.LG stat.ML

    DBSCAN of Multi-Slice Clustering for Third-Order Tensors

    Authors: Dina Faneva Andriantsiory, Joseph Ben Geloun, Mustapha Lebbah

    Abstract: Several methods for triclustering three-dimensional data require the cluster size or the number of clusters in each dimension to be specified. To address this issue, the Multi-Slice Clustering (MSC) for 3-order tensor finds signal slices that lie in a low dimensional subspace for a rank-one tensor dataset in order to find a cluster based on the threshold similarity. We propose an extension algorit… ▽ More

    Submitted 24 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: 13 pages, improved version, typos removed, text restructured, same results

  17. arXiv:2303.07757  [pdf, other

    cs.LG stat.ML

    Multiway clustering of 3-order tensor via affinity matrix

    Authors: Dina Faneva Andriantsiory, Joseph Ben Geloun, Mustapha Lebbah

    Abstract: We propose a new method of multiway clustering for 3-order tensors via affinity matrix (MCAM). Based on a notion of similarity between the tensor slices and the spread of information of each slice, our model builds an affinity/similarity matrix on which we apply advanced clustering methods. The combination of all clusters of the three modes delivers the desired multiway clustering. Finally, MCAM a… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  18. Context Normalization Layer with Applications

    Authors: Bilal Faye, Mohamed-Djallel Dilmi, Hanane Azzag, Mustapha Lebbah, Djamel Bouchaffra

    Abstract: Normalization is a pre-processing step that converts the data into a more usable representation. As part of the deep neural networks (DNNs), the batch normalization (BN) technique uses normalization to address the problem of internal covariate shift. It can be packaged as general modules, which have been extensively integrated into various DNNs, to stabilize and accelerate training, presumably lea… ▽ More

    Submitted 2 February, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

  19. arXiv:2210.02089  [pdf

    cs.LG cs.AI

    Transformer-based conditional generative adversarial network for multivariate time series generation

    Authors: Abdellah Madane, Mohamed-djallel Dilmi, Florent Forest, Hanane Azzag, Mustapha Lebbah, Jerome Lacaille

    Abstract: Conditional generation of time-dependent data is a task that has much interest, whether for data augmentation, scenario simulation, completing missing data, or other purposes. Recent works proposed a Transformer-based Time series generative adversarial network (TTS-GAN) to address the limitations of recurrent neural networks. However, this model assumes a unimodal distribution and tries to generat… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  20. arXiv:2201.05079  [pdf, other

    cs.LG cs.DB

    Improved Multi-objective Data Stream Clustering with Time and Memory Optimization

    Authors: Mohammed Oualid Attaoui, Hanene Azzag, Mustapha Lebbah, Nabil Keskes

    Abstract: The analysis of data streams has received considerable attention over the past few decades due to sensors, social media, etc. It aims to recognize patterns in an unordered, infinite, and evolving stream of observations. Clustering this type of data requires some restrictions in time and memory. This paper introduces a new data stream clustering method (IMOC-Stream). This method, unlike the other c… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: 37 pages, 12 figures, 8 tables

  21. arXiv:2109.13027  [pdf, other

    cs.CV

    Experience feedback using Representation Learning for Few-Shot Object Detection on Aerial Images

    Authors: Pierre Le Jeune, Mustapha Lebbah, Anissa Mokraoui, Hanene Azzag

    Abstract: This paper proposes a few-shot method based on Faster R-CNN and representation learning for object detection in aerial images. The two classification branches of Faster R-CNN are replaced by prototypical networks for online adaptation to new classes. These networks produce embeddings vectors for each generated box, which are then compared with class prototypes. The distance between an embedding an… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 11 pages, 2 figures, accepted at ICMLA 2021 as a short paper

  22. arXiv:2109.10803  [pdf, other

    cs.LG stat.ML

    Multi-Slice Clustering for 3-order Tensor Data

    Authors: Dina Faneva Andriantsiory, Joseph Ben Geloun, Mustapha Lebbah

    Abstract: Several methods of triclustering of three dimensional data require the specification of the cluster size in each dimension. This introduces a certain degree of arbitrariness. To address this issue, we propose a new method, namely the multi-slice clustering (MSC) for a 3-order tensor data set. We analyse, in each dimension or tensor mode, the spectral decomposition of each tensor slice, i.e. a matr… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

  23. arXiv:2011.05847  [pdf, other

    cs.NE cs.LG

    A Survey and Implementation of Performance Metrics for Self-Organized Maps

    Authors: Florent Forest, Mustapha Lebbah, Hanane Azzag, Jérôme Lacaille

    Abstract: Self-Organizing Map algorithms have been used for almost 40 years across various application domains such as biology, geology, healthcare, industry and humanities as an interpretable tool to explore, cluster and visualize high-dimensional data sets. In every application, practitioners need to know whether they can \textit{trust} the resulting mapping, and perform model selection to tune algorithm… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 9 pages

    MSC Class: 62H30; 68T07 ACM Class: I.5.3; I.2.6

  24. arXiv:2008.00946  [pdf, other

    cs.LG stat.ML

    Conditional Latent Block Model: a Multivariate Time Series Clustering Approach for Autonomous Driving Validation

    Authors: Etienne Goffinet, Anthony Coutant, Mustapha Lebbah, Hanane Azzag, Loïc Giraldi

    Abstract: Autonomous driving systems validation remains one of the biggest challenges car manufacturers must tackle in order to provide safe driverless cars. The high complexity stems from several factors: the multiplicity of vehicles, embedded systems, use cases, and the very high required level of reliability for the driving system to be at least as safe as a human driver. In order to circumvent these iss… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: 17 pages, 15 figures

  25. arXiv:2006.08530  [pdf, other

    cs.LG stat.ML

    Selecting the Number of Clusters $K$ with a Stability Trade-off: an Internal Validation Criterion

    Authors: Alex Mourer, Florent Forest, Mustapha Lebbah, Hanane Azzag, Jérôme Lacaille

    Abstract: Model selection is a major challenge in non-parametric clustering. There is no universally admitted way to evaluate clustering results for the obvious reason that no ground truth is available. The difficulty to find a universal evaluation criterion is a consequence of the ill-defined objective of clustering. In this perspective, clustering stability has emerged as a natural and model-agnostic prin… ▽ More

    Submitted 16 May, 2023; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Accepted at PAKDD 2023

    MSC Class: 62H30 (Primary) 68T10 (Secondary) ACM Class: I.5.3

  26. arXiv:1903.04042  [pdf, other

    cs.LG stat.ML

    Algorithms for an Efficient Tensor Biclustering

    Authors: Andriantsiory Dina Faneva, Mustapha Lebbah, Hanane Azzag, Gaël Beck

    Abstract: Consider a data set collected by (individuals-features) pairs in different times. It can be represented as a tensor of three dimensions (Individuals, features and times). The tensor biclustering problem computes a subset of individuals and a subset of features whose signal trajectories over time lie in a low-dimensional subspace, modeling similarity among the signal trajectories while allowing dif… ▽ More

    Submitted 10 March, 2019; originally announced March 2019.

    Comments: Algorithms available on Clustering4Ever github, https://github.com/Clustering4Ever/Clustering4Ever

  27. arXiv:1902.04181  [pdf, other

    cs.LG cs.AI stat.ML

    Nearest Neighbor Median Shift Clustering for Binary Data

    Authors: Gaël Beck, Tarn Duong, Mustapha Lebbah, Hanane Azzag

    Abstract: We describe in this paper the theory and practice behind a new modal clustering method for binary data. Our approach (BinNNMS) is based on the nearest neighbor median shift. The median shift is an extension of the well-known mean shift, which was designed for continuous data, to handle binary data. We demonstrate that BinNNMS can discover accurately the location of clusters in binary data with the… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: Algorithms are available at https://github.com/Clustering4Ever/Clustering4Ever

  28. arXiv:1902.03833  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    A Distributed and Approximated Nearest Neighbors Algorithm for an Efficient Large Scale Mean Shift Clustering

    Authors: Gaël Beck, Tarn Duong, Mustapha Lebbah, Hanane Azzag, Christophe Cérin

    Abstract: In this paper we target the class of modal clustering methods where clusters are defined in terms of the local modes of the probability density function which generates the data. The most well-known modal clustering method is the k-means clustering. Mean Shift clustering is a generalization of the k-means clustering which computes arbitrarily shaped clusters as defined as the basins of attraction… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: Algorithms are available at https://github.com/Clustering4Ever/Clustering4Ever