Neural and Evolutionary Computing (cs.NE)

High-Order Associative Learning Based on Memristive Circuits for Efficient Learning
Shengbo Wang, Xuemeng Li, Jialin Ding, Weihao Ma, Ying Wang, Luigi Occhipinti, Arokia Nathan, Shuo Gao
Oct 23 2024 cs.NE eess.SP physics.app-ph arXiv:2410.16734v1

@misc{2410.16734, author = {Shengbo Wang and Xuemeng Li and Jialin Ding and Weihao Ma and Ying Wang and Luigi Occhipinti and Arokia Nathan and Shuo Gao}, title = {{H}igh-{O}rder {A}ssociative {L}earning {B}ased on {M}emristive {C}ircuits for {E}fficient {L}earning}, year = {2024}, eprint = {2410.16734}, note = {arXiv:2410.16734v1} }
PDF
Memristive associative learning has gained significant attention for its ability to mimic fundamental biological learning mechanisms while maintaining system simplicity. In this work, we introduce a high-order memristive associative learning framework with a biologically realistic structure. By utilizing memristors as synaptic modules and their state information to bridge different orders of associative learning, our design effectively establishes associations between multiple stimuli and replicates the transient nature of high-order associative learning. In Pavlov's classical conditioning experiments, our design achieves a 230% improvement in learning efficiency compared to previous works, with memristor power consumption in the synaptic modules remaining below 11 \muW. In large-scale image recognition tasks, we utilize a 20*20 memristor array to represent images, enabling the system to recognize and label test images with semantic information at 100% accuracy. This scalability across different tasks highlights the framework's potential for a wide range of applications, offering enhanced learning efficiency for current memristor-based neuromorphic systems.
The Neuromorphic Analog Electronic Nose
Shavika Rastogi, Nik Dennler, Michael Schmuker, André van Schaik
Oct 23 2024 cs.NE arXiv:2410.16677v1

@misc{2410.16677, author = {Shavika Rastogi and Nik Dennler and Michael Schmuker and André van Schaik}, title = {{T}he {N}euromorphic {A}nalog {E}lectronic {N}ose}, year = {2024}, eprint = {2410.16677}, note = {arXiv:2410.16677v1} }
PDF
Rapid detection of gas concentration is important in different domains like gas leakage monitoring, pollution control, and so on, for the prevention of health hazards. Out of different types of gas sensors, Metal oxide (MOx) sensors are extensively used in such applications because of their portability, low cost, and high sensitivity for specific gases. However, how to effectively sample the MOx data for the real-time detection of gas and its concentration level remains an open question. Here we introduce a simple analog front-end for one MOx sensor that encodes the gas concentration in the time difference between pulses of two separate pathways. This front-end design is inspired by the spiking output of a mammalian olfactory bulb. We show that for a gas pulse injected in a constant airflow, the time difference between pulses decreases with increasing gas concentration, similar to the spike time difference between the two principal output neurons in the olfactory bulb. The circuit design is further extended to a MOx sensor array and this sensor array front-end was tested in the same environment for gas identification and concentration estimation. Encoding of gas stimulus features in analog spikes at the sensor level itself may result in data and power-efficient real-time gas sensing systems in the future that can ultimately be used in uncontrolled and turbulent environments for longer periods without data explosion.
Real-time Sub-milliwatt Epilepsy Detection Implemented on a Spiking Neural Network Edge Inference Processor
Ruixin Lia, Guoxu Zhaoa, Dylan Richard Muir, Yuya Ling, Karla Burelo, Mina Khoei, Dong Wang, Yannan Xing, Ning Qiao
Oct 23 2024 eess.SP cs.LG cs.NE q-bio.NC arXiv:2410.16613v1

@misc{2410.16613, author = {Ruixin Lia and Guoxu Zhaoa and Dylan Richard Muir and Yuya Ling and Karla Burelo and Mina Khoei and Dong Wang and Yannan Xing and Ning Qiao}, title = {{R}eal-time {S}ub-milliwatt {E}pilepsy {D}etection {I}mplemented on a {S}piking {N}eural {N}etwork {E}dge {I}nference {P}rocessor}, year = {2024}, eprint = {2410.16613}, howpublished = {Computers in Biology and Medicine(2024), 183, 109225}, doi = {10.1016/j.compbiomed.2024.109225}, note = {arXiv:2410.16613v1} }
PDF
Analyzing electroencephalogram (EEG) signals to detect the epileptic seizure status of a subject presents a challenge to existing technologies aimed at providing timely and efficient diagnosis. In this study, we aimed to detect interictal and ictal periods of epileptic seizures using a spiking neural network (SNN). Our proposed approach provides an online and real-time preliminary diagnosis of epileptic seizures and helps to detect possible pathological conditions.To validate our approach, we conducted experiments using multiple datasets. We utilized a trained SNN to identify the presence of epileptic seizures and compared our results with those of related studies. The SNN model was deployed on Xylo, a digital SNN neuromorphic processor designed to process temporal signals. Xylo efficiently simulates spiking leaky integrate-and-fire neurons with exponential input synapses. Xylo has much lower energy requirments than traditional approaches to signal processing, making it an ideal platform for developing low-power seizure detection systems.Our proposed method has a high test accuracy of 93.3% and 92.9% when classifying ictal and interictal periods. At the same time, the application has an average power consumption of 87.4 uW(IO power) + 287.9 uW(computational power) when deployed to Xylo. Our method demonstrates excellent low-latency performance when tested on multiple datasets. Our work provides a new solution for seizure detection, and it is expected to be widely used in portable and wearable devices in the future.
Spiking Neural Networks as a Controller for Emergent Swarm Agents
Kevin Zhu, Connor Mattson, Shay Snyder, Ricardo Vega, Daniel S. Brown, Maryam Parsa, Cameron Nowzari
Oct 23 2024 cs.NE cs.MA cs.SY eess.SY arXiv:2410.16175v1

@misc{2410.16175, author = {Kevin Zhu and Connor Mattson and Shay Snyder and Ricardo Vega and Daniel S.~Brown and Maryam Parsa and Cameron Nowzari}, title = {{S}piking {N}eural {N}etworks as a {C}ontroller for {E}mergent {S}warm {A}gents}, year = {2024}, eprint = {2410.16175}, note = {arXiv:2410.16175v1} }
PDF
Drones which can swarm and loiter in a certain area cost hundreds of dollars, but mosquitos can do the same and are essentially worthless. To control swarms of low-cost robots, researchers may end up spending countless hours brainstorming robot configurations and policies to ``organically" create behaviors which do not need expensive sensors and perception. Existing research explores the possible emergent behaviors in swarms of robots with only a binary sensor and a simple but hand-picked controller structure. Even agents in this highly limited sensing, actuation, and computational capability class can exhibit relatively complex global behaviors such as aggregation, milling, and dispersal, but finding the local interaction rules that enable more collective behaviors remains a significant challenge. This paper investigates the feasibility of training spiking neural networks to find those local interaction rules that result in particular emergent behaviors. In this paper, we focus on simulating a specific milling behavior already known to be producible using very simple binary sensing and acting agents. To do this, we use evolutionary algorithms to evolve not only the parameters (the weights, biases, and delays) of a spiking neural network, but also its structure. To create a baseline, we also show an evolutionary search strategy over the parameters for the incumbent hand-picked binary controller structure. Our simulations show that spiking neural networks can be evolved in binary sensing agents to form a mill.
Metric as Transform: Exploring beyond Affine Transform for Interpretable Neural Network
Suman Sapkota
Oct 23 2024 cs.LG cs.CV cs.NE arXiv:2410.16159v1

@misc{2410.16159, author = {Suman Sapkota}, title = {{M}etric as {T}ransform: {E}xploring beyond {A}ffine {T}ransform for {I}nterpretable {N}eural {N}etwork}, year = {2024}, eprint = {2410.16159}, note = {arXiv:2410.16159v1} }
PDF
Artificial Neural Networks of varying architectures are generally paired with affine transformation at the core. However, we find dot product neurons with global influence less interpretable as compared to local influence of euclidean distance (as used in Radial Basis Function Network). In this work, we explore the generalization of dot product neurons to $l^p$-norm, metrics, and beyond. We find that metrics as transform performs similarly to affine transform when used in MultiLayer Perceptron or Convolutional Neural Network. Moreover, we explore various properties of Metrics, compare it with Affine, and present multiple cases where metrics seem to provide better interpretability. We develop an interpretable local dictionary based Neural Networks and use it to understand and reject adversarial examples.
Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance
Mostafa Hussien, Mahmoud Afifi, Kim Khoa Nguyen, Mohamed Cheriet
Oct 23 2024 cs.LG cs.AI cs.NE arXiv:2410.16151v1

@misc{2410.16151, author = {Mostafa Hussien and Mahmoud Afifi and Kim Khoa Nguyen and Mohamed Cheriet}, title = {{S}mall {C}ontributions, {S}mall {N}etworks: {E}fficient {N}eural {N}etwork {P}runing {B}ased on {R}elative {I}mportance}, year = {2024}, eprint = {2410.16151}, note = {arXiv:2410.16151v1} }
PDF
Recent advancements have scaled neural networks to unprecedented sizes, achieving remarkable performance across a wide range of tasks. However, deploying these large-scale models on resource-constrained devices poses significant challenges due to substantial storage and computational requirements. Neural network pruning has emerged as an effective technique to mitigate these limitations by reducing model size and complexity. In this paper, we introduce an intuitive and interpretable pruning method based on activation statistics, rooted in information theory and statistical analysis. Our approach leverages the statistical properties of neuron activations to identify and remove weights with minimal contributions to neuron outputs. Specifically, we build a distribution of weight contributions across the dataset and utilize its parameters to guide the pruning process. Furthermore, we propose a Pruning-aware Training strategy that incorporates an additional regularization term to enhance the effectiveness of our pruning method. Extensive experiments on multiple datasets and network architectures demonstrate that our method consistently outperforms several baseline and state-of-the-art pruning techniques.
Steering Large Language Models using Conceptors: Improving Addition-Based Activation Engineering
Joris Postmus, Steven Abreu
Oct 23 2024 cs.NE cs.LG arXiv:2410.16314v1

@misc{2410.16314, author = {Joris Postmus and Steven Abreu}, title = {{S}teering {L}arge {L}anguage {M}odels using {C}onceptors: {I}mproving {A}ddition-{B}ased {A}ctivation {E}ngineering}, year = {2024}, eprint = {2410.16314}, note = {arXiv:2410.16314v1} }
PDF
Large language models have transformed AI, yet reliably controlling their outputs remains a challenge. This paper explores activation engineering, where outputs of pre-trained LLMs are controlled by manipulating their activations at inference time. Unlike traditional methods using a single steering vector, we introduce conceptors - mathematical constructs that represent sets of activation vectors as ellipsoidal regions. Conceptors act as soft projection matrices and offer more precise control over complex activation patterns. Our experiments demonstrate that conceptors outperform traditional methods across multiple in-context learning steering tasks. We further use Boolean operations on conceptors that allows for combined steering goals that empirically outperforms combining steering vectors on a set of tasks. These results highlight conceptors as a promising tool for more effective steering of LLMs.
In-the-loop Hyper-Parameter Optimization for LLM-Based Automated Design of Heuristics
Niki van Stein, Diederick Vermetten, Thomas Bäck
Oct 23 2024 cs.NE cs.AI arXiv:2410.16309v1

@misc{2410.16309, author = {Niki van Stein and Diederick Vermetten and Thomas Bäck}, title = {{I}n-the-loop {H}yper-{P}arameter {O}ptimization for {LLM}-{B}ased {A}utomated {D}esign of {H}euristics}, year = {2024}, eprint = {2410.16309}, note = {arXiv:2410.16309v1} }
PDF
Large Language Models (LLMs) have shown great potential in automatically generating and optimizing (meta)heuristics, making them valuable tools in heuristic optimization tasks. However, LLMs are generally inefficient when it comes to fine-tuning hyper-parameters of the generated algorithms, often requiring excessive queries that lead to high computational and financial costs. This paper presents a novel hybrid approach, LLaMEA-HPO, which integrates the open source LLaMEA (Large Language Model Evolutionary Algorithm) framework with a Hyper-Parameter Optimization (HPO) procedure in the loop. By offloading hyper-parameter tuning to an HPO procedure, the LLaMEA-HPO framework allows the LLM to focus on generating novel algorithmic structures, reducing the number of required LLM queries and improving the overall efficiency of the optimization process. We empirically validate the proposed hybrid framework on benchmark problems, including Online Bin Packing, Black-Box Optimization, and the Traveling Salesperson Problem. Our results demonstrate that LLaMEA-HPO achieves superior or comparable performance compared to existing LLM-driven frameworks while significantly reducing computational costs. This work highlights the importance of separating algorithmic innovation and structural code search from parameter tuning in LLM-driven code optimization and offers a scalable approach to improve the efficiency and effectiveness of LLM-based code generation.
Hardware-Software Co-optimised Fast and Accurate Deep Reconfigurable Spiking Inference Accelerator Architecture Design Methodology
Anagha Nimbekar, Prabodh Katti, Chen Li, Bashir M. Al-Hashimi, Amit Acharyya, Bipin Rajendran
Oct 23 2024 cs.NE arXiv:2410.16298v1

@misc{2410.16298, author = {Anagha Nimbekar and Prabodh Katti and Chen Li and Bashir M.~Al-Hashimi and Amit Acharyya and Bipin Rajendran}, title = {{H}ardware-{S}oftware {C}o-optimised {F}ast and {A}ccurate {D}eep {R}econfigurable {S}piking {I}nference {A}ccelerator {A}rchitecture {D}esign {M}ethodology}, year = {2024}, eprint = {2410.16298}, note = {arXiv:2410.16298v1} }
PDF
Spiking Neural Networks (SNNs) have emerged as a promising approach to improve the energy efficiency of machine learning models, as they naturally implement event-driven computations while avoiding expensive multiplication operations. In this paper, we develop a hardware-software co-optimisation strategy to port software-trained deep neural networks (DNN) to reduced-precision spiking models demonstrating fast and accurate inference in a novel event-driven CMOS reconfigurable spiking inference accelerator. Experimental results show that a reduced-precision Resnet-18 and VGG-11 SNN models achieves classification accuracy within 1% of the baseline full-precision DNN model within 8 spike timesteps. We also demonstrate an FPGA prototype implementation of the spiking inference accelerator with a throughput of 38.4 giga operations per second (GOPS) consuming 1.54 Watts on PYNQ-Z2 FPGA. This corresponds to 0.6 GOPS per processing element and 2.25,GOPS/DSP slice, which is 2x and 4.5x higher utilisation efficiency respectively compared to the state-of-the-art. Our co-optimisation strategy can be employed to develop deep reduced precision SNN models and port them to resource-efficient event-driven hardware accelerators for edge applications.

Recent comments