subscribe to arXiv mailings

Field-Aligned Current Structures during the Terrestrial Magnetosphere's Transformation into Alfven Wings and Recovery

Authors: Jason M. H. Beedle, Li-Jen Chen, Jason R. Shuster, Harsha Gurram, Dan J. Gershman, Yuxi Chen, Rachel C. Rice, Brandon L. Burkholder, Akhtar S. Ardakani, Kevin J. Genestreti, Roy B. Torbert

Abstract: On April 24th, 2023, a CME event caused the solar wind to become sub-Alfvenic, leading to the development of an Alfven Wing configuration in the Earth's Magnetosphere. Alfven Wings have previously been observed as cavities of low flow in Jupiter's magnetosphere, but the observing satellites did not have the ability to directly measure the Alfven Wings' current structures. Through in situ measureme… ▽ More On April 24th, 2023, a CME event caused the solar wind to become sub-Alfvenic, leading to the development of an Alfven Wing configuration in the Earth's Magnetosphere. Alfven Wings have previously been observed as cavities of low flow in Jupiter's magnetosphere, but the observing satellites did not have the ability to directly measure the Alfven Wings' current structures. Through in situ measurements made by the Magnetospheric Multiscale (MMS) spacecraft, the April 24th event provides us with the first direct measurements of current structures during an Alfven Wing configuration. We have found two distinct types of current structures associated with the Alfven Wing transformation as well as the magnetosphere recovery. These structures are observed to be significantly more anti-field-aligned and electron-driven than typical magnetopause currents, indicating the disruptions caused to the magnetosphere current system by the Alfven Wing formation. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.08091 [pdf]

Earth's Alfvén wings driven by the April 2023 Coronal Mass Ejection

Authors: Li-Jen Chen, Daniel Gershman, Brandon Burkholder, Yuxi Chen, Menelaos Sarantos, Lan Jian, James Drake, Chuanfei Dong, Harsha Gurram, Jason Shuster, Daniel Graham, Olivier Le Contel, Steven Schwartz, Stephen Fuselier, Hadi Madanian, Craig Pollock, Haoming Liang, Matthew Argall, Richard Denton, Rachel Rice, Jason Beedle, Kevin Genestreti, Akhtar Ardakani, Adam Stanier, Ari Le , et al. (11 additional authors not shown)

Abstract: We report a rare regime of Earth's magnetosphere interaction with sub-Alfvénic solar wind in which the windsock-like magnetosphere transforms into one with Alfvén wings. In the magnetic cloud of a Coronal Mass Ejection (CME) on April 24, 2023, NASA's Magnetospheric Multiscale mission distinguishes the following features: (1) unshocked and accelerated cold CME plasma coming directly against Earth's… ▽ More We report a rare regime of Earth's magnetosphere interaction with sub-Alfvénic solar wind in which the windsock-like magnetosphere transforms into one with Alfvén wings. In the magnetic cloud of a Coronal Mass Ejection (CME) on April 24, 2023, NASA's Magnetospheric Multiscale mission distinguishes the following features: (1) unshocked and accelerated cold CME plasma coming directly against Earth's dayside magnetosphere; (2) dynamical wing filaments representing new channels of magnetic connection between the magnetosphere and foot points of the Sun's erupted flux rope; (3) cold CME ions observed with energized counter-streaming electrons, evidence of CME plasma captured due to reconnection between magnetic-cloud and Alfvén-wing field lines. The reported measurements advance our knowledge of CME interaction with planetary magnetospheres, and open new opportunities to understand how sub-Alfvénic plasma flows impact astrophysical bodies such as Mercury, moons of Jupiter, and exoplanets close to their host stars. △ Less

Submitted 3 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: 14 pages, including 4 figures, Under review in Geophys. Res. Lett

arXiv:2312.16620 [pdf, other]

Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning

Authors: Amin Jalal Aghdasian, Amirhossein Heydarian Ardakani, Kianoush Aqabakee, Farzaneh Abdollahi

Abstract: This paper proposes a novel approach by integrating sensor fusion with deep reinforcement learning, specifically the Soft Actor-Critic (SAC) algorithm, to develop an optimal control policy for self-driving cars. Our system employs a two-branch fusion method for vehicle image and tracking sensor data, leveraging the strengths of residual structures and identity mapping to enhance agent training. Th… ▽ More This paper proposes a novel approach by integrating sensor fusion with deep reinforcement learning, specifically the Soft Actor-Critic (SAC) algorithm, to develop an optimal control policy for self-driving cars. Our system employs a two-branch fusion method for vehicle image and tracking sensor data, leveraging the strengths of residual structures and identity mapping to enhance agent training. Through comprehensive comparisons, we demonstrate the efficacy of information fusion and establish the superiority of our selected algorithm over alternative approaches. Our work advances the field of autonomous driving and demonstrates the potential of reinforcement learning in enabling intelligent vehicle decision-making. △ Less

Submitted 27 December, 2023; originally announced December 2023.

Comments: Accepted Paper on International Conference on Robotics and Mechatronics (ICROM, 2023)

arXiv:2305.18513 [pdf, ps, other]

SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics

Authors: Arash Ardakani, Altan Haan, Shangyin Tan, Doru Thom Popovici, Alvin Cheung, Costin Iancu, Koushik Sen

Abstract: Transformer-based models, such as BERT and ViT, have achieved state-of-the-art results across different natural language processing (NLP) and computer vision (CV) tasks. However, these models are extremely memory intensive during their fine-tuning process, making them difficult to deploy on GPUs with limited memory resources. To address this issue, we introduce a new tool called SlimFit that reduc… ▽ More Transformer-based models, such as BERT and ViT, have achieved state-of-the-art results across different natural language processing (NLP) and computer vision (CV) tasks. However, these models are extremely memory intensive during their fine-tuning process, making them difficult to deploy on GPUs with limited memory resources. To address this issue, we introduce a new tool called SlimFit that reduces the memory requirements of these models by dynamically analyzing their training dynamics and freezing less-contributory layers during fine-tuning. The layers to freeze are chosen using a runtime inter-layer scheduling algorithm. SlimFit adopts quantization and pruning for particular layers to balance the load of dynamic activations and to minimize the memory footprint of static activations, where static activations refer to those that cannot be discarded regardless of freezing. This allows SlimFit to freeze up to 95% of layers and reduce the overall on-device GPU memory usage of transformer-based models such as ViT and BERT by an average of 2.2x, across different NLP and CV benchmarks/datasets such as GLUE, SQuAD 2.0, CIFAR-10, CIFAR-100 and ImageNet with an average degradation of 0.2% in accuracy. For such NLP and CV tasks, SlimFit can reduce up to 3.1x the total on-device memory usage with an accuracy degradation of only up to 0.4%. As a result, while fine-tuning of ViT on ImageNet and BERT on SQuAD 2.0 with a batch size of 128 requires 3 and 2 32GB GPUs respectively, SlimFit enables their fine-tuning on a single 32GB GPU without any significant accuracy degradation. △ Less

Submitted 29 May, 2023; originally announced May 2023.

arXiv:2202.12422 [pdf, other]

Standard Deviation-Based Quantization for Deep Neural Networks

Authors: Amir Ardakani, Arash Ardakani, Brett Meyer, James J. Clark, Warren J. Gross

Abstract: Quantization of deep neural networks is a promising approach that reduces the inference cost, making it feasible to run deep networks on resource-restricted devices. Inspired by existing methods, we propose a new framework to learn the quantization intervals (discrete values) using the knowledge of the network's weight and activation distributions, i.e., standard deviation. Furthermore, we propose… ▽ More Quantization of deep neural networks is a promising approach that reduces the inference cost, making it feasible to run deep networks on resource-restricted devices. Inspired by existing methods, we propose a new framework to learn the quantization intervals (discrete values) using the knowledge of the network's weight and activation distributions, i.e., standard deviation. Furthermore, we propose a novel base-2 logarithmic quantization scheme to quantize weights to power-of-two discrete values. Our proposed scheme allows us to replace resource-hungry high-precision multipliers with simple shift-add operations. According to our evaluations, our method outperforms existing work on CIFAR10 and ImageNet datasets and even achieves better accuracy performance with 3-bit weights and activations when compared to the full-precision models. Moreover, our scheme simultaneously prunes the network's parameters and allows us to flexibly adjust the pruning ratio during the quantization process. △ Less

Submitted 24 February, 2022; originally announced February 2022.

arXiv:2107.01811 [pdf]

doi 10.1016/j.ijleo.2022.169048

Optical amplification of surface plasmon polaritons in a graphene single layer integrated with a random grating

Authors: Abbas Ghasempour Ardakani, Peymaneh Rafieipour

Abstract: In this paper, we design and simulate a terahertz (THz) controllable active plasmonic waveguide structure based on a single graphene layer that is placed on a random silicon grating substrate. Optical gain in the proposed THz active plasmonic waveguide structure is provided by the stimulated emission process in the photoexcited graphene monolayer that leads to the amplification of surface plasmon… ▽ More In this paper, we design and simulate a terahertz (THz) controllable active plasmonic waveguide structure based on a single graphene layer that is placed on a random silicon grating substrate. Optical gain in the proposed THz active plasmonic waveguide structure is provided by the stimulated emission process in the photoexcited graphene monolayer that leads to the amplification of surface plasmon polariton (SPP) waves. We use a random grating substrate to introduce Anderson localization of the SPP waves propagating through the graphene monolayer to enhance their optical amplification at resonant frequencies. It is shown that the enhancement factor of the resonant peaks corresponding to the graphene SPPs can be as high as 175. We also analyze their corresponding field intensity distributions along the graphene monolayer and find out that their intensities and localization positions are different from each other. By investigating the pump dependent and temperature dependent variations of the transmittance of the structure, it is shown that the resonant peak frequencies are blue-shifted by increasing the temperature and the external pump intensity. Also, we show that increasing the ambient temperature by 60 K can dramatically reduce the output amplified intensity by a factor of 70. This property of the proposed graphene-based THz plasmonic waveguide structure makes it useful in temperature sensing applications and on/off switchable laser devices. △ Less

Submitted 4 October, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

arXiv:1907.03003 [pdf]

doi 10.1088/1555-6611/abbe91

Resonant random laser emission from graphene quantum dot doped dye solutions

Authors: Peymaneh Rafieipour, Abbas Ghasempour Ardakani, Fatemeh Daneshmand

Abstract: Graphene quantum dots (GQDs) are more promising than other kinds of semiconductor QDs because of their photostability and biocompatibility in different applications such as bioimaging, biosensing and light emitting diodes (LEDs). In addition, advances in random lasers (RLs) have led to an emerging desire for developing remote sensing and detecting strategies, lightning and imaging systems that are… ▽ More Graphene quantum dots (GQDs) are more promising than other kinds of semiconductor QDs because of their photostability and biocompatibility in different applications such as bioimaging, biosensing and light emitting diodes (LEDs). In addition, advances in random lasers (RLs) have led to an emerging desire for developing remote sensing and detecting strategies, lightning and imaging systems that are far cheaper, more precise and simpler. Although combining GQDs and RLs seems promising for the development of advanced biosensing and bioimaging systems, the RLs fabricated based on GQDs have been rarely studied. Here, we report on the fabrication of dye doped GQDs RLs with resonant feedback that are pumped optically with nanosecond pulses. GQDs, synthesized by the pyrolysis of citric acid, are used as scattering centers in an ethylene glycol solution of rhodamine B dye. It is demonstrated experimentally that discrete lasing modes with subnanometer linewidths appear at pump fluences above the threshold. Furthermore, the dependence of random lasing emission characteristics on the concentration of GQDs and the pump position is investigated experimentally. △ Less

Submitted 5 July, 2019; originally announced July 2019.

Comments: 10 pages

arXiv:1904.08174 [pdf, other]

doi 10.1016/j.euromechflu.2020.02.008

An alternative view on the Bateman-Luke variational principle

Authors: Hamid Alemi Ardakani

Abstract: A new derivation of the Bernoulli equation for water waves in three-dimensional rotating and translating coordinate systems is given. An alternative view on the Bateman-Luke variational principle is presented. The variational principle recovers the boundary value problem governing the motion of potential water waves in a container undergoing prescribed rigid-body motion in three dimensions. A math… ▽ More A new derivation of the Bernoulli equation for water waves in three-dimensional rotating and translating coordinate systems is given. An alternative view on the Bateman-Luke variational principle is presented. The variational principle recovers the boundary value problem governing the motion of potential water waves in a container undergoing prescribed rigid-body motion in three dimensions. A mathematical theory is presented for the problem of three-dimensional interactions between potential surface waves and a floating structure with interior potential fluid sloshing. The complete set of equations of motion for the exterior gravity-driven water waves, and the exact nonlinear hydrodynamic equations of motion for the linear momentum and angular momentum of the floating structure containing fluid, are derived from a second variational principle. △ Less

Submitted 23 May, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

arXiv:1902.02170 [pdf, other]

Metamaterials mimic the black holes: the effects of charge and rotation on the optical properties

Authors: S. H. Hendi, Z. S. Taghadomi, A. Ghasempour Ardakani

Abstract: Motivated by investigation of black hole properties in the lab, some interesting subjects such as analogue gravity and transformation optics are generated. In this paper, we look for the analogies between the geometry of a gravitating system and the optical medium. In addition, we recognize that appropriate metamaterials can be used to mimic the propagation of light in the curved spacetimes and be… ▽ More Motivated by investigation of black hole properties in the lab, some interesting subjects such as analogue gravity and transformation optics are generated. In this paper, we look for the analogies between the geometry of a gravitating system and the optical medium. In addition, we recognize that appropriate metamaterials can be used to mimic the propagation of light in the curved spacetimes and behave like black holes. The resemblance of metamaterials with Kerr and Reissner-Nordström spacetimes is studied. At last, we compare the full-wave numerical calculation of light with its optical limit of geometry. △ Less

Submitted 30 January, 2019; originally announced February 2019.

Comments: 20 pages with 5 captioned figures. Submitted for publication

arXiv:1811.10396 [pdf, other]

Learning to Skip Ineffectual Recurrent Computations in LSTMs

Authors: Arash Ardakani, Zhengyun Ji, Warren J. Gross

Abstract: Long Short-Term Memory (LSTM) is a special class of recurrent neural network, which has shown remarkable successes in processing sequential data. The typical architecture of an LSTM involves a set of states and gates: the states retain information over arbitrary time intervals and the gates regulate the flow of information. Due to the recursive nature of LSTMs, they are computationally intensive t… ▽ More Long Short-Term Memory (LSTM) is a special class of recurrent neural network, which has shown remarkable successes in processing sequential data. The typical architecture of an LSTM involves a set of states and gates: the states retain information over arbitrary time intervals and the gates regulate the flow of information. Due to the recursive nature of LSTMs, they are computationally intensive to deploy on edge devices with limited hardware resources. To reduce the computational complexity of LSTMs, we first introduce a method that learns to retain only the important information in the states by pruning redundant information. We then show that our method can prune over 90% of information in the states without incurring any accuracy degradation over a set of temporal tasks. This observation suggests that a large fraction of the recurrent computations are ineffectual and can be avoided to speed up the process during the inference as they involve noncontributory multiplications/accumulations with zero-valued states. Finally, we introduce a custom hardware accelerator that can perform the recurrent computations using both sparse and dense states. Experimental measurements show that performing the computations using the sparse states speeds up the process and improves energy efficiency by up to 5.2x when compared to implementation results of the accelerator performing the computations using dense states. △ Less

Submitted 29 November, 2018; v1 submitted 9 November, 2018; originally announced November 2018.

Comments: Accepted as a conference paper for presentation at DATE 2019

arXiv:1809.11086 [pdf, other]

Learning Recurrent Binary/Ternary Weights

Authors: Arash Ardakani, Zhengyun Ji, Sean C. Smithson, Brett H. Meyer, Warren J. Gross

Abstract: Recurrent neural networks (RNNs) have shown excellent performance in processing sequence data. However, they are both complex and memory intensive due to their recursive nature. These limitations make RNNs difficult to embed on mobile devices requiring real-time processes with limited hardware resources. To address the above issues, we introduce a method that can learn binary and ternary weights d… ▽ More Recurrent neural networks (RNNs) have shown excellent performance in processing sequence data. However, they are both complex and memory intensive due to their recursive nature. These limitations make RNNs difficult to embed on mobile devices requiring real-time processes with limited hardware resources. To address the above issues, we introduce a method that can learn binary and ternary weights during the training phase to facilitate hardware implementations of RNNs. As a result, using this approach replaces all multiply-accumulate operations by simple accumulations, bringing significant benefits to custom hardware in terms of silicon area and power consumption. On the software side, we evaluate the performance (in terms of accuracy) of our method using long short-term memories (LSTMs) on various sequential models including sequence classification and language modeling. We demonstrate that our method achieves competitive results on the aforementioned tasks while using binary/ternary weights during the runtime. On the hardware side, we present custom hardware for accelerating the recurrent computations of LSTMs with binary/ternary weights. Ultimately, we show that LSTMs with binary/ternary weights can achieve up to 12x memory saving and 10x inference speedup compared to the full-precision implementation on an ASIC platform. △ Less

Submitted 24 January, 2019; v1 submitted 28 September, 2018; originally announced September 2018.

Comments: Published as a conference paper at ICLR 2019

arXiv:1809.10909 [pdf, other]

doi 10.1098/rspa.2018.0642

A variational principle for fluid sloshing with vorticity, dynamically coupled to vessel motion

Authors: H. Alemi Ardakani, T. J. Bridges, F. Gay-Balmaz, Y. Huang, C. Tronci

Abstract: A variational principle is derived for two-dimensional incompressible rotational fluid flow with a free surface in a moving vessel when both the vessel and fluid motion are to be determined. The fluid is represented by a stream function and the vessel motion is represented by a path in the planar Euclidean group. Novelties in the formulation include how the pressure boundary condition is treated,… ▽ More A variational principle is derived for two-dimensional incompressible rotational fluid flow with a free surface in a moving vessel when both the vessel and fluid motion are to be determined. The fluid is represented by a stream function and the vessel motion is represented by a path in the planar Euclidean group. Novelties in the formulation include how the pressure boundary condition is treated, the introduction of a stream function into the Euler-Poincaré variations, the derivation of free surface variations, and how the equations for the vessel path in the Euclidean group, coupled to the fluid motion, are generated automatically. △ Less

Submitted 28 September, 2018; originally announced September 2018.

Comments: 19 pages, 3 figures

MSC Class: 76B99 (Primary) 76B07 (Secondary)

arXiv:1809.06932 [pdf]

doi 10.1126/science.aat2998

Electron-Scale Dynamics of the Diffusion Region during Symmetric Magnetic Reconnection in Space

Authors: R. B. Torbert, J. L. Burch, T. D. Phan, M. Hesse, M. R. Argall, J. Shuster, R. E. Ergun, L. Alm, R. Nakamura, K. Genestreti, D. J. Gershman, W. R. Paterson, D. L. Turner, I. Cohen, B. L. Giles, C. J. Pollock, S. Wang, L. -J. Chen, Julia Stawarz, J. P. Eastwood, K. - J. Hwang, C. Farrugia, I. Dors, H. Vaith, C. Mouikis , et al. (24 additional authors not shown)

Abstract: Magnetic reconnection is an energy conversion process important in many astrophysical contexts including the Earth's magnetosphere, where the process can be investigated in-situ. Here we present the first encounter of a reconnection site by NASA's Magnetospheric Multiscale (MMS) spacecraft in the magnetotail, where reconnection involves symmetric inflow conditions. The unprecedented electron-scale… ▽ More Magnetic reconnection is an energy conversion process important in many astrophysical contexts including the Earth's magnetosphere, where the process can be investigated in-situ. Here we present the first encounter of a reconnection site by NASA's Magnetospheric Multiscale (MMS) spacecraft in the magnetotail, where reconnection involves symmetric inflow conditions. The unprecedented electron-scale plasma measurements revealed (1) super-Alfvenic electron jets reaching 20,000 km/s, (2) electron meandering motion and acceleration by the electric field, producing multiple crescent-shaped structures, (3) spatial dimensions of the electron diffusion region implying a reconnection rate of 0.1-0.2. The well-structured multiple layers of electron populations indicate that, despite the presence of turbulence near the reconnection site, the key electron dynamics appears to be largely laminar. △ Less

Submitted 18 September, 2018; originally announced September 2018.

Comments: 4 pages, 3 figures, and supplementary material

arXiv:1809.02641 [pdf]

Temperature tunable Anderson localization for surface plasmon waves propagating in a graphene single layer placed on a random InAs grating

Authors: Abbas Ghasempour Ardakani, Marzieh Sedaghat Nejad

Abstract: In this paper, we propose a one-dimensional disordered plasmonic structure composed of a graphene single layer placed on a random grating composed of InAs. The propagation of a plasmonic wave through this structure is investigated numerically. By calculation of normalized localization length for systems with different disorder strengths, it is determined whether or not the system is in the localiz… ▽ More In this paper, we propose a one-dimensional disordered plasmonic structure composed of a graphene single layer placed on a random grating composed of InAs. The propagation of a plasmonic wave through this structure is investigated numerically. By calculation of normalized localization length for systems with different disorder strengths, it is determined whether or not the system is in the localized regime. For some frequencies, depending on the disorder level, Anderson localization occurs for plasmonic waves propagating through the graphene layer. Furthermore, the effect of optical loss on the localization length is studied. By calculating the localization length at different temperatures, it is observed that Anderson localization of graphene plasmons is temperature dependent and can be controlled by changing the temperature. In the transmission spectrum for each random realization, there are some resonance peaks which are blue-shifted with increasing the temperature. Finally, the effects of Fermi energy level of the graphene layer and width of air gaps on the individual transmission resonances are examined. △ Less

Submitted 7 September, 2018; originally announced September 2018.

arXiv:1801.01820 [pdf, other]

Design and Implementation of a Polar Codes Blind Detection Scheme

Authors: Carlo Condo, Seyyed Ali Hashemi, Arash Ardakani, Furkan Ercan, Warren J. Gross

Abstract: In blind detection, a set of candidates has to be decoded within a strict time constraint, to identify which transmissions are directed at the user equipment. Blind detection is required by the 3GPP LTE/LTE-Advanced standard, and it will be required in the 5th generation wireless communication standard (5G) as well. Polar codes have been selected for use in 5G: thus, the issue of blind detection o… ▽ More In blind detection, a set of candidates has to be decoded within a strict time constraint, to identify which transmissions are directed at the user equipment. Blind detection is required by the 3GPP LTE/LTE-Advanced standard, and it will be required in the 5th generation wireless communication standard (5G) as well. Polar codes have been selected for use in 5G: thus, the issue of blind detection of polar codes must be addressed. We propose a polar code blind detection scheme where the user ID is transmitted instead of some of the frozen bits. A first, coarse decoding phase helps selecting a subset of candidates that is decoded by a more powerful algorithm: an early stopping criterion is also introduced for the second decoding phase. Simulations results show good missed detection and false alarm rates, along with substantial latency gains thanks to early stopping. We then propose an architecture to implement the devised blind detection scheme, based on a tunable decoder that can be used for both phases. The architecture is synthesized and implementation results are reported for various system parameters. The reported area occupation and latency, obtained in 65 nm CMOS technology, are able to meet 5G requirements, and are guaranteed to meet them with even less resource usage in the latest technology nodes. △ Less

Submitted 4 January, 2018; originally announced January 2018.

Comments: arXiv admin note: text overlap with arXiv:1705.01864

arXiv:1712.03994 [pdf, other]

Multi-Mode Inference Engine for Convolutional Neural Networks

Authors: Arash Ardakani, Carlo Condo, Warren J. Gross

Abstract: During the past few years, interest in convolutional neural networks (CNNs) has risen constantly, thanks to their excellent performance on a wide range of recognition and classification tasks. However, they suffer from the high level of complexity imposed by the high-dimensional convolutions in convolutional layers. Within scenarios with limited hardware resources and tight power and latency const… ▽ More During the past few years, interest in convolutional neural networks (CNNs) has risen constantly, thanks to their excellent performance on a wide range of recognition and classification tasks. However, they suffer from the high level of complexity imposed by the high-dimensional convolutions in convolutional layers. Within scenarios with limited hardware resources and tight power and latency constraints, the high computational complexity of CNNs makes them difficult to be exploited. Hardware solutions have striven to reduce the power consumption using low-power techniques, and to limit the processing time by increasing the number of processing elements (PEs). While most of ASIC designs claim a peak performance of a few hundred giga operations per seconds, their average performance is substantially lower when applied to state-of-the-art CNNs such as AlexNet, VGGNet and ResNet, leading to low resource utilization. Their performance efficiency is limited to less than 55% on average, which leads to unnecessarily high processing latency and silicon area. In this paper, we propose a dataflow which enables to perform both the fully-connected and convolutional computations for any filter/layer size using the same PEs. We then introduce a multi-mode inference engine (MMIE) based on the proposed dataflow. Finally, we show that the proposed MMIE achieves a performance efficiency of more than 84% when performing the computations of the three renown CNNs (i.e., AlexNet, VGGNet and ResNet), outperforming the best architecture in the state-of-the-art in terms of energy consumption, processing latency and silicon area. △ Less

Submitted 11 December, 2017; originally announced December 2017.

arXiv:1709.08023 [pdf]

Ownership Cost Calculations for Distributed Energy Resources Using Uncertainty and Risk Analyses

Authors: S. Ali Pourmousavi, Mahdi Behrangrad, Ali Jahanbani Ardakani, M. Hashem Nehrir

Abstract: Ownership cost calculation plays an important role in optimal operation of distributed energy resources (DERs) and microgrids (MGs) in the future power system, known as smart grid. In this paper, a general framework for ownership cost calculation is proposed using uncertainty and risk analyses. Four ownership cost calculation approaches are introduced and compared based on their associated risk va… ▽ More Ownership cost calculation plays an important role in optimal operation of distributed energy resources (DERs) and microgrids (MGs) in the future power system, known as smart grid. In this paper, a general framework for ownership cost calculation is proposed using uncertainty and risk analyses. Four ownership cost calculation approaches are introduced and compared based on their associated risk values. Finally, the best method is chosen based on a series of simulation results, performed for a typical diesel generator (DiG). Although simulation results are given for a DiG (as commonly used in MGs), the proposed approaches can be applied to other MG components, such as batteries, with slight modifications, as presented in this paper. The analyses and proposed approaches can be useful in MG optimal design, optimal power flow, and market-based operation of the smart grid for accurate operational cost calculations. △ Less

Submitted 23 September, 2017; originally announced September 2017.

Comments: 8 pages, 7 figures, 3 tables

arXiv:1611.01427 [pdf, other]

Sparsely-Connected Neural Networks: Towards Efficient VLSI Implementation of Deep Neural Networks

Authors: Arash Ardakani, Carlo Condo, Warren J. Gross

Abstract: Recently deep neural networks have received considerable attention due to their ability to extract and represent high-level abstractions in data sets. Deep neural networks such as fully-connected and convolutional neural networks have shown excellent performance on a wide range of recognition and classification tasks. However, their hardware implementations currently suffer from large silicon area… ▽ More Recently deep neural networks have received considerable attention due to their ability to extract and represent high-level abstractions in data sets. Deep neural networks such as fully-connected and convolutional neural networks have shown excellent performance on a wide range of recognition and classification tasks. However, their hardware implementations currently suffer from large silicon area and high power consumption due to the their high degree of complexity. The power/energy consumption of neural networks is dominated by memory accesses, the majority of which occur in fully-connected networks. In fact, they contain most of the deep neural network parameters. In this paper, we propose sparsely-connected networks, by showing that the number of connections in fully-connected networks can be reduced by up to 90% while improving the accuracy performance on three popular datasets (MNIST, CIFAR10 and SVHN). We then propose an efficient hardware architecture based on linear-feedback shift registers to reduce the memory requirements of the proposed sparsely-connected networks. The proposed architecture can save up to 90% of memory compared to the conventional implementations of fully-connected neural networks. Moreover, implementation results show up to 84% reduction in the energy consumption of a single neuron of the proposed sparsely-connected networks compared to a single neuron of fully-connected neural networks. △ Less

Submitted 30 March, 2017; v1 submitted 4 November, 2016; originally announced November 2016.

Comments: Published as a conference paper at ICLR 2017

arXiv:1509.08972 [pdf, other]

doi 10.1109/TVLSI.2017.2654298

VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing

Authors: Arash Ardakani, François Leduc-Primeau, Naoya Onizawa, Takahiro Hanyu, Warren J. Gross

Abstract: The hardware implementation of deep neural networks (DNNs) has recently received tremendous attention: many applications in fact require high-speed operations that suit a hardware implementation. However, numerous elements and complex interconnections are usually required, leading to a large area occupation and copious power consumption. Stochastic computing has shown promising results for low-pow… ▽ More The hardware implementation of deep neural networks (DNNs) has recently received tremendous attention: many applications in fact require high-speed operations that suit a hardware implementation. However, numerous elements and complex interconnections are usually required, leading to a large area occupation and copious power consumption. Stochastic computing has shown promising results for low-power area-efficient hardware implementations, even though existing stochastic algorithms require long streams that cause long latencies. In this paper, we propose an integer form of stochastic computation and introduce some elementary circuits. We then propose an efficient implementation of a DNN based on integral stochastic computing. The proposed architecture has been implemented on a Virtex7 FPGA, resulting in 45% and 62% average reductions in area and latency compared to the best reported architecture in literature. We also synthesize the circuits in a 65 nm CMOS technology and we show that the proposed integral stochastic architecture results in up to 21% reduction in energy consumption compared to the binary radix implementation at the same misclassification rate. Due to fault-tolerant nature of stochastic architectures, we also consider a quasi-synchronous implementation which yields 33% reduction in energy consumption w.r.t. the binary radix implementation without any compromise on performance. △ Less

Submitted 24 August, 2016; v1 submitted 29 September, 2015; originally announced September 2015.

Comments: 11 pages, 12 figures

Journal ref: IEEE Transactions on Very Large Scale Integration (VLSI) Systems , vol.PP, no.99, pp.1-12, 2017

arXiv:1507.04149 [pdf, ps, other]

doi 10.1088/2040-8978/17/10/105601

Controlling Anderson localization in disordered heterostructures with Lévy-type distribution

Authors: A. Ghasempour Ardakani, M. Ghasemi Nezhadhaghighi

Abstract: In this paper, we propose a disordered heterostructure in which the distribution of refractive index of one of its constituents follows a Lévy-type distribution characterized by the exponent $α$. For the normal and oblique incidences, the effect of $α$ variation on the localization length is investigated in different frequency ranges. As a result, the controllability of Anderson localization can b… ▽ More In this paper, we propose a disordered heterostructure in which the distribution of refractive index of one of its constituents follows a Lévy-type distribution characterized by the exponent $α$. For the normal and oblique incidences, the effect of $α$ variation on the localization length is investigated in different frequency ranges. As a result, the controllability of Anderson localization can be achieved by changing the exponent $α$ in the disordered structure having heavy tailed distribution. △ Less

Submitted 15 July, 2015; originally announced July 2015.

Comments: 5 pages, 8 figures, accepted in Journal of Optics

arXiv:physics/0403003 [pdf, ps, other]

Mid-Infrared Radiation as a Short-Term Earthquake Precursor

Authors: M. Allameh-Zadeh, A. Ansari, A. Bahraminasab, K. Kaviani, A. Mahdavi Ardakani, H. Mehr-nahad, D. Mehr-shahi, M. D. Niry, M. Reza Rahimi Tabar, S. Tabatabai, N. Taghavinia M. Vesaghi, F. Zamani

Abstract: Recently it has been found by F. Freund that the granite under high pressure undergoes a phase transition from insulator to a p-type semiconductor. This phase transition is a key concept to understanding pre-earthquake phenomena. This effect accompanies with the radiation of the granite in the mid-infrared region. we were able to predict the recent earthquake in the south of Iran by monitoring t… ▽ More Recently it has been found by F. Freund that the granite under high pressure undergoes a phase transition from insulator to a p-type semiconductor. This phase transition is a key concept to understanding pre-earthquake phenomena. This effect accompanies with the radiation of the granite in the mid-infrared region. we were able to predict the recent earthquake in the south of Iran by monitoring this radiation. △ Less

Submitted 29 February, 2004; originally announced March 2004.

Comments: 1 page, 1 figure, short report

Showing 1–21 of 21 results for author: Ardakani, A