subscribe to arXiv mailings

A New Statistical Analysis of the Morphology of Spiral Galaxies

Authors: Junye Wei, Ye Xu, Zehao Lin, Chaojie Hao, Yingjie Li, Dejian Liu, Shuaibo Bian

Abstract: Morphology is the starting point for understanding galaxies. Elmegreen et al. classified spiral galaxies into flocculent, multiple-arm, and grand-design galaxies based on the regularity of their spiral arm structure. With the release of a vast number of clear spiral galaxy images from the Sloan Digital Sky Survey, we conducted a morphological classification of 5093 blue spiral galaxies. A statisti… ▽ More Morphology is the starting point for understanding galaxies. Elmegreen et al. classified spiral galaxies into flocculent, multiple-arm, and grand-design galaxies based on the regularity of their spiral arm structure. With the release of a vast number of clear spiral galaxy images from the Sloan Digital Sky Survey, we conducted a morphological classification of 5093 blue spiral galaxies. A statistical analysis of this sample shows that the fractions of flocculent, multiple-arm, and grand-design galaxies are 38 $\pm$ 1%, 59 $\pm$ 1%, and 3 $\pm$ 1%, respectively. Redshift has no obvious influence on this classification. However, as the bulge size becomes larger, the fraction of multiple-arm galaxies increases, while that of flocculent galaxies decreases. In addition, we performed a statistical analysis of 3958 galaxies with a clear spiral arm structure, finding 82% of these galaxies have two arms in their inner regions. We also found that the majority (74%) of the barred spiral galaxies exhibit the characteristics of two inner spiral arms and multiple outer spiral arms, and there is no barred spiral galaxy in this work with four continuous spiral arms from the inner to the outer regions. These results highlight that the spiral arm structure of the Milky Way, according to the current mainstream view of a four-arm galaxy with continuous arms extending from the inner to outer regions, is quite unique. However, our findings align with the spiral morphology of the Milky Way proposed by Xu et al., in which case our Galaxy can be considered typical. △ Less

Submitted 10 October, 2024; originally announced October 2024.

Comments: 15 pages, 9 figures, 3 tables, accepted for publication in AJ

arXiv:2409.00083 [pdf, other]

doi 10.1145/3675095.3676607

On-device Learning of EEGNet-based Network For Wearable Motor Imagery Brain-Computer Interface

Authors: Sizhen Bian, Pixi Kang, Julian Moosmann, Mengxi Liu, Pietro Bonazzi, Roman Rosipal, Michele Magno

Abstract: Electroencephalogram (EEG)-based Brain-Computer Interfaces (BCIs) have garnered significant interest across various domains, including rehabilitation and robotics. Despite advancements in neural network-based EEG decoding, maintaining performance across diverse user populations remains challenging due to feature distribution drift. This paper presents an effective approach to address this challeng… ▽ More Electroencephalogram (EEG)-based Brain-Computer Interfaces (BCIs) have garnered significant interest across various domains, including rehabilitation and robotics. Despite advancements in neural network-based EEG decoding, maintaining performance across diverse user populations remains challenging due to feature distribution drift. This paper presents an effective approach to address this challenge by implementing a lightweight and efficient on-device learning engine for wearable motor imagery recognition. The proposed approach, applied to the well-established EEGNet architecture, enables real-time and accurate adaptation to EEG signals from unregistered users. Leveraging the newly released low-power parallel RISC-V-based processor, GAP9 from Greeenwaves, and the Physionet EEG Motor Imagery dataset, we demonstrate a remarkable accuracy gain of up to 7.31\% with respect to the baseline with a memory footprint of 15.6 KByte. Furthermore, by optimizing the input stream, we achieve enhanced real-time performance without compromising inference accuracy. Our tailored approach exhibits inference time of 14.9 ms and 0.76 mJ per single inference and 20 us and 0.83 uJ per single update during online training. These findings highlight the feasibility of our method for edge EEG devices as well as other battery-powered wearable AI systems suffering from subject-dependant feature distribution drift. △ Less

Submitted 25 August, 2024; originally announced September 2024.

arXiv:2407.09260 [pdf, other]

Evaluation of Encoding Schemes on Ubiquitous Sensor Signal for Spiking Neural Network

Authors: Sizhen Bian, Elisa Donati, Michele Magno

Abstract: Spiking neural networks (SNNs), a brain-inspired computing paradigm, are emerging for their inference performance, particularly in terms of energy efficiency and latency attributed to the plasticity in signal processing. To deploy SNNs in ubiquitous computing systems, signal encoding of sensors is crucial for achieving high accuracy and robustness. Using inertial sensor readings for gym activity r… ▽ More Spiking neural networks (SNNs), a brain-inspired computing paradigm, are emerging for their inference performance, particularly in terms of energy efficiency and latency attributed to the plasticity in signal processing. To deploy SNNs in ubiquitous computing systems, signal encoding of sensors is crucial for achieving high accuracy and robustness. Using inertial sensor readings for gym activity recognition as a case study, this work comprehensively evaluates four main encoding schemes and deploys the corresponding SNN on the neuromorphic processor Loihi2 for post-deployment encoding assessment. Rate encoding, time-to-first-spike encoding, binary encoding, and delta modulation are evaluated using metrics like average fire rate, signal-to-noise ratio, classification accuracy, robustness, and inference latency and energy. In this case study, the time-to-first-spike encoding required the lowest firing rate (2%) and achieved a comparative accuracy (89%), although it was the least robust scheme against error spikes (over 20% accuracy drop with 0.1 noisy spike rate). Rate encoding with optimal value-to-probability mapping achieved the highest accuracy (91.7%). Binary encoding provided a balance between information reconstruction and noise resistance. Multi-threshold delta modulation showed the best robustness, with only a 0.7% accuracy drop at a 0.1 noisy spike rate. This work serves researchers in selecting the best encoding scheme for SNN-based ubiquitous sensor signal processing, tailored to specific performance requirements. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.06186 [pdf, other]

Earable and Wrist-worn Setup for Accurate Step Counting Utilizing Body-Area Electrostatic Sensing

Authors: Sizhen Bian, Rakita Strahinja, Philipp Schilk, Clénin Marc-André, Silvano Cortesi, Elio Reinschmidt, Kanika Dheman, Michele Magno

Abstract: Step-counting has been widely implemented in wrist-worn devices and is accepted by end users as a quantitative indicator of everyday exercise. However, existing counting approach (mostly on wrist-worn setup) lacks robustness and thus introduces inaccuracy issues in certain scenarios like brief intermittent walking bouts and random arm motions or static arm status while walking (no clear correlatio… ▽ More Step-counting has been widely implemented in wrist-worn devices and is accepted by end users as a quantitative indicator of everyday exercise. However, existing counting approach (mostly on wrist-worn setup) lacks robustness and thus introduces inaccuracy issues in certain scenarios like brief intermittent walking bouts and random arm motions or static arm status while walking (no clear correlation of motion pattern between arm and leg). This paper proposes a low-power step-counting solution utilizing the body area electric field acquired by a novel electrostatic sensing unit, consuming only 87.3 $μ$W of power, hoping to strengthen the robustness of current dominant solution. We designed two wearable devices for on-the-wrist and in-the-ear deployment and collected body-area electric field-derived motion signals from ten volunteers. Four walking scenarios are considered: in the parking lot/shopping center with/without pushing the shopping trolley. The step-counting accuracy from the prototypes shows better accuracy than the commercial wrist-worn devices (e.g.,96% of the wrist- and ear-worn prototype vs. 66% of the Fitbit when walking in the shopping center while pushing a shopping trolley). We finally discussed the potential and limitations of sensing body-area electric fields for wrist-worn and ear-worn step-counting and beyond. △ Less

Submitted 8 July, 2024; originally announced July 2024.

arXiv:2407.03644 [pdf, other]

On-Device Training Empowered Transfer Learning For Human Activity Recognition

Authors: Pixi Kang, Julian Moosmann, Sizhen Bian, Michele Magno

Abstract: Human Activity Recognition (HAR) is an attractive topic to perceive human behavior and supplying assistive services. Besides the classical inertial unit and vision-based HAR methods, new sensing technologies, such as ultrasound and body-area electric fields, have emerged in HAR to enhance user experience and accommodate new application scenarios. As those sensors are often paired with AI for HAR,… ▽ More Human Activity Recognition (HAR) is an attractive topic to perceive human behavior and supplying assistive services. Besides the classical inertial unit and vision-based HAR methods, new sensing technologies, such as ultrasound and body-area electric fields, have emerged in HAR to enhance user experience and accommodate new application scenarios. As those sensors are often paired with AI for HAR, they frequently encounter challenges due to limited training data compared to the more widely IMU or vision-based HAR solutions. Additionally, user-induced concept drift (UICD) is common in such HAR scenarios. UICD is characterized by deviations in the sample distribution of new users from that of the training participants, leading to deteriorated recognition performance. This paper proposes an on-device transfer learning (ODTL) scheme tailored for energy- and resource-constrained IoT edge devices. Optimized on-device training engines are developed for two representative MCU-level edge computing platforms: STM32F756ZG and GAP9. Based on this, we evaluated the ODTL benefits in three HAR scenarios: body capacitance-based gym activity recognition, QVAR- and ultrasonic-based hand gesture recognition. We demonstrated an improvement of 3.73%, 17.38%, and 3.70% in the activity recognition accuracy, respectively. Besides this, we observed that the RISC-V-based GAP9 achieves 20x and 280x less latency and power consumption than STM32F7 MCU during the ODTL deployment, demonstrating the advantages of employing the latest low-power parallel computing devices for edge tasks. △ Less

Submitted 4 July, 2024; originally announced July 2024.

arXiv:2407.00354 [pdf, ps, other]

On selection dynamics for a nonlocal phenotype-structured model

Authors: Shen Bian, Jiale Bu

Abstract: This paper is devoted to the analysis of the long-time behavior of a phenotypic-structured model where phenotypic changes do not occur. We give a mathematical description of the process in which the best adapted trait is selected in a given environment created by the total population. It is exhibited that the long-time limit of the unique solution to the nonlocal equation is given by a Dirac mass… ▽ More This paper is devoted to the analysis of the long-time behavior of a phenotypic-structured model where phenotypic changes do not occur. We give a mathematical description of the process in which the best adapted trait is selected in a given environment created by the total population. It is exhibited that the long-time limit of the unique solution to the nonlocal equation is given by a Dirac mass centered at the peak of the fitness within or at the boundary of the region where the initial data is positive. Specially, If the peak of the fitness can't be in the support of the solution, then the infinite time blow-up of the solution occurs near the boundary of the region where the solution is positive. Moreover, our numerical results facilitate a deeper understanding of identifying the position of the centers. △ Less

Submitted 29 June, 2024; originally announced July 2024.

arXiv:2406.11914 [pdf, other]

Initial Investigation of Kolmogorov-Arnold Networks (KANs) as Feature Extractors for IMU Based Human Activity Recognition

Authors: Mengxi Liu, Daniel Geißler, Dominique Nshimyimana, Sizhen Bian, Bo Zhou, Paul Lukowicz

Abstract: In this work, we explore the use of a novel neural network architecture, the Kolmogorov-Arnold Networks (KANs) as feature extractors for sensor-based (specifically IMU) Human Activity Recognition (HAR). Where conventional networks perform a parameterized weighted sum of the inputs at each node and then feed the result into a statically defined nonlinearity, KANs perform non-linear computations rep… ▽ More In this work, we explore the use of a novel neural network architecture, the Kolmogorov-Arnold Networks (KANs) as feature extractors for sensor-based (specifically IMU) Human Activity Recognition (HAR). Where conventional networks perform a parameterized weighted sum of the inputs at each node and then feed the result into a statically defined nonlinearity, KANs perform non-linear computations represented by B-SPLINES on the edges leading to each node and then just sum up the inputs at the node. Instead of learning weights, the system learns the spline parameters. In the original work, such networks have been shown to be able to more efficiently and exactly learn sophisticated real valued functions e.g. in regression or PDE solution. We hypothesize that such an ability is also advantageous for computing low-level features for IMU-based HAR. To this end, we have implemented KAN as the feature extraction architecture for IMU-based human activity recognition tasks, including four architecture variations. We present an initial performance investigation of the KAN feature extractor on four public HAR datasets. It shows that the KAN-based feature extractor outperforms CNN-based extractors on all datasets while being more parameter efficient. △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: This paper is under review

arXiv:2406.01646 [pdf, other]

iKAN: Global Incremental Learning with KAN for Human Activity Recognition Across Heterogeneous Datasets

Authors: Mengxi Liu, Sizhen Bian, Bo Zhou, Paul Lukowicz

Abstract: This work proposes an incremental learning (IL) framework for wearable sensor human activity recognition (HAR) that tackles two challenges simultaneously: catastrophic forgetting and non-uniform inputs. The scalable framework, iKAN, pioneers IL with Kolmogorov-Arnold Networks (KAN) to replace multi-layer perceptrons as the classifier that leverages the local plasticity and global stability of spli… ▽ More This work proposes an incremental learning (IL) framework for wearable sensor human activity recognition (HAR) that tackles two challenges simultaneously: catastrophic forgetting and non-uniform inputs. The scalable framework, iKAN, pioneers IL with Kolmogorov-Arnold Networks (KAN) to replace multi-layer perceptrons as the classifier that leverages the local plasticity and global stability of splines. To adapt KAN for HAR, iKAN uses task-specific feature branches and a feature redistribution layer. Unlike existing IL methods that primarily adjust the output dimension or the number of classifier nodes to adapt to new tasks, iKAN focuses on expanding the feature extraction branches to accommodate new inputs from different sensor modalities while maintaining consistent dimensions and the number of classifier outputs. Continual learning across six public HAR datasets demonstrated the iKAN framework's incremental learning performance, with a last performance of 84.9\% (weighted F1 score) and an average incremental performance of 81.34\%, which significantly outperforms the two existing incremental learning methods, such as EWC (51.42\%) and experience replay (59.92\%). △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: This work is submitted to Ubicomp/ISWC24 and is under review

arXiv:2405.16767 [pdf, other]

doi 10.1007/978-3-031-74234-7_4

Oblivious Monitoring for Discrete-Time STL via Fully Homomorphic Encryption

Authors: Masaki Waga, Kotaro Matsuoka, Takashi Suwa, Naoki Matsumoto, Ryotaro Banno, Song Bian, Kohei Suenaga

Abstract: When monitoring a cyber-physical system (CPS) from a remote server, keeping the monitored data secret is crucial, particularly when they contain sensitive information, e.g., biological or location data. Recently, Banno et al. (CAV'22) proposed a protocol for online LTL monitoring that keeps data concealed from the server using Fully Homomorphic Encryption (FHE). We build on this protocol to allow… ▽ More When monitoring a cyber-physical system (CPS) from a remote server, keeping the monitored data secret is crucial, particularly when they contain sensitive information, e.g., biological or location data. Recently, Banno et al. (CAV'22) proposed a protocol for online LTL monitoring that keeps data concealed from the server using Fully Homomorphic Encryption (FHE). We build on this protocol to allow arithmetic operations over encrypted values, e.g., to compute a safety measurement combining distance, velocity, and so forth. Overall, our protocol enables oblivious online monitoring of discrete-time real-valued signals against signal temporal logic (STL) formulas. Our protocol combines two FHE schemes, CKKS and TFHE, leveraging their respective strengths. We employ CKKS to evaluate arithmetic predicates in STL formulas while utilizing TFHE to process them using a DFA derived from the STL formula. We conducted case studies on monitoring blood glucose levels and vehicles' behavior against the Responsibility-Sensitive Safety (RSS) rules. Our results suggest the practical relevance of our protocol. △ Less

Submitted 18 October, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

Comments: Accepted to RV'24

arXiv:2405.11439 [pdf, other]

doi 10.3847/1538-3881/ad4030

On the Structure of the Sagittarius Spiral Arm in the Inner Milky Way

Authors: S. B. Bian, Y. W. Wu, Y. Xu, M. J. Reid, J. J. Li, B. Zhang, K. M. Menten, L. Moscadelli, A. Brunthaler

Abstract: We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with 3-dimensional kinematic distances. The distances of G033.64$-$00.22, G035.57$-$00.03, G041.15$-$00.20, and G04… ▽ More We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with 3-dimensional kinematic distances. The distances of G033.64$-$00.22, G035.57$-$00.03, G041.15$-$00.20, and G043.89$-$00.78 are $9.9\pm0.5$, $10.2\pm0.6$, $7.6\pm0.5$, and $7.5\pm0.3$ kpc, respectively. Based on these measurements, we suggest that the Sagittarius arm segment beyond about 8 kpc from the Sun in the first Galactic quadrant should be adjusted radially outward relative to previous models. This supports the suggestion of Xu et al. (2023) that the Sagittarius and Perseus spiral arms might merge in the first quadrant before spiraling inward to the far end of the Galactic bar. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 14 pages, 5 figures, accepted to AJ

Journal ref: 2024 AJ 167:267

arXiv:2404.15819 [pdf, other]

APACHE: A Processing-Near-Memory Architecture for Multi-Scheme Fully Homomorphic Encryption

Authors: Lin Ding, Song Bian, Penggao He, Yan Xu, Gang Qu, Jiliang Zhang

Abstract: Fully Homomorphic Encryption (FHE) allows one to outsource computation over encrypted data to untrusted servers without worrying about data breaching. Since FHE is known to be extremely computationally-intensive, application-specific accelerators emerged as a powerful solution to narrow the performance gap. Nonetheless, due to the increasing complexities in FHE schemes per se and multi-scheme FHE… ▽ More Fully Homomorphic Encryption (FHE) allows one to outsource computation over encrypted data to untrusted servers without worrying about data breaching. Since FHE is known to be extremely computationally-intensive, application-specific accelerators emerged as a powerful solution to narrow the performance gap. Nonetheless, due to the increasing complexities in FHE schemes per se and multi-scheme FHE algorithm designs in end-to-end privacy-preserving tasks, existing FHE accelerators often face the challenges of low hardware utilization rates and insufficient memory bandwidth. In this work, we present APACHE, a layered near-memory computing hierarchy tailored for multi-scheme FHE acceleration. By closely inspecting the data flow across different FHE schemes, we propose a layered near-memory computing architecture with fine-grained functional unit design to significantly enhance the utilization rates of both computational resources and memory bandwidth. In addition, we propose a multi-scheme operator compiler to efficiently schedule high-level FHE computations across lower-level functional units. In the experiment, we evaluate APACHE on various FHE applications, such as Lola MNIST, HELR, fully-packed bootstrapping, and fully homomorphic processors. The results illustrate that APACHE outperforms the state-of-the-art ASIC FHE accelerators by 2.4x to 19.8x over a variety of operator and application benchmarks. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.14663 [pdf, other]

VLBI with SKA: Possible Arrays and Astrometric Science

Authors: Yingjie Li, Ye Xu, Jingjing Li, Shuaibo Bian, Zehao Lin, Chaojie Hao, Dejian Liu

Abstract: The next generation of very long baseline interferometry (VLBI) is stepping into the era of microarcsecond ($μ$as) astronomy, and pushing astronomy, especially astrometry, to new heights. VLBI with the Square Kilometre Array (SKA), SKA-VLBI, will increase current sensitivity by an order of magnitude, and reach astrometric precision routinely below 10 $μ$as, even challenging 1 $μ$as. This advanceme… ▽ More The next generation of very long baseline interferometry (VLBI) is stepping into the era of microarcsecond ($μ$as) astronomy, and pushing astronomy, especially astrometry, to new heights. VLBI with the Square Kilometre Array (SKA), SKA-VLBI, will increase current sensitivity by an order of magnitude, and reach astrometric precision routinely below 10 $μ$as, even challenging 1 $μ$as. This advancement allows precise parallax and proper motion measurements of various celestial objects. Such improvements can be used to study objects (including isolated objects, and binary or multiple systems) in different stellar stages (such as star formation, main-sequence stars, asymptotic giant branch stars, pulsars, black holes, white dwarfs, etc.), unveil the structure and evolution of complex systems (such as the Milky Way), benchmark the international celestial reference frame, and reveal cosmic expansion. Furthermore, the theory of general relativity can also be tested with SKA-VLBI using precise measurements of light deflection under the gravitational fields of different solar system objects and the perihelion precession of solar system objects. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: 41 pages, 12 figures, 4 tables. Accepted to RAA (Review)

arXiv:2403.07915 [pdf, other]

CycloWatt: An Affordable, TinyML-enhanced IoT Device Revolutionizing Cycling Power Metrics

Authors: Victor Luder, Sizhen Bian, Michele Magno

Abstract: Cycling power measurement is an indispensable metric with profound implications for cyclists' performance and fitness levels. It empowers riders with real-time feedback, supports precise training regimen planning, mitigates injury risks, and enhances muscular development. Despite these advantages, the widespread adoption of cycling power meters has been hampered by their prohibitive cost and deplo… ▽ More Cycling power measurement is an indispensable metric with profound implications for cyclists' performance and fitness levels. It empowers riders with real-time feedback, supports precise training regimen planning, mitigates injury risks, and enhances muscular development. Despite these advantages, the widespread adoption of cycling power meters has been hampered by their prohibitive cost and deployment complexity. This paper pioneers a groundbreaking approach to power measurement in cycling, prioritizing affordability and user-friendliness. To achieve this goal, we introduce a cutting-edge Internet of Things (IoT) device that seamlessly integrates force signals with inertial sensor data while leveraging the power of edge machine learning techniques. In-field experimental evaluations demonstrate that our prototype can estimate power with remarkable accuracy, boasting a Mean Absolute Error (MAE) of only 12.29 Watts (4.1\%). Notably, our design emphasizes energy efficiency, operating in a low-power mode that consumes a mere 50 milliwatts and offers an exceptional battery life of up to 25.8 hours in always-on active mode. With an ultra-low latency of 4.33 milliseconds for data processing and inference, our system ensures real-time power estimation during cycling activities. Incorporating IoT concepts and devices, this paper marks a significant milestone in developing cost-effective and accurate cycling power meters. △ Less

Submitted 27 February, 2024; originally announced March 2024.

arXiv:2403.03655

Kronos: A Secure and Generic Sharding Blockchain Consensus with Optimized Overhead

Authors: Yizhong Liu, Andi Liu, Yuan Lu, Zhuocheng Pan, Yinuo Li, Jianwei Liu, Song Bian, Mauro Conti

Abstract: Sharding enhances blockchain scalability by dividing the network into shards, each managing specific unspent transaction outputs or accounts. As an introduced new transaction type, cross-shard transactions pose a critical challenge to the security and efficiency of sharding blockchains. Currently, there is a lack of a generic sharding consensus pattern that achieves both security and low overhead.… ▽ More Sharding enhances blockchain scalability by dividing the network into shards, each managing specific unspent transaction outputs or accounts. As an introduced new transaction type, cross-shard transactions pose a critical challenge to the security and efficiency of sharding blockchains. Currently, there is a lack of a generic sharding consensus pattern that achieves both security and low overhead. In this paper, we present Kronos, a secure sharding blockchain consensus achieving optimized overhead. In particular, we propose a new secure sharding consensus pattern, based on a buffer managed jointly by shard members. Valid transactions are transferred to the payee via the buffer, while invalid ones are rejected through happy or unhappy paths. Kronos is proved to achieve security with atomicity under malicious clients with optimal intra-shard overhead $kB$ ($k$ for involved shard number and $B$ for a Byzantine fault tolerance (BFT) cost). Besides, we propose secure cross-shard certification methods based on batch certification and reliable cross-shard transfer. The former combines hybrid trees or vector commitments, while the latter integrates erasure coding. Handling $b$ transactions, Kronos is proved to achieve reliability with low cross-shard overhead $O(n b λ)$ ($n$ for shard size and $λ$ for the security parameter). Notably, Kronos imposes no restrictions on BFT and does not rely on time assumptions, offering optional constructions in various modules. We implement Kronos using two prominent BFT protocols: asynchronous Speeding Dumbo and partial synchronous Hotstuff. Extensive experiments demonstrate Kronos scales the consensus nodes to thousands, achieving a substantial throughput of 320 ktx/sec with 2.0 sec latency. Compared with the past solutions, Kronos outperforms, achieving up to a 12* improvement in throughput and a 50% reduction in latency. △ Less

Submitted 12 September, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

Comments: The algorithms in Section 4 contain defects and inaccurate descriptions that require correction

arXiv:2403.01345 [pdf, other]

ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation

Authors: Siyuan Bian, Jiefeng Li, Jiasheng Tang, Cewu Lu

Abstract: Accurate human shape recovery from a monocular RGB image is a challenging task because humans come in different shapes and sizes and wear different clothes. In this paper, we propose ShapeBoost, a new human shape recovery framework that achieves pixel-level alignment even for rare body shapes and high accuracy for people wearing different types of clothes. Unlike previous approaches that rely on t… ▽ More Accurate human shape recovery from a monocular RGB image is a challenging task because humans come in different shapes and sizes and wear different clothes. In this paper, we propose ShapeBoost, a new human shape recovery framework that achieves pixel-level alignment even for rare body shapes and high accuracy for people wearing different types of clothes. Unlike previous approaches that rely on the use of PCA-based shape coefficients, we adopt a new human shape parameterization that decomposes the human shape into bone lengths and the mean width of each part slice. This part-based parameterization technique achieves a balance between flexibility and validity using a semi-analytical shape reconstruction algorithm. Based on this new parameterization, a clothing-preserving data augmentation module is proposed to generate realistic images with diverse body shapes and accurate annotations. Experimental results show that our method outperforms other state-of-the-art methods in diverse body shape situations as well as in varied clothing situations. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2402.12160 [pdf, other]

Probing the nature of rotation in the Pleiades, Alpha Persei, and Hyades clusters

Authors: C. J. Hao, Y. Xu, L. G. Hou, S. B. Bian, Z. H. Lin, Y. J. Li, Y. W. Dong, D. J. Liu

Abstract: Unraveling the internal kinematics of open clusters is crucial for understanding their formation and evolution. However, there is a dearth of research on this topic, primarily due to the lack of high-quality kinematic data. Using the exquisite-precision astrometric parameters and radial velocities provided by Gaia data release 3, we investigate the internal rotation in three of the most nearby and… ▽ More Unraveling the internal kinematics of open clusters is crucial for understanding their formation and evolution. However, there is a dearth of research on this topic, primarily due to the lack of high-quality kinematic data. Using the exquisite-precision astrometric parameters and radial velocities provided by Gaia data release 3, we investigate the internal rotation in three of the most nearby and best-studied open clusters, namely the Pleiades, Alpha Persei, and Hyades clusters. Statistical analyses of the residual motions of the member stars clearly indicate the presence of three-dimensional rotation in the three clusters. The mean rotation velocities of the Pleiades, Alpha Persei, and Hyades clusters within their tidal radii are estimated to be 0.24 (0.04), 0.43 (0.08), and 0.09 (0.03) km s-1, respectively. Similar to the Praesepe cluster that we have studied before, the rotation of the member stars within the tidal radii of these three open clusters can be well interpreted by Newton's theorem. No expansion or contraction is detected in the three clusters either. Furthermore, we find that the mean rotation velocity of open clusters may be positively correlated with the cluster mass, and the rotation is likely to diminish as open clusters age. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: 23 pages, 17 figures, Accepted for publication in ApJ

arXiv:2402.10365 [pdf, other]

doi 10.3390/electronics13040720

Deep Spectral Meshes: Multi-Frequency Facial Mesh Processing with Graph Neural Networks

Authors: Robert Kosk, Richard Southern, Lihua You, Shaojun Bian, Willem Kokke, Greg Maguire

Abstract: With the rising popularity of virtual worlds, the importance of data-driven parametric models of 3D meshes has grown rapidly. Numerous applications, such as computer vision, procedural generation, and mesh editing, vastly rely on these models. However, current approaches do not allow for independent editing of deformations at different frequency levels. They also do not benefit from representing d… ▽ More With the rising popularity of virtual worlds, the importance of data-driven parametric models of 3D meshes has grown rapidly. Numerous applications, such as computer vision, procedural generation, and mesh editing, vastly rely on these models. However, current approaches do not allow for independent editing of deformations at different frequency levels. They also do not benefit from representing deformations at different frequencies with dedicated representations, which would better expose their properties and improve the generated meshes' geometric and perceptual quality. In this work, spectral meshes are introduced as a method to decompose mesh deformations into low-frequency and high-frequency deformations. These features of low- and high-frequency deformations are used for representation learning with graph convolutional networks. A parametric model for 3D facial mesh synthesis is built upon the proposed framework, exposing user parameters that control disentangled high- and low-frequency deformations. Independent control of deformations at different frequencies and generation of plausible synthetic examples are mutually exclusive objectives. A Conditioning Factor is introduced to leverage these objectives. Our model takes further advantage of spectral partitioning by representing different frequency levels with disparate, more suitable representations. Low frequencies are represented with standardised Euclidean coordinates, and high frequencies with a normalised deformation representation (DR). This paper investigates applications of our proposed approach in mesh reconstruction, mesh interpolation, and multi-frequency editing. It is demonstrated that our method improves the overall quality of generated meshes on most datasets when considering both the $L_1$ norm and perceptual Dihedral Angle Mesh Error (DAME) metrics. △ Less

Submitted 15 February, 2024; originally announced February 2024.

Comments: 26 pages, 10 figures, journal article

MSC Class: 68T10; 68T45; 68U05 ACM Class: I.5.4; I.5.1; I.3.5; I.3.7; I.4.5; I.4.2; I.5.1; I.5.2

Journal ref: Electronics. 2024; 13(4):720

arXiv:2401.17525 [pdf, other]

Molecular Bubble and Outflow in S Mon Revealed by Multiband Datasets

Authors: Dejian Liu, Ye Xu, YingJie Li, Zehao Lin, Chaojie Hao, WenJin Yang, Jingjing Li, Xinrong Liu, Yiwei Dong, Shuaibo Bian, and Deyun Kong

Abstract: We identify a molecular bubble, and study the star formation and its feedback in the S Mon region, using multiple molecular lines, young stellar objects (YSOs), and infrared data. We revisit the distance to S Mon, ~722+/-9 pc, using Gaia Data Release 3 parallaxes of the associated Class II YSOs. The bubble may be mainly driven by a massive binary system (namely 15 Mon), the primary of which is an… ▽ More We identify a molecular bubble, and study the star formation and its feedback in the S Mon region, using multiple molecular lines, young stellar objects (YSOs), and infrared data. We revisit the distance to S Mon, ~722+/-9 pc, using Gaia Data Release 3 parallaxes of the associated Class II YSOs. The bubble may be mainly driven by a massive binary system (namely 15 Mon), the primary of which is an O7V-type star. An outflow is detected in the shell of the bubble, suggesting ongoing star formation activities in the vicinity of the bubble. The total wind energy of the massive binary star is three orders of magnitude higher than the sum of the observed turbulent energy in the molecular gas and the kinetic energy of the bubble, indicating that stellar winds help to maintain the turbulence in the S Mon region and drive the bubble. We conclude that the stellar winds of massive stars have an impact on their surrounding environment. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 34 pages,19 figures, 5 tables, Accepted for publication in ApJ

arXiv:2401.12230 [pdf, other]

Computing in the Era of Large Generative Models: From Cloud-Native to AI-Native

Authors: Yao Lu, Song Bian, Lequn Chen, Yongjun He, Yulong Hui, Matthew Lentz, Beibin Li, Fei Liu, Jialin Li, Qi Liu, Rui Liu, Xiaoxuan Liu, Lin Ma, Kexin Rong, Jianguo Wang, Yingjun Wu, Yongji Wu, Huanchen Zhang, Minjia Zhang, Qizhen Zhang, Tianyi Zhou, Danyang Zhuo

Abstract: In this paper, we investigate the intersection of large generative AI models and cloud-native computing architectures. Recent large models such as ChatGPT, while revolutionary in their capabilities, face challenges like escalating costs and demand for high-end GPUs. Drawing analogies between large-model-as-a-service (LMaaS) and cloud database-as-a-service (DBaaS), we describe an AI-native computin… ▽ More In this paper, we investigate the intersection of large generative AI models and cloud-native computing architectures. Recent large models such as ChatGPT, while revolutionary in their capabilities, face challenges like escalating costs and demand for high-end GPUs. Drawing analogies between large-model-as-a-service (LMaaS) and cloud database-as-a-service (DBaaS), we describe an AI-native computing paradigm that harnesses the power of both cloud-native technologies (e.g., multi-tenancy and serverless computing) and advanced machine learning runtime (e.g., batched LoRA inference). These joint efforts aim to optimize costs-of-goods-sold (COGS) and improve resource accessibility. The journey of merging these two domains is just at the beginning and we hope to stimulate future research and development in this area. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2401.06959 [pdf, other]

Quantifying energy landscape of oscillatory systems: Explosion, pre-solution, and diffusion decomposition

Authors: Shirui Bian, Ruisong Zhou, Wei Lin, Chunhe Li

Abstract: The energy landscape theory finds its both extensive and intensive application in studying stochastic dynamics of physical and biological systems. Although the weighted summation of the Gaussian approximation (WSGA) approach has been proposed for quantifying the energy landscape in multistable systems by solving the diffusion equation approximately from moment equations, we are still lacking an ac… ▽ More The energy landscape theory finds its both extensive and intensive application in studying stochastic dynamics of physical and biological systems. Although the weighted summation of the Gaussian approximation (WSGA) approach has been proposed for quantifying the energy landscape in multistable systems by solving the diffusion equation approximately from moment equations, we are still lacking an accurate approach for quantifying the energy landscape of the periodic oscillatory systems. To address this challenge, we propose an approach, called the diffusion decomposition of the Gaussian approximation (DDGA). Using typical oscillatory systems as examples, we demonstrate the efficacy of the proposed DDGA in quantifying the energy landscape of oscillatory systems and corresponding stochastic dynamics, in comparison with existing approaches. By further applying the DDGA to a high-dimensional cell cycle network, we are able to uncover more intricate biological mechanisms in cell cycle, which cannot be discerned using previously developed approaches. △ Less

Submitted 12 January, 2024; originally announced January 2024.

Comments: 13 pages, 4 figures

arXiv:2401.06000 [pdf, other]

Body-Area Capacitive or Electric Field Sensing for Human Activity Recognition and Human-Computer Interaction: A Comprehensive Survey

Authors: Sizhen Bian, Mengxi Liu, Bo Zhou, Paul Lukowicz, Michele Magno

Abstract: Due to the fact that roughly sixty percent of the human body is essentially composed of water, the human body is inherently a conductive object, being able to, firstly, form an inherent electric field from the body to the surroundings and secondly, deform the distribution of an existing electric field near the body. Body-area capacitive sensing, also called body-area electric field sensing, is bec… ▽ More Due to the fact that roughly sixty percent of the human body is essentially composed of water, the human body is inherently a conductive object, being able to, firstly, form an inherent electric field from the body to the surroundings and secondly, deform the distribution of an existing electric field near the body. Body-area capacitive sensing, also called body-area electric field sensing, is becoming a promising alternative for wearable devices to accomplish certain tasks in human activity recognition and human-computer interaction. Over the last decade, researchers have explored plentiful novel sensing systems backed by the body-area electric field. On the other hand, despite the pervasive exploration of the body-area electric field, a comprehensive survey does not exist for an enlightening guideline. Moreover, the various hardware implementations, applied algorithms, and targeted applications result in a challenging task to achieve a systematic overview of the subject. This paper aims to fill in the gap by comprehensively summarizing the existing works on body-area capacitive sensing so that researchers can have a better view of the current exploration status. To this end, we first sorted the explorations into three domains according to the involved body forms: body-part electric field, whole-body electric field, and body-to-body electric field, and enumerated the state-of-art works in the domains with a detailed survey of the backed sensing tricks and targeted applications. We then summarized the three types of sensing frontends in circuit design, which is the most critical part in body-area capacitive sensing, and analyzed the data processing pipeline categorized into three kinds of approaches. Finally, we described the challenges and outlooks of body-area electric sensing. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2312.11318 [pdf, other]

Domain Invariant Learning for Gaussian Processes and Bayesian Exploration

Authors: Xilong Zhao, Siyuan Bian, Yaoyun Zhang, Yuliang Zhang, Qinying Gu, Xinbing Wang, Chenghu Zhou, Nanyang Ye

Abstract: Out-of-distribution (OOD) generalization has long been a challenging problem that remains largely unsolved. Gaussian processes (GP), as popular probabilistic model classes, especially in the small data regime, presume strong OOD generalization abilities. Surprisingly, their OOD generalization abilities have been under-explored before compared with other lines of GP research. In this paper, we iden… ▽ More Out-of-distribution (OOD) generalization has long been a challenging problem that remains largely unsolved. Gaussian processes (GP), as popular probabilistic model classes, especially in the small data regime, presume strong OOD generalization abilities. Surprisingly, their OOD generalization abilities have been under-explored before compared with other lines of GP research. In this paper, we identify that GP is not free from the problem and propose a domain invariant learning algorithm for Gaussian processes (DIL-GP) with a min-max optimization on the likelihood. DIL-GP discovers the heterogeneity in the data and forces invariance across partitioned subsets of data. We further extend the DIL-GP to improve Bayesian optimization's adaptability on changing environments. Numerical experiments demonstrate the superiority of DIL-GP for predictions on several synthetic and real-world datasets. We further demonstrate the effectiveness of the DIL-GP Bayesian optimization method on a PID parameters tuning experiment for a quadrotor. The full version and source code are available at: https://github.com/Billzxl/DIL-GP. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: Accepted to The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

arXiv:2312.09854 [pdf]

Q-Segment: Segmenting Images In-Sensor for Vessel-Based Medical Diagnosis

Authors: Pietro Bonazzi, Yawei Li, Sizhen Bian, Michele Magno

Abstract: This paper addresses the growing interest in deploying deep learning models directly in-sensor. We present "Q-Segment", a quantized real-time segmentation algorithm, and conduct a comprehensive evaluation on a low-power edge vision platform with an in-sensors processor, the Sony IMX500. One of the main goals of the model is to achieve end-to-end image segmentation for vessel-based medical diagnosi… ▽ More This paper addresses the growing interest in deploying deep learning models directly in-sensor. We present "Q-Segment", a quantized real-time segmentation algorithm, and conduct a comprehensive evaluation on a low-power edge vision platform with an in-sensors processor, the Sony IMX500. One of the main goals of the model is to achieve end-to-end image segmentation for vessel-based medical diagnosis. Deployed on the IMX500 platform, Q-Segment achieves ultra-low inference time in-sensor only 0.23 ms and power consumption of only 72mW. We compare the proposed network with state-of-the-art models, both float and quantized, demonstrating that the proposed solution outperforms existing networks on various platforms in computing efficiency, e.g., by a factor of 75x compared to ERFNet. The network employs an encoder-decoder structure with skip connections, and results in a binary accuracy of 97.25% and an Area Under the Receiver Operating Characteristic Curve (AUC) of 96.97% on the CHASE dataset. We also present a comparison of the IMX500 processing core with the Sony Spresense, a low-power multi-core ARM Cortex-M microcontroller, and a single-core ARM Cortex-M4 showing that it can achieve in-sensor processing with end-to-end low latency (17 ms) and power concumption (254mW). This research contributes valuable insights into edge-based image segmentation, laying the foundation for efficient algorithms tailored to low-power environments. △ Less

Submitted 4 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

arXiv:2312.00425 [pdf, other]

Retina : Low-Power Eye Tracking with Event Camera and Spiking Hardware

Authors: Pietro Bonazzi, Sizhen Bian, Giovanni Lippolis, Yawei Li, Sadique Sheik, Michele Magno

Abstract: This paper introduces a neuromorphic methodology for eye tracking, harnessing pure event data captured by a Dynamic Vision Sensor (DVS) camera. The framework integrates a directly trained Spiking Neuron Network (SNN) regression model and leverages a state-of-the-art low power edge neuromorphic processor - Speck, collectively aiming to advance the precision and efficiency of eye-tracking systems. F… ▽ More This paper introduces a neuromorphic methodology for eye tracking, harnessing pure event data captured by a Dynamic Vision Sensor (DVS) camera. The framework integrates a directly trained Spiking Neuron Network (SNN) regression model and leverages a state-of-the-art low power edge neuromorphic processor - Speck, collectively aiming to advance the precision and efficiency of eye-tracking systems. First, we introduce a representative event-based eye-tracking dataset, "Ini-30", which was collected with two glass-mounted DVS cameras from thirty volunteers. Then,a SNN model, based on Integrate And Fire (IAF) neurons, named "Retina", is described , featuring only 64k parameters (6.63x fewer than the latest) and achieving pupil tracking error of only 3.24 pixels in a 64x64 DVS input. The continous regression output is obtained by means of convolution using a non-spiking temporal 1D filter slided across the output spiking layer. Finally, we evaluate Retina on the neuromorphic processor, showing an end-to-end power between 2.89-4.8 mW and a latency of 5.57-8.01 mS dependent on the time window. We also benchmark our model against the latest event-based eye-tracking method, "3ET", which was built upon event frames. Results show that Retina achieves superior precision with 1.24px less pupil centroid error and reduced computational complexity with 35 times fewer MAC operations. We hope this work will open avenues for further investigation of close-loop neuromorphic solutions and true event-based training pursuing edge performance. △ Less

Submitted 17 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

arXiv:2311.01831 [pdf, other]

Universal Multi-modal Multi-domain Pre-trained Recommendation

Authors: Wenqi Sun, Ruobing Xie, Shuqing Bian, Wayne Xin Zhao, Jie Zhou

Abstract: There is a rapidly-growing research interest in modeling user preferences via pre-training multi-domain interactions for recommender systems. However, Existing pre-trained multi-domain recommendations mostly select the item texts to be bridges across domains, and simply explore the user behaviors in target domains. Hence, they ignore other informative multi-modal item contents (e.g., visual inform… ▽ More There is a rapidly-growing research interest in modeling user preferences via pre-training multi-domain interactions for recommender systems. However, Existing pre-trained multi-domain recommendations mostly select the item texts to be bridges across domains, and simply explore the user behaviors in target domains. Hence, they ignore other informative multi-modal item contents (e.g., visual information), and also lack of thorough consideration of user behaviors from all interactive domains. To address these issues, in this paper, we propose to pre-train universal multi-modal item content presentation for multi-domain recommendation, called UniM^2Rec, which could smoothly learn the multi-modal item content presentations and the multi-modal user preferences from all domains. With the pre-trained multi-domain recommendation model, UniM^2Rec could be efficiently and effectively transferred to new target domains in practice. Extensive experiments conducted on five real-world datasets in target domains demonstrate the superiority of the proposed method over existing competitive methods, especially for the real-world recommendation scenarios that usually struggle with seriously missing or noisy item contents. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2311.01057 [pdf]

Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLO

Authors: Julian Moosmann, Pietro Bonazzi, Yawei Li, Sizhen Bian, Philipp Mayer, Luca Benini, Michele Magno

Abstract: Smart glasses are rapidly gaining advanced functionality thanks to cutting-edge computing technologies, accelerated hardware architectures, and tiny AI algorithms. Integrating AI into smart glasses featuring a small form factor and limited battery capacity is still challenging when targeting full-day usage for a satisfactory user experience. This paper illustrates the design and implementation of… ▽ More Smart glasses are rapidly gaining advanced functionality thanks to cutting-edge computing technologies, accelerated hardware architectures, and tiny AI algorithms. Integrating AI into smart glasses featuring a small form factor and limited battery capacity is still challenging when targeting full-day usage for a satisfactory user experience. This paper illustrates the design and implementation of tiny machine-learning algorithms exploiting novel low-power processors to enable prolonged continuous operation in smart glasses. We explore the energy- and latency-efficient of smart glasses in the case of real-time object detection. To this goal, we designed a smart glasses prototype as a research platform featuring two microcontrollers, including a novel milliwatt-power RISC-V parallel processor with a hardware accelerator for visual AI, and a Bluetooth low-power module for communication. The smart glasses integrate power cycling mechanisms, including image and audio sensing interfaces. Furthermore, we developed a family of novel tiny deep-learning models based on YOLO with sub-million parameters customized for microcontroller-based inference dubbed TinyissimoYOLO v1.3, v5, and v8, aiming at benchmarking object detection with smart glasses for energy and latency. Evaluations on the prototype of the smart glasses demonstrate TinyissimoYOLO's 17ms inference latency and 1.59mJ energy consumption per inference while ensuring acceptable detection accuracy. Further evaluation reveals an end-to-end latency from image capturing to the algorithm's prediction of 56ms or equivalently 18 fps, with a total power consumption of 62.9mW, equivalent to a 9.3 hours of continuous run time on a 154mAh battery. These results outperform MCUNet (TinyNAS+TinyEngine), which runs a simpler task (image classification) at just 7.3 fps per second. △ Less

Submitted 3 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

arXiv:2310.08680 [pdf, other]

An Efficient Resilient MPC Scheme via Constraint Tightening against Cyberattacks: Application to Vehicle Cruise Control

Authors: Milad Farsi, Shuhao Bian, Nasser L. Azad, Xiaobing Shi, Andrew Walenstein

Abstract: We propose a novel framework for designing a resilient Model Predictive Control (MPC) targeting uncertain linear systems under cyber attack. Assuming a periodic attack scenario, we model the system under Denial of Service (DoS) attack, also with measurement noise, as an uncertain linear system with parametric and additive uncertainty. To detect anomalies, we employ a Kalman filter-based approach.… ▽ More We propose a novel framework for designing a resilient Model Predictive Control (MPC) targeting uncertain linear systems under cyber attack. Assuming a periodic attack scenario, we model the system under Denial of Service (DoS) attack, also with measurement noise, as an uncertain linear system with parametric and additive uncertainty. To detect anomalies, we employ a Kalman filter-based approach. Then, through our observations of the intensity of the launched attack, we determine a range of possible values for the system matrices, as well as establish bounds of the additive uncertainty for the equivalent uncertain system. Leveraging a recent constraint tightening robust MPC method, we present an optimization-based resilient algorithm. Accordingly, we compute the uncertainty bounds and corresponding constraints offline for various attack magnitudes. Then, this data can be used efficiently in the MPC computations online. We demonstrate the effectiveness of the developed framework on the Adaptive Cruise Control (ACC) problem. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: To Appear in ICINCO 2023

arXiv:2308.12489 [pdf, other]

Kinematics of the Local Spiral Structure Revealed by Young Stars in \emph{Gaia}~DR3

Authors: Dejian Liu, Ye Xu, Chaojie Hao, Shuaibo Bian, Zehao Lin, Yingjie Li, Jingjing Li

Abstract: Using young open clusters and O--B2-type stars in~\emph{Gaia}~DR3, we investigate the kinematics of the local spiral structure. In general, the young sources in the outer spiral arms may present larger peculiar motions than those in the inner spiral arms. The young open clusters appear to have smaller peculiar motions than the O--B2-type stars, and the sources in both the Perseus and Local Arms ma… ▽ More Using young open clusters and O--B2-type stars in~\emph{Gaia}~DR3, we investigate the kinematics of the local spiral structure. In general, the young sources in the outer spiral arms may present larger peculiar motions than those in the inner spiral arms. The young open clusters appear to have smaller peculiar motions than the O--B2-type stars, and the sources in both the Perseus and Local Arms may show an inward motion toward the Galactic center and rotate slower than Galactic rotation. Meanwhile, the sources in the Carina Arm may move in the opposite direction from the Sun to the Galactic center and rotate marginally faster than Galactic rotation. In addition, using young open clusters and O--B2-type stars, we have improved the distance estimations of kinematic methods for several regions near the Sun. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: 13 pages, 10 figures, Accepted for publication in The Astrophysical Journal Supplement Series

arXiv:2308.10484 [pdf, other]

doi 10.3847/1538-4365/acde81

Distributions and Physical Properties of Molecular Clouds in the Third Galactic Quadrant: $l$ = [219.75, 229.75]$^\circ$ and $b$ = [-5.25, 5.25]$^\circ$

Authors: Yiwei Dong, Yan Sun, Ye Xu, Zehao Lin, Shuaibo Bian, Chaojie Hao, Dejian Liu, Yingjie Li, Ji Yang, Yang Su, Xin Zhou, Shaobo Zhang, Qing-Zeng Yan, Zhiwei Chen

Abstract: We present the results of an unbiased $^{12}$CO/$^{13}$CO/C$^{18}$O ($J$ = 1-0) survey in a portion of the third Galactic quadrant (TGQ): $l$ = [219.75, 229.75]$^\circ$ and $b$ = [-5.25, 5.25]$^\circ$. The high-resolution and high-sensitivity data sets help to unravel the distributions and physical properties of the molecular clouds (MCs) in the mapped area. In the LSR velocity range from -1 to 85… ▽ More We present the results of an unbiased $^{12}$CO/$^{13}$CO/C$^{18}$O ($J$ = 1-0) survey in a portion of the third Galactic quadrant (TGQ): $l$ = [219.75, 229.75]$^\circ$ and $b$ = [-5.25, 5.25]$^\circ$. The high-resolution and high-sensitivity data sets help to unravel the distributions and physical properties of the molecular clouds (MCs) in the mapped area. In the LSR velocity range from -1 to 85 km/s, the molecular material successfully traces the Local, Perseus, and Outer arms. In the TGQ, the Outer arm appears to be more prominent than that in the second Galactic quadrant (SGQ), but the Perseus arm is not as conspicuous as that in the SGQ. A total of 1,502 $^{12}$CO, 570 $^{13}$CO, and 53 C$^{18}$O molecular structures are identified, spanning over $\sim2$ and $\sim6$ orders of magnitude in size and mass, respectively. Tight mass-radius correlations and virial parameter-mass anticorrelations are observable. Yet, it seems that no clear correlations between velocity dispersion and effective radius can be found over the full dynamic range. The vertical distribution of the MCs renders evident pictures of the Galactic warp and flare. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: 22 pages, 13 figures, 7 tables (with machine-readable versions), published in ApJS

Journal ref: ApJS 268 1 (2023)

arXiv:2308.03514 [pdf, other]

Worker Activity Recognition in Manufacturing Line Using Near-body Electric Field

Authors: Sungho Suh, Vitor Fortes Rey, Sizhen Bian, Yu-Chi Huang, Jože M. Rožanec, Hooman Tavakoli Ghinani, Bo Zhou, Paul Lukowicz

Abstract: Manufacturing industries strive to improve production efficiency and product quality by deploying advanced sensing and control systems. Wearable sensors are emerging as a promising solution for achieving this goal, as they can provide continuous and unobtrusive monitoring of workers' activities in the manufacturing line. This paper presents a novel wearable sensing prototype that combines IMU and… ▽ More Manufacturing industries strive to improve production efficiency and product quality by deploying advanced sensing and control systems. Wearable sensors are emerging as a promising solution for achieving this goal, as they can provide continuous and unobtrusive monitoring of workers' activities in the manufacturing line. This paper presents a novel wearable sensing prototype that combines IMU and body capacitance sensing modules to recognize worker activities in the manufacturing line. To handle these multimodal sensor data, we propose and compare early, and late sensor data fusion approaches for multi-channel time-series convolutional neural networks and deep convolutional LSTM. We evaluate the proposed hardware and neural network model by collecting and annotating sensor data using the proposed sensing prototype and Apple Watches in the testbed of the manufacturing line. Experimental results demonstrate that our proposed methods achieve superior performance compared to the baseline methods, indicating the potential of the proposed approach for real-world applications in manufacturing industries. Furthermore, the proposed sensing prototype with a body capacitive sensor and feature fusion method improves by 6.35%, yielding a 9.38% higher macro F1 score than the proposed sensing prototype without a body capacitive sensor and Apple Watch data, respectively. △ Less

Submitted 7 August, 2023; originally announced August 2023.

arXiv:2308.03198 [pdf, other]

Re-imagining the Future of Forest Management -- An Age-Dependent Approach towards Harvesting

Authors: Shuyang Bian, Yuanyuan Xie, Flora Zhang

Abstract: Facing the drastic climate changes, current strategies for enhancing carbon dioxide stocks need to be thoroughly honed. To address the problem, we first built a carbon sequestration growth model driven by growth rate dependency (GRDM). We abstracted the carbon cycling system into the process of photosynthesis, the humidity fluctuation, and the original storage of carbon in the trees. In the photos… ▽ More Facing the drastic climate changes, current strategies for enhancing carbon dioxide stocks need to be thoroughly honed. To address the problem, we first built a carbon sequestration growth model driven by growth rate dependency (GRDM). We abstracted the carbon cycling system into the process of photosynthesis, the humidity fluctuation, and the original storage of carbon in the trees. In the photosynthesis model, we considered various factors, including transition rate of absorption and organic matter production. We also designed an Economic Return Evaluation Model (EREM) to estimate the optimal distribution of trees in the forest based on the utility function. Maximizing the utility brought by the amount of carbon storage, we derived the equation for profit optimization with the constraints of total economic expenses allowed. To assess its performance, we took an object-oriented approach, simulated an ideal forest by placing instances of trees and plotted a time-dependent forest composition graph. After proper normalization of climate and economic data, we also make predictions for 169 worldwide forest-covered countries. Our model further suggests high sensitivity and robustness with a similar trend of overall utility when environmental aridity or proportion of harvested woods are varied. Finally, we apply the model to Georgia temperate deciduous forest, and we evaluate the carbon storage ability to adjust the Red Spruce based on available biological literature research. We recognize that while the model is preliminary in its failure to identify a diverse array of variables, it has encapsulated key features of idealized forests. △ Less

Submitted 6 August, 2023; originally announced August 2023.

MSC Class: 92-10

arXiv:2308.00787 [pdf, other]

doi 10.1145/3594738.3611369

Evaluating Spiking Neural Network On Neuromorphic Platform For Human Activity Recognition

Authors: Sizhen Bian, Michele Magno

Abstract: Energy efficiency and low latency are crucial requirements for designing wearable AI-empowered human activity recognition systems, due to the hard constraints of battery operations and closed-loop feedback. While neural network models have been extensively compressed to match the stringent edge requirements, spiking neural networks and event-based sensing are recently emerging as promising solutio… ▽ More Energy efficiency and low latency are crucial requirements for designing wearable AI-empowered human activity recognition systems, due to the hard constraints of battery operations and closed-loop feedback. While neural network models have been extensively compressed to match the stringent edge requirements, spiking neural networks and event-based sensing are recently emerging as promising solutions to further improve performance due to their inherent energy efficiency and capacity to process spatiotemporal data in very low latency. This work aims to evaluate the effectiveness of spiking neural networks on neuromorphic processors in human activity recognition for wearable applications. The case of workout recognition with wrist-worn wearable motion sensors is used as a study. A multi-threshold delta modulation approach is utilized for encoding the input sensor data into spike trains to move the pipeline into the event-based approach. The spikes trains are then fed to a spiking neural network with direct-event training, and the trained model is deployed on the research neuromorphic platform from Intel, Loihi, to evaluate energy and latency efficiency. Test results show that the spike-based workouts recognition system can achieve a comparable accuracy (87.5\%) comparable to the popular milliwatt RISC-V bases multi-core processor GAP8 with a traditional neural network ( 88.1\%) while achieving two times better energy-delay product (0.66 \si{\micro\joule\second} vs. 1.32 \si{\micro\joule\second}). △ Less

Submitted 1 August, 2023; originally announced August 2023.

arXiv:2307.11449 [pdf]

AIGC Empowering Telecom Sector White Paper_chinese

Authors: Ye Ouyang, Yaqin Zhang, Xiaozhou Ye, Yunxin Liu, Yong Song, Yang Liu, Sen Bian, Zhiyong Liu

Abstract: In the global craze of GPT, people have deeply realized that AI, as a transformative technology and key force in economic and social development, will bring great leaps and breakthroughs to the global industry and profoundly influence the future world competition pattern. As the builder and operator of information and communication infrastructure, the telecom sector provides infrastructure support… ▽ More In the global craze of GPT, people have deeply realized that AI, as a transformative technology and key force in economic and social development, will bring great leaps and breakthroughs to the global industry and profoundly influence the future world competition pattern. As the builder and operator of information and communication infrastructure, the telecom sector provides infrastructure support for the development of AI, and even takes the lead in the implementation of AI applications. How to enable the application of AIGC (GPT) and implement AIGC in the telecom sector are questions that telecom practitioners must ponder and answer. Through the study of GPT, a typical representative of AIGC, the authors have analyzed how GPT empowers the telecom sector in the form of scenarios, discussed the gap between the current GPT general model and telecom services, proposed for the first time a Telco Augmented Cognition capability system, provided answers to how to construct a telecom service GPT in the telecom sector, and carried out various practices. Our counterparts in the industry are expected to focus on collaborative innovation around telecom and AI, build an open and shared innovation ecosystem, promote the deep integration of AI and telecom sector, and accelerate the construction of next-generation information infrastructure, in an effort to facilitate the digital transformation of the economy and society. △ Less

Submitted 23 July, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

arXiv:2307.09045 [pdf]

6G Network Operation Support System

Authors: Ye Ouyang, Yaqin Zhang, Xiaozhou Ye, Yunxin Liu, Xidong Wang, Jie Sun, Yang Liu, Shoufeng Wang, Sen Bian, Yun Li

Abstract: 6G is the next-generation intelligent and integrated digital information infrastructure, characterized by ubiquitous interconnection, native intelligence, multi-dimensional perception, global coverage, green and low-carbon, native network security, etc. 6G will realize the transition from serving people and people-things communication to supporting the efficient connection of intelligent agents, a… ▽ More 6G is the next-generation intelligent and integrated digital information infrastructure, characterized by ubiquitous interconnection, native intelligence, multi-dimensional perception, global coverage, green and low-carbon, native network security, etc. 6G will realize the transition from serving people and people-things communication to supporting the efficient connection of intelligent agents, and comprehensively leading the digital, intelligent and green transformation of the economy and the society. As the core support system for mobile communication network, 6G OSS needs to achieve high-level network automation, intelligence and digital twinning capabilities to achieve end-to-end autonomous network operation and maintenance, support the operation of typical 6G business scenarios and play a greater social responsibility in the fields of environment, society, and governance (ESG).This paper provides a detailed introduction to the overall vision, potential key technologies, and functional architecture of 6G OSS . It also presents an evolutionary roadmap and technological prospects for the OSS from 5G to 6G. △ Less

Submitted 25 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

Comments: 103 pages, 20 figures, 52 references (chinese version)

arXiv:2307.07813 [pdf]

TinyTracker: Ultra-Fast and Ultra-Low-Power Edge Vision In-Sensor for Gaze Estimation

Authors: Pietro Bonazzi, Thomas Ruegg, Sizhen Bian, Yawei Li, Michele Magno

Abstract: Intelligent edge vision tasks encounter the critical challenge of ensuring power and latency efficiency due to the typically heavy computational load they impose on edge platforms.This work leverages one of the first "AI in sensor" vision platforms, IMX500 by Sony, to achieve ultra-fast and ultra-low-power end-to-end edge vision applications. We evaluate the IMX500 and compare it to other edge pla… ▽ More Intelligent edge vision tasks encounter the critical challenge of ensuring power and latency efficiency due to the typically heavy computational load they impose on edge platforms.This work leverages one of the first "AI in sensor" vision platforms, IMX500 by Sony, to achieve ultra-fast and ultra-low-power end-to-end edge vision applications. We evaluate the IMX500 and compare it to other edge platforms, such as the Google Coral Dev Micro and Sony Spresense, by exploring gaze estimation as a case study. We propose TinyTracker, a highly efficient, fully quantized model for 2D gaze estimation designed to maximize the performance of the edge vision systems considered in this study. TinyTracker achieves a 41x size reduction (600Kb) compared to iTracker [1] without significant loss in gaze estimation accuracy (maximum of 0.16 cm when fully quantized). TinyTracker's deployment on the Sony IMX500 vision sensor results in end-to-end latency of around 19ms. The camera takes around 17.9ms to read, process and transmit the pixels to the accelerator. The inference time of the network is 0.86ms with an additional 0.24 ms for retrieving the results from the sensor. The overall energy consumption of the end-to-end system is 4.9 mJ, including 0.06 mJ for inference. The end-to-end study shows that IMX500 is 1.7x faster than CoralMicro (19ms vs 34.4ms) and 7x more power efficient (4.9mJ VS 34.2mJ) △ Less

Submitted 20 November, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

Journal ref: IEEE Sensors, Conference, Lecture, Vienna, 2023

arXiv:2306.16870 [pdf, ps, other]

The aggregation-diffusion equation with the intermediate exponent

Authors: Shen Bian, Jiale Bu

Abstract: We consider a Keller-Segel model with non-linear porous medium type diffusion and nonlocal attractive power law interaction, focusing on potentials that are less singular than Newtonian interaction. Here, the nonlinear diffusion is chosen to be $\frac{2d}{d+2s}<m<2-\frac{2s}{d}$ in which case the steady states are compactly supported. We analyse under which conditions on the initial data the regim… ▽ More We consider a Keller-Segel model with non-linear porous medium type diffusion and nonlocal attractive power law interaction, focusing on potentials that are less singular than Newtonian interaction. Here, the nonlinear diffusion is chosen to be $\frac{2d}{d+2s}<m<2-\frac{2s}{d}$ in which case the steady states are compactly supported. We analyse under which conditions on the initial data the regime that attractive forces are stronger than diffusion occurs and classify the global existence and finite time blow-up of solutions. It is shown that there is a threshold value which is characterized by the optimal constant of a variant of Hardy-Littlewood-Sobolev inequality such that the solution will exist globally if the initial data is below the threshold, while the solution blows up in finite time when the initial data is above the threshold. △ Less

Submitted 29 June, 2023; originally announced June 2023.

arXiv:2305.18371 [pdf, other]

ColibriUAV: An Ultra-Fast, Energy-Efficient Neuromorphic Edge Processing UAV-Platform with Event-Based and Frame-Based Cameras

Authors: Sizhen Bian, Lukas Schulthess, Georg Rutishauser, Alfio Di Mauro, Luca Benini, Michele Magno

Abstract: The interest in dynamic vision sensor (DVS)-powered unmanned aerial vehicles (UAV) is raising, especially due to the microsecond-level reaction time of the bio-inspired event sensor, which increases robustness and reduces latency of the perception tasks compared to a RGB camera. This work presents ColibriUAV, a UAV platform with both frame-based and event-based cameras interfaces for efficient per… ▽ More The interest in dynamic vision sensor (DVS)-powered unmanned aerial vehicles (UAV) is raising, especially due to the microsecond-level reaction time of the bio-inspired event sensor, which increases robustness and reduces latency of the perception tasks compared to a RGB camera. This work presents ColibriUAV, a UAV platform with both frame-based and event-based cameras interfaces for efficient perception and near-sensor processing. The proposed platform is designed around Kraken, a novel low-power RISC-V System on Chip with two hardware accelerators targeting spiking neural networks and deep ternary neural networks.Kraken is capable of efficiently processing both event data from a DVS camera and frame data from an RGB camera. A key feature of Kraken is its integrated, dedicated interface with a DVS camera. This paper benchmarks the end-to-end latency and power efficiency of the neuromorphic and event-based UAV subsystem, demonstrating state-of-the-art event data with a throughput of 7200 frames of events per second and a power consumption of 10.7 \si{\milli\watt}, which is over 6.6 times faster and a hundred times less power-consuming than the widely-used data reading approach through the USB interface. The overall sensing and processing power consumption is below 50 mW, achieving latency in the milliseconds range, making the platform suitable for low-latency autonomous nano-drones as well. △ Less

Submitted 27 May, 2023; originally announced May 2023.

arXiv:2305.17594 [pdf, other]

doi 10.1109/MetroInd4.0IoT57462.2023.10180177

Fully Automatic Gym Exercises Recording: An IoT Solution

Authors: Sizhen Bian, Alexander Rupp, Michele Magno

Abstract: In recent years, working out in the gym has gotten increasingly more data-focused and many gym enthusiasts are recording their exercises to have a better overview of their historical gym activities and to make a better exercise plan for the future. As a side effect, this recording process has led to a lot of time spent painstakingly operating these apps by plugging in used types of equipment and r… ▽ More In recent years, working out in the gym has gotten increasingly more data-focused and many gym enthusiasts are recording their exercises to have a better overview of their historical gym activities and to make a better exercise plan for the future. As a side effect, this recording process has led to a lot of time spent painstakingly operating these apps by plugging in used types of equipment and repetitions. This project aims to automate this process using an Internet of Things (IoT) approach. Specifically, beacons with embedded ultra-low-power inertial measurement units (IMUs) are attached to the types of equipment to recognize the usage and transmit the information to gym-goers and managers. We have created a small ecosystem composed of beacons, a gateway, smartwatches, android/iPhone applications, a firebase cloud server, and a dashboard, all communicating over a mixture of Bluetooth and Wifi to distribute collected data from machines to users and gym managers in a compact and meaningful way. The system we have implemented is a working prototype of a bigger end goal and is supposed to initialize progress toward a smarter, more efficient, and still privacy-respect gym environment in the future. A small-scale real-life test shows 94.6\% accuracy in user gym session recording, which can reach up to 100\% easily with a more suitable assembling of the beacons. This promising result shows the potential of a fully automatic exercise recording system, which enables comprehensive monitoring and analysis of the exercise sessions and frees the user from manual recording. The estimated battery life of the beacon is 400 days with a 210 mAh coin battery. We also discussed the shortcoming of the current demonstration system and the future work for a reliable and ready-to-deploy automatic gym workout recording system. △ Less

Submitted 27 May, 2023; originally announced May 2023.

arXiv:2305.17482 [pdf, other]

Federated Empirical Risk Minimization via Second-Order Method

Authors: Song Bian, Zhao Song, Junze Yin

Abstract: Many convex optimization problems with important applications in machine learning are formulated as empirical risk minimization (ERM). There are several examples: linear and logistic regression, LASSO, kernel regression, quantile regression, $p$-norm regression, support vector machines (SVM), and mean-field variational inference. To improve data privacy, federated learning is proposed in machine l… ▽ More Many convex optimization problems with important applications in machine learning are formulated as empirical risk minimization (ERM). There are several examples: linear and logistic regression, LASSO, kernel regression, quantile regression, $p$-norm regression, support vector machines (SVM), and mean-field variational inference. To improve data privacy, federated learning is proposed in machine learning as a framework for training deep learning models on the network edge without sharing data between participating nodes. In this work, we present an interior point method (IPM) to solve a general ERM problem under the federated learning setting. We show that the communication complexity of each iteration of our IPM is $\tilde{O}(d^{3/2})$, where $d$ is the dimension (i.e., number of features) of the dataset. △ Less

Submitted 27 May, 2023; originally announced May 2023.

arXiv:2305.08590 [pdf, other]

NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation

Authors: Jiefeng Li, Siyuan Bian, Qi Liu, Jiasheng Tang, Fan Wang, Cewu Lu

Abstract: With the progress of 3D human pose and shape estimation, state-of-the-art methods can either be robust to occlusions or obtain pixel-aligned accuracy in non-occlusion cases. However, they cannot obtain robustness and mesh-image alignment at the same time. In this work, we present NIKI (Neural Inverse Kinematics with Invertible Neural Network), which models bi-directional errors to improve the robu… ▽ More With the progress of 3D human pose and shape estimation, state-of-the-art methods can either be robust to occlusions or obtain pixel-aligned accuracy in non-occlusion cases. However, they cannot obtain robustness and mesh-image alignment at the same time. In this work, we present NIKI (Neural Inverse Kinematics with Invertible Neural Network), which models bi-directional errors to improve the robustness to occlusions and obtain pixel-aligned accuracy. NIKI can learn from both the forward and inverse processes with invertible networks. In the inverse process, the model separates the error from the plausible 3D pose manifold for a robust 3D human pose estimation. In the forward process, we enforce the zero-error boundary conditions to improve the sensitivity to reliable joint positions for better mesh-image alignment. Furthermore, NIKI emulates the analytical inverse kinematics algorithms with the twist-and-swing decomposition for better interpretability. Experiments on standard and occlusion-specific benchmarks demonstrate the effectiveness of NIKI, where we exhibit robust and well-aligned results simultaneously. Code is available at https://github.com/Jeff-sjtu/NIKI △ Less

Submitted 15 May, 2023; originally announced May 2023.

Comments: CVPR 2023

arXiv:2305.03899 [pdf, other]

NL-CS Net: Deep Learning with Non-Local Prior for Image Compressive Sensing

Authors: Shuai Bian, Shouliang Qi, Chen Li, Yudong Yao, Yueyang Teng

Abstract: Deep learning has been applied to compressive sensing (CS) of images successfully in recent years. However, existing network-based methods are often trained as the black box, in which the lack of prior knowledge is often the bottleneck for further performance improvement. To overcome this drawback, this paper proposes a novel CS method using non-local prior which combines the interpretability of t… ▽ More Deep learning has been applied to compressive sensing (CS) of images successfully in recent years. However, existing network-based methods are often trained as the black box, in which the lack of prior knowledge is often the bottleneck for further performance improvement. To overcome this drawback, this paper proposes a novel CS method using non-local prior which combines the interpretability of the traditional optimization methods with the speed of network-based methods, called NL-CS Net. We unroll each phase from iteration of the augmented Lagrangian method solving non-local and sparse regularized optimization problem by a network. NL-CS Net is composed of the up-sampling module and the recovery module. In the up-sampling module, we use learnable up-sampling matrix instead of a predefined one. In the recovery module, patch-wise non-local network is employed to capture long-range feature correspondences. Important parameters involved (e.g. sampling matrix, nonlinear transforms, shrinkage thresholds, step size, $etc.$) are learned end-to-end, rather than hand-crafted. Furthermore, to facilitate practical implementation, orthogonal and binary constraints on the sampling matrix are simultaneously adopted. Extensive experiments on natural images and magnetic resonance imaging (MRI) demonstrate that the proposed method outperforms the state-of-the-art methods while maintaining great interpretability and speed. △ Less

Submitted 5 May, 2023; originally announced May 2023.

Comments: 21pages,6figures

ACM Class: I.4.7

arXiv:2304.10690 [pdf, other]

doi 10.3847/1538-4357/acc45c

What Does the Milky Way Look Like?

Authors: Y. Xu, C. J. Hao, D. J. Liu, Z. H. Lin, S. B. Bian, L. G. Hou, J. J. Li, Y. J. Li

Abstract: In spite of much work, the overall spiral structure morphology of the Milky Way remains somewhat uncertain. In the last two decades, accurate distance measurements have provided us with an opportunity to solve this issue. Using the precise locations of very young objects, for the first time, we propose that our galaxy has a multiple-arm morphology that consists of two-arm symmetry (the Perseus and… ▽ More In spite of much work, the overall spiral structure morphology of the Milky Way remains somewhat uncertain. In the last two decades, accurate distance measurements have provided us with an opportunity to solve this issue. Using the precise locations of very young objects, for the first time, we propose that our galaxy has a multiple-arm morphology that consists of two-arm symmetry (the Perseus and Norma Arms) in the inner parts and that extends to the outer parts, where there are several long, irregular arms (the Centaurus, Sagittarius, Carina, Outer, and Local Arms). △ Less

Submitted 20 April, 2023; originally announced April 2023.

Comments: 11 pages, 9 figures, ApJ, 947, 54

arXiv:2304.05690 [pdf, other]

HybrIK-X: Hybrid Analytical-Neural Inverse Kinematics for Whole-body Mesh Recovery

Authors: Jiefeng Li, Siyuan Bian, Chao Xu, Zhicun Chen, Lixin Yang, Cewu Lu

Abstract: Recovering whole-body mesh by inferring the abstract pose and shape parameters from visual content can obtain 3D bodies with realistic structures. However, the inferring process is highly non-linear and suffers from image-mesh misalignment, resulting in inaccurate reconstruction. In contrast, 3D keypoint estimation methods utilize the volumetric representation to achieve pixel-level accuracy but m… ▽ More Recovering whole-body mesh by inferring the abstract pose and shape parameters from visual content can obtain 3D bodies with realistic structures. However, the inferring process is highly non-linear and suffers from image-mesh misalignment, resulting in inaccurate reconstruction. In contrast, 3D keypoint estimation methods utilize the volumetric representation to achieve pixel-level accuracy but may predict unrealistic body structures. To address these issues, this paper presents a novel hybrid inverse kinematics solution, HybrIK, that integrates the merits of 3D keypoint estimation and body mesh recovery in a unified framework. HybrIK directly transforms accurate 3D joints to body-part rotations via twist-and-swing decomposition. The swing rotations are analytically solved with 3D joints, while the twist rotations are derived from visual cues through neural networks. To capture comprehensive whole-body details, we further develop a holistic framework, HybrIK-X, which enhances HybrIK with articulated hands and an expressive face. HybrIK-X is fast and accurate by solving the whole-body pose with a one-stage model. Experiments demonstrate that HybrIK and HybrIK-X preserve both the accuracy of 3D joints and the realistic structure of the parametric human model, leading to pixel-aligned whole-body mesh recovery. The proposed method significantly surpasses the state-of-the-art methods on various benchmarks for body-only, hand-only, and whole-body scenarios. Code and results can be found at https://jeffli.site/HybrIK-X/ △ Less

Submitted 12 April, 2023; originally announced April 2023.

Comments: An eXpressive extension of HybrIK [arXiv:2011.14672], supports SMPL-X. arXiv admin note: substantial text overlap with arXiv:2011.14672

arXiv:2303.04811 [pdf, other]

Naive Bayes Classifiers over Missing Data: Decision and Poisoning

Authors: Song Bian, Xiating Ouyang, Zhiwei Fan, Paraschos Koutris

Abstract: We study the certifiable robustness of ML classifiers on dirty datasets that could contain missing values. A test point is certifiably robust for an ML classifier if the classifier returns the same prediction for that test point, regardless of which cleaned version (among exponentially many) of the dirty dataset the classifier is trained on. In this paper, we show theoretically that for Naive Baye… ▽ More We study the certifiable robustness of ML classifiers on dirty datasets that could contain missing values. A test point is certifiably robust for an ML classifier if the classifier returns the same prediction for that test point, regardless of which cleaned version (among exponentially many) of the dirty dataset the classifier is trained on. In this paper, we show theoretically that for Naive Bayes Classifiers (NBC) over dirty datasets with missing values: (i) there exists an efficient polynomial time algorithm to decide whether multiple input test points are all certifiably robust over a dirty dataset; and (ii) the data poisoning attack, which aims to make all input test points certifiably non-robust by inserting missing cells to the clean dataset, is in polynomial time for single test points but NP-complete for multiple test points. Extensive experiments demonstrate that our algorithms are efficient and outperform existing baselines. △ Less

Submitted 28 May, 2024; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: 22 pages, 10 figures

Journal ref: ICML 2024

arXiv:2302.09490 [pdf, ps, other]

The aggregation-diffusion equation with energy critical exponent

Authors: Shen Bian

Abstract: We consider a Keller-Segel model with non-linear porous medium type diffusion and nonlocal attractive power law interaction, focusing on potentials that are less singular than Newtonian interaction. Here, the nonlinear diffusion is chosen to be $m=\frac{2d}{d+2s}$ in such a way that the associated free energy is conformal invariant and there is a family of stationary solutions… ▽ More We consider a Keller-Segel model with non-linear porous medium type diffusion and nonlocal attractive power law interaction, focusing on potentials that are less singular than Newtonian interaction. Here, the nonlinear diffusion is chosen to be $m=\frac{2d}{d+2s}$ in such a way that the associated free energy is conformal invariant and there is a family of stationary solutions $U(x)=c\left(\fracλ{λ^2+|x-x_0|^2}\right)^{\frac{d+2s}{2}}$ for any constant $c$ and some $λ>0, x_0 \in \R^d.$ We analyze under which conditions on the initial data the regime that attractive forces are stronger than diffusion occurs and classify the global existence and finite time blow-up of dynamical solutions by virtue of stationary solutions. Precisely, solutions exist globally in time if the $L^m$ norm of the initial data $\|u_0\|_{L^m(\R^d)}$ is less than the $L^m$ norm of stationary solutions $\|U(x)\|_{L^m(\R^d)}$. Whereas there are blowing-up solutions for $\|u_0\|_{L^m(\R^d)}>\|U(x)\|_{L^m(\R^d)}$. △ Less

Submitted 19 February, 2023; originally announced February 2023.

arXiv:2302.07957 [pdf, other]

ColibriES: A Milliwatts RISC-V Based Embedded System Leveraging Neuromorphic and Neural Networks Hardware Accelerators for Low-Latency Closed-loop Control Applications

Authors: Georg Rutishauser, Robin Hunziker, Alfio Di Mauro, Sizhen Bian, Luca Benini, Michele Magno

Abstract: End-to-end event-based computation has the potential to push the envelope in latency and energy efficiency for edge AI applications. Unfortunately, event-based sensors (e.g., DVS cameras) and neuromorphic spike-based processors (e.g., Loihi) have been designed in a decoupled fashion, thereby missing major streamlining opportunities. This paper presents ColibriES, the first-ever neuromorphic hardwa… ▽ More End-to-end event-based computation has the potential to push the envelope in latency and energy efficiency for edge AI applications. Unfortunately, event-based sensors (e.g., DVS cameras) and neuromorphic spike-based processors (e.g., Loihi) have been designed in a decoupled fashion, thereby missing major streamlining opportunities. This paper presents ColibriES, the first-ever neuromorphic hardware embedded system platform with dedicated event-sensor interfaces and full processing pipelines. ColibriES includes event and frame interfaces and data processing, aiming at efficient and long-life embedded systems in edge scenarios. ColibriES is based on the Kraken system-on-chip and contains a heterogeneous parallel ultra-low power (PULP) processor, frame-based and event-based camera interfaces, and two hardware accelerators for the computation of both event-based spiking neural networks and frame-based ternary convolutional neural networks. This paper explores and accurately evaluates the performance of event data processing on the example of gesture recognition on ColibriES, as the first step of full-system evaluation. In our experiments, we demonstrate a chip energy consumption of 7.7 \si{\milli\joule} and latency of 164.5 \si{\milli\second} of each inference with the DVS Gesture event data set as an example for closed-loop data processing, showcasing the potential of ColibriES for battery-powered applications such as wearable devices and UAVs that require low-latency closed-loop control. △ Less

Submitted 15 February, 2023; originally announced February 2023.

arXiv:2301.05748 [pdf, other]

doi 10.1109/IGSC55832.2022.9969370

Exploring Automatic Gym Workouts Recognition Locally On Wearable Resource-Constrained Devices

Authors: Sizhen Bian, Xiaying Wang, Tommaso Polonelli, Michele Magno

Abstract: Automatic gym activity recognition on energy- and resource-constrained wearable devices removes the human-interaction requirement during intense gym sessions - like soft-touch tapping and swiping. This work presents a tiny and highly accurate residual convolutional neural network that runs in milliwatt microcontrollers for automatic workouts classification. We evaluated the inference performance o… ▽ More Automatic gym activity recognition on energy- and resource-constrained wearable devices removes the human-interaction requirement during intense gym sessions - like soft-touch tapping and swiping. This work presents a tiny and highly accurate residual convolutional neural network that runs in milliwatt microcontrollers for automatic workouts classification. We evaluated the inference performance of the deep model with quantization on three resource-constrained devices: two microcontrollers with ARM-Cortex M4 and M7 core from ST Microelectronics, and a GAP8 system on chip, which is an open-sourced, multi-core RISC-V computing platform from GreenWaves Technologies. Experimental results show an accuracy of up to 90.4% for eleven workouts recognition with full precision inference. The paper also presents the trade-off performance of the resource-constrained system. While keeping the recognition accuracy (88.1%) with minimal loss, each inference takes only 3.2 ms on GAP8, benefiting from the 8 RISC-V cluster cores. We measured that it features an execution time that is 18.9x and 6.5x faster than the Cortex-M4 and Cortex-M7 cores, showing the feasibility of real-time on-board workouts recognition based on the described data set with 20 Hz sampling rate. The energy consumed for each inference on GAP8 is 0.41 mJ compared to 5.17 mJ on Cortex-M4 and 8.07 mJ on Cortex-M7 with the maximum clock. It can lead to longer battery life when the system is battery-operated. We also introduced an open data set composed of fifty sessions of eleven gym workouts collected from ten subjects that is publicly available. △ Less

Submitted 13 January, 2023; originally announced January 2023.

arXiv:2301.02654 [pdf, other]

Does compressing activations help model parallel training?

Authors: Song Bian, Dacheng Li, Hongyi Wang, Eric P. Xing, Shivaram Venkataraman

Abstract: Large-scale Transformer models are known for their exceptional performance in a range of tasks, but training them can be difficult due to the requirement for communication-intensive model parallelism. One way to improve training speed is to compress the message size in communication. Previous approaches have primarily focused on compressing gradients in a data parallelism setting, but compression… ▽ More Large-scale Transformer models are known for their exceptional performance in a range of tasks, but training them can be difficult due to the requirement for communication-intensive model parallelism. One way to improve training speed is to compress the message size in communication. Previous approaches have primarily focused on compressing gradients in a data parallelism setting, but compression in a model-parallel setting is an understudied area. We have discovered that model parallelism has fundamentally different characteristics than data parallelism. In this work, we present the first empirical study on the effectiveness of compression methods for model parallelism. We implement and evaluate three common classes of compression algorithms - pruning-based, learning-based, and quantization-based - using a popular Transformer training framework. We evaluate these methods across more than 160 settings and 8 popular datasets, taking into account different hyperparameters, hardware, and both fine-tuning and pre-training stages. We also provide analysis when the model is scaled up. Finally, we provide insights for future development of model parallelism compression algorithms. △ Less

Submitted 6 January, 2023; originally announced January 2023.

Comments: 16 pages, 5 figures

arXiv:2211.05541 [pdf, other]

doi 10.1145/3544794.3558462

Non-contact, real-time eye blink detection with capacitive sensing

Authors: Mengxi Liu, Sizhen Bian, Paul Lukowicz

Abstract: This work described a novel non-contact, wearable, real-time eye blink detection solution based on capacitive sensing technology. A low-cost and low-power consumption capacitive sensing prototype was developed and deployed on a pair of standard glasses with a copper electrode attached to the glass frame. The eye blink action will cause the capacitance variation between the electrode and the eyelid… ▽ More This work described a novel non-contact, wearable, real-time eye blink detection solution based on capacitive sensing technology. A low-cost and low-power consumption capacitive sensing prototype was developed and deployed on a pair of standard glasses with a copper electrode attached to the glass frame. The eye blink action will cause the capacitance variation between the electrode and the eyelid. Thus by monitoring the capacitance variation caused oscillating frequency shift signal, the eye blink can be abstracted by a simple comparison of the raw frequency signal with a customized threshold. The feasibility and robustness of the proposed solution were demonstrated in five scenarios performed by eight volunteers with an average precision of 92\% and recall of 94\%. △ Less

Submitted 10 November, 2022; originally announced November 2022.

Comments: 4 pages, 5 figures

arXiv:2210.14794 [pdf, other]

The Contribution of Human Body Capacitance/Body-Area Electric Field To Individual and Collaborative Activity Recognition

Authors: Sizhen Bian, Vitor Fortes Rey, Siyu Yuan, Paul Lukowicz

Abstract: The current dominated wearable body motion sensor is IMU. This work presented an alternative wearable motion-sensing approach: human body capacitance (HBC, also commonly defined as body-area electric field). While being less robust in tracking the posture and trajectory, HBC has two properties that make it an attractive. First, the deployment of the sensing node on the being tracked body part is n… ▽ More The current dominated wearable body motion sensor is IMU. This work presented an alternative wearable motion-sensing approach: human body capacitance (HBC, also commonly defined as body-area electric field). While being less robust in tracking the posture and trajectory, HBC has two properties that make it an attractive. First, the deployment of the sensing node on the being tracked body part is not a requirement for HBC sensing approach. Second, HBC is sensitive to the body's interaction with its surroundings, including both touching and being in the immediate proximity of people and objects. We first described the sensing principle for HBC, sensor architecture and implementation, and methods for evaluation. We then presented two case studies demonstrating the usefulness of HBC as a complement/alternative to IMUs. First, we explored the exercise recognition and repetition counting of seven machine-free leg-only exercises and eleven general gym workouts with the signal source of HBC and IMU. The HBC sensing shows significant advantages over the IMU signals in classification(0.89 vs 0.78 in F-score) and counting(0.982 vs 0.938 in accuracy) of the leg-only exercises. For the general gym workouts, HBC only shows recognition improvement for certain workouts like adductor where legs alone complete the movement. And it also supplies better results over the IMU for workouts counting(0.800 vs. 0.756 when wearing the sensors on the wrist). In the second case, we tried to recognize actions related to manipulating objects and physical collaboration between users by using a wrist-worn HBC sensing unit. We detected collaboration between the users with 0.69 F-score when receiving data from a single user and 0.78 when receiving data from both users. The capacitive sensor can improve the recognition of collaborative activities with an F-score over a single wrist accelerometer approach by 16\%. △ Less

Submitted 26 October, 2022; originally announced October 2022.

Comments: 30 Pages, 35 Figures

Showing 1–50 of 105 results for author: Bian, S