-
A Benchmark for Cross-Domain Argumentative Stance Classification on Social Media
Authors:
Jiaqing Yuan,
Ruijie Xi,
Munindar P. Singh
Abstract:
Argumentative stance classification plays a key role in identifying authors' viewpoints on specific topics. However, generating diverse pairs of argumentative sentences across various domains is challenging. Existing benchmarks often come from a single domain or focus on a limited set of topics. Additionally, manual annotation for accurate labeling is time-consuming and labor-intensive. To address…
▽ More
Argumentative stance classification plays a key role in identifying authors' viewpoints on specific topics. However, generating diverse pairs of argumentative sentences across various domains is challenging. Existing benchmarks often come from a single domain or focus on a limited set of topics. Additionally, manual annotation for accurate labeling is time-consuming and labor-intensive. To address these challenges, we propose leveraging platform rules, readily available expert-curated content, and large language models to bypass the need for human annotation. Our approach produces a multidomain benchmark comprising 4,498 topical claims and 30,961 arguments from three sources, spanning 21 domains. We benchmark the dataset in fully supervised, zero-shot, and few-shot settings, shedding light on the strengths and limitations of different methodologies. We release the dataset and code in this study at hidden for anonymity.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
Outlier Detection Bias Busted: Understanding Sources of Algorithmic Bias through Data-centric Factors
Authors:
Xueying Ding,
Rui Xi,
Leman Akoglu
Abstract:
The astonishing successes of ML have raised growing concern for the fairness of modern methods when deployed in real world settings. However, studies on fairness have mostly focused on supervised ML, while unsupervised outlier detection (OD), with numerous applications in finance, security, etc., have attracted little attention. While a few studies proposed fairness-enhanced OD algorithms, they re…
▽ More
The astonishing successes of ML have raised growing concern for the fairness of modern methods when deployed in real world settings. However, studies on fairness have mostly focused on supervised ML, while unsupervised outlier detection (OD), with numerous applications in finance, security, etc., have attracted little attention. While a few studies proposed fairness-enhanced OD algorithms, they remain agnostic to the underlying driving mechanisms or sources of unfairness. Even within the supervised ML literature, there exists debate on whether unfairness stems solely from algorithmic biases (i.e. design choices) or from the biases encoded in the data on which they are trained. To close this gap, this work aims to shed light on the possible sources of unfairness in OD by auditing detection models under different data-centric factors. By injecting various known biases into the input data -- as pertain to sample size disparity, under-representation, feature measurement noise, and group membership obfuscation -- we find that the OD algorithms under the study all exhibit fairness pitfalls, although differing in which types of data bias they are more susceptible to. Most notable of our study is to demonstrate that OD algorithm bias is not merely a data bias problem. A key realization is that the data properties that emerge from bias injection could as well be organic -- as pertain to natural group differences w.r.t. sparsity, base rate, variance, and multi-modality. Either natural or biased, such data properties can give rise to unfairness as they interact with certain algorithmic design choices.
△ Less
Submitted 24 August, 2024;
originally announced August 2024.
-
Optimizing Hybrid Ferromagnetic Metal-Ferrimagnetic Insulator Spin-Hall Nano-Oscillators: A Micromagnetic Study
Authors:
Robert Xi,
Ya-An Lai,
Andrew D. Kent
Abstract:
Spin-Hall nano-oscillators (SHNO) are nanoscale spintronic devices that generate high-frequency (GHz) microwave signals useful for various applications such as neuromorphic computing and creating Ising systems. Recent research demonstrated that hybrid SHNOs consisting of a ferromagnetic metal (permalloy) and lithium aluminum ferrite (LAFO), a ferrimagnetic insulator, thin films have advantages in…
▽ More
Spin-Hall nano-oscillators (SHNO) are nanoscale spintronic devices that generate high-frequency (GHz) microwave signals useful for various applications such as neuromorphic computing and creating Ising systems. Recent research demonstrated that hybrid SHNOs consisting of a ferromagnetic metal (permalloy) and lithium aluminum ferrite (LAFO), a ferrimagnetic insulator, thin films have advantages in having lower auto-oscillation threshold currents ($I_{\text{th}}$) and generating larger microwave output power, making this hybrid structure an attractive candidate for spintronic applications. It is essential to understand how the tunable material properties of LAFO, e.g., its thickness, perpendicular magnetic anisotropy ($K_u$), and saturation magnetization ($M_s$), affect magnetic dynamics in hybrid SHNOs. We investigate the change in $I_{\text{th}}$ and the output power of the device as the LAFO parameters vary. We find the $I_{\text{th}}$ does not depend strongly on these parameters, but the output power has a highly nonlinear dependence on $M_s$ and $K_u$. We further investigate the nature of the excited spin-wave modes as a function of $K_u$ and determine a critical value of $K_u$ above which propagating spin-waves are excited. Our simulation results provide a roadmap for designing hybrid SHNOs to achieve targeted spin excitation characteristics.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models
Authors:
Jing Yang,
Runping Xi,
Yingxin Lai,
Xun Lin,
Zitong Yu
Abstract:
Diffusion-based personalized visual content generation technologies have achieved significant breakthroughs, allowing for the creation of specific objects by just learning from a few reference photos. However, when misused to fabricate fake news or unsettling content targeting individuals, these technologies could cause considerable societal harm. To address this problem, current methods generate…
▽ More
Diffusion-based personalized visual content generation technologies have achieved significant breakthroughs, allowing for the creation of specific objects by just learning from a few reference photos. However, when misused to fabricate fake news or unsettling content targeting individuals, these technologies could cause considerable societal harm. To address this problem, current methods generate adversarial samples by adversarially maximizing the training loss, thereby disrupting the output of any personalized generation model trained with these samples. However, the existing methods fail to achieve effective defense and maintain stealthiness, as they overlook the intrinsic properties of diffusion models. In this paper, we introduce a novel Dual-Domain Anti-Personalization framework (DDAP). Specifically, we have developed Spatial Perturbation Learning (SPL) by exploiting the fixed and perturbation-sensitive nature of the image encoder in personalized generation. Subsequently, we have designed a Frequency Perturbation Learning (FPL) method that utilizes the characteristics of diffusion models in the frequency domain. The SPL disrupts the overall texture of the generated images, while the FPL focuses on image details. By alternating between these two methods, we construct the DDAP framework, effectively harnessing the strengths of both domains. To further enhance the visual quality of the adversarial samples, we design a localization module to accurately capture attentive areas while ensuring the effectiveness of the attack and avoiding unnecessary disturbances in the background. Extensive experiments on facial benchmarks have shown that the proposed DDAP enhances the disruption of personalized generation models while also maintaining high quality in adversarial samples, making it more effective in protecting privacy in practical applications.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention
Authors:
Jiahao Lyu,
Minghua Zhao,
Jing Hu,
Runtao Xi,
Xuewen Huang,
Shuangli Du,
Cheng Shi,
Tian Ma
Abstract:
With the widespread deployment of video surveillance devices and the demand for intelligent system development, video anomaly detection (VAD) has become an important part of constructing intelligent surveillance systems. Expanding the discriminative boundary between normal and abnormal events to enhance performance is the common goal and challenge of VAD. To address this problem, we propose a Bidi…
▽ More
With the widespread deployment of video surveillance devices and the demand for intelligent system development, video anomaly detection (VAD) has become an important part of constructing intelligent surveillance systems. Expanding the discriminative boundary between normal and abnormal events to enhance performance is the common goal and challenge of VAD. To address this problem, we propose a Bidirectional Skip-frame Prediction (BiSP) network based on a dual-stream autoencoder, from the perspective of learning the intra-domain disparity between different features. The BiSP skips frames in the training phase to achieve the forward and backward frame prediction respectively, and in the testing phase, it utilizes bidirectional consecutive frames to co-predict the same intermediate frames, thus expanding the degree of disparity between normal and abnormal events. The BiSP designs the variance channel attention and context spatial attention from the perspectives of movement patterns and object scales, respectively, thus ensuring the maximization of the disparity between normal and abnormal in the feature extraction and delivery with different dimensions. Extensive experiments from four benchmark datasets demonstrate the effectiveness of the proposed BiSP, which substantially outperforms state-of-the-art competing methods.
△ Less
Submitted 23 July, 2024; v1 submitted 22 July, 2024;
originally announced July 2024.
-
Enhancing HNSW Index for Real-Time Updates: Addressing Unreachable Points and Performance Degradation
Authors:
Wentao Xiao,
Yueyang Zhan,
Rui Xi,
Mengshu Hou,
Jianming Liao
Abstract:
The approximate nearest neighbor search (ANNS) is a fundamental and essential component in data mining and information retrieval, with graph-based methodologies demonstrating superior performance compared to alternative approaches. Extensive research efforts have been dedicated to improving search efficiency by developing various graph-based indices, such as HNSW (Hierarchical Navigable Small Worl…
▽ More
The approximate nearest neighbor search (ANNS) is a fundamental and essential component in data mining and information retrieval, with graph-based methodologies demonstrating superior performance compared to alternative approaches. Extensive research efforts have been dedicated to improving search efficiency by developing various graph-based indices, such as HNSW (Hierarchical Navigable Small World). However, the performance of HNSW and most graph-based indices become unacceptable when faced with a large number of real-time deletions, insertions, and updates. Furthermore, during update operations, HNSW can result in some data points becoming unreachable, a situation we refer to as the `unreachable points phenomenon'. This phenomenon could significantly affect the search accuracy of the graph in certain situations.
To address these issues, we present efficient measures to overcome the shortcomings of HNSW, specifically addressing poor performance over long periods of delete and update operations and resolving the issues caused by the unreachable points phenomenon. Our proposed MN-RU algorithm effectively improves update efficiency and suppresses the growth rate of unreachable points, ensuring better overall performance and maintaining the integrity of the graph. Our results demonstrate that our methods outperform existing approaches. Furthermore, since our methods are based on HNSW, they can be easily integrated with existing indices widely used in the industrial field, making them practical for future real-world applications. Code is available at \url{https://github.com/xwt1/MN-RU.git}
△ Less
Submitted 15 July, 2024; v1 submitted 10 July, 2024;
originally announced July 2024.
-
Salient Object Detection From Arbitrary Modalities
Authors:
Nianchang Huang,
Yang Yang,
Ruida Xi,
Qiang Zhang,
Jungong Han,
Jin Huang
Abstract:
Toward desirable saliency prediction, the types and numbers of inputs for a salient object detection (SOD) algorithm may dynamically change in many real-life applications. However, existing SOD algorithms are mainly designed or trained for one particular type of inputs, failing to be generalized to other types of inputs. Consequentially, more types of SOD algorithms need to be prepared in advance…
▽ More
Toward desirable saliency prediction, the types and numbers of inputs for a salient object detection (SOD) algorithm may dynamically change in many real-life applications. However, existing SOD algorithms are mainly designed or trained for one particular type of inputs, failing to be generalized to other types of inputs. Consequentially, more types of SOD algorithms need to be prepared in advance for handling different types of inputs, raising huge hardware and research costs. Differently, in this paper, we propose a new type of SOD task, termed Arbitrary Modality SOD (AM SOD). The most prominent characteristics of AM SOD are that the modality types and modality numbers will be arbitrary or dynamically changed. The former means that the inputs to the AM SOD algorithm may be arbitrary modalities such as RGB, depths, or even any combination of them. While, the latter indicates that the inputs may have arbitrary modality numbers as the input type is changed, e.g. single-modality RGB image, dual-modality RGB-Depth (RGB-D) images or triple-modality RGB-Depth-Thermal (RGB-D-T) images. Accordingly, a preliminary solution to the above challenges, ı.e. a modality switch network (MSN), is proposed in this paper. In particular, a modality switch feature extractor (MSFE) is first designed to extract discriminative features from each modality effectively by introducing some modality indicators, which will generate some weights for modality switching. Subsequently, a dynamic fusion module (DFM) is proposed to adaptively fuse features from a variable number of modalities based on a novel Transformer structure. Finally, a new dataset, named AM-XD, is constructed to facilitate research on AM SOD. Extensive experiments demonstrate that our AM SOD method can effectively cope with changes in the type and number of input modalities for robust salient object detection.
△ Less
Submitted 9 May, 2024; v1 submitted 6 May, 2024;
originally announced May 2024.
-
Moral Sparks in Social Media Narratives
Authors:
Ruijie Xi,
Munindar P. Singh
Abstract:
There is increasing interest in building computational models of moral reasoning by people to enable effective interaction by Artificial Intelligence (AI) agents. We examine interactions on social media to understand human moral judgments in real-life ethical scenarios. Specifically, we examine posts from a popular Reddit subreddit (i.e., a subcommunity) called r/AmITheAsshole, where authors and c…
▽ More
There is increasing interest in building computational models of moral reasoning by people to enable effective interaction by Artificial Intelligence (AI) agents. We examine interactions on social media to understand human moral judgments in real-life ethical scenarios. Specifically, we examine posts from a popular Reddit subreddit (i.e., a subcommunity) called r/AmITheAsshole, where authors and commenters share their moral judgments on who (i.e., which participant of the described scenario) is blameworthy. To investigate the underlying reasoning influencing moral judgments, we focus on excerpts-which we term moral sparks-from original posts that some commenters include to indicate what motivates their judgments. To this end, we examine how (1) events activating social commonsense and (2) linguistic signals affect the identified moral sparks and their subsequent judgments. By examining over 24672 posts and 175988 comments, we find that event-related negative character traits (e.g., immature and rude) attract attention and stimulate blame, implying a dependent relationship between character traits and moral values. Specifically, we focus on causal graphs involving events (c-events) that activate social commonsense. We observe that c-events are perceived with varying levels of informativeness, influencing moral spark and judgment assignment in distinct ways. This observation is reinforced by examining linguistic features describing semantically similar c-events. Moreover, language influencing commenters' cognitive processes enhances the probability of an excerpt becoming a moral spark, while factual and concrete descriptions tend to inhibit this effect.
△ Less
Submitted 21 April, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Disturbance Rejection Control for Autonomous Trolley Collection Robots with Prescribed Performance
Authors:
Rui-Dong Xi,
Liang Lu,
Xue Zhang,
Xiao Xiao,
Bingyi Xia,
Jiankun Wang,
Max Q. -H. Meng
Abstract:
Trajectory tracking control of autonomous trolley collection robots (ATCR) is an ambitious work due to the complex environment, serious noise and external disturbances. This work investigates a control scheme for ATCR subjecting to severe environmental interference. A kinematics model based adaptive sliding mode disturbance observer with fast convergence is first proposed to estimate the lumped di…
▽ More
Trajectory tracking control of autonomous trolley collection robots (ATCR) is an ambitious work due to the complex environment, serious noise and external disturbances. This work investigates a control scheme for ATCR subjecting to severe environmental interference. A kinematics model based adaptive sliding mode disturbance observer with fast convergence is first proposed to estimate the lumped disturbances. On this basis, a robust controller with prescribed performance is proposed using a backstepping technique, which improves the transient performance and guarantees fast convergence. Simulation outcomes have been provided to illustrate the effectiveness of the proposed control scheme.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
mmHawkeye: Passive UAV Detection with a COTS mmWave Radar
Authors:
Jia Zhang,
Xin Na,
Rui Xi,
Yimiao Sun,
Yuan He
Abstract:
Small Unmanned Aerial Vehicles (UAVs) are becoming potential threats to security-sensitive areas and personal privacy. A UAV can shoot photos at height, but how to detect such an uninvited intruder is an open problem. This paper presents mmHawkeye, a passive approach for UAV detection with a COTS millimeter wave (mmWave) radar. mmHawkeye doesn't require prior knowledge of the type, motions, and fl…
▽ More
Small Unmanned Aerial Vehicles (UAVs) are becoming potential threats to security-sensitive areas and personal privacy. A UAV can shoot photos at height, but how to detect such an uninvited intruder is an open problem. This paper presents mmHawkeye, a passive approach for UAV detection with a COTS millimeter wave (mmWave) radar. mmHawkeye doesn't require prior knowledge of the type, motions, and flight trajectory of the UAV, while exploiting the signal feature induced by the UAV's periodic micro-motion (PMM) for long-range accurate detection. The design is therefore effective in dealing with low-SNR and uncertain reflected signals from the UAV. mmHawkeye can further track the UAV's position with dynamic programming and particle filtering, and identify it with a Long Short-Term Memory (LSTM) based detector. We implement mmHawkeye on a commercial mmWave radar and evaluate its performance under varied settings. The experimental results show that mmHawkeye has a detection accuracy of 95.8% and can realize detection at a range up to 80m.
△ Less
Submitted 12 August, 2023;
originally announced August 2023.
-
A Survey of mmWave-based Human Sensing: Technology, Platform and Applications
Authors:
Jia Zhang,
Rui Xi,
Yuan He,
Yimiao Sun,
Xiuzhen Guo,
Weiguo Wang,
Xin Na,
Yunhao Liu,
Zhenguo Shi,
Tao Gu
Abstract:
With the rapid development of the Internet of Things (IoT) and the rise of 5G communication networks and automatic driving, millimeter wave (mmWave) sensing is emerging and starts impacting our life and workspace. mmWave sensing can sense humans and objects in a contactless way, providing fine-grained sensing ability. In the past few years, many mmWave sensing techniques have been proposed and app…
▽ More
With the rapid development of the Internet of Things (IoT) and the rise of 5G communication networks and automatic driving, millimeter wave (mmWave) sensing is emerging and starts impacting our life and workspace. mmWave sensing can sense humans and objects in a contactless way, providing fine-grained sensing ability. In the past few years, many mmWave sensing techniques have been proposed and applied in various human sensing applications (e.g., human localization, gesture recognition, and vital monitoring). We discover the need of a comprehensive survey to summarize the technology, platforms and applications of mmWave-based human sensing. In this survey, we first present the mmWave hardware platforms and some key techniques of mmWave sensing. We then provide a comprehensive review of existing mmWave-based human sensing works. Specifically, we divide existing works into four categories according to the sensing granularity: human tracking and localization, motion recognition, biometric measurement and human imaging. Finally, we discuss the potential research challenges and present future directions in this area.
△ Less
Submitted 6 August, 2023;
originally announced August 2023.
-
Feature screening for clustering analysis
Authors:
Changhu Wang,
Zihao Chen,
Ruibin Xi
Abstract:
In this paper, we consider feature screening for ultrahigh dimensional clustering analyses. Based on the observation that the marginal distribution of any given feature is a mixture of its conditional distributions in different clusters, we propose to screen clustering features by independently evaluating the homogeneity of each feature's mixture distribution. Important cluster-relevant features h…
▽ More
In this paper, we consider feature screening for ultrahigh dimensional clustering analyses. Based on the observation that the marginal distribution of any given feature is a mixture of its conditional distributions in different clusters, we propose to screen clustering features by independently evaluating the homogeneity of each feature's mixture distribution. Important cluster-relevant features have heterogeneous components in their mixture distributions and unimportant features have homogeneous components. The well-known EM-test statistic is used to evaluate the homogeneity. Under general parametric settings, we establish the tail probability bounds of the EM-test statistic for the homogeneous and heterogeneous features, and further show that the proposed screening procedure can achieve the sure independent screening and even the consistency in selection properties. Limiting distribution of the EM-test statistic is also obtained for general parametric distributions. The proposed method is computationally efficient, can accurately screen for important cluster-relevant features and help to significantly improve clustering, as demonstrated in our extensive simulation and real data analyses.
△ Less
Submitted 2 February, 2024; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Elongated Physiological Structure Segmentation via Spatial and Scale Uncertainty-aware Network
Authors:
Yinglin Zhang,
Ruiling Xi,
Huazhu Fu,
Dave Towey,
RuiBin Bai,
Risa Higashita,
Jiang Liu
Abstract:
Robust and accurate segmentation for elongated physiological structures is challenging, especially in the ambiguous region, such as the corneal endothelium microscope image with uneven illumination or the fundus image with disease interference. In this paper, we present a spatial and scale uncertainty-aware network (SSU-Net) that fully uses both spatial and scale uncertainty to highlight ambiguous…
▽ More
Robust and accurate segmentation for elongated physiological structures is challenging, especially in the ambiguous region, such as the corneal endothelium microscope image with uneven illumination or the fundus image with disease interference. In this paper, we present a spatial and scale uncertainty-aware network (SSU-Net) that fully uses both spatial and scale uncertainty to highlight ambiguous regions and integrate hierarchical structure contexts. First, we estimate epistemic and aleatoric spatial uncertainty maps using Monte Carlo dropout to approximate Bayesian networks. Based on these spatial uncertainty maps, we propose the gated soft uncertainty-aware (GSUA) module to guide the model to focus on ambiguous regions. Second, we extract the uncertainty under different scales and propose the multi-scale uncertainty-aware (MSUA) fusion module to integrate structure contexts from hierarchical predictions, strengthening the final prediction. Finally, we visualize the uncertainty map of final prediction, providing interpretability for segmentation results. Experiment results show that the SSU-Net performs best on cornea endothelial cell and retinal vessel segmentation tasks. Moreover, compared with counterpart uncertainty-based methods, SSU-Net is more accurate and robust.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Extracting Incidents, Effects, and Requested Advice from MeToo Posts
Authors:
Vaibhav Garg,
Jiaqing Yuan,
Rujie Xi,
Munindar P. Singh
Abstract:
Survivors of sexual harassment frequently share their experiences on social media, revealing their feelings and emotions and seeking advice. We observed that on Reddit, survivors regularly share long posts that describe a combination of (i) a sexual harassment incident, (ii) its effect on the survivor, including their feelings and emotions, and (iii) the advice being sought. We term such posts MeT…
▽ More
Survivors of sexual harassment frequently share their experiences on social media, revealing their feelings and emotions and seeking advice. We observed that on Reddit, survivors regularly share long posts that describe a combination of (i) a sexual harassment incident, (ii) its effect on the survivor, including their feelings and emotions, and (iii) the advice being sought. We term such posts MeToo posts, even though they may not be so tagged and may appear in diverse subreddits. A prospective helper (such as a counselor or even a casual reader) must understand a survivor's needs from such posts. But long posts can be time-consuming to read and respond to.
Accordingly, we address the problem of extracting key information from a long MeToo post. We develop a natural language-based model to identify sentences from a post that describe any of the above three categories.
On ten-fold cross-validation of a dataset, our model achieves a macro F1 score of 0.82.
In addition, we contribute MeThree, a dataset comprising 8,947 labeled sentences extracted from Reddit posts. We apply the LIWC-22 toolkit on MeThree to understand how different language patterns in sentences of the three categories can reveal differences in emotional tone, authenticity, and other aspects.
△ Less
Submitted 19 March, 2023;
originally announced March 2023.
-
The blame game: Understanding blame assignment in social media
Authors:
Ruijie Xi,
Munindar P. Singh
Abstract:
Cognitive and psychological studies on morality have proposed underlying linguistic and semantic factors. However, laboratory experiments in the philosophical literature often lack the nuances and complexity of real life. This paper examines how well the findings of these cognitive studies generalize to a corpus of over 30,000 narratives of tense social situations submitted to a popular social med…
▽ More
Cognitive and psychological studies on morality have proposed underlying linguistic and semantic factors. However, laboratory experiments in the philosophical literature often lack the nuances and complexity of real life. This paper examines how well the findings of these cognitive studies generalize to a corpus of over 30,000 narratives of tense social situations submitted to a popular social media forum. These narratives describe interpersonal moral situations or misgivings; other users judge from the post whether the author (protagonist) or the opposing side (antagonist) is morally culpable. Whereas previous work focuses on predicting the polarity of normative behaviors, we extend and apply natural language processing (NLP) techniques to understand the effects of descriptions of the people involved in these posts. We conduct extensive experiments to investigate the effect sizes of features to understand how they affect the assignment of blame on social media. Our findings show that aggregating psychology theories enables understanding real-life moral situations. Moreover, our results suggest that there exist biases in blame assignment on social media, such as males are more likely to receive blame no matter whether they are protagonists or antagonists.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
Morality in the mundane: Categorizing moral reasoning in real-life social situations
Authors:
Ruijie Xi,
Munindar P. Singh
Abstract:
Moral reasoning reflects how people acquire and apply moral rules in particular situations. With increasingly social interactions happening online, social media data provides an unprecedented opportunity to assess in-the-wild moral reasoning. We investigate the commonsense aspects of morality in ordinary matters empirically. To this end, we examine data from a Reddit subcommunity (i.e., a subreddi…
▽ More
Moral reasoning reflects how people acquire and apply moral rules in particular situations. With increasingly social interactions happening online, social media data provides an unprecedented opportunity to assess in-the-wild moral reasoning. We investigate the commonsense aspects of morality in ordinary matters empirically. To this end, we examine data from a Reddit subcommunity (i.e., a subreddit) where an author may describe their behavior in a situation to seek comments about whether that behavior was appropriate. Other users comment to provide judgments and reasoning. We focus on the novel problem of understanding the moral reasoning implicit in user comments about the propriety of an author's behavior. Especially, we explore associations between the common elements of the indicated reasoning and the extractable social factors. Our results suggest the reasoning depends on the author's gender and the topic of a post, such as when expressing anger emotion and using sensible words (e.g., f-ck, hell, and damn) in work-related situations. Moreover, we find that the commonly expressed semantics also depends on commenters' interests.
△ Less
Submitted 26 July, 2023; v1 submitted 24 February, 2023;
originally announced February 2023.
-
Network Analysis of Count Data from Mixed Populations
Authors:
Junjie Tang,
Changhu Wang,
Feiyi Xiao,
Ruibin Xi
Abstract:
In applications such as gene regulatory network analysis based on single-cell RNA sequencing data, samples often come from a mixture of different populations and each population has its own unique network. Available graphical models often assume that all samples are from the same population and share the same network. One has to first cluster the samples and use available methods to infer the netw…
▽ More
In applications such as gene regulatory network analysis based on single-cell RNA sequencing data, samples often come from a mixture of different populations and each population has its own unique network. Available graphical models often assume that all samples are from the same population and share the same network. One has to first cluster the samples and use available methods to infer the network for every cluster separately. However, this two-step procedure ignores uncertainty in the clustering step and thus could lead to inaccurate network estimation. Motivated by these applications, we consider the mixture Poisson log-normal model for network inference of count data from mixed populations. The latent precision matrices of the mixture model correspond to the networks of different populations and can be jointly estimated by maximizing the lasso-penalized log-likelihood. Under rather mild conditions, we show that the mixture Poisson log-normal model is identifiable and has the positive definite Fisher information matrix. Consistency of the maximum lasso-penalized log-likelihood estimator is also established. To avoid the intractable optimization of the log-likelihood, we develop an algorithm called VMPLN based on the variational inference method. Comprehensive simulation and real single-cell RNA sequencing data analyses demonstrate the superior performance of VMPLN.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
Wi-attack: Cross-technology Impersonation Attack against iBeacon Services
Authors:
Xin Na,
Xiuzhen Guo,
Yuan He,
Rui Xi
Abstract:
iBeacon protocol is widely deployed to provide location-based services. By receiving its BLE advertisements, nearby devices can estimate the proximity to the iBeacon or calculate indoor positions. However, the open nature of these advertisements brings vulnerability to impersonation attacks. Such attacks could lead to spam, unreliable positioning, and even security breaches. In this paper, we prop…
▽ More
iBeacon protocol is widely deployed to provide location-based services. By receiving its BLE advertisements, nearby devices can estimate the proximity to the iBeacon or calculate indoor positions. However, the open nature of these advertisements brings vulnerability to impersonation attacks. Such attacks could lead to spam, unreliable positioning, and even security breaches. In this paper, we propose Wi-attack, revealing the feasibility of using WiFi devices to conduct impersonation attacks on iBeacon services. Different from impersonation attacks using BLE compatible hardware, Wi-attack is not restricted by broadcasting intervals and is able to impersonate multiple iBeacons at the same time. Effective attacks can be launched on iBeacon services without modifications to WiFi hardware or firmware. To enable direct communication from WiFi to BLE, we use the digital emulation technique of cross technology communication. To enhance the packet reception along with its stability, we add redundant packets to eliminate cyclic prefix error entirely. The emulation provides an iBeacon packet reception rate up to 66.2%. We conduct attacks on three iBeacon services scenarios, point deployment, multilateration, and fingerprint-based localization. The evaluation results show that Wi-attack can bring an average distance error of more than 20 meters on fingerprint-based localization using only 3 APs.
△ Less
Submitted 30 September, 2022;
originally announced September 2022.
-
Single-cell gene regulatory network analysis for mixed cell populations with applications to COVID-19 single cell data
Authors:
Junjie Tang,
Changhu Wang,
Feiyi Xiao,
Ruibin Xi
Abstract:
Gene regulatory network (GRN) refers to the complex network formed by regulatory interactions between genes in living cells. In this paper, we consider inferring GRNs in single cells based on single cell RNA sequencing (scRNA-seq) data. In scRNA-seq, single cells are often profiled from mixed populations and their cell identities are unknown. A common practice for single cell GRN analysis is to fi…
▽ More
Gene regulatory network (GRN) refers to the complex network formed by regulatory interactions between genes in living cells. In this paper, we consider inferring GRNs in single cells based on single cell RNA sequencing (scRNA-seq) data. In scRNA-seq, single cells are often profiled from mixed populations and their cell identities are unknown. A common practice for single cell GRN analysis is to first cluster the cells and infer GRNs for every cluster separately. However, this two-step procedure ignores uncertainty in the clustering step and thus could lead to inaccurate estimation of the networks. To address this problem, we propose to model scRNA-seq by the mixture multivariate Poisson log-normal (MPLN) distribution. The precision matrices of the MPLN are the GRNs of different cell types and can be jointly estimated by maximizing MPLN's lasso-penalized log-likelihood. We show that the MPLN model is identifiable and the resulting penalized log-likelihood estimator is consistent. To avoid the intractable optimization of the MPLN's log-likelihood, we develop an algorithm called VMPLN based on the variational inference method. Comprehensive simulation and real scRNA-seq data analyses reveal that VMPLN performs better than the state-of-the-art single cell GRN methods.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Theoretical study on the interfacial instability of a spherical droplet subject to vertical vibration
Authors:
Yikai Li,
Kun Wu,
Dehua Liu,
Ru Xi
Abstract:
Interfacial instability would be aroused on a spherical liquid droplet when it is subject to external vertical vibration. In this paper, a linear analysis was conducted on this instability problem. The polar-angle dependent acceleration in the spherical coordinate is strongly coupled with the temporal and spatial component of the surface deformation displacement, which gives a recursion equation t…
▽ More
Interfacial instability would be aroused on a spherical liquid droplet when it is subject to external vertical vibration. In this paper, a linear analysis was conducted on this instability problem. The polar-angle dependent acceleration in the spherical coordinate is strongly coupled with the temporal and spatial component of the surface deformation displacement, which gives a recursion equation that implicitly expresses the dispersion relation between the growth rate and spherical mode numbers. The unstable regions (or unstable tongues) for the inviscid fluids considering latitudinal mode (longitudinal mode number m = 0) were derived and presented in the parameter plane. Compared with the solution of the spherical Faraday instability under radial vibration acceleration, the regions of harmonic unstable tongues for the mono-directional vibration case is much narrowed and the subharmonic unstable tongues almost become straight lines. The analysis shows that the latitudinal waves emerging on the spherical droplet surface ought to oscillate harmonically instead of subharmonically, which is opposite to the results for the case under radial vibration acceleration. A corresponding experiment of a liquid droplet lying on a vertically vibrating plate was conducted and the observations substantiate our theoretical predictions.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Observation of topologically enabled complete polarization conversion
Authors:
Fujia Chen,
Zhen Gao,
Li Zhang,
Qiaolu Chen,
Qinghui Yan,
Rui Xi,
Liqiao Jing,
Erping Li,
Wenyan Yin,
Hongsheng Chen,
Yihao Yang
Abstract:
Exploiting topological ideas has been a major theme in modern photonics, which provides unprecedented opportunities to design photonic devices with robustness against defects and flaws. While most previous works in topological photonics have focused on band theory, recent theoretical advances extend the topological concepts to the analysis of scattering matrices and suggest a topological route to…
▽ More
Exploiting topological ideas has been a major theme in modern photonics, which provides unprecedented opportunities to design photonic devices with robustness against defects and flaws. While most previous works in topological photonics have focused on band theory, recent theoretical advances extend the topological concepts to the analysis of scattering matrices and suggest a topological route to complete polarization conversion (CPC), a unique photonic phenomenon without an electronic counterpart. Here, we report on the experimental observation of the topological effect in reflection matrices of a photonic crystal slab, enabling CPC between two linear polarizations over a wide range of frequencies. Using angle-resolved reflection measurements, we observe CPC occurring at vortex singularities of reflection coefficients in momentum space, verifying the topological nature of CPC. In addition, the topological effect also guarantees the spin-preserved reflection of a circularly polarized wave. Remarkably, we experimentally establish a connection between two seemingly unrelated topological phenomena--CPC and bound states in the continuum (BICs): BICs lie on the critical coupling curves that define the condition for CPC. Our work paves the way to exploring the topological properties in scattering matrices for controlling light polarization and spin and creating robust photonic devices.
△ Less
Submitted 26 November, 2021;
originally announced December 2021.
-
Gene regulatory network in single cells based on the Poisson log-normal model
Authors:
Feiyi Xiao,
Junjie Tang,
Huaying Fang,
Ruibin Xi
Abstract:
Gene regulatory network inference is crucial for understanding the complex molecular interactions in various genetic and environmental conditions. The rapid development of single-cell RNA sequencing (scRNA-seq) technologies unprecedentedly enables gene regulatory networks inference at the single cell resolution. However, traditional graphical models for continuous data, such as Gaussian graphical…
▽ More
Gene regulatory network inference is crucial for understanding the complex molecular interactions in various genetic and environmental conditions. The rapid development of single-cell RNA sequencing (scRNA-seq) technologies unprecedentedly enables gene regulatory networks inference at the single cell resolution. However, traditional graphical models for continuous data, such as Gaussian graphical models, are inappropriate for network inference of scRNA-seq's count data. Here, we model the scRNA-seq data using the multivariate Poisson log-normal (PLN) distribution and represent the precision matrix of the latent normal distribution as the regulatory network. We propose to first estimate the latent covariance matrix using a moment estimator and then estimate the precision matrix by minimizing the lasso-penalized D-trace loss function. We establish the convergence rate of the covariance matrix estimator and further establish the convergence rates and the sign consistency of the proposed PLNet estimator of the precision matrix in the high dimensional setting. The performance of PLNet is evaluated and compared with available methods using simulation and gene regulatory network analysis of scRNA-seq data.
△ Less
Submitted 7 November, 2021;
originally announced November 2021.
-
Topological chiral edge states in deep-subwavelength valley photonic metamaterials
Authors:
Rui Xi,
Qiaolu Chen,
Qinghui Yan,
Li Zhang,
Fujia Chen,
Ying Li,
Hongsheng Chen,
Yihao Yang
Abstract:
Topological valley photonics has emerged as a new frontier in photonics with many promising applications. Previous valley boundary transport relies on kink states at internal boundaries between two topologically distinct domains. However, recent studies have revealed a novel class of topological chiral edge states (CESs) at external boundaries of valley materials, which have remained elusive in ph…
▽ More
Topological valley photonics has emerged as a new frontier in photonics with many promising applications. Previous valley boundary transport relies on kink states at internal boundaries between two topologically distinct domains. However, recent studies have revealed a novel class of topological chiral edge states (CESs) at external boundaries of valley materials, which have remained elusive in photonics. Here, we propose and experimentally demonstrate the topological CESs in valley photonic metamaterials (VPMMs) by accurately tuning on-site edge potentials. Moreover, the VPMMs work at deep-subwavelength scales. Thus, the supported CESs are highly confined and self-guiding without relying on a cladding layer to prevent leakage radiation. Via direct near-field measurements, we observe the bulk bandgap, the edge dispersions, and the robust edge transport passing through sharp corners, which are hallmarks of the CESs. Our work paves a way to explore novel topological edge states in valley photonics and sheds light on robust and miniaturized photonic devices.
△ Less
Submitted 23 August, 2021;
originally announced September 2021.
-
Unconventional Weyl exceptional contours in non-Hermitian photonic continua
Authors:
Qinghui Yan,
Qiaolu Chen,
Li Zhang,
Rui Xi,
Hongsheng Chen,
Yihao Yang
Abstract:
Unconventional Weyl points with topological charges higher than 1 can transform into various complex unconventional Weyl exceptional contours under non-Hermitian perturbations. However, theoretical studies of these exceptional contours have been limited to tight-binding models. Here, we propose to realize unconventional Weyl exceptional contours in photonic continua -- non-Hermitian anisotropic ch…
▽ More
Unconventional Weyl points with topological charges higher than 1 can transform into various complex unconventional Weyl exceptional contours under non-Hermitian perturbations. However, theoretical studies of these exceptional contours have been limited to tight-binding models. Here, we propose to realize unconventional Weyl exceptional contours in photonic continua -- non-Hermitian anisotropic chiral plasma, based on ab initio calculation by Maxwell's equations. By perturbing in-plane permittivity, an unconventional Weyl point can transform into a quadratic Weyl exceptional circle, a Type-I Weyl exceptional chain with one chain point, a Type-II Weyl exceptional chain with two chain points, or other forms. Realistic metamaterials with effective constitutive parameters are proposed to implement these unconventional Weyl exceptional contours. Our work paves a way toward exploration of exotic physics of unconventional Weyl exceptional contours in non-Hermitian topological photonic continua.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
Acoustic non-Hermitian skin effect from twisted winding topology
Authors:
Li Zhang,
Yihao Yang,
Yong Ge,
Yi-jun Guan,
Qiaolu Chen,
Qinghui Yan,
Fujia Chen,
Rui Xi,
Yuanzhen Li,
Ding Jia,
Shou-qi Yuan,
Hong-xiang Sun,
Hongsheng Chen,
Baile Zhang
Abstract:
The recently discovered non-Hermitian skin effect (NHSE) manifests the breakdown of current classification of topological phases in energy-nonconservative systems, and necessitates the introduction of non-Hermitian band topology. So far, all NHSE observations are based on one type of non-Hermitian band topology, in which the complex energy spectrum winds along a closed loop. As recently characteri…
▽ More
The recently discovered non-Hermitian skin effect (NHSE) manifests the breakdown of current classification of topological phases in energy-nonconservative systems, and necessitates the introduction of non-Hermitian band topology. So far, all NHSE observations are based on one type of non-Hermitian band topology, in which the complex energy spectrum winds along a closed loop. As recently characterized along a synthetic dimension on a photonic platform, non-Hermitian band topology can exhibit almost arbitrary windings in momentum space, but their actual phenomena in real physical systems remain unclear. Here, we report the experimental realization of NHSE in a one-dimensional (1D) non-reciprocal acoustic crystal. With direct acoustic measurement, we demonstrate that a twisted winding, whose topology consists of two oppositely oriented loops in contact rather than a single loop, will dramatically change the NHSE, following previous predictions of unique features such as the bipolar localization and the Bloch point for a Bloch-wave-like extended state. This work reveals previously unnoticed features of NHSE, and provides the observation of physical phenomena originating from complex non-Hermitian winding topology.
△ Less
Submitted 9 November, 2021; v1 submitted 18 April, 2021;
originally announced April 2021.
-
Photonic topological valley-locked waveguides
Authors:
Qiaolu Chen,
Li Zhang,
Qinghui Yan,
Rui Xi,
Hongsheng Chen,
Yihao Yang
Abstract:
Topological valley kink states have become a significant research frontier with considerable intriguing applications such as robust on-chip communications and topological lasers. Unlike guided modes with adjustable widths in most conventional waveguides, the valley kink states are usually highly confined around the domain walls and thus lack the mode width degree of freedom (DOF), posing a serious…
▽ More
Topological valley kink states have become a significant research frontier with considerable intriguing applications such as robust on-chip communications and topological lasers. Unlike guided modes with adjustable widths in most conventional waveguides, the valley kink states are usually highly confined around the domain walls and thus lack the mode width degree of freedom (DOF), posing a serious limitation to potential device applications. Here, by adding a photonic crystal (PhC) featuring a Dirac point between two valley PhCs with opposite valley-Chern numbers, we design and experimentally demonstrate topological valley-locked waveguides (TVLWs) with tunable mode widths. The photoinc TVLWs could find unique applications, such as high-energy-capacity topological channel intersections, valley-locked energy concentrators, and topological cavities with designable confinement, as verified numerically and experimentally. The TVLWs with width DOF could be beneficial to interface with the exsisting photonic waveguides and devices, and serve as a novel platform for practical use of topological lasing, field enhancement, on-chip communicaitons, and high-capacity energy transport.
△ Less
Submitted 6 October, 2020;
originally announced November 2020.
-
A robust statistical method for Genome-wide association analysis of human copy number variation
Authors:
Han Wang,
Changhu Wang,
Linjie Wu,
Ruibin Xi
Abstract:
Conducting genome-wide association studies (GWAS) in copy number variation (CNV) level is a field where few people involves and little statistical progresses have been achieved, traditional methods suffer from many problems such as batch effects, heterogeneity across genome, leading to low power or high false discovery rate. We develop a new robust method to find disease-risking regions related to…
▽ More
Conducting genome-wide association studies (GWAS) in copy number variation (CNV) level is a field where few people involves and little statistical progresses have been achieved, traditional methods suffer from many problems such as batch effects, heterogeneity across genome, leading to low power or high false discovery rate. We develop a new robust method to find disease-risking regions related to CNV's disproportionately distributed between case and control samples, even if there are batch effects between them, our test formula is robust to such effects. We propose a new empirical Bayes rule to deal with overfitting when estimating parameters during testing, this rule can be extended to the field of model selection, it can be more efficient compared with traditional methods when there are too much potential models to be specified. We also give solid theoretical guarantees for our proposed method, and demonstrate the effectiveness by simulation and realdata analysis.
△ Less
Submitted 15 November, 2020;
originally announced November 2020.
-
Community Detection by $L_0$-penalized Graph Laplacian
Authors:
Chong Chen,
Ruibin Xi,
Nan Lin
Abstract:
Community detection in network analysis aims at partitioning nodes in a network into $K$ disjoint communities. Most currently available algorithms assume that $K$ is known, but choosing a correct $K$ is generally very difficult for real networks. In addition, many real networks contain outlier nodes not belonging to any community, but currently very few algorithm can handle networks with outliers.…
▽ More
Community detection in network analysis aims at partitioning nodes in a network into $K$ disjoint communities. Most currently available algorithms assume that $K$ is known, but choosing a correct $K$ is generally very difficult for real networks. In addition, many real networks contain outlier nodes not belonging to any community, but currently very few algorithm can handle networks with outliers. In this paper, we propose a novel model free tightness criterion and an efficient algorithm to maximize this criterion for community detection. This tightness criterion is closely related with the graph Laplacian with $L_0$ penalty. Unlike most community detection methods, our method does not require a known $K$ and can properly detect communities in networks with outliers.
Both theoretical and numerical properties of the method are analyzed. The theoretical result guarantees that, under the degree corrected stochastic block model, even for networks with outliers, the maximizer of the tightness criterion can extract communities with small misclassification rates even when the number of communities grows to infinity as the network size grows. Simulation study shows that the proposed method can recover true communities more accurately than other methods. Applications to a college football data and a yeast protein-protein interaction data also reveal that the proposed method performs significantly better.
△ Less
Submitted 30 June, 2017;
originally announced June 2017.
-
Differential Network Analysis via the Lasso Penalized D-Trace Loss
Authors:
Huili Yuan,
Ruibin Xi,
Chong Chen,
Minghua Deng
Abstract:
Biological networks often change under different environmental and genetic conditions. Understanding how these networks change becomes an important problem in biological studies. In this paper, we model the network change as the difference of two precision matrices and propose a novel loss function for estimating the precision matrix difference. Under a new irrepresentability condition, we show th…
▽ More
Biological networks often change under different environmental and genetic conditions. Understanding how these networks change becomes an important problem in biological studies. In this paper, we model the network change as the difference of two precision matrices and propose a novel loss function for estimating the precision matrix difference. Under a new irrepresentability condition, we show that the new loss function with the lasso penalty can give consistent estimates in high-dimensional setting for sub-Gaussian and polynomial-tailed distributions. An efficient algorithm is developed based on the alternating direction method to solve the optimization problem. Simulation studies and a real data analysis about colorectal cancer show that the proposed method outperforms other available methods.
△ Less
Submitted 28 May, 2017; v1 submitted 30 November, 2015;
originally announced November 2015.