subscribe to arXiv mailings

Order-aware Interactive Segmentation

Authors: Bin Wang, Anwesa Choudhuri, Meng Zheng, Zhongpai Gao, Benjamin Planche, Andong Deng, Qin Liu, Terrence Chen, Ulas Bagci, Ziyan Wu

Abstract: Interactive segmentation aims to accurately segment target objects with minimal user interactions. However, current methods often fail to accurately separate target objects from the background, due to a limited understanding of order, the relative depth between objects in a scene. To address this issue, we propose OIS: order-aware interactive segmentation, where we explicitly encode the relative d… ▽ More Interactive segmentation aims to accurately segment target objects with minimal user interactions. However, current methods often fail to accurately separate target objects from the background, due to a limited understanding of order, the relative depth between objects in a scene. To address this issue, we propose OIS: order-aware interactive segmentation, where we explicitly encode the relative depth between objects into order maps. We introduce a novel order-aware attention, where the order maps seamlessly guide the user interactions (in the form of clicks) to attend to the image features. We further present an object-aware attention module to incorporate a strong object-level understanding to better differentiate objects with similar order. Our approach allows both dense and sparse integration of user clicks, enhancing both accuracy and efficiency as compared to prior works. Experimental results demonstrate that OIS achieves state-of-the-art performance, improving mIoU after one click by 7.61 on the HQSeg44K dataset and 1.32 on the DAVIS dataset as compared to the previous state-of-the-art SegNext, while also doubling inference speed compared to current leading methods. The project page is https://ukaukaaaa.github.io/projects/OIS/index.html △ Less

Submitted 17 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

Comments: Interactive demo can be found in project page: https://ukaukaaaa.github.io/projects/OIS/index.html

arXiv:2410.07006 [pdf]

The Mitochondrial Genome of Cathaya argyrophylla Reaches 18.99 Mb: Analysis of Super-Large Mitochondrial Genomes in Pinaceae

Authors: Kerui Huang, Wenbo Xu, Haoliang Hu, Xiaolong Jiang, Lei Sun, Wenyan Zhao, Binbin Long, Shaogang Fan, Zhibo Zhou, Ping Mo, Xiaocheng Jiang, Jianhong Tian, Aihua Deng, Peng Xie, Yun Wang

Abstract: Mitochondrial genomes in the Pinaceae family are notable for their large size and structural complexity. In this study, we sequenced and analyzed the mitochondrial genome of Cathaya argyrophylla, an endangered and endemic Pinaceae species, uncovering a genome size of 18.99 Mb, meaning the largest mitochondrial genome reported to date. To investigate the mechanisms behind this exceptional size, we… ▽ More Mitochondrial genomes in the Pinaceae family are notable for their large size and structural complexity. In this study, we sequenced and analyzed the mitochondrial genome of Cathaya argyrophylla, an endangered and endemic Pinaceae species, uncovering a genome size of 18.99 Mb, meaning the largest mitochondrial genome reported to date. To investigate the mechanisms behind this exceptional size, we conducted comparative analyses with other Pinaceae species possessing both large and small mitochondrial genomes, as well as with other gymnosperms. We focused on repeat sequences, transposable element activity, RNA editing events, chloroplast-derived sequence transfers (mtpts), and sequence homology with nuclear genomes. Our findings indicate that while Cathaya argyrophylla and other extremely large Pinaceae mitochondrial genomes contain substantial amounts of repeat sequences and show increased activity of LINEs and LTR retrotransposons, these factors alone do not fully account for the genome expansion. Notably, we observed a significant incorporation of chloroplast-derived sequences in Cathaya argyrophylla and other large mitochondrial genomes, suggesting that extensive plastid-to-mitochondrial DNA transfer may play a crucial role in genome enlargement. Additionally, large mitochondrial genomes exhibited distinct patterns of RNA editing and limited similarity with nuclear genomes compared to smaller genomes. These results suggest that the massive mitochondrial genomes in Pinaceae are likely the result of multiple contributing factors, including repeat sequences, transposon activity, and extensive plastid sequence incorporation. Our study enhances the understanding of mitochondrial genome evolution in plants and provides valuable genetic information for the conservation and study of Cathaya argyrophylla. △ Less

Submitted 9 October, 2024; originally announced October 2024.

Comments: 22 pages, 9 figures

arXiv:2409.17576 [pdf, other]

ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition

Authors: Shen Li, Jianqing Xu, Jiaying Wu, Miao Xiong, Ailin Deng, Jiazhen Ji, Yuge Huang, Wenjie Feng, Shouhong Ding, Bryan Hooi

Abstract: Synthetic face recognition (SFR) aims to generate synthetic face datasets that mimic the distribution of real face data, which allows for training face recognition models in a privacy-preserving manner. Despite the remarkable potential of diffusion models in image generation, current diffusion-based SFR models struggle with generalization to real-world faces. To address this limitation, we outline… ▽ More Synthetic face recognition (SFR) aims to generate synthetic face datasets that mimic the distribution of real face data, which allows for training face recognition models in a privacy-preserving manner. Despite the remarkable potential of diffusion models in image generation, current diffusion-based SFR models struggle with generalization to real-world faces. To address this limitation, we outline three key objectives for SFR: (1) promoting diversity across identities (inter-class diversity), (2) ensuring diversity within each identity by injecting various facial attributes (intra-class diversity), and (3) maintaining identity consistency within each identity group (intra-class identity preservation). Inspired by these goals, we introduce a diffusion-fueled SFR model termed $\text{ID}^3$. $\text{ID}^3$ employs an ID-preserving loss to generate diverse yet identity-consistent facial appearances. Theoretically, we show that minimizing this loss is equivalent to maximizing the lower bound of an adjusted conditional log-likelihood over ID-preserving data. This equivalence motivates an ID-preserving sampling algorithm, which operates over an adjusted gradient vector field, enabling the generation of fake face recognition datasets that approximate the distribution of real-world faces. Extensive experiments across five challenging benchmarks validate the advantages of $\text{ID}^3$. △ Less

Submitted 26 September, 2024; originally announced September 2024.

Comments: Accepted to NeurIPS 2024

arXiv:2408.13399 [pdf, other]

Transforming Location Retrieval at Airbnb: A Journey from Heuristics to Reinforcement Learning

Authors: Dillon Davis, Huiji Gao, Weiwei Guo, Thomas Legrand, Malay Haldar, Alex Deng, Han Zhao, Liwei He, Sanjeev Katariya

Abstract: The Airbnb search system grapples with many unique challenges as it continues to evolve. We oversee a marketplace that is nuanced by geography, diversity of homes, and guests with a variety of preferences. Crafting an efficient search system that can accommodate diverse guest needs, while showcasing relevant homes lies at the heart of Airbnb's success. Airbnb search has many challenges that parall… ▽ More The Airbnb search system grapples with many unique challenges as it continues to evolve. We oversee a marketplace that is nuanced by geography, diversity of homes, and guests with a variety of preferences. Crafting an efficient search system that can accommodate diverse guest needs, while showcasing relevant homes lies at the heart of Airbnb's success. Airbnb search has many challenges that parallel other recommendation and search systems but it has a unique information retrieval problem, upstream of ranking, called location retrieval. It requires defining a topological map area that is relevant to the searched query for homes listing retrieval. The purpose of this paper is to demonstrate the methodology, challenges, and impact of building a machine learning based location retrieval product from the ground up. Despite the lack of suitable, prevalent machine learning based approaches, we tackle cold start, generalization, differentiation and algorithmic bias. We detail the efficacy of heuristics, statistics, machine learning, and reinforcement learning approaches to solve these challenges, particularly for systems that are often unexplored by current literature. △ Less

Submitted 23 August, 2024; originally announced August 2024.

arXiv:2408.12748 [pdf, other]

SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection

Authors: Mengya Hu, Rui Xu, Deren Lei, Yaxi Li, Mingyu Wang, Emily Ching, Eslam Kamal, Alex Deng

Abstract: Large language models (LLMs) are highly capable but face latency challenges in real-time applications, such as conducting online hallucination detection. To overcome this issue, we propose a novel framework that leverages a small language model (SLM) classifier for initial detection, followed by a LLM as constrained reasoner to generate detailed explanations for detected hallucinated content. This… ▽ More Large language models (LLMs) are highly capable but face latency challenges in real-time applications, such as conducting online hallucination detection. To overcome this issue, we propose a novel framework that leverages a small language model (SLM) classifier for initial detection, followed by a LLM as constrained reasoner to generate detailed explanations for detected hallucinated content. This study optimizes the real-time interpretable hallucination detection by introducing effective prompting techniques that align LLM-generated explanations with SLM decisions. Empirical experiment results demonstrate its effectiveness, thereby enhancing the overall user experience. △ Less

Submitted 22 August, 2024; originally announced August 2024.

Comments: preprint under review

arXiv:2406.17746 [pdf, other]

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Authors: USVSN Sai Prashanth, Alvin Deng, Kyle O'Brien, Jyothir S V, Mohammad Aflah Khan, Jaydeep Borkar, Christopher A. Choquette-Choo, Jacob Ray Fuehne, Stella Biderman, Tracy Ke, Katherine Lee, Naomi Saphra

Abstract: Memorization in language models is typically treated as a homogenous phenomenon, neglecting the specifics of the memorized data. We instead model memorization as the effect of a set of complex factors that describe each sample and relate it to the model and corpus. To build intuition around these factors, we break memorization down into a taxonomy: recitation of highly duplicated sequences, recons… ▽ More Memorization in language models is typically treated as a homogenous phenomenon, neglecting the specifics of the memorized data. We instead model memorization as the effect of a set of complex factors that describe each sample and relate it to the model and corpus. To build intuition around these factors, we break memorization down into a taxonomy: recitation of highly duplicated sequences, reconstruction of inherently predictable sequences, and recollection of sequences that are neither. We demonstrate the usefulness of our taxonomy by using it to construct a predictive model for memorization. By analyzing dependencies and inspecting the weights of the predictive model, we find that different factors influence the likelihood of memorization differently depending on the taxonomic category. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2402.18396 [pdf, other]

Deep Confident Steps to New Pockets: Strategies for Docking Generalization

Authors: Gabriele Corso, Arthur Deng, Benjamin Fry, Nicholas Polizzi, Regina Barzilay, Tommi Jaakkola

Abstract: Accurate blind docking has the potential to lead to new biological breakthroughs, but for this promise to be realized, docking methods must generalize well across the proteome. Existing benchmarks, however, fail to rigorously assess generalizability. Therefore, we develop DockGen, a new benchmark based on the ligand-binding domains of proteins, and we show that existing machine learning-based dock… ▽ More Accurate blind docking has the potential to lead to new biological breakthroughs, but for this promise to be realized, docking methods must generalize well across the proteome. Existing benchmarks, however, fail to rigorously assess generalizability. Therefore, we develop DockGen, a new benchmark based on the ligand-binding domains of proteins, and we show that existing machine learning-based docking models have very weak generalization abilities. We carefully analyze the scaling laws of ML-based docking and show that, by scaling data and model size, as well as integrating synthetic data strategies, we are able to significantly increase the generalization capacity and set new state-of-the-art performance across benchmarks. Further, we propose Confidence Bootstrapping, a new training paradigm that solely relies on the interaction between diffusion and confidence models and exploits the multi-resolution generation process of diffusion models. We demonstrate that Confidence Bootstrapping significantly improves the ability of ML-based docking methods to dock to unseen protein classes, edging closer to accurate and generalizable blind docking methods. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Journal ref: International Conference on Learning Representations 2024

arXiv:2402.15300 [pdf, other]

Seeing is Believing: Mitigating Hallucination in Large Vision-Language Models via CLIP-Guided Decoding

Authors: Ailin Deng, Zhirui Chen, Bryan Hooi

Abstract: Large Vision-Language Models (LVLMs) are susceptible to object hallucinations, an issue in which their generated text contains non-existent objects, greatly limiting their reliability and practicality. Current approaches often rely on the model's token likelihoods or other internal information, instruction tuning on additional datasets, or incorporating complex external tools. We first perform emp… ▽ More Large Vision-Language Models (LVLMs) are susceptible to object hallucinations, an issue in which their generated text contains non-existent objects, greatly limiting their reliability and practicality. Current approaches often rely on the model's token likelihoods or other internal information, instruction tuning on additional datasets, or incorporating complex external tools. We first perform empirical analysis on sentence-level LVLM hallucination, finding that CLIP similarity to the image acts as a stronger and more robust indicator of hallucination compared to token likelihoods. Motivated by this, we introduce our CLIP-Guided Decoding (CGD) approach, a straightforward but effective training-free approach to reduce object hallucination at decoding time. CGD uses CLIP to guide the model's decoding process by enhancing visual grounding of generated text with the image. Experiments demonstrate that CGD effectively mitigates object hallucination across multiple LVLM families while preserving the utility of text generation. Codes are available at https://github.com/d-ailin/CLIP-Guided-Decoding. △ Less

Submitted 23 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

Comments: Code URL: https://github.com/d-ailin/CLIP-Guided-Decoding

arXiv:2401.01505 [pdf, other]

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

Authors: Haopeng Li, Andong Deng, Qiuhong Ke, Jun Liu, Hossein Rahmani, Yulan Guo, Bernt Schiele, Chen Chen

Abstract: Reasoning over sports videos for question answering is an important task with numerous applications, such as player training and information retrieval. However, this task has not been explored due to the lack of relevant datasets and the challenging nature it presents. Most datasets for video question answering (VideoQA) focus mainly on general and coarse-grained understanding of daily-life videos… ▽ More Reasoning over sports videos for question answering is an important task with numerous applications, such as player training and information retrieval. However, this task has not been explored due to the lack of relevant datasets and the challenging nature it presents. Most datasets for video question answering (VideoQA) focus mainly on general and coarse-grained understanding of daily-life videos, which is not applicable to sports scenarios requiring professional action understanding and fine-grained motion analysis. In this paper, we introduce the first dataset, named Sports-QA, specifically designed for the sports VideoQA task. The Sports-QA dataset includes various types of questions, such as descriptions, chronologies, causalities, and counterfactual conditions, covering multiple sports. Furthermore, to address the characteristics of the sports VideoQA task, we propose a new Auto-Focus Transformer (AFT) capable of automatically focusing on particular scales of temporal information for question answering. We conduct extensive experiments on Sports-QA, including baseline studies and the evaluation of different methods. The results demonstrate that our AFT achieves state-of-the-art performance. △ Less

Submitted 14 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

arXiv:2312.08341 [pdf, other]

Coordinating mobile network coverage and vehicle routing: a double column generation approach

Authors: Adam Deng, Alexandre Jacquillat

Abstract: The emergence of 5G technologies opens opportunities to support mission-critical activities with high-speed Internet coverage. This paper defines a joint job-emitting vehicle routing problem with time windows to coordinate the operations of mission-oriented vehicles ("mission vehicles") and mobile emitters ("emitting vehicles"). This problem exhibits a joint vehicle routing structure, with couplin… ▽ More The emergence of 5G technologies opens opportunities to support mission-critical activities with high-speed Internet coverage. This paper defines a joint job-emitting vehicle routing problem with time windows to coordinate the operations of mission-oriented vehicles ("mission vehicles") and mobile emitters ("emitting vehicles"). This problem exhibits a joint vehicle routing structure, with coupling constraints to ensure that each job is supported by appropriate network coverage. We solve it via an exact and finite double column generation algorithm: pricing problems generate vehicle paths dynamically, and a master problem coordinates the operations of mission vehicles and emitting vehicles to ensure appropriate network coverage for each job. We propose several acceleration strategies to strengthen the algorithm's computational performance. Computational results show the scalability of the proposed methodology. Specifically, the methodology cuts runtimes by over 95% in small-scale instances as compared to an explicit formulation, and scales to large-scale instances involving over 50 jobs. From a practical standpoint, results highlight the benefits of dynamically coordinating mission vehicles and emitting vehicles, thus suggesting opportunities to support emerging 5G technologies with dedicated routing algorithms. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: 36 pages, 8 figures, 7 tables

arXiv:2312.02935 [pdf, other]

From Augmentation to Decomposition: A New Look at CUPED in 2023

Authors: Alex Deng, Luke Hagar, Nathaniel Stevens, Tatiana Xifara, Lo-Hua Yuan, Amit Gandhi

Abstract: Ten years ago, CUPED (Controlled Experiments Utilizing Pre-Experiment Data) mainstreamed the idea of variance reduction leveraging pre-experiment covariates. Since its introduction, it has been implemented, extended, and modernized by major online experimentation platforms. Many researchers and practitioners often interpret CUPED as a regression adjustment. In this article, we clarify its similari… ▽ More Ten years ago, CUPED (Controlled Experiments Utilizing Pre-Experiment Data) mainstreamed the idea of variance reduction leveraging pre-experiment covariates. Since its introduction, it has been implemented, extended, and modernized by major online experimentation platforms. Many researchers and practitioners often interpret CUPED as a regression adjustment. In this article, we clarify its similarities and differences to regression adjustment and present CUPED as a more general augmentation framework which is closer to the spirit of the 2013 paper. We show that the augmentation view naturally leads to cleaner developments of variance reduction beyond simple average metrics, including ratio metrics and percentile metrics. Moreover, the augmentation view can go beyond using pre-experiment data and leverage in-experiment data, leading to significantly larger variance reduction. We further introduce metric decomposition using approximate null augmentation (ANA) as a mental model for in-experiment variance reduction. We study it under both a Bayesian framework and a frequentist optimal proxy metric framework. Metric decomposition arises naturally in conversion funnels, so this work has broad applicability. △ Less

Submitted 5 December, 2023; originally announced December 2023.

arXiv:2309.16424 [pdf, other]

doi 10.1145/3583780.3615015

Prompt-and-Align: Prompt-Based Social Alignment for Few-Shot Fake News Detection

Authors: Jiaying Wu, Shen Li, Ailin Deng, Miao Xiong, Bryan Hooi

Abstract: Despite considerable advances in automated fake news detection, due to the timely nature of news, it remains a critical open question how to effectively predict the veracity of news articles based on limited fact-checks. Existing approaches typically follow a "Train-from-Scratch" paradigm, which is fundamentally bounded by the availability of large-scale annotated data. While expressive pre-traine… ▽ More Despite considerable advances in automated fake news detection, due to the timely nature of news, it remains a critical open question how to effectively predict the veracity of news articles based on limited fact-checks. Existing approaches typically follow a "Train-from-Scratch" paradigm, which is fundamentally bounded by the availability of large-scale annotated data. While expressive pre-trained language models (PLMs) have been adapted in a "Pre-Train-and-Fine-Tune" manner, the inconsistency between pre-training and downstream objectives also requires costly task-specific supervision. In this paper, we propose "Prompt-and-Align" (P&A), a novel prompt-based paradigm for few-shot fake news detection that jointly leverages the pre-trained knowledge in PLMs and the social context topology. Our approach mitigates label scarcity by wrapping the news article in a task-related textual prompt, which is then processed by the PLM to directly elicit task-specific knowledge. To supplement the PLM with social context without inducing additional training overheads, motivated by empirical observation on user veracity consistency (i.e., social users tend to consume news of the same veracity type), we further construct a news proximity graph among news articles to capture the veracity-consistent signals in shared readerships, and align the prompting predictions along the graph edges in a confidence-informed manner. Extensive experiments on three real-world benchmarks demonstrate that P&A sets new states-of-the-art for few-shot fake news detection performance by significant margins. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: Accepted to CIKM 2023 (Full Paper)

arXiv:2308.03278 [pdf]

Key Gene Mining in Transcriptional Regulation for Specific Biological Processes with Small Sample Sizes Using Multi-network pipeline Transformer

Authors: Kerui Huang, Jianhong Tian, Lei Sun, Li Zeng, Peng Xie, Aihua Deng, Ping Mo, Zhibo Zhou, Ming Jiang, Yun Wang, Xiaocheng Jiang

Abstract: Gene mining is an important topic in the field of life sciences, but traditional machine learning methods cannot consider the regulatory relationships between genes. Deep learning methods perform poorly in small sample sizes. This study proposed a deep learning method, called TransGeneSelector, that can mine critical regulatory genes involved in certain life processes using a small-sample transcri… ▽ More Gene mining is an important topic in the field of life sciences, but traditional machine learning methods cannot consider the regulatory relationships between genes. Deep learning methods perform poorly in small sample sizes. This study proposed a deep learning method, called TransGeneSelector, that can mine critical regulatory genes involved in certain life processes using a small-sample transcriptome dataset. The method combines a WGAN-GP data augmentation network, a sample filtering network, and a Transformer classifier network, which successfully classified the state (germinating or dry seeds) of Arabidopsis thaliana seed in a dataset of 79 samples, showing performance comparable to that of Random Forests. Further, through the use of SHapley Additive exPlanations method, TransGeneSelector successfully mined genes involved in seed germination. Through the construction of gene regulatory networks and the enrichment analysis of KEGG, as well as RT-qPCR quantitative analysis, it was confirmed that these genes are at a more upstream regulatory level than those Random Forests mined, and the top 11 genes that were uniquely mined by TransGeneSelector were found to be related to the KAI2 signaling pathway, which is of great regulatory importance for germination-related genes. This study provides a practical tool for life science researchers to mine key genes from transcriptome data. △ Less

Submitted 6 August, 2023; originally announced August 2023.

Comments: 34 pages,6 figures

arXiv:2307.01920 [pdf, other]

doi 10.1109/DSLW53931.2022.9820497

Siamese Learning-based Monarch Butterfly Localization

Authors: Sara Shoouri, Mingyu Yang, Gordy Carichner, Yuyang Li, Ehab A. Hamed, Angela Deng, Delbert A. Green II, Inhee Lee, David Blaauw, Hun-Seok Kim

Abstract: A new GPS-less, daily localization method is proposed with deep learning sensor fusion that uses daylight intensity and temperature sensor data for Monarch butterfly tracking. Prior methods suffer from the location-independent day length during the equinox, resulting in high localization errors around that date. This work proposes a new Siamese learning-based localization model that improves the a… ▽ More A new GPS-less, daily localization method is proposed with deep learning sensor fusion that uses daylight intensity and temperature sensor data for Monarch butterfly tracking. Prior methods suffer from the location-independent day length during the equinox, resulting in high localization errors around that date. This work proposes a new Siamese learning-based localization model that improves the accuracy and reduces the bias of daily Monarch butterfly localization using light and temperature measurements. To train and test the proposed algorithm, we use $5658$ daily measurement records collected through a data measurement campaign involving 306 volunteers across the U.S., Canada, and Mexico from 2018 to 2020. This model achieves a mean absolute error of $1.416^\circ$ in latitude and $0.393^\circ$ in longitude coordinates outperforming the prior method. △ Less

Submitted 4 July, 2023; originally announced July 2023.

Comments: 2022 IEEE Data Science and Learning Workshop (DSLW)

arXiv:2306.04590 [pdf, other]

Proximity-Informed Calibration for Deep Neural Networks

Authors: Miao Xiong, Ailin Deng, Pang Wei Koh, Jiaying Wu, Shen Li, Jianqing Xu, Bryan Hooi

Abstract: Confidence calibration is central to providing accurate and interpretable uncertainty estimates, especially under safety-critical scenarios. However, we find that existing calibration algorithms often overlook the issue of *proximity bias*, a phenomenon where models tend to be more overconfident in low proximity data (i.e., data lying in the sparse region of the data distribution) compared to high… ▽ More Confidence calibration is central to providing accurate and interpretable uncertainty estimates, especially under safety-critical scenarios. However, we find that existing calibration algorithms often overlook the issue of *proximity bias*, a phenomenon where models tend to be more overconfident in low proximity data (i.e., data lying in the sparse region of the data distribution) compared to high proximity samples, and thus suffer from inconsistent miscalibration across different proximity samples. We examine the problem over 504 pretrained ImageNet models and observe that: 1) Proximity bias exists across a wide variety of model architectures and sizes; 2) Transformer-based models are relatively more susceptible to proximity bias than CNN-based models; 3) Proximity bias persists even after performing popular calibration algorithms like temperature scaling; 4) Models tend to overfit more heavily on low proximity samples than on high proximity samples. Motivated by the empirical findings, we propose ProCal, a plug-and-play algorithm with a theoretical guarantee to adjust sample confidence based on proximity. To further quantify the effectiveness of calibration algorithms in mitigating proximity bias, we introduce proximity-informed expected calibration error (PIECE) with theoretical analysis. We show that ProCal is effective in addressing proximity bias and improving calibration on balanced, long-tail, and distribution-shift settings under four metrics over various model architectures. We believe our findings on proximity bias will guide the development of *fairer and better-calibrated* models, contributing to the broader pursuit of trustworthy AI. Our code is available at: https://github.com/MiaoXiong2320/ProximityBias-Calibration. △ Less

Submitted 17 March, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: The paper is accepted by NeurIPS 2023. The code is available at: https://github.com/MiaoXiong2320/ProximityBias-Calibration

arXiv:2305.01481 [pdf, other]

Great Models Think Alike: Improving Model Reliability via Inter-Model Latent Agreement

Authors: Ailin Deng, Miao Xiong, Bryan Hooi

Abstract: Reliable application of machine learning is of primary importance to the practical deployment of deep learning methods. A fundamental challenge is that models are often unreliable due to overconfidence. In this paper, we estimate a model's reliability by measuring \emph{the agreement between its latent space, and the latent space of a foundation model}. However, it is challenging to measure the ag… ▽ More Reliable application of machine learning is of primary importance to the practical deployment of deep learning methods. A fundamental challenge is that models are often unreliable due to overconfidence. In this paper, we estimate a model's reliability by measuring \emph{the agreement between its latent space, and the latent space of a foundation model}. However, it is challenging to measure the agreement between two different latent spaces due to their incoherence, \eg, arbitrary rotations and different dimensionality. To overcome this incoherence issue, we design a \emph{neighborhood agreement measure} between latent spaces and find that this agreement is surprisingly well-correlated with the reliability of a model's predictions. Further, we show that fusing neighborhood agreement into a model's predictive confidence in a post-hoc way significantly improves its reliability. Theoretical analysis and extensive experiments on failure detection across various datasets verify the effectiveness of our method on both in-distribution and out-of-distribution settings. △ Less

Submitted 2 May, 2023; originally announced May 2023.

Comments: ICML 2023

arXiv:2304.14860 [pdf]

Electron-infrared phonon coupling in ABC trilayer graphene

Authors: Xiaozhou Zan, Xiangdong Guo, Aolin Deng, Zhiheng Huang, Le Liu, Fanfan Wu, Yalong Yuan, Jiaojiao Zhao, Yalin Peng, Lu Li, Yangkun Zhang, Xiuzhen Li, Jundong Zhu, Jingwei Dong, Dongxia Shi, Wei Yang, Xiaoxia Yang, Zhiwen Shi, Luojun Du, Qing Dai, Guangyu Zhang

Abstract: Stacking order plays a crucial role in determining the crystal symmetry and has significant impacts on electronic, optical, magnetic, and topological properties. Electron-phonon coupling, which is central to a wide range of intriguing quantum phenomena, is expected to be intricately connected with stacking order. Understanding the stacking order-dependent electron-phonon coupling is essential for… ▽ More Stacking order plays a crucial role in determining the crystal symmetry and has significant impacts on electronic, optical, magnetic, and topological properties. Electron-phonon coupling, which is central to a wide range of intriguing quantum phenomena, is expected to be intricately connected with stacking order. Understanding the stacking order-dependent electron-phonon coupling is essential for understanding peculiar physical phenomena associated with electron-phonon coupling, such as superconductivity and charge density waves. In this study, we investigate the effect of stacking order on electron-infrared phonon coupling in graphene trilayers. By using gate-tunable Raman spectroscopy and excitation frequency-dependent near-field infrared nanoscopy, we show that rhombohedral ABC-stacked trilayer graphene has a significantly stronger electron-infrared phonon coupling strength than the Bernal ABA-stacked trilayer graphene. Our findings provide novel insights into the superconductivity and other fundamental physical properties of rhombohedral ABC-stacked trilayer graphene, and can enable nondestructive and high-throughput imaging of trilayer graphene stacking order using Raman scattering. △ Less

Submitted 28 April, 2023; originally announced April 2023.

arXiv:2304.07775 [pdf, other]

Robust Cross-Modal Knowledge Distillation for Unconstrained Videos

Authors: Wenke Xia, Xingjian Li, Andong Deng, Haoyi Xiong, Dejing Dou, Di Hu

Abstract: Cross-modal distillation has been widely used to transfer knowledge across different modalities, enriching the representation of the target unimodal one. Recent studies highly relate the temporal synchronization between vision and sound to the semantic consistency for cross-modal distillation. However, such semantic consistency from the synchronization is hard to guarantee in unconstrained videos,… ▽ More Cross-modal distillation has been widely used to transfer knowledge across different modalities, enriching the representation of the target unimodal one. Recent studies highly relate the temporal synchronization between vision and sound to the semantic consistency for cross-modal distillation. However, such semantic consistency from the synchronization is hard to guarantee in unconstrained videos, due to the irrelevant modality noise and differentiated semantic correlation. To this end, we first propose a \textit{Modality Noise Filter} (MNF) module to erase the irrelevant noise in teacher modality with cross-modal context. After this purification, we then design a \textit{Contrastive Semantic Calibration} (CSC) module to adaptively distill useful knowledge for target modality, by referring to the differentiated sample-wise semantic correlation in a contrastive fashion. Extensive experiments show that our method could bring a performance boost compared with other distillation methods in both visual action recognition and video retrieval task. We also extend to the audio tagging task to prove the generalization of our method. The source code is available at \href{https://github.com/GeWu-Lab/cross-modal-distillation}{https://github.com/GeWu-Lab/cross-modal-distillation}. △ Less

Submitted 27 April, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

arXiv:2303.13505 [pdf, other]

A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition

Authors: Andong Deng, Taojiannan Yang, Chen Chen

Abstract: The goal of building a benchmark (suite of datasets) is to provide a unified protocol for fair evaluation and thus facilitate the evolution of a specific area. Nonetheless, we point out that existing protocols of action recognition could yield partial evaluations due to several limitations. To comprehensively probe the effectiveness of spatiotemporal representation learning, we introduce BEAR, a n… ▽ More The goal of building a benchmark (suite of datasets) is to provide a unified protocol for fair evaluation and thus facilitate the evolution of a specific area. Nonetheless, we point out that existing protocols of action recognition could yield partial evaluations due to several limitations. To comprehensively probe the effectiveness of spatiotemporal representation learning, we introduce BEAR, a new BEnchmark on video Action Recognition. BEAR is a collection of 18 video datasets grouped into 5 categories (anomaly, gesture, daily, sports, and instructional), which covers a diverse set of real-world applications. With BEAR, we thoroughly evaluate 6 common spatiotemporal models pre-trained by both supervised and self-supervised learning. We also report transfer performance via standard finetuning, few-shot finetuning, and unsupervised domain adaptation. Our observation suggests that current state-of-the-art cannot solidly guarantee high performance on datasets close to real-world applications, and we hope BEAR can serve as a fair and challenging evaluation benchmark to gain insights on building next-generation spatiotemporal learners. Our dataset, code, and models are released at: https://github.com/AndongDeng/BEAR △ Less

Submitted 18 August, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

Comments: ICCV 2023

arXiv:2303.06322 [pdf]

doi 10.1088/2752-5724/acbecd

Quick Identification of ABC Trilayer Graphene at Nanoscale Resolution via a Near-field Optical Route

Authors: Peiyue Shen, Xianliang Zhou, Jiajun Chen, Aolin Deng, Bosai Lyu, Zhichun Zhang, Shuo Lou, Saiqun Ma, Binbin Wei, Zhiwen Shi

Abstract: ABC-stacked trilayer graphene has exhibited a variety of correlated phenomena owing to its relatively flat bands and gate-tunable bandgap. However, convenient methods are still lacking for identifying ABC graphene with nanometer-scale resolution. Here we demonstrate that the scanning near-field optical microscope (SNOM) working in ambient conditions can provide quick recognition of ABC trilayer gr… ▽ More ABC-stacked trilayer graphene has exhibited a variety of correlated phenomena owing to its relatively flat bands and gate-tunable bandgap. However, convenient methods are still lacking for identifying ABC graphene with nanometer-scale resolution. Here we demonstrate that the scanning near-field optical microscope (SNOM) working in ambient conditions can provide quick recognition of ABC trilayer graphene with no ambiguity and excellent resolution (~20 nm). The recognition is based on the difference in their near-field infrared (IR) responses between the ABA and ABC trilayers. We show that in most frequencies, the response of the ABC trilayer is weaker than the ABA trilayer. However, near the graphene phonon frequency (~1585 cm-1), ABC's response increases dramatically when gated and exhibits a narrow and sharp Fano-shape resonant line, whereas the ABA trilayer is largely featherless. Consequently, the IR contrast between ABC and ABA becomes reversed and can even be striking (ABC/ABA~3) near the graphene phonon frequency. The observed near-field IR features can serve as a golden rule to quickly distinguish ABA and ABC trilayers with no ambiguity, which could largely advance the exploration of correlation physics in ABC-stacked trilayer graphene. △ Less

Submitted 11 March, 2023; originally announced March 2023.

Journal ref: Mater. Futures 2 (2023) 015301

arXiv:2302.02628 [pdf, other]

Trust, but Verify: Using Self-Supervised Probing to Improve Trustworthiness

Authors: Ailin Deng, Shen Li, Miao Xiong, Zhirui Chen, Bryan Hooi

Abstract: Trustworthy machine learning is of primary importance to the practical deployment of deep learning models. While state-of-the-art models achieve astonishingly good performance in terms of accuracy, recent literature reveals that their predictive confidence scores unfortunately cannot be trusted: e.g., they are often overconfident when wrong predictions are made, or so even for obvious outliers. In… ▽ More Trustworthy machine learning is of primary importance to the practical deployment of deep learning models. While state-of-the-art models achieve astonishingly good performance in terms of accuracy, recent literature reveals that their predictive confidence scores unfortunately cannot be trusted: e.g., they are often overconfident when wrong predictions are made, or so even for obvious outliers. In this paper, we introduce a new approach of self-supervised probing, which enables us to check and mitigate the overconfidence issue for a trained model, thereby improving its trustworthiness. We provide a simple yet effective framework, which can be flexibly applied to existing trustworthiness-related methods in a plug-and-play manner. Extensive experiments on three trustworthiness-related tasks (misclassification detection, calibration and out-of-distribution detection) across various benchmarks verify the effectiveness of our proposed probing framework. △ Less

Submitted 6 February, 2023; originally announced February 2023.

Comments: European Conference on Computer Vision 2022

arXiv:2301.02444 [pdf, other]

High-Performance Deterministic Concurrency using Lingua Franca

Authors: Christian Menard, Marten Lohstroh, Soroush Bateni, Matthew Chorlian, Arthur Deng, Peter Donovan, Clément Fournier, Shaokai Lin, Felix Suchert, Tassilo Tanneberger, Hokeun Kim, Jeronimo Castrillon, Edward A. Lee

Abstract: Actor frameworks and similar reactive programming techniques are widely used for building concurrent systems. They promise to be efficient and scale well to a large number of cores or nodes in a distributed system. However, they also expose programmers to nondeterminism, which often makes implementations hard to understand, debug, and test. The recently proposed reactor model is a promising altern… ▽ More Actor frameworks and similar reactive programming techniques are widely used for building concurrent systems. They promise to be efficient and scale well to a large number of cores or nodes in a distributed system. However, they also expose programmers to nondeterminism, which often makes implementations hard to understand, debug, and test. The recently proposed reactor model is a promising alternative that enables efficient deterministic concurrency. In this paper, we show that determinacy does neither imply a loss in expressivity nor in performance. To show this, we evaluate Lingua Franca (LF), a reactor-oriented coordination language that equips mainstream programming languages with a concurrency model that automatically takes advantage of opportunities to exploit parallelism that do not introduce nondeterminism. Our implementation of the Savina benchmark suite demonstrates that, in terms of execution time, the runtime performance of LF programs even exceeds popular and highly optimized actor frameworks. We compare against Akka and CAF, which LF outperforms by 1.86x and 1.42x, respectively. △ Less

Submitted 9 January, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

arXiv:2212.11366 [pdf, ps, other]

Statistical Challenges in Online Controlled Experiments: A Review of A/B Testing Methodology

Authors: Nicholas Larsen, Jonathan Stallrich, Srijan Sengupta, Alex Deng, Ron Kohavi, Nathaniel Stevens

Abstract: The rise of internet-based services and products in the late 1990's brought about an unprecedented opportunity for online businesses to engage in large scale data-driven decision making. Over the past two decades, organizations such as Airbnb, Alibaba, Amazon, Baidu, Booking, Alphabet's Google, LinkedIn, Lyft, Meta's Facebook, Microsoft, Netflix, Twitter, Uber, and Yandex have invested tremendous… ▽ More The rise of internet-based services and products in the late 1990's brought about an unprecedented opportunity for online businesses to engage in large scale data-driven decision making. Over the past two decades, organizations such as Airbnb, Alibaba, Amazon, Baidu, Booking, Alphabet's Google, LinkedIn, Lyft, Meta's Facebook, Microsoft, Netflix, Twitter, Uber, and Yandex have invested tremendous resources in online controlled experiments (OCEs) to assess the impact of innovation on their customers and businesses. Running OCEs at scale has presented a host of challenges requiring solutions from many domains. In this paper we review challenges that require new statistical methodologies to address them. In particular, we discuss the practice and culture of online experimentation, as well as its statistics literature, placing the current methodologies within their relevant statistical lineages and providing illustrative examples of OCE applications. Our goal is to raise academic statisticians' awareness of these new research opportunities to increase collaboration between academia and the online industry. △ Less

Submitted 19 October, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

arXiv:2212.08072 [pdf]

Foresight -- Generative Pretrained Transformer (GPT) for Modelling of Patient Timelines using EHRs

Authors: Zeljko Kraljevic, Dan Bean, Anthony Shek, Rebecca Bendayan, Harry Hemingway, Joshua Au Yeung, Alexander Deng, Alfie Baston, Jack Ross, Esther Idowu, James T Teo, Richard J Dobson

Abstract: Background: Electronic Health Records hold detailed longitudinal information about each patient's health status and general clinical history, a large portion of which is stored within the unstructured text. Existing approaches focus mostly on structured data and a subset of single-domain outcomes. We explore how temporal modelling of patients from free text and structured data, using deep generati… ▽ More Background: Electronic Health Records hold detailed longitudinal information about each patient's health status and general clinical history, a large portion of which is stored within the unstructured text. Existing approaches focus mostly on structured data and a subset of single-domain outcomes. We explore how temporal modelling of patients from free text and structured data, using deep generative transformers can be used to forecast a wide range of future disorders, substances, procedures or findings. Methods: We present Foresight, a novel transformer-based pipeline that uses named entity recognition and linking tools to convert document text into structured, coded concepts, followed by providing probabilistic forecasts for future medical events such as disorders, substances, procedures and findings. We processed the entire free-text portion from three different hospital datasets totalling 811336 patients covering both physical and mental health. Findings: On tests in two UK hospitals (King's College Hospital, South London and Maudsley) and the US MIMIC-III dataset precision@10 0.68, 0.76 and 0.88 was achieved for forecasting the next disorder in a patient timeline, while precision@10 of 0.80, 0.81 and 0.91 was achieved for forecasting the next biomedical concept. Foresight was also validated on 34 synthetic patient timelines by five clinicians and achieved relevancy of 97% for the top forecasted candidate disorder. As a generative model, it can forecast follow-on biomedical concepts for as many steps as required. Interpretation: Foresight is a general-purpose model for biomedical concept modelling that can be used for real-world risk forecasting, virtual trials and clinical research to study the progression of disorders, simulate interventions and counterfactuals, and educational purposes. △ Less

Submitted 24 January, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

arXiv:2212.01560 [pdf]

High-resolution and reliable automatic target recognition based on photonic ISAR imaging system with explainable deep learning

Authors: Xiuting Zou, Anyi Deng, Yiheng Hu, Shiyu Hua, Linbo Zhang, Shaofu Xu, Weiwen Zou

Abstract: Automatic target recognition (ATR) based on inverse synthetic aperture radar (ISAR) images, which is extensively utilized to surveil environment in military and civil fields, must be high-precision and reliable. Photonic technologies' advantage of broad bandwidth enables ISAR systems to realize high-resolution imaging, which is in favor of achieving high-performance ATR. Deep learning (DL) algorit… ▽ More Automatic target recognition (ATR) based on inverse synthetic aperture radar (ISAR) images, which is extensively utilized to surveil environment in military and civil fields, must be high-precision and reliable. Photonic technologies' advantage of broad bandwidth enables ISAR systems to realize high-resolution imaging, which is in favor of achieving high-performance ATR. Deep learning (DL) algorithms have achieved excellent recognition accuracies. However, the lack of interpretability of DL algorithms causes the head-scratching problem of credibility. In this paper, we exploit the inner relationship between a photonic ISAR imaging system and behaviors of a convolutional neural network (CNN) to deeply comprehend the intelligent recognition. Specifically, we manipulate imaging physical process and analyze network outputs, the relevance between the ISAR image and network output, and the visualization of features in the network output layer. Consequently, the broader imaging bandwidths and appropriate imaging angles lead to more detailed structural and contour features and the bigger discrepancy among ISAR images of different targets, which contributes to the CNN recognizing and distinguishing objects according to physical laws. Then, based on the photonic ISAR imaging system and the explainable CNN, we accomplish a high-accuracy and reliable ATR. To the best of our knowledge, there is no precedent of explaining the DL algorithms by exploring the influence of the physical process of data generation on network behaviors. It is anticipated that this work can not only inspire the accomplishment of a high-performance ATR but also bring new insights to explore network behaviors and thus achieve better intelligent abilities. △ Less

Submitted 3 December, 2022; originally announced December 2022.

arXiv:2211.16466 [pdf, other]

Birds of a Feather Trust Together: Knowing When to Trust a Classifier via Adaptive Neighborhood Aggregation

Authors: Miao Xiong, Shen Li, Wenjie Feng, Ailin Deng, Jihai Zhang, Bryan Hooi

Abstract: How do we know when the predictions made by a classifier can be trusted? This is a fundamental problem that also has immense practical applicability, especially in safety-critical areas such as medicine and autonomous driving. The de facto approach of using the classifier's softmax outputs as a proxy for trustworthiness suffers from the over-confidence issue; while the most recent works incur prob… ▽ More How do we know when the predictions made by a classifier can be trusted? This is a fundamental problem that also has immense practical applicability, especially in safety-critical areas such as medicine and autonomous driving. The de facto approach of using the classifier's softmax outputs as a proxy for trustworthiness suffers from the over-confidence issue; while the most recent works incur problems such as additional retraining cost and accuracy versus trustworthiness trade-off. In this work, we argue that the trustworthiness of a classifier's prediction for a sample is highly associated with two factors: the sample's neighborhood information and the classifier's output. To combine the best of both worlds, we design a model-agnostic post-hoc approach NeighborAgg to leverage the two essential information via an adaptive neighborhood aggregation. Theoretically, we show that NeighborAgg is a generalized version of a one-hop graph convolutional network, inheriting the powerful modeling ability to capture the varying similarity between samples within each class. We also extend our approach to the closely related task of mislabel detection and provide a theoretical coverage guarantee to bound the false negative. Empirically, extensive experiments on image and tabular benchmarks verify our theory and suggest that NeighborAgg outperforms other methods, achieving state-of-the-art trustworthiness performance. △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: Published in Transactions on Machine Learning Research (TMLR) 2022

Journal ref: Transactions on Machine Learning Research 08/2022

arXiv:2211.09310 [pdf, other]

Language-Assisted Deep Learning for Autistic Behaviors Recognition

Authors: Andong Deng, Taojiannan Yang, Chen Chen, Qian Chen, Leslie Neely, Sakiko Oyama

Abstract: Correctly recognizing the behaviors of children with Autism Spectrum Disorder (ASD) is of vital importance for the diagnosis of Autism and timely early intervention. However, the observation and recording during the treatment from the parents of autistic children may not be accurate and objective. In such cases, automatic recognition systems based on computer vision and machine learning (in partic… ▽ More Correctly recognizing the behaviors of children with Autism Spectrum Disorder (ASD) is of vital importance for the diagnosis of Autism and timely early intervention. However, the observation and recording during the treatment from the parents of autistic children may not be accurate and objective. In such cases, automatic recognition systems based on computer vision and machine learning (in particular deep learning) technology can alleviate this issue to a large extent. Existing human action recognition models can now achieve persuasive performance on challenging activity datasets, e.g. daily activity, and sports activity. However, problem behaviors in children with ASD are very different from these general activities, and recognizing these problem behaviors via computer vision is less studied. In this paper, we first evaluate a strong baseline for action recognition, i.e. Video Swin Transformer, on two autism behaviors datasets (SSBD and ESBD) and show that it can achieve high accuracy and outperform the previous methods by a large margin, demonstrating the feasibility of vision-based problem behaviors recognition. Moreover, we propose language-assisted training to further enhance the action recognition performance. Specifically, we develop a two-branch multimodal deep learning framework by incorporating the "freely available" language description for each type of problem behavior. Experimental results demonstrate that incorporating additional language supervision can bring an obvious performance boost for the autism problem behaviors recognition task as compared to using the video information only (i.e. 3.49% improvement on ESBD and 1.46% on SSBD). △ Less

Submitted 4 January, 2024; v1 submitted 16 November, 2022; originally announced November 2022.

Comments: Smart Health Journal

arXiv:2210.16373 [pdf, other]

Continuous Attribution of Episodical Outcomes for More Efficient and Targeted Online Measurement

Authors: Alex Deng, Michelle Du, Anna Matlin

Abstract: Online experimentation platforms collect user feedback at low cost and large scale. Some systems even support real-time or near real-time data processing, and can update metrics and statistics continuously. Many commonly used metrics, such as clicks and page views, can be observed without much delay. However, many important signals can only be observed after several hours or days, with noise addin… ▽ More Online experimentation platforms collect user feedback at low cost and large scale. Some systems even support real-time or near real-time data processing, and can update metrics and statistics continuously. Many commonly used metrics, such as clicks and page views, can be observed without much delay. However, many important signals can only be observed after several hours or days, with noise adding up over the duration of the episode. When episodical outcomes follow a complex sequence of user-product interactions, it is difficult to understand which interactions lead to the final outcome. There is no obvious attribution logic for us to associate a positive or negative outcome back to the actions and choices we made at different times. This attribution logic is critical to unlocking more targeted and efficient measurement at a finer granularity that could eventually lead to the full capability of reinforcement learning. In this paper, we borrow the idea of Causal Surrogacy to model a long-term outcome using leading indicators that are incrementally observed and apply it as the value function to track the progress towards the final outcome and attribute incrementally to various user-product interaction steps. Applying this approach to the guest booking metric at Airbnb resulted in significant variance reductions of 50% to 85%, while aligning well with the booking metric itself. Continuous attribution allows us to assign a utility score to each product page-view, and this score can be flexibly further aggregated to a variety of units of interest, such as searches and listings. We provide multiple real-world applications of attribution to illustrate its versatility. △ Less

Submitted 28 October, 2022; originally announced October 2022.

arXiv:2205.13965 [pdf]

Catalytic growth of ultralong graphene nanoribbons on insulating substrates

Authors: Bosai Lyu, Jiajun Chen, Shuo Lou, Can Li, Lu Qiu, Wengen Ouyang, Jingxu Xie, Izaac Mitchell, Tongyao Wu, Aolin Deng, Cheng Hu, Xianliang Zhou, Peiyue Shen, Saiqun Ma, Zhenghan Wu, Kenji Watanabe, Takashi Taniguchi, Xiaoqun Wang, Qi Liang, Jinfeng Jia, Michael Urbakh, Oded Hod, Feng Ding, Shiyong Wang, Zhiwen Shi

Abstract: Graphene nanoribbons (GNRs) with widths of a few nanometres are promising candidates for future nano-electronic applications due to their structurally tunable bandgaps, ultrahigh carrier mobilities, and exceptional stability. However, the direct growth of micrometre-long GNRs on insulating substrates, which is essential for the fabrication of nano-electronic devices, remains an immense challenge.… ▽ More Graphene nanoribbons (GNRs) with widths of a few nanometres are promising candidates for future nano-electronic applications due to their structurally tunable bandgaps, ultrahigh carrier mobilities, and exceptional stability. However, the direct growth of micrometre-long GNRs on insulating substrates, which is essential for the fabrication of nano-electronic devices, remains an immense challenge. Here, we report the epitaxial growth of GNRs on an insulating hexagonal boron nitride (h-BN) substrate through nanoparticle-catalysed chemical vapor deposition (CVD). Ultra-narrow GNRs with lengths of up to 10 μm are synthesized. Remarkably, the as-grown GNRs are crystallographically aligned with the h-BN substrate, forming one-dimensional (1D) moiré superlattices. Scanning tunnelling microscopy reveals an average width of 2 nm and a typical bandgap of ~1 eV for similar GNRs grown on conducting graphite substrates. Fully atomistic computational simulations support the experimental results and reveal a competition between the formation of GNRs and carbon nanotubes (CNTs) during the nucleation stage, and van der Waals sliding of the GNRs on the h-BN substrate throughout the growth stage. Our study provides a scalable, single-step method for growing micrometre-long narrow GNRs on insulating substrates, thus opening a route to explore the performance of high-quality GNR devices and the fundamental physics of 1D moiré superlattices. △ Less

Submitted 27 May, 2022; originally announced May 2022.

arXiv:2204.02665 [pdf, other]

Microjoule-level mid-infrared femtosecond pulse generation in hollow-core fibres

Authors: Ang Deng, Trivikramarao Gavara, Muhammad Rosdi Abu Hassan, Md Imran Hasan, Wonkeun Chang

Abstract: We demonstrate a fibre-based approach that generates mid-infrared femtosecond pulses in the 3-4 μm spectral region with microjoule-level single pulse energy. This is realised in a piece of gas-filled antiresonant hollow-core fibre that is pumped by a two-micron light source. A rapid variation of the dispersion near a structural resonance of the fibre creates a phase-matching point in the mid-infra… ▽ More We demonstrate a fibre-based approach that generates mid-infrared femtosecond pulses in the 3-4 μm spectral region with microjoule-level single pulse energy. This is realised in a piece of gas-filled antiresonant hollow-core fibre that is pumped by a two-micron light source. A rapid variation of the dispersion near a structural resonance of the fibre creates a phase-matching point in the mid-infrared, which mediates the frequency-down conversion. We generate femtosecond pulses centred at 3.16 μm wavelength with the pulse energy of more than 1 μJ, achieving the conversion efficiency as high as 9.4%. The wavelength of the radiation is determined solely by the dielectric wall thickness of the cladding elements, while the yield is subject to other experimental parameters. This, combined with high power-handling capability of hollow-core fibres, makes it possible to power scale the mid-infrared output by either increasing the pulse energy or repetition rate of the pump. The technique presents a new pathway to build an all-fibre-based mid-infrared supercontinuum source, which promises to be a powerful new tool for ultrahigh sensitivity molecular spectroscopy. △ Less

Submitted 6 April, 2022; originally announced April 2022.

arXiv:2203.15332 [pdf, other]

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Authors: Xiaokang Peng, Yake Wei, Andong Deng, Dong Wang, Di Hu

Abstract: Multimodal learning helps to comprehensively understand the world, by integrating different senses. Accordingly, multiple input modalities are expected to boost model performance, but we actually find that they are not fully exploited even when the multimodal model outperforms its uni-modal counterpart. Specifically, in this paper we point out that existing multimodal discriminative models, in whi… ▽ More Multimodal learning helps to comprehensively understand the world, by integrating different senses. Accordingly, multiple input modalities are expected to boost model performance, but we actually find that they are not fully exploited even when the multimodal model outperforms its uni-modal counterpart. Specifically, in this paper we point out that existing multimodal discriminative models, in which uniform objective is designed for all modalities, could remain under-optimized uni-modal representations, caused by another dominated modality in some scenarios, e.g., sound in blowing wind event, vision in drawing picture event, etc. To alleviate this optimization imbalance, we propose on-the-fly gradient modulation to adaptively control the optimization of each modality, via monitoring the discrepancy of their contribution towards the learning objective. Further, an extra Gaussian noise that changes dynamically is introduced to avoid possible generalization drop caused by gradient modulation. As a result, we achieve considerable improvement over common fusion methods on different multimodal tasks, and this simple strategy can also boost existing multimodal methods, which illustrates its efficacy and versatility. The source code is available at \url{https://github.com/GeWu-Lab/OGM-GE_CVPR2022}. △ Less

Submitted 29 March, 2022; originally announced March 2022.

Comments: Accepted by CVPR 2022 (ORAL)

arXiv:2203.04668 [pdf, other]

Towards Inadequately Pre-trained Models in Transfer Learning

Authors: Andong Deng, Xingjian Li, Di Hu, Tianyang Wang, Haoyi Xiong, Chengzhong Xu

Abstract: Pre-training has been a popular learning paradigm in deep learning era, especially in annotation-insufficient scenario. Better ImageNet pre-trained models have been demonstrated, from the perspective of architecture, by previous research to have better transferability to downstream tasks. However, in this paper, we found that during the same pre-training process, models at middle epochs, which is… ▽ More Pre-training has been a popular learning paradigm in deep learning era, especially in annotation-insufficient scenario. Better ImageNet pre-trained models have been demonstrated, from the perspective of architecture, by previous research to have better transferability to downstream tasks. However, in this paper, we found that during the same pre-training process, models at middle epochs, which is inadequately pre-trained, can outperform fully trained models when used as feature extractors (FE), while the fine-tuning (FT) performance still grows with the source performance. This reveals that there is not a solid positive correlation between top-1 accuracy on ImageNet and the transferring result on target data. Based on the contradictory phenomenon between FE and FT that better feature extractor fails to be fine-tuned better accordingly, we conduct comprehensive analyses on features before softmax layer to provide insightful explanations. Our discoveries suggest that, during pre-training, models tend to first learn spectral components corresponding to large singular values and the residual components contribute more when fine-tuning. △ Less

Submitted 16 August, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

Comments: Accepted by ICCV'2023

arXiv:2112.13299 [pdf, other]

Zero to Hero: Exploiting Null Effects to Achieve Variance Reduction in Experiments with One-sided Triggering

Authors: Alex Deng, Lo-Hua Yuan, Naoya Kanai, Alexandre Salama-Manteau

Abstract: In online experiments where the intervention is only exposed, or "triggered", for a small subset of the population, it is critical to use variance reduction techniques to estimate treatment effects with sufficient precision to inform business decisions. Trigger-dilute analysis is often used in these situations, and reduces the sampling variance of overall intent-to-treat (ITT) effects by an order… ▽ More In online experiments where the intervention is only exposed, or "triggered", for a small subset of the population, it is critical to use variance reduction techniques to estimate treatment effects with sufficient precision to inform business decisions. Trigger-dilute analysis is often used in these situations, and reduces the sampling variance of overall intent-to-treat (ITT) effects by an order of magnitude equal to the inverse of the triggering rate; for example, a triggering rate of $5\%$ corresponds to roughly a $20x$ reduction in variance. To apply trigger-dilute analysis, one needs to know experimental subjects' triggering counterfactual statuses, i.e., the counterfactual behavior of subjects under both treatment and control conditions. In this paper, we propose an unbiased ITT estimator with reduced variance applicable for experiments where the triggering counterfactual status is only observed in the treatment group. Our method is based on the efficiency augmentation idea of CUPED and draws upon identification frameworks from the principal stratification and instrumental variables literature. The unbiasedness of our estimation approach relies on a testable assumption that the augmentation term used for covariate adjustment equals zero in expectation. Unlike traditional covariate adjustment or principal score modeling approaches, our estimator can incorporate both pre-experiment and in-experiment observations. We demonstrate through a real-world experiment and simulations that our estimator can remain unbiased and achieve precision improvements as large as if triggering status were fully observed, and in some cases can even outperform trigger-dilute analysis. △ Less

Submitted 31 January, 2023; v1 submitted 25 December, 2021; originally announced December 2021.

Comments: To be published in WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining

arXiv:2112.08255 [pdf]

doi 10.1016/j.cplett.2022.139474

Atomic bonding and electrical characteristics of two-dimensional graphene/boron nitride van der Waals heterostuctures with manufactured defects via binding energy and bond-charge model

Authors: Jiannan Wang, Liangjing Ge, Anlin Deng, Hongrong Qiu, Hanze Li, Yunhu Zhu, Maolin Bo

Abstract: We used the binding energy-bond-charge model to study the atomic bonding and electrical properties of the two-dimensional graphene/BN van der Waals heterostructure. We manipulated its atomic bonding and electrical properties by manufacturing defects. We discovered that this process yielded a band structure with a flat band, i.e., a horizontal band structure without dispersion at the Fermi level. T… ▽ More We used the binding energy-bond-charge model to study the atomic bonding and electrical properties of the two-dimensional graphene/BN van der Waals heterostructure. We manipulated its atomic bonding and electrical properties by manufacturing defects. We discovered that this process yielded a band structure with a flat band, i.e., a horizontal band structure without dispersion at the Fermi level. Thus, our research is significant because it is the first report on this flat band of defect graphene/BN van der Waals heterostructures. △ Less

Submitted 20 December, 2021; v1 submitted 15 December, 2021; originally announced December 2021.

arXiv:2112.03649 [pdf, other]

Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection

Authors: Shoubin Yu, Zhongyin Zhao, Haoshu Fang, Andong Deng, Haisheng Su, Dongliang Wang, Weihao Gan, Cewu Lu, Wei Wu

Abstract: Anomaly detection in surveillance videos is challenging and important for ensuring public security. Different from pixel-based anomaly detection methods, pose-based methods utilize highly-structured skeleton data, which decreases the computational burden and also avoids the negative impact of background noise. However, unlike pixel-based methods, which could directly exploit explicit motion featur… ▽ More Anomaly detection in surveillance videos is challenging and important for ensuring public security. Different from pixel-based anomaly detection methods, pose-based methods utilize highly-structured skeleton data, which decreases the computational burden and also avoids the negative impact of background noise. However, unlike pixel-based methods, which could directly exploit explicit motion features such as optical flow, pose-based methods suffer from the lack of alternative dynamic representation. In this paper, a novel Motion Embedder (ME) is proposed to provide a pose motion representation from the probability perspective. Furthermore, a novel task-specific Spatial-Temporal Transformer (STT) is deployed for self-supervised pose sequence reconstruction. These two modules are then integrated into a unified framework for pose regularity learning, which is referred to as Motion Prior Regularity Learner (MoPRL). MoPRL achieves the state-of-the-art performance by an average improvement of 4.7% AUC on several challenging datasets. Extensive experiments validate the versatility of each proposed module. △ Less

Submitted 7 December, 2021; v1 submitted 7 December, 2021; originally announced December 2021.

arXiv:2106.06947 [pdf, other]

Graph Neural Network-Based Anomaly Detection in Multivariate Time Series

Authors: Ailin Deng, Bryan Hooi

Abstract: Given high-dimensional time series data (e.g., sensor data), how can we detect anomalous events, such as system faults and attacks? More challengingly, how can we do this in a way that captures complex inter-sensor relationships, and detects and explains anomalies which deviate from these relationships? Recently, deep learning approaches have enabled improvements in anomaly detection in high-dimen… ▽ More Given high-dimensional time series data (e.g., sensor data), how can we detect anomalous events, such as system faults and attacks? More challengingly, how can we do this in a way that captures complex inter-sensor relationships, and detects and explains anomalies which deviate from these relationships? Recently, deep learning approaches have enabled improvements in anomaly detection in high-dimensional datasets; however, existing methods do not explicitly learn the structure of existing relationships between variables, or use them to predict the expected behavior of time series. Our approach combines a structure learning approach with graph neural networks, additionally using attention weights to provide explainability for the detected anomalies. Experiments on two real-world sensor datasets with ground truth anomalies show that our method detects anomalies more accurately than baseline approaches, accurately captures correlations between sensors, and allows users to deduce the root cause of a detected anomaly. △ Less

Submitted 13 June, 2021; originally announced June 2021.

Comments: Accepted at AAAI Conference on Artificial Intelligence (AAAI), 2021

arXiv:2105.14705 [pdf, ps, other]

The equivalence of the Delta method and the cluster-robust variance estimator for the analysis of clustered randomized experiments

Authors: Alex Deng, Jiannan Lu, Wen Qin

Abstract: It often happens that the same problem presents itself to different communities and the solutions proposed or adopted by those communities are different. We take the case of the variance estimation of the population average treatment effect in cluster-randomized experiments. The econometrics literature promotes the cluster-robust variance estimator (Athey and Imbens, 2017), which can be dated back… ▽ More It often happens that the same problem presents itself to different communities and the solutions proposed or adopted by those communities are different. We take the case of the variance estimation of the population average treatment effect in cluster-randomized experiments. The econometrics literature promotes the cluster-robust variance estimator (Athey and Imbens, 2017), which can be dated back to the study of linear regression with clustered residuals (Liang and Zeger, 1986). The A/B testing or online experimentation literature promotes the delta method (Kohavi et al., 2010, Deng et al., 2017, 2018), which tackles the variance estimation of the ATE estimator directly using large sample theory. The two methods are seemly different as the former begins with a regression setting at the individual unit level and the latter is semi-parametric with only i.i.d. assumptions on the clusters. Both methods are widely used in practice. It begs the question for their connection and comparison. In this paper we prove they are equivalent and in the canonical implementation they should give exactly the same result. △ Less

Submitted 31 May, 2021; originally announced May 2021.

arXiv:2104.09186 [pdf]

doi 10.1088/0256-307X/38/5/056301

Fano resonance enabled infrared nano-imaging of local strain in bilayer graphene

Authors: Jing Du, Bosai Lyu, Wanfei Shan, Jiajun Chen, Xianliang Zhou, Jingxu Xie, Aolin Deng, Cheng Hu, Qi Liang, Guibai Xie, Xiaojun Li, Weidong Luo, Zhiwen Shi

Abstract: Detection of local strain at the nanometer scale with high sensitivity remains challenging. Here we report near-field infrared nano-imaging of local strains in bilayer graphene through probing strain-induced shifts of phonon frequency. As a non-polar crystal, intrinsic bilayer graphene possesses little infrared response at its transverse optical (TO) phonon frequency. The reported optical detectio… ▽ More Detection of local strain at the nanometer scale with high sensitivity remains challenging. Here we report near-field infrared nano-imaging of local strains in bilayer graphene through probing strain-induced shifts of phonon frequency. As a non-polar crystal, intrinsic bilayer graphene possesses little infrared response at its transverse optical (TO) phonon frequency. The reported optical detection of local strain is enabled by applying a vertical electrical field that breaks the symmetry of the two graphene layers and introduces finite electrical dipole moment to graphene phonon. The activated phonon further interacts with continuum electronic transitions, and generates a strong Fano resonance. The resulted Fano resonance features a very sharp near-field infrared scattering peak, which leads to an extraordinary sensitivity of ~0.002% for the strain detection. Our studies demonstrate the first nano-scale near-field Fano resonance, provide a new way to probe local strains with high sensitivity in non-polar crystals, and open exciting possibilities for studying strain-induced rich phenomena. △ Less

Submitted 19 April, 2021; originally announced April 2021.

Journal ref: Chin. Phys. Lett., 2021 38 (5): 056301

arXiv:2103.16097 [pdf]

Quantum Capacitance Induced Non-Local Electrostatic Gating Effect in Graphene

Authors: Aolin Deng, Cheng Hu, Peiyue Shen, Xingdong Luo, Jiajun Chen, Bosai Lyu, Kenji Watanabe, Takashi Taniguchi, Qi Liang, Jie Ma, Zhiwen Shi

Abstract: Electrostatic gating lies in the heart of modern FET-based integrated circuits. Usually, the gate electrode has to be placed very close to the conduction channel, typically a few nanometers, in order to achieve efficient tunability. However, remote control of a FET device through a gate electrode placed far away is always highly desired, because it not only reduces the complexity of device fabrica… ▽ More Electrostatic gating lies in the heart of modern FET-based integrated circuits. Usually, the gate electrode has to be placed very close to the conduction channel, typically a few nanometers, in order to achieve efficient tunability. However, remote control of a FET device through a gate electrode placed far away is always highly desired, because it not only reduces the complexity of device fabrication, but also enables designing novel devices with new functionalities. Here, a non-local gating effect in graphene using both near-field optical nano-imaging and electrical transport measurement is reported. With assistance of absorbed water molecules, the charge density of graphene can be efficiently tuned by a local-gate placed over 30 μm away. The observed non-local gating effect is initially driven by an in-plane electric field established between graphene regions with different charge densities due to the quantum capacitance near the Dirac point in graphene. The nonlocality is further amplified and largely enhanced by absorbed water molecules through screening the in-plane electric field and expending the transition length. This research reveals novel non-local phenomenon of Dirac electrons, and paves the way for designing electronic devices with remote-control using 2D materials with small density of states. △ Less

Submitted 30 March, 2021; originally announced March 2021.

arXiv:2007.15240 [pdf, other]

doi 10.1145/3394171.3413635

Action2Motion: Conditioned Generation of 3D Human Motions

Authors: Chuan Guo, Xinxin Zuo, Sen Wang, Shihao Zou, Qingyao Sun, Annan Deng, Minglun Gong, Li Cheng

Abstract: Action recognition is a relatively established task, where givenan input sequence of human motion, the goal is to predict its ac-tion category. This paper, on the other hand, considers a relativelynew problem, which could be thought of as an inverse of actionrecognition: given a prescribed action type, we aim to generateplausible human motion sequences in 3D. Importantly, the set ofgenerated motio… ▽ More Action recognition is a relatively established task, where givenan input sequence of human motion, the goal is to predict its ac-tion category. This paper, on the other hand, considers a relativelynew problem, which could be thought of as an inverse of actionrecognition: given a prescribed action type, we aim to generateplausible human motion sequences in 3D. Importantly, the set ofgenerated motions are expected to maintain itsdiversityto be ableto explore the entire action-conditioned motion space; meanwhile,each sampled sequence faithfully resembles anaturalhuman bodyarticulation dynamics. Motivated by these objectives, we followthe physics law of human kinematics by adopting the Lie Algebratheory to represent thenaturalhuman motions; we also propose atemporal Variational Auto-Encoder (VAE) that encourages adiversesampling of the motion space. A new 3D human motion dataset, HumanAct12, is also constructed. Empirical experiments overthree distinct human motion datasets (including ours) demonstratethe effectiveness of our approach. △ Less

Submitted 30 July, 2020; originally announced July 2020.

Comments: 13 pages, ACM MultiMedia 2020

arXiv:2007.12634 [pdf, other]

All-optical density downramp injection in electron-driven plasma wakefield accelerators

Authors: D. Ullmann, P. Scherkl, A. Knetsch, T. Heinemann, A. Sutherland, A. F. Habib, O. S. Karger, A. Beaton, G. G. Manahan, A. Deng, G. Andonian, M. D. Litos, B. D. OShea, D. L. Bruhwiler, J. R. Cary, M. J. Hogan, V. Yakimenko, J. B. Rosenzweig, B. Hidding

Abstract: Injection of well-defined, high-quality electron populations into plasma waves is a key challenge of plasma wakefield accelerators. Here, we report on the first experimental demonstration of plasma density downramp injection in an electron-driven plasma wakefield accelerator, which can be controlled and tuned in all-optical fashion by mJ-level laser pulses. The laser pulse is directed across the p… ▽ More Injection of well-defined, high-quality electron populations into plasma waves is a key challenge of plasma wakefield accelerators. Here, we report on the first experimental demonstration of plasma density downramp injection in an electron-driven plasma wakefield accelerator, which can be controlled and tuned in all-optical fashion by mJ-level laser pulses. The laser pulse is directed across the path of the plasma wave before its arrival, where it generates a local plasma density spike in addition to the background plasma by tunnelling ionization of a high ionization threshold gas component. This density spike distorts the plasma wave during the density downramp, causing plasma electrons to be injected into the plasma wave. By tuning the laser pulse energy and shape, highly flexible plasma density spike profiles can be designed, enabling dark current free, versatile production of high-quality electron beams. This in turn permits creation of unique injected beam configurations such as counter-oscillating twin beamlets. △ Less

Submitted 24 July, 2020; originally announced July 2020.

arXiv:2006.13443 [pdf]

Hardware-irrelevant parallel processing system

Authors: Xiuting Zou, Shaofu Xu, Anyi Deng, Rui Wang, Weiwen Zou

Abstract: Parallel processing technology has been a primary tool for achieving high-speed, high-accuracy, and broadband processing for many years across modern information systems and data processing such as optical and radar, synthetic aperture radar imaging, digital beam forming, and digital filtering systems. However, hardware deviations in a parallel processing system (PPS) severely degrade system perfo… ▽ More Parallel processing technology has been a primary tool for achieving high-speed, high-accuracy, and broadband processing for many years across modern information systems and data processing such as optical and radar, synthetic aperture radar imaging, digital beam forming, and digital filtering systems. However, hardware deviations in a parallel processing system (PPS) severely degrade system performance and pose an urgent challenge. We propose a hardware-irrelevant PPS of which the performance is unaffected by hardware deviations. In this system, an embedded convolutional recurrent autoencoder (CRAE), which learns inherent system patterns as well as acquires and removes adverse effects brought by hardware deviations, is adopted. We implement a hardware-irrelevant PPS into a parallel photonic sampling system to accomplish a high-performance analog-to-digital conversion for microwave signals with high frequency and broad bandwidth. Under one system state, a category of signals with two different mismatch degrees is utilized to train the CRAE, which can then compensate for mismatches in various categories of signals with multiple mismatch degrees under random system states. Our approach is extensively applicable to achieving hardware-irrelevant PPSs which are either discrete or integrated in photonic, electric, and other fields. △ Less

Submitted 23 June, 2020; originally announced June 2020.

arXiv:1910.08601 [pdf, other]

doi 10.1103/PhysRevLett.124.044802

Single-Shot Characterization of High Transformer Ratio Wakefields in Nonlinear Plasma Acceleration

Authors: Ryan Roussel, Gerard Andonian, Walter Lynn, Kunal Sanwalka, River Robles, Claire Hansel, Aihua Deng, Gerard Lawler, James Rosenzweig

Abstract: Plasma wakefields can enable very high accelerating gradients for frontier high energy particle accelerators, in excess of 10 GeV/m. To overcome limits on total acceleration achievable, specially shaped drive beams can be used in both linear and nonlinear plasma wakefield accelerators (PWFA), to increase the transformer ratio, implying that the drive beam deceleration is minimized relative to acce… ▽ More Plasma wakefields can enable very high accelerating gradients for frontier high energy particle accelerators, in excess of 10 GeV/m. To overcome limits on total acceleration achievable, specially shaped drive beams can be used in both linear and nonlinear plasma wakefield accelerators (PWFA), to increase the transformer ratio, implying that the drive beam deceleration is minimized relative to acceleration obtained in the wake. In this Letter, we report the results of a nonlinear PWFA, high transformer ratio experiment using high-charge, longitudinally asymmetric drive beams in a plasma cell. An emittance exchange process is used to generate variable drive current profiles, in conjunction with a long (multiple plasma wavelength) witness beam. The witness beam is energy-modulated by the wakefield, yielding a response that contains detailed spectral information in a single-shot measurement. Using these methods, we generate a variety of beam profiles and characterize the wakefields, directly observing beam-loaded transformer ratios up to R=7.8. Furthermore, a spectrally-based reconstruction technique, validated by 3D particle-in-cell simulations, is introduced to obtain the drive beam current profile from the decelerating wake data. △ Less

Submitted 18 October, 2019; originally announced October 2019.

Journal ref: Phys. Rev. Lett. 124, 044802 (2020)

arXiv:1910.03788 [pdf, other]

doi 10.1145/3447548.3467129

On Post-Selection Inference in A/B Tests

Authors: Alex Deng, Yicheng Li, Jiannan Lu, Vivek Ramamurthy

Abstract: When interpreting A/B tests, we typically focus only on the statistically significant results and take them by face value. This practice, termed post-selection inference in the statistical literature, may negatively affect both point estimation and uncertainty quantification, and therefore hinder trustworthy decision making in A/B testing. To address this issue, in this paper we explore two seemin… ▽ More When interpreting A/B tests, we typically focus only on the statistically significant results and take them by face value. This practice, termed post-selection inference in the statistical literature, may negatively affect both point estimation and uncertainty quantification, and therefore hinder trustworthy decision making in A/B testing. To address this issue, in this paper we explore two seemingly unrelated paths, one based on supervised machine learning and the other on empirical Bayes, and propose post-selection inferential approaches that combine the strengths of both. Through large-scale simulated and empirical examples, we demonstrate that our proposed methodologies stand out among other existing ones in both reducing post-selection biases and improving confidence interval coverage rates, and discuss how they can be conveniently adjusted to real-life scenarios. △ Less

Submitted 30 May, 2021; v1 submitted 9 October, 2019; originally announced October 2019.

arXiv:1910.02767 [pdf, other]

doi 10.1103/PhysRevB.101.041407

Reflection Phase Shift of One-dimensional Plasmon Polaritons in Carbon Nanotubes

Authors: Xingdong Luo, Cheng Hu, Bosai Lyu, Liu Yang, Xianliang Zhou, Aolin Deng, Ji-Hun Kang, Zhiwen Shi

Abstract: We investigated, both experimentally and theoretically, the reflection phase shift (RPS) of one-dimensional plasmon polaritons. We launched 1D plasmon polaritons in carbon nanotube and probed the plasmon interference pattern using scanning near-field optical microscopy (SNOM) technique, through which a non-zero phase shift was observed. We further developed a theory to understand the nonzero phase… ▽ More We investigated, both experimentally and theoretically, the reflection phase shift (RPS) of one-dimensional plasmon polaritons. We launched 1D plasmon polaritons in carbon nanotube and probed the plasmon interference pattern using scanning near-field optical microscopy (SNOM) technique, through which a non-zero phase shift was observed. We further developed a theory to understand the nonzero phase shift of 1D polaritons, and found that the RPS can be understood by considering the evanescent field beyond the nanotube end. Interesting, our theory shows a strong dependence of RPS on polaritons wavelength and nanotube diameter, which is in stark contrast to 2D plasmon polaritons in graphene where the RPS is a constant. In short wave region, the RPS of 1D polaritons only depends on a dimensionless variable -- the ratio between polaritons wavelength and nanotube diameter. These results provide fundamental insights into the reflection of polaritons in 1D system, and could facilitate the design of ultrasmall 1D polaritonic devices, such as resonators, interferometers. △ Less

Submitted 27 December, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

Journal ref: Phys. Rev. B 101, 041407 (2020)

arXiv:1908.09263 [pdf]

Plasma-photonic spatiotemporal synchronization of relativistic electron and laser beams

Authors: Paul Scherkl, Alexander Knetsch, Thomas Heinemann, Andrew Sutherland, Ahmad Fahim Habib, Oliver Karger, Daniel Ullmann, Andrew Beaton, Gavin Kirwan, Grace Manahan, Yunfeng Xi, Aihua Deng, Michael Dennis Litos, Brendan D. OShea, Selina Z. Green, Christine I. Clarke, Gerard Andonian, Ralph Assmann, Dino A. Jaroszynski, David L. Bruhwiler, Jonathan Smith, John R. Cary, Mark J. Hogan, Vitaly Yakimenko, James B. Rosenzweig , et al. (1 additional authors not shown)

Abstract: Modern particle accelerators and their applications increasingly rely on precisely coordinated interactions of intense charged particle and laser beams. Femtosecond-scale synchronization alongside micrometre-scale spatial precision are essential e.g. for pump-probe experiments, seeding and diagnostics of advanced light sources and for plasma-based accelerators. State-of-the-art temporal or spatial… ▽ More Modern particle accelerators and their applications increasingly rely on precisely coordinated interactions of intense charged particle and laser beams. Femtosecond-scale synchronization alongside micrometre-scale spatial precision are essential e.g. for pump-probe experiments, seeding and diagnostics of advanced light sources and for plasma-based accelerators. State-of-the-art temporal or spatial diagnostics typically operate with low-intensity beams to avoid material damage at high intensity. As such, we present a plasma-based approach, which allows measurement of both temporal and spatial overlap of high-intensity beams directly at their interaction point. It exploits amplification of plasma afterglow arising from the passage of an electron beam through a laser-generated plasma filament. The corresponding photon yield carries the spatiotemporal signature of the femtosecond-scale dynamics, yet can be observed as a visible light signal on microsecond-millimetre scales. △ Less

Submitted 25 August, 2019; originally announced August 2019.

arXiv:1907.00875 [pdf]

Electron bunch generation from a plasma photocathode

Authors: Aihua Deng, Oliver Karger, Thomas Heinemann, Alexander Knetsch, Paul Scherkl, Grace Gloria Manahan, Andrew Beaton, Daniel Ullmann, Gregor Wittig, Ahmad Fahim Habib, Yunfeng Xi, Mike Dennis Litos, Brendan D. O'Shea, Spencer Gessner, Christine I. Clarke, Selina Z. Green, Carl Andreas Lindstrøm, Erik Adli, Rafal Zgadzaj, Mike C. Downer, Gerard Andonian, Alex Murokh, David Leslie Bruhwiler, John R. Cary, Mark J. Hogan , et al. (3 additional authors not shown)

Abstract: Plasma waves generated in the wake of intense, relativistic laser or particle beams can accelerate electron bunches to giga-electronvolt (GeV) energies in centimetre-scale distances. This allows the realization of compact accelerators having emerging applications, ranging from modern light sources such as the free-electron laser (FEL) to energy frontier lepton colliders. In a plasma wakefield acce… ▽ More Plasma waves generated in the wake of intense, relativistic laser or particle beams can accelerate electron bunches to giga-electronvolt (GeV) energies in centimetre-scale distances. This allows the realization of compact accelerators having emerging applications, ranging from modern light sources such as the free-electron laser (FEL) to energy frontier lepton colliders. In a plasma wakefield accelerator, such multi-gigavolt-per-metre (GV m$^{-1}$) wakefields can accelerate witness electron bunches that are either externally injected or captured from the background plasma. Here we demonstrate optically triggered injection and acceleration of electron bunches, generated in a multi-component hydrogen and helium plasma employing a spatially aligned and synchronized laser pulse. This ''plasma photocathode'' decouples injection from wake excitation by liberating tunnel-ionized helium electrons directly inside the plasma cavity, where these cold electrons are then rapidly boosted to relativistic velocities. The injection regime can be accessed via optical density down-ramp injection, is highly tunable and paves the way to generation of electron beams with unprecedented low transverse emittance, high current and 6D-brightness. This experimental path opens numerous prospects for transformative plasma wakefield accelerator applications based on ultra-high brightness beams. △ Less

Submitted 1 July, 2019; originally announced July 2019.

Comments: Alternative title: Generation and acceleration of electron bunches from a plasma photocathode

arXiv:1803.06336 [pdf, ps, other]

Applying the Delta method in metric analytics: A practical guide with novel ideas

Authors: Alex Deng, Ulf Knoblich, Jiannan Lu

Abstract: During the last decade, the information technology industry has adopted a data-driven culture, relying on online metrics to measure and monitor business performance. Under the setting of big data, the majority of such metrics approximately follow normal distributions, opening up potential opportunities to model them directly without extra model assumptions and solve big data problems via closed-fo… ▽ More During the last decade, the information technology industry has adopted a data-driven culture, relying on online metrics to measure and monitor business performance. Under the setting of big data, the majority of such metrics approximately follow normal distributions, opening up potential opportunities to model them directly without extra model assumptions and solve big data problems via closed-form formulas using distributed algorithms at a fraction of the cost of simulation-based procedures like bootstrap. However, certain attributes of the metrics, such as their corresponding data generating processes and aggregation levels, pose numerous challenges for constructing trustworthy estimation and inference procedures. Motivated by four real-life examples in metric development and analytics for large-scale A/B testing, we provide a practical guide to applying the Delta method, one of the most important tools from the classic statistics literature, to address the aforementioned challenges. We emphasize the central role of the Delta method in metric analytics by highlighting both its classic and novel applications. △ Less

Submitted 12 September, 2018; v1 submitted 16 March, 2018; originally announced March 2018.

arXiv:1707.05919 [pdf]

doi 10.1103/PhysRevB.97.115426

$Γ$-valley assisted intervalley scattering on monolayer and bilayer WS2 revealed by time-resolved Kerr rotation spectroscopy

Authors: Huimin Su, Aiying Deng, Zhiheng Zhen, Jun-Feng Dai

Abstract: We investigated the valley depolarization and carrier relaxation process of monolayer and bilayer WS2 at 10 K by using time-resolved Kerr rotation (TRKR) and differential reflectance measurement simultaneously. Two decay processes extracted from TRKR signals were observed on both monolayer and bilayer WS2. In monolayer WS2, the initial ultrafast decay component (less than 1 ps) was interpreted as… ▽ More We investigated the valley depolarization and carrier relaxation process of monolayer and bilayer WS2 at 10 K by using time-resolved Kerr rotation (TRKR) and differential reflectance measurement simultaneously. Two decay processes extracted from TRKR signals were observed on both monolayer and bilayer WS2. In monolayer WS2, the initial ultrafast decay component (less than 1 ps) was interpreted as the stimulated emission or Pauli-blocking of electrons and holes in one valley by comparing with carrier decay process. The relatively slow component was around 4 ps under low excitation energy (less than 2.21 eV) and then increases with excitation energies, which approaches a saturation value of 15 ps. The onset excitation energy of 2.21 eV suggests the $Γ$-valley assisted intervalley scattering between the K and K' valley play a critical part during the valley depolarization process under the off-resonance excitation condition. By contrast, the slow decay component (48 ps) of bilayer WS2 is comparable with the carrier lifetime (58 ps). It is attributed to irreversible scattering processes from K (or K') to $Γ$ valley due to its characteristic of indirect band gap semiconductor. △ Less

Submitted 18 July, 2017; originally announced July 2017.

Journal ref: Phys. Rev. B 97, 115426 (2018)

arXiv:1707.00401 [pdf, other]

Spectra of Digraph Transformations

Authors: Aiping Deng, Alexander Kelmans

Abstract: For a digraph D and three parameters x, y, z in {0,1,+,-} we define the digraph D^(x,y,z) and call it the (x,y,z)-transformation of D. We show that for every r-regular digraph D the adjacency characteristic polynomial A(t, D^(x,y,z)) of (x,y,z)-transformation of D is uniquely defined by r and the adjacency characteristic polynomial A(t, D) of digraph D and we give a description of this function A(… ▽ More For a digraph D and three parameters x, y, z in {0,1,+,-} we define the digraph D^(x,y,z) and call it the (x,y,z)-transformation of D. We show that for every r-regular digraph D the adjacency characteristic polynomial A(t, D^(x,y,z)) of (x,y,z)-transformation of D is uniquely defined by r and the adjacency characteristic polynomial A(t, D) of digraph D and we give a description of this function A(t, D^(x,y,z)) = F(r, A(t, D)). We also obtain similar results for some non-regular digraphs, namely, for so-called digraph-functions and their inverse. Also using the (x,y,z)-transformations of digraphs, we give various new constructions of non-isomorphic adjacency cospectral digraphs. △ Less

Submitted 3 July, 2017; originally announced July 2017.

Journal ref: Linear Algebra and its Applications, 439 (2013) 106-132

Showing 1–50 of 60 results for author: Deng, A