-
Attentive-based Multi-level Feature Fusion for Voice Disorder Diagnosis
Authors:
Lipeng Shen,
Yifan Xiong,
Dongyue Guo,
Wei Mo,
Lingyu Yu,
Hui Yang,
Yi Lin
Abstract:
Voice disorders negatively impact the quality of daily life in various ways. However, accurately recognizing the category of pathological features from raw audio remains a considerable challenge due to the limited dataset. A promising method to handle this issue is extracting multi-level pathological information from speech in a comprehensive manner by fusing features in the latent space. In this…
▽ More
Voice disorders negatively impact the quality of daily life in various ways. However, accurately recognizing the category of pathological features from raw audio remains a considerable challenge due to the limited dataset. A promising method to handle this issue is extracting multi-level pathological information from speech in a comprehensive manner by fusing features in the latent space. In this paper, a novel framework is designed to explore the way of high-quality feature fusion for effective and generalized detection performance. Specifically, the proposed model follows a two-stage training paradigm: (1) ECAPA-TDNN and Wav2vec 2.0 which have shown remarkable effectiveness in various domains are employed to learn the universal pathological information from raw audio; (2) An attentive fusion module is dedicatedly designed to establish the interaction between pathological features projected by EcapTdnn and Wav2vec 2.0 respectively and guide the multi-layer fusion, the entire model is jointly fine-tuned from pre-trained features by the automatic voice pathology detection task. Finally, comprehensive experiments on the FEMH and SVD datasets demonstrate that the proposed framework outperforms the competitive baselines, and achieves the accuracy of 90.51% and 87.68%.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges
Authors:
Qin Liu,
Wenjie Mo,
Terry Tong,
Jiashu Xu,
Fei Wang,
Chaowei Xiao,
Muhao Chen
Abstract:
The advancement of Large Language Models (LLMs) has significantly impacted various domains, including Web search, healthcare, and software development. However, as these models scale, they become more vulnerable to cybersecurity risks, particularly backdoor attacks. By exploiting the potent memorization capacity of LLMs, adversaries can easily inject backdoors into LLMs by manipulating a small por…
▽ More
The advancement of Large Language Models (LLMs) has significantly impacted various domains, including Web search, healthcare, and software development. However, as these models scale, they become more vulnerable to cybersecurity risks, particularly backdoor attacks. By exploiting the potent memorization capacity of LLMs, adversaries can easily inject backdoors into LLMs by manipulating a small portion of training data, leading to malicious behaviors in downstream applications whenever the hidden backdoor is activated by the pre-defined triggers. Moreover, emerging learning paradigms like instruction tuning and reinforcement learning from human feedback (RLHF) exacerbate these risks as they rely heavily on crowdsourced data and human feedback, which are not fully controlled. In this paper, we present a comprehensive survey of emerging backdoor threats to LLMs that appear during LLM development or inference, and cover recent advancement in both defense and detection strategies for mitigating backdoor threats to LLMs. We also outline key challenges in addressing these threats, highlighting areas for future research.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
Rethinking Backdoor Detection Evaluation for Language Models
Authors:
Jun Yan,
Wenjie Jacky Mo,
Xiang Ren,
Robin Jia
Abstract:
Backdoor attacks, in which a model behaves maliciously when given an attacker-specified trigger, pose a major security risk for practitioners who depend on publicly released language models. Backdoor detection methods aim to detect whether a released model contains a backdoor, so that practitioners can avoid such vulnerabilities. While existing backdoor detection methods have high accuracy in dete…
▽ More
Backdoor attacks, in which a model behaves maliciously when given an attacker-specified trigger, pose a major security risk for practitioners who depend on publicly released language models. Backdoor detection methods aim to detect whether a released model contains a backdoor, so that practitioners can avoid such vulnerabilities. While existing backdoor detection methods have high accuracy in detecting backdoored models on standard benchmarks, it is unclear whether they can robustly identify backdoors in the wild. In this paper, we examine the robustness of backdoor detectors by manipulating different factors during backdoor planting. We find that the success of existing methods highly depends on how intensely the model is trained on poisoned data during backdoor planting. Specifically, backdoors planted with either more aggressive or more conservative training are significantly more difficult to detect than the default ones. Our results highlight a lack of robustness of existing backdoor detectors and the limitations in current benchmark construction.
△ Less
Submitted 31 August, 2024;
originally announced September 2024.
-
Timing Recovery for Non-Orthogonal Multiple Access with Asynchronous Clocks
Authors:
Qingxin Lu,
Haide Wang,
Wenxuan Mo,
Ji Zhou,
Weiping Liu,
Changyuan Yu
Abstract:
A passive optical network (PON) based on non-orthogonal multiple access (NOMA) meets low latency and high capacity. In the NOMA-PON, the asynchronous clocks between the strong and weak optical network units (ONUs) cause the timing error and phase noise on the signal of the weak ONU. The theoretical derivation shows that the timing error and phase noise can be independently compensated. In this Let…
▽ More
A passive optical network (PON) based on non-orthogonal multiple access (NOMA) meets low latency and high capacity. In the NOMA-PON, the asynchronous clocks between the strong and weak optical network units (ONUs) cause the timing error and phase noise on the signal of the weak ONU. The theoretical derivation shows that the timing error and phase noise can be independently compensated. In this Letter, we propose a timing recovery (TR) algorithm based on an absolute timing error detector (Abs TED) and a pilot-based carrier phase recovery (CPR) to eliminate the timing error and phase noise separately. An experiment for 25G NOMA-PON is set up to verify the feasibility of the proposed algorithms. The weak ONU can achieve the 20% soft-decision forward error correction limit after compensating for timing error and phase noise. In conclusion, the proposed TR and the pilot-based CPR show great potential for the NOMA-PON.
△ Less
Submitted 13 September, 2024; v1 submitted 10 July, 2024;
originally announced July 2024.
-
3D Vision and Language Pretraining with Large-Scale Synthetic Data
Authors:
Dejie Yang,
Zhu Xu,
Wentao Mo,
Qingchao Chen,
Siyuan Huang,
Yang Liu
Abstract:
3D Vision-Language Pre-training (3D-VLP) aims to provide a pre-train model which can bridge 3D scenes with natural language, which is an important technique for embodied intelligence. However, current 3D-VLP datasets are hindered by limited scene-level diversity and insufficient fine-grained annotations (only 1.2K scenes and 280K textual annotations in ScanScribe), primarily due to the labor-inten…
▽ More
3D Vision-Language Pre-training (3D-VLP) aims to provide a pre-train model which can bridge 3D scenes with natural language, which is an important technique for embodied intelligence. However, current 3D-VLP datasets are hindered by limited scene-level diversity and insufficient fine-grained annotations (only 1.2K scenes and 280K textual annotations in ScanScribe), primarily due to the labor-intensive of collecting and annotating 3D scenes. To overcome these obstacles, we construct SynVL3D, a comprehensive synthetic scene-text corpus with 10K indoor scenes and 1M descriptions at object, view, and room levels, which has the advantages of diverse scene data, rich textual descriptions, multi-grained 3D-text associations, and low collection cost. Utilizing the rich annotations in SynVL3D, we pre-train a simple and unified Transformer for aligning 3D and language with multi-grained pretraining tasks. Moreover, we propose a synthetic-to-real domain adaptation in downstream task fine-tuning process to address the domain shift. Through extensive experiments, we verify the effectiveness of our model design by achieving state-of-the-art performance on downstream tasks including visual grounding, dense captioning, and question answering.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Authors:
Fei Wang,
Xingyu Fu,
James Y. Huang,
Zekun Li,
Qin Liu,
Xiaogeng Liu,
Mingyu Derek Ma,
Nan Xu,
Wenxuan Zhou,
Kai Zhang,
Tianyi Lorena Yan,
Wenjie Jacky Mo,
Hsiang-Hui Liu,
Pan Lu,
Chunyuan Li,
Chaowei Xiao,
Kai-Wei Chang,
Dan Roth,
Sheng Zhang,
Hoifung Poon,
Muhao Chen
Abstract:
We introduce MuirBench, a comprehensive benchmark that focuses on robust multi-image understanding capabilities of multimodal LLMs. MuirBench consists of 12 diverse multi-image tasks (e.g., scene understanding, ordering) that involve 10 categories of multi-image relations (e.g., multiview, temporal relations). Comprising 11,264 images and 2,600 multiple-choice questions, MuirBench is created in a…
▽ More
We introduce MuirBench, a comprehensive benchmark that focuses on robust multi-image understanding capabilities of multimodal LLMs. MuirBench consists of 12 diverse multi-image tasks (e.g., scene understanding, ordering) that involve 10 categories of multi-image relations (e.g., multiview, temporal relations). Comprising 11,264 images and 2,600 multiple-choice questions, MuirBench is created in a pairwise manner, where each standard instance is paired with an unanswerable variant that has minimal semantic differences, in order for a reliable assessment. Evaluated upon 20 recent multi-modal LLMs, our results reveal that even the best-performing models like GPT-4o and Gemini Pro find it challenging to solve MuirBench, achieving 68.0% and 49.3% in accuracy. Open-source multimodal LLMs trained on single images can hardly generalize to multi-image questions, hovering below 33.3% in accuracy. These results highlight the importance of MuirBench in encouraging the community to develop multimodal LLMs that can look beyond a single image, suggesting potential pathways for future improvements.
△ Less
Submitted 1 July, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
Efficient Arbitrated Quantum Digital Signature with Multi-Receiver Verification
Authors:
Siyu Xiong,
Bangying Tang,
Hui Han,
Jinquan Huang,
Mingqiang Bai,
Fangzhao Li,
Wanrong Yu Zhiwen Mo,
Bo Liu
Abstract:
Quantum digital signature is used to authenticate the identity of the signer with information theoretical security, while providing non-forgery and non-repudiation services. In traditional multi-receiver quantum digital signature schemes without an arbitrater, the transferability of one-to-one signature is always required to achieve unforgeability, with complicated implementation and heavy key con…
▽ More
Quantum digital signature is used to authenticate the identity of the signer with information theoretical security, while providing non-forgery and non-repudiation services. In traditional multi-receiver quantum digital signature schemes without an arbitrater, the transferability of one-to-one signature is always required to achieve unforgeability, with complicated implementation and heavy key consumption. In this article, we propose an arbitrated quantum digital signature scheme, in which the signature can be verified by multiple receivers simultaneously, and meanwhile, the transferability of the signature is still kept. Our scheme can be simplified performed to various quantum secure networks, due to the proposed efficient signature calculation procedure with low secure key consumption and low computation complexity, by employing one-time universal hashing algorithm and one-time pad encryption scheme. The evaluation results show that our scheme uses at least two orders of magnitude less key than existing signature schemes with transferability when signing files of the same length with the same number of receivers and security parameter settings.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Minimax Regret Learning for Data with Heterogeneous Subgroups
Authors:
Weibin Mo,
Weijing Tang,
Songkai Xue,
Yufeng Liu,
Ji Zhu
Abstract:
Modern complex datasets often consist of various sub-populations. To develop robust and generalizable methods in the presence of sub-population heterogeneity, it is important to guarantee a uniform learning performance instead of an average one. In many applications, prior information is often available on which sub-population or group the data points belong to. Given the observed groups of data,…
▽ More
Modern complex datasets often consist of various sub-populations. To develop robust and generalizable methods in the presence of sub-population heterogeneity, it is important to guarantee a uniform learning performance instead of an average one. In many applications, prior information is often available on which sub-population or group the data points belong to. Given the observed groups of data, we develop a min-max-regret (MMR) learning framework for general supervised learning, which targets to minimize the worst-group regret. Motivated from the regret-based decision theoretic framework, the proposed MMR is distinguished from the value-based or risk-based robust learning methods in the existing literature. The regret criterion features several robustness and invariance properties simultaneously. In terms of generalizability, we develop the theoretical guarantee for the worst-case regret over a super-population of the meta data, which incorporates the observed sub-populations, their mixtures, as well as other unseen sub-populations that could be approximated by the observed ones. We demonstrate the effectiveness of our method through extensive simulation studies and an application to kidney transplantation data from hundreds of transplant centers.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Dynamic Prompt Optimizing for Text-to-Image Generation
Authors:
Wenyi Mo,
Tianyu Zhang,
Yalong Bai,
Bing Su,
Ji-Rong Wen,
Qing Yang
Abstract:
Text-to-image generative models, specifically those based on diffusion models like Imagen and Stable Diffusion, have made substantial advancements. Recently, there has been a surge of interest in the delicate refinement of text prompts. Users assign weights or alter the injection time steps of certain words in the text prompts to improve the quality of generated images. However, the success of fin…
▽ More
Text-to-image generative models, specifically those based on diffusion models like Imagen and Stable Diffusion, have made substantial advancements. Recently, there has been a surge of interest in the delicate refinement of text prompts. Users assign weights or alter the injection time steps of certain words in the text prompts to improve the quality of generated images. However, the success of fine-control prompts depends on the accuracy of the text prompts and the careful selection of weights and time steps, which requires significant manual intervention. To address this, we introduce the \textbf{P}rompt \textbf{A}uto-\textbf{E}diting (PAE) method. Besides refining the original prompts for image generation, we further employ an online reinforcement learning strategy to explore the weights and injection time steps of each word, leading to the dynamic fine-control prompts. The reward function during training encourages the model to consider aesthetic score, semantic consistency, and user preferences. Experimental results demonstrate that our proposed method effectively improves the original prompts, generating visually more appealing images while maintaining semantic alignment. Code is available at https://github.com/Mowenyii/PAE.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
Authors:
Wentao Mo,
Yang Liu
Abstract:
In 3D Visual Question Answering (3D VQA), the scarcity of fully annotated data and limited visual content diversity hampers the generalization to novel scenes and 3D concepts (e.g., only around 800 scenes are utilized in ScanQA and SQA dataset). Current approaches resort supplement 3D reasoning with 2D information. However, these methods face challenges: either they use top-down 2D views that intr…
▽ More
In 3D Visual Question Answering (3D VQA), the scarcity of fully annotated data and limited visual content diversity hampers the generalization to novel scenes and 3D concepts (e.g., only around 800 scenes are utilized in ScanQA and SQA dataset). Current approaches resort supplement 3D reasoning with 2D information. However, these methods face challenges: either they use top-down 2D views that introduce overly complex and sometimes question-irrelevant visual clues, or they rely on globally aggregated scene/image-level representations from 2D VLMs, losing the fine-grained vision-language correlations. To overcome these limitations, our approach utilizes question-conditional 2D view selection procedure, pinpointing semantically relevant 2D inputs for crucial visual clues. We then integrate this 2D knowledge into the 3D-VQA system via a two-branch Transformer structure. This structure, featuring a Twin-Transformer design, compactly combines 2D and 3D modalities and captures fine-grained correlations between modalities, allowing them mutually augmenting each other. Integrating proposed mechanisms above, we present BridgeQA, that offers a fresh perspective on multi-modal transformer-based architectures for 3D-VQA. Experiments validate that BridgeQA achieves state-of-the-art on 3D-VQA datasets and significantly outperforms existing solutions. Code is available at $\href{https://github.com/matthewdm0816/BridgeQA}{\text{this URL}}$.
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations
Authors:
Wenjie Mo,
Jiashu Xu,
Qin Liu,
Jiongxiao Wang,
Jun Yan,
Chaowei Xiao,
Muhao Chen
Abstract:
Existing studies in backdoor defense have predominantly focused on the training phase, overlooking the critical aspect of testing time defense. This gap becomes particularly pronounced in the context of Large Language Models (LLMs) deployed as Web Services, which typically offer only black-box access, rendering training-time defenses impractical. To bridge this gap, our work introduces defensive d…
▽ More
Existing studies in backdoor defense have predominantly focused on the training phase, overlooking the critical aspect of testing time defense. This gap becomes particularly pronounced in the context of Large Language Models (LLMs) deployed as Web Services, which typically offer only black-box access, rendering training-time defenses impractical. To bridge this gap, our work introduces defensive demonstrations, an innovative backdoor defense strategy for blackbox large language models. Our method involves identifying the task and retrieving task-relevant demonstrations from an uncontaminated pool. These demonstrations are then combined with user queries and presented to the model during testing, without requiring any modifications/tuning to the black-box model or insights into its internal mechanisms. Defensive demonstrations are designed to counteract the adverse effects of triggers, aiming to recalibrate and correct the behavior of poisoned models during test-time evaluations. Extensive experiments show that defensive demonstrations are effective in defending both instance-level and instruction-level backdoor attacks, not only rectifying the behavior of poisoned models but also surpassing existing baselines in most scenarios.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
On Stable Rationality of Polytopes
Authors:
Simen Westbye Moe
Abstract:
Nicaise--Ottem introduced the notion of (stably) rational polytopes and studied this using a combinatorial description of the motivic volume. In this framework, we ask whether being non-stably rational is preserved under inclusions. We prove this holds for a large class of polytopes, leading to a combinatorial strategy for studying stable rationality of hypersurfaces in toric varieties. As a resul…
▽ More
Nicaise--Ottem introduced the notion of (stably) rational polytopes and studied this using a combinatorial description of the motivic volume. In this framework, we ask whether being non-stably rational is preserved under inclusions. We prove this holds for a large class of polytopes, leading to a combinatorial strategy for studying stable rationality of hypersurfaces in toric varieties. As a result, we obtain new bounds for non-stably rational hypersurface in projective space, improving the ones given by Schreieder when the field has characteristic 0. We also obtain similar bounds for double covers of projective space and some new classes of non-stably rational varieties in products of projective space.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Predicting Three Types of Freezing of Gait Events Using Deep Learning Models
Authors:
Wen Tao Mo,
Jonathan H. Chan
Abstract:
Freezing of gait is a Parkinson's Disease symptom that episodically inflicts a patient with the inability to step or turn while walking. While medical experts have discovered various triggers and alleviating actions for freezing of gait, the underlying causes and prediction models are still being explored today. Current freezing of gait prediction models that utilize machine learning achieve high…
▽ More
Freezing of gait is a Parkinson's Disease symptom that episodically inflicts a patient with the inability to step or turn while walking. While medical experts have discovered various triggers and alleviating actions for freezing of gait, the underlying causes and prediction models are still being explored today. Current freezing of gait prediction models that utilize machine learning achieve high sensitivity and specificity in freezing of gait predictions based on time-series data; however, these models lack specifications on the type of freezing of gait events. We develop various deep learning models using the transformer encoder architecture plus Bidirectional LSTM layers and different feature sets to predict the three different types of freezing of gait events. The best performing model achieves a score of 0.427 on testing data, which would rank top 5 in Kaggle's Freezing of Gait prediction competition, hosted by THE MICHAEL J. FOX FOUNDATION. However, we also recognize overfitting in training data that could be potentially improved through pseudo labelling on additional data and model architecture simplification.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling
Authors:
Quanyi Li,
Zhenghao Peng,
Lan Feng,
Zhizheng Liu,
Chenda Duan,
Wenjie Mo,
Bolei Zhou
Abstract:
Large-scale driving datasets such as Waymo Open Dataset and nuScenes substantially accelerate autonomous driving research, especially for perception tasks such as 3D detection and trajectory forecasting. Since the driving logs in these datasets contain HD maps and detailed object annotations which accurately reflect the real-world complexity of traffic behaviors, we can harvest a massive number of…
▽ More
Large-scale driving datasets such as Waymo Open Dataset and nuScenes substantially accelerate autonomous driving research, especially for perception tasks such as 3D detection and trajectory forecasting. Since the driving logs in these datasets contain HD maps and detailed object annotations which accurately reflect the real-world complexity of traffic behaviors, we can harvest a massive number of complex traffic scenarios and recreate their digital twins in simulation. Compared to the hand-crafted scenarios often used in existing simulators, data-driven scenarios collected from the real world can facilitate many research opportunities in machine learning and autonomous driving. In this work, we present ScenarioNet, an open-source platform for large-scale traffic scenario modeling and simulation. ScenarioNet defines a unified scenario description format and collects a large-scale repository of real-world traffic scenarios from the heterogeneous data in various driving datasets including Waymo, nuScenes, Lyft L5, and nuPlan datasets. These scenarios can be further replayed and interacted with in multiple views from Bird-Eye-View layout to realistic 3D rendering in MetaDrive simulator. This provides a benchmark for evaluating the safety of autonomous driving stacks in simulation before their real-world deployment. We further demonstrate the strengths of ScenarioNet on large-scale scenario generation, imitation learning, and reinforcement learning in both single-agent and multi-agent settings. Code, demo videos, and website are available at https://metadriverse.github.io/scenarionet.
△ Less
Submitted 30 October, 2023; v1 submitted 21 June, 2023;
originally announced June 2023.
-
A Causal View of Entity Bias in (Large) Language Models
Authors:
Fei Wang,
Wenjie Mo,
Yiwei Wang,
Wenxuan Zhou,
Muhao Chen
Abstract:
Entity bias widely affects pretrained (large) language models, causing them to rely on (biased) parametric knowledge to make unfaithful predictions. Although causality-inspired methods have shown great potential to mitigate entity bias, it is hard to precisely estimate the parameters of underlying causal models in practice. The rise of black-box LLMs also makes the situation even worse, because of…
▽ More
Entity bias widely affects pretrained (large) language models, causing them to rely on (biased) parametric knowledge to make unfaithful predictions. Although causality-inspired methods have shown great potential to mitigate entity bias, it is hard to precisely estimate the parameters of underlying causal models in practice. The rise of black-box LLMs also makes the situation even worse, because of their inaccessible parameters and uncalibrated logits. To address these problems, we propose a specific structured causal model (SCM) whose parameters are comparatively easier to estimate. Building upon this SCM, we propose causal intervention techniques to mitigate entity bias for both white-box and black-box settings. The proposed causal intervention perturbs the original entity with neighboring entities. This intervention reduces specific biasing information pertaining to the original entity while still preserving sufficient semantic information from similar entities. Under the white-box setting, our training-time intervention improves OOD performance of PLMs on relation extraction (RE) and machine reading comprehension (MRC) by 5.7 points and by 9.1 points, respectively. Under the black-box setting, our in-context intervention effectively reduces the entity-based knowledge conflicts of GPT-3.5, achieving up to 20.5 points of improvement of exact match accuracy on MRC and up to 17.6 points of reduction in memorization ratio on RE. Our code is available at https://github.com/luka-group/Causal-View-of-Entity-Bias.
△ Less
Submitted 26 October, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Comparison of Two Search Criteria for Lattice-based Kernel Approximation
Authors:
Frances Y. Kuo,
Weiwen Mo,
Dirk Nuyens,
Ian H. Sloan,
Abirami Srikumar
Abstract:
The kernel interpolant in a reproducing kernel Hilbert space is optimal in the worst-case sense among all approximations of a function using the same set of function values. In this paper, we compare two search criteria to construct lattice point sets for use in lattice-based kernel approximation. The first candidate, $\calP_n^*$, is based on the power function that appears in machine learning lit…
▽ More
The kernel interpolant in a reproducing kernel Hilbert space is optimal in the worst-case sense among all approximations of a function using the same set of function values. In this paper, we compare two search criteria to construct lattice point sets for use in lattice-based kernel approximation. The first candidate, $\calP_n^*$, is based on the power function that appears in machine learning literature. The second, $\calS_n^*$, is a search criterion used for generating lattices for approximation using truncated Fourier series. We find that the empirical difference in error between the lattices constructed using $\calP_n^*$ and $\calS_n^*$ is marginal. The criterion $\calS_n^*$ is preferred as it is computationally more efficient and has a proven error bound.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Feature Tracks are not Zero-Mean Gaussian
Authors:
Stephanie Tsuei,
Wenjie Mo,
Stefano Soatto
Abstract:
In state estimation algorithms that use feature tracks as input, it is customary to assume that the errors in feature track positions are zero-mean Gaussian. Using a combination of calibrated camera intrinsics, ground-truth camera pose, and depth images, it is possible to compute ground-truth positions for feature tracks extracted using an image processing algorithm. We find that feature track err…
▽ More
In state estimation algorithms that use feature tracks as input, it is customary to assume that the errors in feature track positions are zero-mean Gaussian. Using a combination of calibrated camera intrinsics, ground-truth camera pose, and depth images, it is possible to compute ground-truth positions for feature tracks extracted using an image processing algorithm. We find that feature track errors are not zero-mean Gaussian and that the distribution of errors is conditional on the type of motion, the speed of motion, and the image processing algorithm used to extract the tracks.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
PASTA: Pessimistic Assortment Optimization
Authors:
Juncheng Dong,
Weibin Mo,
Zhengling Qi,
Cong Shi,
Ethan X. Fang,
Vahid Tarokh
Abstract:
We consider a class of assortment optimization problems in an offline data-driven setting. A firm does not know the underlying customer choice model but has access to an offline dataset consisting of the historically offered assortment set, customer choice, and revenue. The objective is to use the offline dataset to find an optimal assortment. Due to the combinatorial nature of assortment optimiza…
▽ More
We consider a class of assortment optimization problems in an offline data-driven setting. A firm does not know the underlying customer choice model but has access to an offline dataset consisting of the historically offered assortment set, customer choice, and revenue. The objective is to use the offline dataset to find an optimal assortment. Due to the combinatorial nature of assortment optimization, the problem of insufficient data coverage is likely to occur in the offline dataset. Therefore, designing a provably efficient offline learning algorithm becomes a significant challenge. To this end, we propose an algorithm referred to as Pessimistic ASsortment opTimizAtion (PASTA for short) designed based on the principle of pessimism, that can correctly identify the optimal assortment by only requiring the offline data to cover the optimal assortment under general settings. In particular, we establish a regret bound for the offline assortment optimization problem under the celebrated multinomial logit model. We also propose an efficient computational procedure to solve our pessimistic assortment optimization problem. Numerical studies demonstrate the superiority of the proposed method over the existing baseline method.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
An optimized online filter stack spectrometer
Authors:
Jia-xing Wen,
Ge Ma,
Ming-hai Yu,
Yu-chi Wu,
Yong-hong Yan,
Shao-yi Wang,
Huai-zhong Gao,
Lu-shan Wang,
Yu-gang Zhou,
Qiang Li,
Yue Yang,
Fang Tan,
Xiao-hui Zhang,
Jie Zhang,
Wen-bo Mo,
Jing-qin Su,
Wei-min Zhou,
Yu-qiu Gu,
Zong-qing Zhao,
Ming Zeng
Abstract:
The spectrum of laser-plasma-generated X-rays is very important as it can characterize electron dynamics and also be useful for applications, and nowadays with the forthcoming high-repetition-rate laser-plasma experiments, there is a raising demand for online diagnosis for the X-ray spectrum. In this paper, scintillators and silicon PIN diodes are used to build a wideband online filter stack spect…
▽ More
The spectrum of laser-plasma-generated X-rays is very important as it can characterize electron dynamics and also be useful for applications, and nowadays with the forthcoming high-repetition-rate laser-plasma experiments, there is a raising demand for online diagnosis for the X-ray spectrum. In this paper, scintillators and silicon PIN diodes are used to build a wideband online filter stack spectrometer. The genetic algorithm is used to optimize the arrangements of the X-ray sensors and filters by minimizing the condition number of the response matrix, thus the unfolding error can be significantly decreased according to the numerical experiments. The detector responses are quantitatively calibrated by irradiating the scintillator and PIN diode using different nuclides and comparing the measured gamma-ray peaks. Finally, a 15-channel spectrometer prototype has been implemented. The X-ray detector, front-end electronics, and back-end electronics are integrated into the prototype, and the prototype can determine the spectrum with 1 kHz repetition rates.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
MetaMask: Revisiting Dimensional Confounder for Self-Supervised Learning
Authors:
Jiangmeng Li,
Wenwen Qiang,
Yanan Zhang,
Wenyi Mo,
Changwen Zheng,
Bing Su,
Hui Xiong
Abstract:
As a successful approach to self-supervised learning, contrastive learning aims to learn invariant information shared among distortions of the input sample. While contrastive learning has yielded continuous advancements in sampling strategy and architecture design, it still remains two persistent defects: the interference of task-irrelevant information and sample inefficiency, which are related to…
▽ More
As a successful approach to self-supervised learning, contrastive learning aims to learn invariant information shared among distortions of the input sample. While contrastive learning has yielded continuous advancements in sampling strategy and architecture design, it still remains two persistent defects: the interference of task-irrelevant information and sample inefficiency, which are related to the recurring existence of trivial constant solutions. From the perspective of dimensional analysis, we find out that the dimensional redundancy and dimensional confounder are the intrinsic issues behind the phenomena, and provide experimental evidence to support our viewpoint. We further propose a simple yet effective approach MetaMask, short for the dimensional Mask learned by Meta-learning, to learn representations against dimensional redundancy and confounder. MetaMask adopts the redundancy-reduction technique to tackle the dimensional redundancy issue and innovatively introduces a dimensional mask to reduce the gradient effects of specific dimensions containing the confounder, which is trained by employing a meta-learning paradigm with the objective of improving the performance of masked representations on a typical self-supervised task. We provide solid theoretical analyses to prove MetaMask can obtain tighter risk bounds for downstream classification compared to typical contrastive methods. Empirically, our method achieves state-of-the-art performance on various benchmarks.
△ Less
Submitted 9 August, 2023; v1 submitted 16 September, 2022;
originally announced September 2022.
-
Constructing Embedded Lattice-based Algorithms for Multivariate Function Approximation with a Composite Number of Points
Authors:
Frances Y. Kuo,
Weiwen Mo,
Dirk Nuyens
Abstract:
We approximate $d$-variate periodic functions in weighted Korobov spaces with general weight parameters using $n$ function values at lattice points. We do not limit $n$ to be a prime number, as in currently available literature, but allow any number of points, including powers of $2$, thus providing the fundamental theory for construction of embedded lattice sequences. Our results are constructive…
▽ More
We approximate $d$-variate periodic functions in weighted Korobov spaces with general weight parameters using $n$ function values at lattice points. We do not limit $n$ to be a prime number, as in currently available literature, but allow any number of points, including powers of $2$, thus providing the fundamental theory for construction of embedded lattice sequences. Our results are constructive in that we provide a component-by-component algorithm which constructs a suitable generating vector for a given number of points or even a range of numbers of points. It does so without needing to construct the index set on which the functions will be represented. The resulting generating vector can then be used to approximate functions in the underlying weighted Korobov space. We analyse the approximation error in the worst-case setting under both the $L_2$ and $L_{\infty}$ norms. Our component-by-component construction under the $L_2$ norm achieves the best possible rate of convergence for lattice-based algorithms, and the theory can be applied to lattice-based kernel methods and splines. Depending on the value of the smoothness parameter $α$, we propose two variants of the search criterion in the construction under the $L_{\infty}$ norm, extending previous results which hold only for product-type weight parameters and prime $n$. We also provide a theoretical upper bound showing that embedded lattice sequences are essentially as good as lattice rules with a fixed value of $n$. Under some standard assumptions on the weight parameters, the worst-case error bound is independent of $d$.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
Supporting Vision-Language Model Inference with Confounder-pruning Knowledge Prompt
Authors:
Jiangmeng Li,
Wenyi Mo,
Wenwen Qiang,
Bing Su,
Changwen Zheng,
Hui Xiong,
Ji-Rong Wen
Abstract:
Vision-language models are pre-trained by aligning image-text pairs in a common space to deal with open-set visual concepts. To boost the transferability of the pre-trained models, recent works adopt fixed or learnable prompts, i.e., classification weights are synthesized from natural language describing task-relevant categories, to reduce the gap between tasks in the training and test phases. How…
▽ More
Vision-language models are pre-trained by aligning image-text pairs in a common space to deal with open-set visual concepts. To boost the transferability of the pre-trained models, recent works adopt fixed or learnable prompts, i.e., classification weights are synthesized from natural language describing task-relevant categories, to reduce the gap between tasks in the training and test phases. However, how and what prompts can improve inference performance remains unclear. In this paper, we explicitly clarify the importance of including semantic information in prompts, while existing prompting methods generate prompts without exploring the semantic information of textual labels. Manually constructing prompts with rich semantics requires domain expertise and is extremely time-consuming. To cope with this issue, we propose a semantic-aware prompt learning method, namely CPKP, which retrieves an ontological knowledge graph by treating the textual label as a query to extract task-relevant semantic information. CPKP further introduces a double-tier confounder-pruning procedure to refine the derived semantic information. The graph-tier confounders are gradually identified and phased out, inspired by the principle of Granger causality. The feature-tier confounders are demolished by following the maximum entropy principle in information theory. Empirically, the evaluations demonstrate the effectiveness of CPKP, e.g., with two shots, CPKP outperforms the manual-prompt method by 4.64% and the learnable-prompt method by 1.09% on average, and the superiority of CPKP in domain generalization compared to benchmark approaches. Our implementation is available at https://github.com/Mowenyii/CPKP.
△ Less
Submitted 23 March, 2024; v1 submitted 23 May, 2022;
originally announced May 2022.
-
Record Capacity-Reach of C band IM/DD Optical Systems over Dispersion-Uncompensated Links
Authors:
Haide Wang,
Ji Zhou,
Jinlong Wei,
Wenxuan Mo,
Yuanhua Feng,
Weiping Liu,
Changyuan Yu,
Zhaohui Li
Abstract:
We experimentally demonstrate a C band 100Gbit/s intensity modulation and direct detection entropy-loaded multi-rate Nyquist-subcarrier modulation signal over 100km dispersion-uncompensated link. A record capacity-reach of 10Tbit/s$\times$km is achieved.
We experimentally demonstrate a C band 100Gbit/s intensity modulation and direct detection entropy-loaded multi-rate Nyquist-subcarrier modulation signal over 100km dispersion-uncompensated link. A record capacity-reach of 10Tbit/s$\times$km is achieved.
△ Less
Submitted 12 December, 2021;
originally announced February 2022.
-
Rejoinder: Learning Optimal Distributionally Robust Individualized Treatment Rules
Authors:
Weibin Mo,
Zhengling Qi,
Yufeng Liu
Abstract:
We thank the opportunity offered by editors for this discussion and the discussants for their insightful comments and thoughtful contributions. We also want to congratulate Kallus (2020) for his inspiring work in improving the efficiency of policy learning by retargeting. Motivated from the discussion in Dukes and Vansteelandt (2020), we first point out interesting connections and distinctions bet…
▽ More
We thank the opportunity offered by editors for this discussion and the discussants for their insightful comments and thoughtful contributions. We also want to congratulate Kallus (2020) for his inspiring work in improving the efficiency of policy learning by retargeting. Motivated from the discussion in Dukes and Vansteelandt (2020), we first point out interesting connections and distinctions between our work and Kallus (2020) in Section 1. In particular, the assumptions and sources of variation for consideration in these two papers lead to different research problems with different scopes and focuses. In Section 2, following the discussions in Li et al. (2020); Liang and Zhao (2020), we also consider the efficient policy evaluation problem when we have some data from the testing distribution available at the training stage. We show that under the assumption that the sample sizes from training and testing are growing in the same order, efficient value function estimates can deliver competitive performance. We further show some connections of these estimates with existing literature. However, when the growth of testing sample size available for training is in a slower order, efficient value function estimates may not perform well anymore. In contrast, the requirement of the testing sample size for DRITR is not as strong as that of efficient policy evaluation using the combined data. Finally, we highlight the general applicability and usefulness of DRITR in Section 3.
△ Less
Submitted 17 October, 2021;
originally announced October 2021.
-
Efficient Learning of Optimal Individualized Treatment Rules for Heteroscedastic or Misspecified Treatment-Free Effect Models
Authors:
Weibin Mo,
Yufeng Liu
Abstract:
Recent development in data-driven decision science has seen great advances in individualized decision making. Given data with individual covariates, treatment assignments and outcomes, researchers can search for the optimal individualized treatment rule (ITR) that maximizes the expected outcome. Existing methods typically require initial estimation of some nuisance models. The double robustness pr…
▽ More
Recent development in data-driven decision science has seen great advances in individualized decision making. Given data with individual covariates, treatment assignments and outcomes, researchers can search for the optimal individualized treatment rule (ITR) that maximizes the expected outcome. Existing methods typically require initial estimation of some nuisance models. The double robustness property that can protect from misspecification of either the treatment-free effect or the propensity score has been widely advocated. However, when model misspecification exists, a doubly robust estimate can be consistent but may suffer from downgraded efficiency. Other than potential misspecified nuisance models, most existing methods do not account for the potential problem when the variance of outcome is heterogeneous among covariates and treatment. We observe that such heteroscedasticity can greatly affect the estimation efficiency of the optimal ITR. In this paper, we demonstrate that the consequences of misspecified treatment-free effect and heteroscedasticity can be unified as a covariate-treatment dependent variance of residuals. To improve efficiency of the estimated ITR, we propose an Efficient Learning (E-Learning) framework for finding an optimal ITR in the multi-armed treatment setting. We show that the proposed E-Learning is optimal among a regular class of semiparametric estimates that can allow treatment-free effect misspecification. In our simulation study, E-Learning demonstrates its effectiveness if one of or both misspecified treatment-free effect and heteroscedasticity exist. Our analysis of a Type 2 Diabetes Mellitus (T2DM) observational study also suggests the improved efficiency of E-Learning.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Diagnostics for ultrashort X-ray pulses using silicon trackers
Authors:
Jiaxing Wen,
Minghai Yu,
Yuchi Wu,
Ming Zeng,
Bo Zhang,
Jirong Cang,
Yuge Zhang,
Ge Ma,
Yue Yang,
Wenbo Mo,
Zongqing Zhao
Abstract:
The spectrum of laser-plasma generated X-rays is very important, it characterizes electron dynamics in plasma and is basic for applications. However, the accuracies and efficiencies of existing methods to diagnose the spectrum of laser-plasma based X-ray pulse are not very high, especially in the range of several hundred keV. In this study, a new method based on electron tracks detection to measur…
▽ More
The spectrum of laser-plasma generated X-rays is very important, it characterizes electron dynamics in plasma and is basic for applications. However, the accuracies and efficiencies of existing methods to diagnose the spectrum of laser-plasma based X-ray pulse are not very high, especially in the range of several hundred keV. In this study, a new method based on electron tracks detection to measure the spectrum of laser-plasma produced X-ray pulses is proposed and demonstrated. Laser-plasma generated X-rays are scattered in a multi-pixel silicon tracker. Energies and scattering directions of Compton electrons can be extracted from the response of the detector, and then the spectrum of X-rays can be reconstructed. Simulations indicate that the energy resolution of this method is approximately 20% for X-rays from 200 to 550 keV for a silicon-on-insulator pixel detector with 12 $\rm μ$m pixel pitch and 500 $\rm μ$m depletion region thickness. The results of a proof-of-principle experiment based on a Timepix3 detector are also shown.
△ Less
Submitted 2 September, 2021; v1 submitted 3 May, 2021;
originally announced May 2021.
-
The Massive and Distant Clusters of WISE Survey. VIII. Radio Activity in Massive Galaxy Clusters
Authors:
Wenli Mo,
Anthony Gonzalez,
Mark Brodwin,
Bandon Decker,
Peter Eisenhardt,
Emily Moravec,
S. A. Stanford,
Daniel Stern,
Dominika Wylezalek
Abstract:
We present a study of the central radio activity of galaxy clusters at high redshift. Using a large sample of galaxy clusters at $0.7<z<1.5$ from the Massive and Distant Clusters of {\it WISE} Survey and the Faint Images of the Radio Sky at Twenty-Centimeters $1.4$~GHz catalog, we measure the fraction of clusters containing a radio source within the central $500$~kpc, which we term the cluster rad…
▽ More
We present a study of the central radio activity of galaxy clusters at high redshift. Using a large sample of galaxy clusters at $0.7<z<1.5$ from the Massive and Distant Clusters of {\it WISE} Survey and the Faint Images of the Radio Sky at Twenty-Centimeters $1.4$~GHz catalog, we measure the fraction of clusters containing a radio source within the central $500$~kpc, which we term the cluster radio-active fraction, and the fraction of cluster galaxies within the central $500$~kpc exhibiting radio emission. We find tentative ($2.25σ$) evidence that the cluster radio-active fraction increases with cluster richness, while the fraction of cluster galaxies that are radio-luminous ($L_{1.4~\mathrm{GHz}}\geq10^{25}$~W~Hz$^{-1}$) does not correlate with richness at a statistically significant level. Compared to that calculated at $0 < z < 0.6$, the cluster radio-active fraction at $0 < z < 1.5$ increases by a factor of $10$. This fraction is also dependent on the radio luminosity. Clusters at higher redshift are much more likely to host a radio source of luminosity $L_{1.4~\mathrm{GHz}}\gtrsim10^{26}$~W~Hz$^{-1}$ than are lower redshift clusters. We compare the fraction of radio-luminous cluster galaxies to the fraction measured in a field environment. For $0.7<z<1.5$, we find that both the cluster and field radio-luminous galaxy fraction increases with stellar mass, regardless of environment, though at fixed stellar mass, cluster galaxies are roughly $2$ times more likely to be radio-luminous than field galaxies.
△ Less
Submitted 17 August, 2020;
originally announced August 2020.
-
The Massive and Distant Clusters of WISE Survey IX: High Radio Activity in a Merging Cluster
Authors:
Emily Moravec,
Anthony Gonzalez,
Simon Dicker,
Stacey Alberts,
Mark Brodwin,
Tracy Clarke,
Thomas Connor,
Bandon Decker,
Mark Devlin,
Peter Eisenhardt,
Brian Mason,
Wenli Mo,
Tony Mroczkowski,
Alexandra Pope,
Charles Romero,
Craig Sarazin,
Jonathan Sievers,
Spencer Stanford,
Daniel Stern,
Dominika Wylezalek,
Fernando Zago
Abstract:
We present a multi-wavelength investigation of the radio galaxy population in the galaxy cluster MOO J1506+5137 at $z$=1.09$\pm$0.03, which in previous work we identified as having multiple complex radio sources. The combined dataset used in this work includes data from the Low-Frequency Array Two-metre Sky Survey (LoTSS), NSF's Karl G. Jansky Very Large Array (VLA), the Robert C. Byrd Green Bank…
▽ More
We present a multi-wavelength investigation of the radio galaxy population in the galaxy cluster MOO J1506+5137 at $z$=1.09$\pm$0.03, which in previous work we identified as having multiple complex radio sources. The combined dataset used in this work includes data from the Low-Frequency Array Two-metre Sky Survey (LoTSS), NSF's Karl G. Jansky Very Large Array (VLA), the Robert C. Byrd Green Bank Telescope (GBT), the Spitzer Space Telescope, and the Dark Energy Camera Legacy Survey (DECaLS). We find that there are five radio sources which are all located within 500 kpc ($\sim$1$^{\prime}$) of the cluster center and have radio luminosities $P_{\mathrm{1.4GHz}}$ > 1.6$\times$10$^{24}$ W Hz$^{-1}$. The typical host galaxies are among the highest stellar mass galaxies in the cluster. The exceptional radio activity among the massive galaxy population appears to be linked to the dynamical state of the cluster. The galaxy distribution suggests an ongoing merger, with a subgroup found to the northwest of the main cluster. Further, two of the five sources are classified as bent-tail sources with one being a potential wide-angle tail (WAT)/hybrid morphology radio source (HyMoRS) indicating a dynamic environment. The cluster also lies in a region of the mass-richness plane occupied by other merging clusters in the Massive and Distant Clusters of WISE Survey (MaDCoWS). The data suggest that during the merger phase radio activity can be dramatically enhanced, which would contribute to the observed trend of increased radio activity in clusters with increasing redshift.
△ Less
Submitted 20 January, 2021; v1 submitted 26 June, 2020;
originally announced June 2020.
-
Learning Optimal Distributionally Robust Individualized Treatment Rules
Authors:
Weibin Mo,
Zhengling Qi,
Yufeng Liu
Abstract:
Recent development in the data-driven decision science has seen great advances in individualized decision making. Given data with individual covariates, treatment assignments and outcomes, policy makers best individualized treatment rule (ITR) that maximizes the expected outcome, known as the value function. Many existing methods assume that the training and testing distributions are the same. How…
▽ More
Recent development in the data-driven decision science has seen great advances in individualized decision making. Given data with individual covariates, treatment assignments and outcomes, policy makers best individualized treatment rule (ITR) that maximizes the expected outcome, known as the value function. Many existing methods assume that the training and testing distributions are the same. However, the estimated optimal ITR may have poor generalizability when the training and testing distributions are not identical. In this paper, we consider the problem of finding an optimal ITR from a restricted ITR class where there is some unknown covariate changes between the training and testing distributions. We propose a novel distributionally robust ITR (DR-ITR) framework that maximizes the worst-case value function across the values under a set of underlying distributions that are "close" to the training distribution. The resulting DR-ITR can guarantee the performance among all such distributions reasonably well. We further propose a calibrating procedure that tunes the DR-ITR adaptively to a small amount of calibration data from a target population. In this way, the calibrated DR-ITR can be shown to enjoy better generalizability than the standard ITR based on our numerical studies.
△ Less
Submitted 26 June, 2020;
originally announced June 2020.
-
The Massive and Distant Clusters of WISE Survey VII: The Environments and Properties of Radio Galaxies in Clusters at z~1
Authors:
Emily Moravec,
Anthony H. Gonzalez,
Daniel Stern,
Tracy Clarke,
Mark Brodwin,
Bandon Decker,
Peter R. M. Eisenhardt,
Wenli Mo,
Alexandra Pope,
Spencer A. Stanford,
Dominika Wylezalek
Abstract:
We present the results from a study with NSF's Karl G. Jansky Very Large Array (VLA) to determine the radio morphologies of extended radio sources and the properties of their host galaxies in 50 massive galaxy clusters at z~1. We find a majority of the radio morphologies to be Fanaroff-Riley (FR) type IIs. By analyzing the infrared counterparts of the radio sources, we find that ~40% of the host g…
▽ More
We present the results from a study with NSF's Karl G. Jansky Very Large Array (VLA) to determine the radio morphologies of extended radio sources and the properties of their host galaxies in 50 massive galaxy clusters at z~1. We find a majority of the radio morphologies to be Fanaroff-Riley (FR) type IIs. By analyzing the infrared counterparts of the radio sources, we find that ~40% of the host galaxies are the candidate brightest cluster galaxy (BCG) and ~83% are consistent with being one of the top six most massive galaxies in the cluster. We investigate the role of environmental factors on the radio-loud AGN population by examining correlations between environmental and radio-galaxy properties. We find that the highest stellar mass hosts ($M_{*} \gtrsim$ 4$\times 10^{11} M_{\odot}$) are confined to the cluster center and host compact jets. There is evidence for an increase in the size of the jets with cluster-centric radius, which may be attributed to the decreased ICM pressure confinement with increasing radius. Besides this correlation, there are no other significant correlations between the properties of the radio-AGN (luminosity, morphology, or size) and environmental properties (cluster richness and location within the cluster). The fact that there are more AGN in the cluster environment than the field at this epoch, combined with the lack of strong correlation between galaxy and environmental properties, argues that the cluster environment fosters radio activity but does not solely drive the evolution of these sources at this redshift.
△ Less
Submitted 21 November, 2019;
originally announced November 2019.
-
The Massive and Distant Clusters of WISE Survey VI: Stellar Mass Fractions of a Sample of High-Redshift Infrared-selected Clusters
Authors:
Bandon Decker,
Mark Brodwin,
Zubair Abdulla,
Anthony H. Gonzalez,
Daniel P. Marrone,
Christine O'Donnell,
S. A. Stanford,
Dominika Wylezalek,
John E. Carlstrom,
Peter R. M. Eisenhardt,
Adam Mantz,
Wenli Mo,
Emily Moravec,
Daniel Stern,
Greg Aldering,
Matthew L. N. Ashby,
Kyle Boone,
Brian Hayden,
Nikhel Gupta,
Michael A. McDonald
Abstract:
We present measurements of the stellar mass fractions ($f_\star$) for a sample of high-redshift ($0.93 \le z \le 1.32$) infrared-selected galaxy clusters from the Massive and Distant Clusters of WISE Survey (MaDCoWS) and compare them to the stellar mass fractions of Sunyaev-Zel'dovich (SZ) effect-selected clusters in a similar mass and redshift range from the South Pole Telescope (SPT)-SZ Survey.…
▽ More
We present measurements of the stellar mass fractions ($f_\star$) for a sample of high-redshift ($0.93 \le z \le 1.32$) infrared-selected galaxy clusters from the Massive and Distant Clusters of WISE Survey (MaDCoWS) and compare them to the stellar mass fractions of Sunyaev-Zel'dovich (SZ) effect-selected clusters in a similar mass and redshift range from the South Pole Telescope (SPT)-SZ Survey. We do not find a significant difference in mean $f_\star$ between the two selection methods, though we do find an unexpectedly large range in $f_\star$ for the SZ-selected clusters. In addition, we measure the luminosity function of the MaDCoWS clusters and find $m^*= 19.41\pm0.07$, similar to other studies of clusters at or near our redshift range. Finally, we present SZ detections and masses for seven MaDCoWS clusters and new spectroscopic redshifts for five MaDCoWS clusters. One of these new clusters, MOO J1521+0452 at $z=1.31$, is the most distant MaDCoWS cluster confirmed to date.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.
-
The Massive and Distant Clusters of WISE Survey V: Extended Radio Sources in Massive Galaxy Clusters at z~1
Authors:
Emily Moravec,
Anthony H. Gonzalez,
Daniel Stern,
Mark Brodwin,
Tracy Clarke,
Bandon Decker,
Peter R. M. Eisenhardt,
Wenli Mo,
Christine O'Donnell,
Alexandra Pope,
Spencer A. Stanford,
Dominika Wylezalek
Abstract:
We present the results from a pilot study with the Karl G. Jansky Very Large Array (JVLA) to determine the radio morphologies of extended radio sources and the properties of their host-galaxies in 10 massive galaxy clusters at z~1, an epoch in which clusters are assembling rapidly. These clusters are drawn from a parent sample of WISE-selected galaxy clusters that were cross-correlated with the VL…
▽ More
We present the results from a pilot study with the Karl G. Jansky Very Large Array (JVLA) to determine the radio morphologies of extended radio sources and the properties of their host-galaxies in 10 massive galaxy clusters at z~1, an epoch in which clusters are assembling rapidly. These clusters are drawn from a parent sample of WISE-selected galaxy clusters that were cross-correlated with the VLA Faint Images of the Radio Sky at Twenty-Centimeters survey (FIRST) to identify extended radio sources within 1$^{\prime}$ of the cluster centers. Out of the ten targeted sources, six are FR II sources, one is an FR I source, and three sources have undetermined morphologies. Eight radio sources have associated Spitzer data, 75% presenting infrared counterparts. A majority of these counterparts are consistent with being massive galaxies. The angular extent of the FR sources exhibits a strong correlation with the cluster-centric radius, which warrants further investigation with a larger sample.
△ Less
Submitted 3 December, 2018;
originally announced December 2018.
-
The Massive and Distant Clusters of WISE Survey IV: The Distribution of Active Galactic Nuclei in Galaxy Clusters at $z\sim1$
Authors:
Wenli Mo,
Anthony Gonzalez,
Daniel Stern,
Mark Brodwin,
Bandon Decker,
Peter Eisenhardt,
Emily Moravec,
S. A. Stanford,
Dominika Wylezalek
Abstract:
We present an analysis of the radial distribution of Active Galactic Nuclei (AGN) in $2300$ galaxy clusters from the Massive and Distant Clusters of {\it WISE} Survey (MaDCoWS). MaDCoWS provides the largest coverage of the extragalactic sky for a cluster sample at $z\sim1$. We use literature catalogs of AGN selected via optical, mid-infrared (MIR), and radio data, and by optical-to-MIR (OIR) color…
▽ More
We present an analysis of the radial distribution of Active Galactic Nuclei (AGN) in $2300$ galaxy clusters from the Massive and Distant Clusters of {\it WISE} Survey (MaDCoWS). MaDCoWS provides the largest coverage of the extragalactic sky for a cluster sample at $z\sim1$. We use literature catalogs of AGN selected via optical, mid-infrared (MIR), and radio data, and by optical-to-MIR (OIR) color. Stacking the radial distribution of AGN within the $6\arcmin$ of the centers of MaDCoWS galaxy clusters, we find a distinct overdensity of AGN within $1\arcmin$ of the galaxy cluster center for AGN of all selection methods. The fraction of red galaxies that host AGN as a function of clustercentric distance is, however, dependent on the AGN selection. The fraction of red galaxies in cluster environments that host AGN selected by optical signatures or blue OIR color is at a deficit compared to the field, while MIR-selected and red OIR color AGN are enhanced in the centers of clusters when compared to field levels. The radio-selected AGN fraction is more than $2.5$ times that of the field, implying that the centers of clusters are conducive to the triggering of radio emission in AGN. We do not find a statistically significant change in the AGN fraction as a function of cluster richness. We also investigate the correlation of central radio activity with other AGN in galaxy clusters. Clusters with radio activity have more central AGN than radio-inactive clusters, implying that central cluster radio activity and AGN triggering may be linked.
△ Less
Submitted 5 November, 2018;
originally announced November 2018.
-
The Massive and Distant Clusters of WISE Survey. I: Survey Overview and a Catalog of >2000 Galaxy Clusters at z~1
Authors:
Anthony H. Gonzalez,
Daniel P. Gettings,
Mark Brodwin,
Peter R. M. Eisenhardt,
S. Adam Stanford,
Dominika Wylezalek,
Bandon Decker,
Daniel P. Marrone,
Emily Moravec,
Christine O'Donnell,
Brian Stalder,
Daniel Stern,
Zubair Abdulla,
Gillen Brown,
John Carlstrom,
Kenneth C. Chambers,
Brian Hayden,
Yen-Ting Lin,
Eugene Magnier,
Frank Masci,
Adam B. Mantz,
Michael McDonald,
Wenli Mo,
Saul Perlmutter,
Edward L. Wright
, et al. (1 additional authors not shown)
Abstract:
We present the Massive and Distant Clusters of WISE Survey (MaDCoWS), a search for galaxy clusters at 0.7<z<1.5 based upon data from the Wide-field Infrared Survey Explorer (WISE) mission. MaDCoWS is the first cluster survey capable of discovering massive clusters at these redshifts over the full extragalactic sky. The search is divided into two regions -- the region of the extragalactic sky cover…
▽ More
We present the Massive and Distant Clusters of WISE Survey (MaDCoWS), a search for galaxy clusters at 0.7<z<1.5 based upon data from the Wide-field Infrared Survey Explorer (WISE) mission. MaDCoWS is the first cluster survey capable of discovering massive clusters at these redshifts over the full extragalactic sky. The search is divided into two regions -- the region of the extragalactic sky covered by Pan-STARRS (Dec>-30 degrees) and the remainder of the southern extragalactic sky at Dec<-30 degrees for which shallower optical data from SuperCOSMOS Sky Survey are available. In this paper we describe the search algorithm, characterize the sample, and present the first MaDCoWS data release -- catalogs of the 2433 highest amplitude detections in the WISE--Pan-STARRS region and the 250 highest amplitude detections in the WISE--SuperCOSMOS region. A total of 1723 of the detections from the WISE--Pan-STARRS sample have also been observed with the Spitzer Space Telescope, providing photometric redshifts and richnesses, and an additional 64 detections within the WISE--SuperCOSMOS region also have photometric redshifts and richnesses. Spectroscopic redshifts for 38 MaDCoWS clusters with IRAC photometry demonstrate that the photometric redshifts have an uncertainty of $σ_z/(1+z)\sim0.036$. Combining the richness measurements with Sunyaev-Zel'dovich observations of MaDCoWS clusters, we also present a preliminary mass-richness relation that can be used to infer the approximate mass distribution of the full sample. The estimated median mass for the WISE--Pan-STARRS catalog is $M_{500}=1.6^{+0.7}_{-0.8}\times10^{14} \mathrm{M}_\odot$, with the Sunyaev-Zel'dovich data confirming that we detect clusters with masses up to $M_{500}\sim5\times10^{14} \mathrm{M}_\odot$ $(M_{200}\sim10^{15} \mathrm{M}_\odot)$.
△ Less
Submitted 20 December, 2018; v1 submitted 18 September, 2018;
originally announced September 2018.
-
Discovery of a Very Large (~20 kpc) Galaxy at z=3.72
Authors:
Kyoung-Soo Lee,
Arjun Dey,
Thomas Matheson,
Ke Shi,
Chao-Ling Hung,
Rui Xue,
Hanae Inami,
Yun Huang,
Khee-Gan Lee,
Matthew L. N. Ashby,
Buell Jannuzi,
Naveen Reddy,
Sungryong Hong,
Wenli Mo,
Nicola Malavasi
Abstract:
We report the discovery and spectroscopic confirmation of a very large star-forming Lyman Break galaxy, G6025, at z_spec=3.721+/-0.003. In the rest-frame ~2100A, G6025 subtends ~24 kpc in physical extent when measured from the 1.5-sigma isophote, in agreement with the parametric size measurements which yield the half-light radius of 4.9+/-0.5 kpc and the semi-major axis of 12.5+/-0.1 kpc. G6025 is…
▽ More
We report the discovery and spectroscopic confirmation of a very large star-forming Lyman Break galaxy, G6025, at z_spec=3.721+/-0.003. In the rest-frame ~2100A, G6025 subtends ~24 kpc in physical extent when measured from the 1.5-sigma isophote, in agreement with the parametric size measurements which yield the half-light radius of 4.9+/-0.5 kpc and the semi-major axis of 12.5+/-0.1 kpc. G6025 is also very UV-luminous (~5L*(z~4}) and young (~140+/-60 Myr). Despite its unusual size and luminosity, the stellar population parameters and dust reddening (M_star~M*(z~4)$, and E(B-V)=0.18+/-0.05) estimated from the integrated light, are similar to those of smaller galaxies at comparable redshifts. The ground-based morphology and spectroscopy show two dominant components, both located off-center, embedded in more diffuse emission. We speculate that G6025 may be a scaled-up version of chain galaxies seen in deep HST imaging, or alternatively, a nearly equal-mass merger involving two super-L* galaxies in its early stage. G6025 lies close to but not within a known massive protocluster at z=3.78. We find four companions within 6 Mpc from G6025, two of which lie within 1.6 Mpc. While the limited sensitivity of the existing spectroscopy does not allow us to robustly characterize the local environment of G6025, it likely resides in a locally overdense environment. The luminosity, size, and youth of G6025 make it uniquely suited to study the early formation of massive galaxies in the universe.
△ Less
Submitted 27 July, 2018; v1 submitted 19 March, 2018;
originally announced March 2018.
-
Autostacker: A Compositional Evolutionary Learning System
Authors:
Boyuan Chen,
Harvey Wu,
Warren Mo,
Ishanu Chattopadhyay,
Hod Lipson
Abstract:
We introduce an automatic machine learning (AutoML) modeling architecture called Autostacker, which combines an innovative hierarchical stacking architecture and an Evolutionary Algorithm (EA) to perform efficient parameter search. Neither prior domain knowledge about the data nor feature preprocessing is needed. Using EA, Autostacker quickly evolves candidate pipelines with high predictive accura…
▽ More
We introduce an automatic machine learning (AutoML) modeling architecture called Autostacker, which combines an innovative hierarchical stacking architecture and an Evolutionary Algorithm (EA) to perform efficient parameter search. Neither prior domain knowledge about the data nor feature preprocessing is needed. Using EA, Autostacker quickly evolves candidate pipelines with high predictive accuracy. These pipelines can be used as is or as a starting point for human experts to build on. Autostacker finds innovative combinations and structures of machine learning models, rather than selecting a single model and optimizing its hyperparameters. Compared with other AutoML systems on fifteen datasets, Autostacker achieves state-of-art or competitive performance both in terms of test accuracy and time cost.
△ Less
Submitted 1 March, 2018;
originally announced March 2018.
-
IDCS J1426.5+3508: Weak Lensing Analysis of a Massive Galaxy Cluster at $z=1.75$
Authors:
Wenli Mo,
Anthony H. Gonzalez,
M. James Jee,
Richard Massey,
Jason Rhodes,
Mark Brown,
Peter Eisenhardt,
Daniel P. Marrone,
S. A. Stanford,
Gregory R. Zeimann
Abstract:
We present a weak lensing study of the galaxy cluster IDCS J1426.5+3508 at $z=1.75$, which is the highest redshift strong lensing cluster known and the most distant cluster for which a weak lensing analysis has been undertaken. Using F160W, F814W, and F606W observations with the Hubble Space Telescope, we detect tangential shear at $2σ$ significance. Fitting a Navarro-Frenk-White mass profile to t…
▽ More
We present a weak lensing study of the galaxy cluster IDCS J1426.5+3508 at $z=1.75$, which is the highest redshift strong lensing cluster known and the most distant cluster for which a weak lensing analysis has been undertaken. Using F160W, F814W, and F606W observations with the Hubble Space Telescope, we detect tangential shear at $2σ$ significance. Fitting a Navarro-Frenk-White mass profile to the shear with a theoretical median mass-concentration relation, we derive a mass $M_{200,\mathrm{crit}}=2.3^{+2.1}_{-1.4}\times10^{14}$ M$_{\odot}$. This mass is consistent with previous mass estimates from the Sunyaev-Zel'dovich (SZ) effect, X-ray, and strong lensing. The cluster lies on the local SZ-weak lensing mass scaling relation observed at low redshift, indicative of minimal evolution in this relation.
△ Less
Submitted 28 January, 2016;
originally announced January 2016.
-
A Measurement of the Millimeter Emission and the Sunyaev-Zel'dovich Effect Associated with Low-Frequency Radio Sources
Authors:
Megan B. Gralla,
Devin Crichton,
Tobias A. Marriage,
Wenli Mo,
Paula Aguirre,
Graeme E. Addison,
V. Asboth,
Nick Battaglia,
James Bock,
J. Richard Bond,
Mark J. Devlin,
Rolando Dunner,
Amir Hajian,
Mark Halpern,
Matt Hilton,
Adam D. Hincks,
Renee A. Hlozek,
Kevin M. Huffenberger,
John P. Hughes,
R. J. Ivison,
Arthur Kosowsky,
Yen-Ting Lin,
Danica Marsden,
Felipe Menanteau,
Kavilan Moodley
, et al. (16 additional authors not shown)
Abstract:
We present a statistical analysis of the millimeter-wavelength properties of 1.4 GHz-selected sources and a detection of the Sunyaev-Zel'dovich (SZ) effect associated with the halos that host them. The Atacama Cosmology Telescope (ACT) has conducted a survey at 148 GHz, 218 GHz and 277 GHz along the celestial equator. Using samples of radio sources selected at 1.4 GHz from FIRST and NVSS, we measu…
▽ More
We present a statistical analysis of the millimeter-wavelength properties of 1.4 GHz-selected sources and a detection of the Sunyaev-Zel'dovich (SZ) effect associated with the halos that host them. The Atacama Cosmology Telescope (ACT) has conducted a survey at 148 GHz, 218 GHz and 277 GHz along the celestial equator. Using samples of radio sources selected at 1.4 GHz from FIRST and NVSS, we measure the stacked 148, 218 and 277 GHz flux densities for sources with 1.4 GHz flux densities ranging from 5 to 200 mJy. At these flux densities, the radio source population is dominated by active galactic nuclei (AGN), with both steep and flat spectrum populations, which have combined radio-to-millimeter spectral indices ranging from 0.5 to 0.95, reflecting the prevalence of steep spectrum sources at high flux densities and the presence of flat spectrum sources at lower flux densities. The thermal SZ effect associated with the halos that host the AGN is detected at the 5$σ$ level through its spectral signature. When we compare the SZ effect with weak lensing measurements of radio galaxies, we find that the relation between the two is consistent with that measured by Planck for local bright galaxies. We present a detection of the SZ effect in some of the lowest mass halos (average $M_{200}\approx10^{13}$M$_{\odot}h_{70}^{-1}$) studied to date. This detection is particularly important in the context of galaxy evolution models, as it confirms that galaxies with radio AGN also typically support hot gaseous halos. With Herschel observations, we show that the SZ detection is not significantly contaminated by dust. We show that 5 mJy$<S_{1.4}<$200 mJy radio sources contribute $\ell(\ell+1)C_{\ell}/(2π)=0.37\pm0.03μ$K$^2$ to the angular power spectrum at $\ell=3000$ at 148 GHz, after accounting for the SZ effect associated with their host halos.
△ Less
Submitted 23 October, 2014; v1 submitted 30 October, 2013;
originally announced October 2013.
-
NEOWISE Studies of Asteroids with Sloan Photometry: Preliminary Results
Authors:
A. Mainzer,
J. Masiero,
T. Grav,
J. Bauer,
D. J. Tholen,
R. S. McMillan,
E. Wright,
T. Spahr,
R. M. Cutri,
R. Walker,
W. Mo,
J. Watkins,
E. Hand,
C. Maleszewski
Abstract:
We have combined the NEOWISE and Sloan Digital Sky Survey data to study the albedos of 24,353 asteroids with candidate taxonomic classifications derived using Sloan photometry. We find a wide range of moderate to high albedos for candidate S-type asteroids that are analogous to the S-complex defined by previous spectrophotometrically-based taxonomic systems. The candidate C-type asteroids, while g…
▽ More
We have combined the NEOWISE and Sloan Digital Sky Survey data to study the albedos of 24,353 asteroids with candidate taxonomic classifications derived using Sloan photometry. We find a wide range of moderate to high albedos for candidate S-type asteroids that are analogous to the S-complex defined by previous spectrophotometrically-based taxonomic systems. The candidate C-type asteroids, while generally very dark, have a tail of higher albedos that overlaps the S types. The albedo distribution for asteroids with a photometrically derived Q classification is extremely similar to those of the S types. Asteroids with similar colors to (4) Vesta have higher albedos than the S types, and most have orbital elements similar to known Vesta family members. Finally, we show that the relative reflectance at 3.4 and 4.6 $μ$m is higher for D-type asteroids and suggest that their red visible and near-infrared spectral slope extends out to these wavelengths. Understanding the relationship between size, albedo, and taxonomic classification is complicated by the fact that the objects with classifications were selected from the visible/near-infrared Sloan Moving Object Catalog, which is biased against fainter asteroids, including those with lower albedos.
△ Less
Submitted 22 October, 2011;
originally announced October 2011.
-
NEOWISE Studies of Spectrophotometrically Classified Asteroids: Preliminary Results
Authors:
A. Mainzer,
T. Grav,
J. Masiero,
J. Bauer,
E. Hand,
D. Tholen,
R. S. McMillan,
T. Spahr,
R. M. Cutri,
E. Wright,
J. Watkins,
W. Mo,
C. Maleszewski
Abstract:
The NEOWISE dataset offers the opportunity to study the variations in albedo for asteroid classification schemes based on visible and near-infrared observations for a large sample of minor planets. We have determined the albedos for nearly 1900 asteroids classified by the Tholen, Bus and Bus-DeMeo taxonomic classification schemes. We find that the S-complex spans a broad range of bright albedos, p…
▽ More
The NEOWISE dataset offers the opportunity to study the variations in albedo for asteroid classification schemes based on visible and near-infrared observations for a large sample of minor planets. We have determined the albedos for nearly 1900 asteroids classified by the Tholen, Bus and Bus-DeMeo taxonomic classification schemes. We find that the S-complex spans a broad range of bright albedos, partially overlapping the low albedo C-complex at small sizes. As expected, the X-complex covers a wide range of albedos. The multi-wavelength infrared coverage provided by NEOWISE allows determination of the reflectivity at 3.4 and 4.6 $μ$m relative to the visible albedo. The direct computation of the reflectivity at 3.4 and 4.6 $μ$m enables a new means of comparing the various taxonomic classes. Although C, B, D and T asteroids all have similarly low visible albedos, the D and T types can be distinguished from the C and B types by examining their relative reflectance at 3.4 and 4.6 $μ$m. All of the albedo distributions are strongly affected by selection biases against small, low albedo objects, as all objects selected for taxonomic classification were chosen according to their visible light brightness. Due to these strong selection biases, we are unable to determine whether or not there are correlations between size, albedo and space weathering. We argue that the current set of classified asteroids makes any such correlations difficult to verify. A sample of taxonomically classified asteroids drawn without significant albedo bias is needed in order to perform such an analysis.
△ Less
Submitted 29 September, 2011;
originally announced September 2011.
-
NEOWISE Observations of Near-Earth Objects: Preliminary Results
Authors:
A. Mainzer,
T. Grav,
J. Bauer,
J. Masiero,
R. S. McMillan,
R. M. Cutri,
R. Walker,
E. Wright,
P. Eisenhardt,
D. J. Tholen,
T. Spahr,
R. Jedicke,
L. Denneau,
E. DeBaun,
D. Elsbury,
T. Gautier,
S. Gomillion,
E. Hand,
W. Mo,
J. Watkins,
A. Wilkins,
G. L. Bryngelson,
A. Del Pino Molina,
S. Desai,
M. Go'mez Camus
, et al. (12 additional authors not shown)
Abstract:
With the NEOWISE portion of the \emph{Wide-field Infrared Survey Explorer} (WISE) project, we have carried out a highly uniform survey of the near-Earth object (NEO) population at thermal infrared wavelengths ranging from 3 to 22 $μ$m, allowing us to refine estimates of their numbers, sizes, and albedos. The NEOWISE survey detected NEOs the same way whether they were previously known or not, subje…
▽ More
With the NEOWISE portion of the \emph{Wide-field Infrared Survey Explorer} (WISE) project, we have carried out a highly uniform survey of the near-Earth object (NEO) population at thermal infrared wavelengths ranging from 3 to 22 $μ$m, allowing us to refine estimates of their numbers, sizes, and albedos. The NEOWISE survey detected NEOs the same way whether they were previously known or not, subject to the availability of ground-based follow-up observations, resulting in the discovery of more than 130 new NEOs. The survey's uniformity in sensitivity, observing cadence, and image quality have permitted extrapolation of the 428 near-Earth asteroids (NEAs) detected by NEOWISE during the fully cryogenic portion of the WISE mission to the larger population. We find that there are 981$\pm$19 NEAs larger than 1 km and 20,500$\pm$3000 NEAs larger than 100 m. We show that the Spaceguard goal of detecting 90% of all 1 km NEAs has been met, and that the cumulative size distribution is best represented by a broken power law with a slope of 1.32$\pm$0.14 below 1.5 km. This power law slope produces $\sim13,200\pm$1,900 NEAs with $D>$140 m. Although previous studies predict another break in the cumulative size distribution below $D\sim$50-100 m, resulting in an increase in the number of NEOs in this size range and smaller, we did not detect enough objects to comment on this increase. The overall number for the NEA population between 100-1000 m are lower than previous estimates. The numbers of near-Earth comets will be the subject of future work.
△ Less
Submitted 29 September, 2011;
originally announced September 2011.
-
Thermopower and thermal conductivity of superconducting perovskite $MgCNi_3$
Authors:
S. Y. Li,
W. Q. Mo,
M. Yu,
W. H. Zheng,
C. H. Wang,
Y. M. Xiong,
R. Fan,
H. S. Yang,
B. M. Wu,
L. Z. Cao,
X. H. Chen
Abstract:
The thermopower and thermal conductivity of superconducting perovskite $MgCNi_3$ ($T_c \approx$ 8 K) have been studied. The thermopower is negative from room temperature to 10 K. Combining with the negative Hall coefficient reported previously, the negative thermopower definetly indicates that the carrier in $MgCNi_3$ is electron-type. The nonlinear temperature dependence of thermopower below 15…
▽ More
The thermopower and thermal conductivity of superconducting perovskite $MgCNi_3$ ($T_c \approx$ 8 K) have been studied. The thermopower is negative from room temperature to 10 K. Combining with the negative Hall coefficient reported previously, the negative thermopower definetly indicates that the carrier in $MgCNi_3$ is electron-type. The nonlinear temperature dependence of thermopower below 150 K is explained by the electron-phonon interaction renormalization effects. The thermal conductivity is of the order for intermetallics, larger than that of borocarbides and smaller than $MgB_2$. In the normal state, the electronic contribution to the total thermal conductivity is slightly larger than the lattice contribution. The transverse magnetoresistance of $MgCNi_3$ is also measured. It is found that the classical Kohler's rule is valid above 50 K. An electronic crossover occures at $T^* \sim 50 K$, resulting in the abnormal behavior of resistivity, thermopower, and magnetoresistance below 50 K.
△ Less
Submitted 1 December, 2001; v1 submitted 16 July, 2001;
originally announced July 2001.
-
Normal state resistivity, upper critical field and Hall effect in superconducting perovskite $MgCNi_3$
Authors:
S. Y. Li,
R. Fan,
X. H. Chen,
C. H. Wang,
W. Q. Mo,
K. Q. Ruan,
Y. M. Xiong,
X. G. Luo,
H. T. Zhang,
L. Li,
Z. Sun,
L. Z. Cao
Abstract:
The normal state resistivtity, upper critical field $H_{c2}$ and Hall coefficient $R_H$ in superconducting perovskite $MgCNi_3$ ($T_c \approx 8 K$) have been studied. Above 70 K, $ρ(T)$ fits well curve predicted by Bloch-Grüneisen theory consistently with electron-phonon scattering. $H_{c2}(0)$ was estimated to be about 15.0 Tesla within the weak-coupling BCS theory, and the superconducting cohe…
▽ More
The normal state resistivtity, upper critical field $H_{c2}$ and Hall coefficient $R_H$ in superconducting perovskite $MgCNi_3$ ($T_c \approx 8 K$) have been studied. Above 70 K, $ρ(T)$ fits well curve predicted by Bloch-Grüneisen theory consistently with electron-phonon scattering. $H_{c2}(0)$ was estimated to be about 15.0 Tesla within the weak-coupling BCS theory, and the superconducting coherence length $ξ(0)$ is approximately 47 Å. $R_H$ of $MgCNi_3$ is negative for the whole temperature range which definitely indicates that the carrier in $MgCNi_3$ is electron-type. $R_H$ is temperature independent between $T_c$ and $\sim$ 140 K. Above $\sim$ 140 K, the magnitude of $R_H$ decreases as temperature rises. At T = 100 K, the carrier density is $1.0 \times 10^{22}/cm^3$, which is comparable with that in perovskite $(Ba,K)BiO_3$, and less than that of the metallic binary $MgB_2$.
△ Less
Submitted 1 June, 2001; v1 submitted 27 April, 2001;
originally announced April 2001.