-
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Authors:
Terry Yue Zhuo,
Minh Chien Vu,
Jenny Chim,
Han Hu,
Wenhao Yu,
Ratnadira Widyasari,
Imam Nur Bani Yusuf,
Haolan Zhan,
Junda He,
Indraneil Paul,
Simon Brunner,
Chen Gong,
Thong Hoang,
Armel Randy Zebaze,
Xiaoheng Hong,
Wen-Ding Li,
Jean Kaddour,
Ming Xu,
Zhihan Zhang,
Prateek Yadav,
Naman Jain,
Alex Gu,
Zhoujun Cheng,
Jiawei Liu,
Qian Liu
, et al. (8 additional authors not shown)
Abstract:
Task automation has been greatly empowered by the recent advances in Large Language Models (LLMs) via Python code, where the tasks ranging from software engineering development to general-purpose reasoning. While current benchmarks have shown that LLMs can solve tasks using programs like human developers, the majority of their evaluations are limited to short and self-contained algorithmic tasks o…
▽ More
Task automation has been greatly empowered by the recent advances in Large Language Models (LLMs) via Python code, where the tasks ranging from software engineering development to general-purpose reasoning. While current benchmarks have shown that LLMs can solve tasks using programs like human developers, the majority of their evaluations are limited to short and self-contained algorithmic tasks or standalone function calls. Solving challenging and practical requires the capability of utilizing diverse function calls as tools to efficiently implement functionalities like data analysis and web development. In addition, using multiple tools to solve a task needs compositional reasoning by accurately understanding complex instructions. Fulfilling both of these characteristics can pose a great challenge for LLMs.To assess how well LLMs can solve challenging and practical tasks via programs, we introduce BigCodeBench, a benchmark that challenges LLMs to invoke multiple function calls as tools from 139 libraries and 7 domains for 1,140 fine-grained tasks. To evaluate LLMs rigorously, each task encompasses 5.6 test cases with an average branch coverage of 99%. In addition, we propose a natural-language-oriented variant of BigCodeBench, BigCodeBench-Instruct, that automatically transforms the original docstrings into short instructions only with essential information. Our extensive evaluation of 60 LLMs shows that LLMs are not yet capable of following complex instructions to use function calls precisely, with scores up to 60%, significantly lower than the human performance of 97%. The results underscore the need for further advancements in this area.
△ Less
Submitted 7 October, 2024; v1 submitted 22 June, 2024;
originally announced June 2024.
-
StarCoder 2 and The Stack v2: The Next Generation
Authors:
Anton Lozhkov,
Raymond Li,
Loubna Ben Allal,
Federico Cassano,
Joel Lamy-Poirier,
Nouamane Tazi,
Ao Tang,
Dmytro Pykhtar,
Jiawei Liu,
Yuxiang Wei,
Tianyang Liu,
Max Tian,
Denis Kocetkov,
Arthur Zucker,
Younes Belkada,
Zijian Wang,
Qian Liu,
Dmitry Abulkhanov,
Indraneil Paul,
Zhuang Li,
Wen-Ding Li,
Megan Risdal,
Jia Li,
Jian Zhu,
Terry Yue Zhuo
, et al. (41 additional authors not shown)
Abstract:
The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data…
▽ More
The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data sources, such as GitHub pull requests, Kaggle notebooks, and code documentation. This results in a training set that is 4x larger than the first StarCoder dataset. We train StarCoder2 models with 3B, 7B, and 15B parameters on 3.3 to 4.3 trillion tokens and thoroughly evaluate them on a comprehensive set of Code LLM benchmarks. We find that our small model, StarCoder2-3B, outperforms other Code LLMs of similar size on most benchmarks, and also outperforms StarCoderBase-15B. Our large model, StarCoder2- 15B, significantly outperforms other models of comparable size. In addition, it matches or outperforms CodeLlama-34B, a model more than twice its size. Although DeepSeekCoder- 33B is the best-performing model at code completion for high-resource languages, we find that StarCoder2-15B outperforms it on math and code reasoning benchmarks, as well as several low-resource languages. We make the model weights available under an OpenRAIL license and ensure full transparency regarding the training data by releasing the SoftWare Heritage persistent IDentifiers (SWHIDs) of the source code data.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
Authors:
Terry Yue Zhuo,
Armel Zebaze,
Nitchakarn Suppattarachai,
Leandro von Werra,
Harm de Vries,
Qian Liu,
Niklas Muennighoff
Abstract:
The high cost of full-parameter fine-tuning (FFT) of Large Language Models (LLMs) has led to a series of parameter-efficient fine-tuning (PEFT) methods. However, it remains unclear which methods provide the best cost-performance trade-off at different model scales. We introduce Astraios, a suite of 28 instruction-tuned OctoCoder models using 7 tuning methods and 4 model sizes up to 16 billion para…
▽ More
The high cost of full-parameter fine-tuning (FFT) of Large Language Models (LLMs) has led to a series of parameter-efficient fine-tuning (PEFT) methods. However, it remains unclear which methods provide the best cost-performance trade-off at different model scales. We introduce Astraios, a suite of 28 instruction-tuned OctoCoder models using 7 tuning methods and 4 model sizes up to 16 billion parameters. Through investigations across 5 tasks and 8 different datasets encompassing both code comprehension and code generation tasks, we find that FFT generally leads to the best downstream performance across all scales, and PEFT methods differ significantly in their efficacy based on the model scale. LoRA usually offers the most favorable trade-off between cost and performance. Further investigation into the effects of these methods on both model robustness and code security reveals that larger models tend to demonstrate reduced robustness and less security. At last, we explore the relationships among updated parameters, cross-entropy loss, and task performance. We find that the tuning effectiveness observed in small models generalizes well to larger models, and the validation loss in instruction tuning can be a reliable indicator of overall downstream performance.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
The BigCode Project Governance Card
Authors:
BigCode collaboration,
Sean Hughes,
Harm de Vries,
Jennifer Robinson,
Carlos Muñoz Ferrandis,
Loubna Ben Allal,
Leandro von Werra,
Jennifer Ding,
Sebastien Paquet,
Yacine Jernite
Abstract:
This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support transparency by providing relevant information about choices that were made during the project to the broader public, and to serve as an example of intentional governance of an open research project that future endeavors can leverage to shape their own approach. The fi…
▽ More
This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support transparency by providing relevant information about choices that were made during the project to the broader public, and to serve as an example of intentional governance of an open research project that future endeavors can leverage to shape their own approach. The first section, Project Structure, covers the project organization, its stated goals and values, its internal decision processes, and its funding and resources. The second section, Data and Model Governance, covers decisions relating to the questions of data subject consent, privacy, and model release.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
RepoFusion: Training Code Models to Understand Your Repository
Authors:
Disha Shrivastava,
Denis Kocetkov,
Harm de Vries,
Dzmitry Bahdanau,
Torsten Scholak
Abstract:
Despite the huge success of Large Language Models (LLMs) in coding assistants like GitHub Copilot, these models struggle to understand the context present in the repository (e.g., imports, parent classes, files with similar names, etc.), thereby producing inaccurate code completions. This effect is more pronounced when using these assistants for repositories that the model has not seen during trai…
▽ More
Despite the huge success of Large Language Models (LLMs) in coding assistants like GitHub Copilot, these models struggle to understand the context present in the repository (e.g., imports, parent classes, files with similar names, etc.), thereby producing inaccurate code completions. This effect is more pronounced when using these assistants for repositories that the model has not seen during training, such as proprietary software or work-in-progress code projects. Recent work has shown the promise of using context from the repository during inference. In this work, we extend this idea and propose RepoFusion, a framework to train models to incorporate relevant repository context. Experiments on single-line code completion show that our models trained with repository context significantly outperform much larger code models as CodeGen-16B-multi ($\sim73\times$ larger) and closely match the performance of the $\sim 70\times$ larger StarCoderBase model that was trained with the Fill-in-the-Middle objective. We find these results to be a novel and compelling demonstration of the gains that training with repository context can bring. We carry out extensive ablation studies to investigate the impact of design choices such as context type, number of contexts, context length, and initialization within our framework. Lastly, we release Stack-Repo, a dataset of 200 Java repositories with permissive licenses and near-deduplicated files that are augmented with three types of repository contexts. Additionally, we are making available the code and trained checkpoints for our work. Our released resources can be found at \url{https://huggingface.co/RepoFusion}.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
StarCoder: may the source be with you!
Authors:
Raymond Li,
Loubna Ben Allal,
Yangtian Zi,
Niklas Muennighoff,
Denis Kocetkov,
Chenghao Mou,
Marc Marone,
Christopher Akiki,
Jia Li,
Jenny Chim,
Qian Liu,
Evgenii Zheltonozhskii,
Terry Yue Zhuo,
Thomas Wang,
Olivier Dehaene,
Mishig Davaadorj,
Joel Lamy-Poirier,
João Monteiro,
Oleh Shliazhko,
Nicolas Gontier,
Nicholas Meade,
Armel Zebaze,
Ming-Ho Yee,
Logesh Kumar Umapathi,
Jian Zhu
, et al. (42 additional authors not shown)
Abstract:
The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle…
▽ More
The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large collection of permissively licensed GitHub repositories with inspection tools and an opt-out process. We fine-tuned StarCoderBase on 35B Python tokens, resulting in the creation of StarCoder. We perform the most comprehensive evaluation of Code LLMs to date and show that StarCoderBase outperforms every open Code LLM that supports multiple programming languages and matches or outperforms the OpenAI code-cushman-001 model. Furthermore, StarCoder outperforms every model that is fine-tuned on Python, can be prompted to achieve 40\% pass@1 on HumanEval, and still retains its performance on other programming languages. We take several important steps towards a safe open-access model release, including an improved PII redaction pipeline and a novel attribution tracing tool, and make the StarCoder models publicly available under a more commercially viable version of the Open Responsible AI Model license.
△ Less
Submitted 13 December, 2023; v1 submitted 9 May, 2023;
originally announced May 2023.
-
The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents
Authors:
Xing Han Lu,
Siva Reddy,
Harm de Vries
Abstract:
We introduce the StatCan Dialogue Dataset consisting of 19,379 conversation turns between agents working at Statistics Canada and online users looking for published data tables. The conversations stem from genuine intents, are held in English or French, and lead to agents retrieving one of over 5000 complex data tables. Based on this dataset, we propose two tasks: (1) automatic retrieval of releva…
▽ More
We introduce the StatCan Dialogue Dataset consisting of 19,379 conversation turns between agents working at Statistics Canada and online users looking for published data tables. The conversations stem from genuine intents, are held in English or French, and lead to agents retrieving one of over 5000 complex data tables. Based on this dataset, we propose two tasks: (1) automatic retrieval of relevant tables based on a on-going conversation, and (2) automatic generation of appropriate agent responses at each turn. We investigate the difficulty of each task by establishing strong baselines. Our experiments on a temporal data split reveal that all models struggle to generalize to future conversations, as we observe a significant drop in performance across both tasks when we move from the validation to the test set. In addition, we find that response generation models struggle to decide when to return a table. Considering that the tasks pose significant challenges to existing models, we encourage the community to develop models for our task, which can be directly used to help knowledge workers find relevant tables for live chat users.
△ Less
Submitted 4 April, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
SantaCoder: don't reach for the stars!
Authors:
Loubna Ben Allal,
Raymond Li,
Denis Kocetkov,
Chenghao Mou,
Christopher Akiki,
Carlos Munoz Ferrandis,
Niklas Muennighoff,
Mayank Mishra,
Alex Gu,
Manan Dey,
Logesh Kumar Umapathi,
Carolyn Jane Anderson,
Yangtian Zi,
Joel Lamy Poirier,
Hailey Schoelkopf,
Sergey Troshin,
Dmitry Abulkhanov,
Manuel Romero,
Michael Lappert,
Francesco De Toni,
Bernardo García del Río,
Qian Liu,
Shamik Bose,
Urvashi Bhattacharyya,
Terry Yue Zhuo
, et al. (16 additional authors not shown)
Abstract:
The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Personally Identifiable Information (PII) redaction pipeline, the experiments conducted to de-risk the model architecture, and the experiments investigat…
▽ More
The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Personally Identifiable Information (PII) redaction pipeline, the experiments conducted to de-risk the model architecture, and the experiments investigating better preprocessing methods for the training data. We train 1.1B parameter models on the Java, JavaScript, and Python subsets of The Stack and evaluate them on the MultiPL-E text-to-code benchmark. We find that more aggressive filtering of near-duplicates can further boost performance and, surprisingly, that selecting files from repositories with 5+ GitHub stars deteriorates performance significantly. Our best model outperforms previous open-source multilingual code generation models (InCoder-6.7B and CodeGen-Multi-2.7B) in both left-to-right generation and infilling on the Java, JavaScript, and Python portions of MultiPL-E, despite being a substantially smaller model. All models are released under an OpenRAIL license at https://hf.co/bigcode.
△ Less
Submitted 24 February, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
The Stack: 3 TB of permissively licensed source code
Authors:
Denis Kocetkov,
Raymond Li,
Loubna Ben Allal,
Jia Li,
Chenghao Mou,
Carlos Muñoz Ferrandis,
Yacine Jernite,
Margaret Mitchell,
Sean Hughes,
Thomas Wolf,
Dzmitry Bahdanau,
Leandro von Werra,
Harm de Vries
Abstract:
Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for code understanding and generation. To stimulate open and responsible research on LLMs for code, we introduce The Stack, a 3.1 TB dataset consisting of permissively licensed source code in 30 programming languages. We describe how we collect t…
▽ More
Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for code understanding and generation. To stimulate open and responsible research on LLMs for code, we introduce The Stack, a 3.1 TB dataset consisting of permissively licensed source code in 30 programming languages. We describe how we collect the full dataset, construct a permissively licensed subset, present a data governance plan, discuss limitations, and show promising results on text2code benchmarks by training 350M-parameter decoders on different Python subsets. We find that (1) near-deduplicating the data significantly boosts performance across all experiments, and (2) it is possible to match previously reported HumanEval and MBPP performance using only permissively licensed data. We make the dataset available at https://hf.co/BigCode, provide a tool called "Am I in The Stack" (https://hf.co/spaces/bigcode/in-the-stack) for developers to search The Stack for copies of their code, and provide a process for code to be removed from the dataset by following the instructions at https://www.bigcode-project.org/docs/about/the-stack/.
△ Less
Submitted 20 November, 2022;
originally announced November 2022.
-
The Power of Prompt Tuning for Low-Resource Semantic Parsing
Authors:
Nathan Schucher,
Siva Reddy,
Harm de Vries
Abstract:
Prompt tuning has recently emerged as an effective method for adapting pre-trained language models to a number of language understanding and generation tasks. In this paper, we investigate prompt tuning for semantic parsing -- the task of mapping natural language utterances onto formal meaning representations. On the low-resource splits of Overnight and TOPv2, we find that a prompt tuned T5-xl sig…
▽ More
Prompt tuning has recently emerged as an effective method for adapting pre-trained language models to a number of language understanding and generation tasks. In this paper, we investigate prompt tuning for semantic parsing -- the task of mapping natural language utterances onto formal meaning representations. On the low-resource splits of Overnight and TOPv2, we find that a prompt tuned T5-xl significantly outperforms its fine-tuned counterpart, as well as strong GPT-3 and BART baselines. We also conduct ablation studies across different model scales and target representations, finding that, with increasing model scale, prompt tuned T5 models improve at generating target representations that are far from the pre-training distribution.
△ Less
Submitted 1 April, 2022; v1 submitted 16 October, 2021;
originally announced October 2021.
-
TopiOCQA: Open-domain Conversational Question Answering with Topic Switching
Authors:
Vaibhav Adlakha,
Shehzaad Dhuliawala,
Kaheer Suleman,
Harm de Vries,
Siva Reddy
Abstract:
In a conversational question answering scenario, a questioner seeks to extract information about a topic through a series of interdependent questions and answers. As the conversation progresses, they may switch to related topics, a phenomenon commonly observed in information-seeking search sessions. However, current datasets for conversational question answering are limiting in two ways: 1) they d…
▽ More
In a conversational question answering scenario, a questioner seeks to extract information about a topic through a series of interdependent questions and answers. As the conversation progresses, they may switch to related topics, a phenomenon commonly observed in information-seeking search sessions. However, current datasets for conversational question answering are limiting in two ways: 1) they do not contain topic switches; and 2) they assume the reference text for the conversation is given, i.e., the setting is not open-domain. We introduce TopiOCQA (pronounced Tapioca), an open-domain conversational dataset with topic switches on Wikipedia. TopiOCQA contains 3,920 conversations with information-seeking questions and free-form answers. On average, a conversation in our dataset spans 13 question-answer turns and involves four topics (documents). TopiOCQA poses a challenging test-bed for models, where efficient retrieval is required on multiple turns of the same conversation, in conjunction with constructing valid responses using conversational history. We evaluate several baselines, by combining state-of-the-art document retrieval methods with neural reader models. Our best model achieves F1 of 55.8, falling short of human performance by 14.2 points, indicating the difficulty of our dataset. Our dataset and code is available at https://mcgill-nlp.github.io/topiocqa
△ Less
Submitted 20 February, 2022; v1 submitted 2 October, 2021;
originally announced October 2021.
-
DuoRAT: Towards Simpler Text-to-SQL Models
Authors:
Torsten Scholak,
Raymond Li,
Dzmitry Bahdanau,
Harm de Vries,
Chris Pal
Abstract:
Recent neural text-to-SQL models can effectively translate natural language questions to corresponding SQL queries on unseen databases. Working mostly on the Spider dataset, researchers have proposed increasingly sophisticated solutions to the problem. Contrary to this trend, in this paper we focus on simplifications. We begin by building DuoRAT, a re-implementation of the state-of-the-art RAT-SQL…
▽ More
Recent neural text-to-SQL models can effectively translate natural language questions to corresponding SQL queries on unseen databases. Working mostly on the Spider dataset, researchers have proposed increasingly sophisticated solutions to the problem. Contrary to this trend, in this paper we focus on simplifications. We begin by building DuoRAT, a re-implementation of the state-of-the-art RAT-SQL model that unlike RAT-SQL is using only relation-aware or vanilla transformers as the building blocks. We perform several ablation experiments using DuoRAT as the baseline model. Our experiments confirm the usefulness of some techniques and point out the redundancy of others, including structural SQL features and features that link the question with the schema.
△ Less
Submitted 10 September, 2021; v1 submitted 21 October, 2020;
originally announced October 2020.
-
Towards Ecologically Valid Research on Language User Interfaces
Authors:
Harm de Vries,
Dzmitry Bahdanau,
Christopher Manning
Abstract:
Language User Interfaces (LUIs) could improve human-machine interaction for a wide variety of tasks, such as playing music, getting insights from databases, or instructing domestic robots. In contrast to traditional hand-crafted approaches, recent work attempts to build LUIs in a data-driven way using modern deep learning methods. To satisfy the data needs of such learning algorithms, researchers…
▽ More
Language User Interfaces (LUIs) could improve human-machine interaction for a wide variety of tasks, such as playing music, getting insights from databases, or instructing domestic robots. In contrast to traditional hand-crafted approaches, recent work attempts to build LUIs in a data-driven way using modern deep learning methods. To satisfy the data needs of such learning algorithms, researchers have constructed benchmarks that emphasize the quantity of collected data at the cost of its naturalness and relevance to real-world LUI use cases. As a consequence, research findings on such benchmarks might not be relevant for developing practical LUIs. The goal of this paper is to bootstrap the discussion around this issue, which we refer to as the benchmarks' low ecological validity. To this end, we describe what we deem an ideal methodology for machine learning research on LUIs and categorize five common ways in which recent benchmarks deviate from it. We give concrete examples of the five kinds of deviations and their consequences. Lastly, we offer a number of recommendations as to how to increase the ecological validity of machine learning research on LUIs.
△ Less
Submitted 28 July, 2020;
originally announced July 2020.
-
Generative Compositional Augmentations for Scene Graph Prediction
Authors:
Boris Knyazev,
Harm de Vries,
Cătălina Cangea,
Graham W. Taylor,
Aaron Courville,
Eugene Belilovsky
Abstract:
Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges in this task due to a long tail data distribution. Current scene graph generation models are trained on a tiny fraction of the distribution corresponding to the…
▽ More
Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges in this task due to a long tail data distribution. Current scene graph generation models are trained on a tiny fraction of the distribution corresponding to the most frequent compositions, e.g. <cup, on, table>. However, test images might contain zero- and few-shot compositions of objects and relationships, e.g. <cup, on, surfboard>. Despite each of the object categories and the predicate (e.g. 'on') being frequent in the training data, the models often fail to properly understand such unseen or rare compositions. To improve generalization, it is natural to attempt increasing the diversity of the training distribution. However, in the graph domain this is non-trivial. To that end, we propose a method to synthesize rare yet plausible scene graphs by perturbing real ones. We then propose and empirically study a model based on conditional generative adversarial networks (GANs) that allows us to generate visual features of perturbed scene graphs and learn from them in a joint fashion. When evaluated on the Visual Genome dataset, our approach yields marginal, but consistent improvements in zero- and few-shot metrics. We analyze the limitations of our approach indicating promising directions for future research.
△ Less
Submitted 1 October, 2021; v1 submitted 11 July, 2020;
originally announced July 2020.
-
Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation
Authors:
Boris Knyazev,
Harm de Vries,
Cătălina Cangea,
Graham W. Taylor,
Aaron Courville,
Eugene Belilovsky
Abstract:
Scene graph generation (SGG) aims to predict graph-structured descriptions of input images, in the form of objects and relationships between them. This task is becoming increasingly useful for progress at the interface of vision and language. Here, it is important - yet challenging - to perform well on novel (zero-shot) or rare (few-shot) compositions of objects and relationships. In this paper, w…
▽ More
Scene graph generation (SGG) aims to predict graph-structured descriptions of input images, in the form of objects and relationships between them. This task is becoming increasingly useful for progress at the interface of vision and language. Here, it is important - yet challenging - to perform well on novel (zero-shot) or rare (few-shot) compositions of objects and relationships. In this paper, we identify two key issues that limit such generalization. Firstly, we show that the standard loss used in this task is unintentionally a function of scene graph density. This leads to the neglect of individual edges in large sparse graphs during training, even though these contain diverse few-shot examples that are important for generalization. Secondly, the frequency of relationships can create a strong bias in this task, such that a blind model predicting the most frequent relationship achieves good performance. Consequently, some state-of-the-art models exploit this bias to improve results. We show that such models can suffer the most in their ability to generalize to rare compositions, evaluating two different models on the Visual Genome dataset and its more recent, improved version, GQA. To address these issues, we introduce a density-normalized edge loss, which provides more than a two-fold improvement in certain generalization metrics. Compared to other works in this direction, our enhancements require only a few lines of code and no added computational cost. We also highlight the difficulty of accurately evaluating models using existing metrics, especially on zero/few shots, and introduce a novel weighted metric.
△ Less
Submitted 17 August, 2020; v1 submitted 17 May, 2020;
originally announced May 2020.
-
CLOSURE: Assessing Systematic Generalization of CLEVR Models
Authors:
Dzmitry Bahdanau,
Harm de Vries,
Timothy J. O'Donnell,
Shikhar Murty,
Philippe Beaudoin,
Yoshua Bengio,
Aaron Courville
Abstract:
The CLEVR dataset of natural-looking questions about 3D-rendered scenes has recently received much attention from the research community. A number of models have been proposed for this task, many of which achieved very high accuracies of around 97-99%. In this work, we study how systematic the generalization of such models is, that is to which extent they are capable of handling novel combinations…
▽ More
The CLEVR dataset of natural-looking questions about 3D-rendered scenes has recently received much attention from the research community. A number of models have been proposed for this task, many of which achieved very high accuracies of around 97-99%. In this work, we study how systematic the generalization of such models is, that is to which extent they are capable of handling novel combinations of known linguistic constructs. To this end, we test models' understanding of referring expressions based on matching object properties (such as e.g. "another cube that is the same size as the brown cube") in novel contexts. Our experiments on the thereby constructed CLOSURE benchmark show that state-of-the-art models often do not exhibit systematicity after being trained on CLEVR. Surprisingly, we find that an explicitly compositional Neural Module Network model also generalizes badly on CLOSURE, even when it has access to the ground-truth programs at test time. We improve the NMN's systematic generalization by developing a novel Vector-NMN module architecture with vector-valued inputs and outputs. Lastly, we investigate how much few-shot transfer learning can help models that are pretrained on CLEVR to adapt to CLOSURE. Our few-shot learning experiments contrast the adaptation behavior of the models with intermediate discrete programs with that of the end-to-end continuous models.
△ Less
Submitted 17 October, 2020; v1 submitted 12 December, 2019;
originally announced December 2019.
-
Single-cell eQTLGen Consortium: a personalized understanding of disease
Authors:
Monique G. P. van der Wijst,
Dylan H. de Vries,
Hilde E. Groot,
Gosia Trynka,
Chung-Chau Hon,
Martijn C. Nawijn,
Youssef Idaghdour,
Pim van der Harst,
Chun J. Ye,
Joseph Powell,
Fabian J. Theis,
Ahmed Mahfouz,
Matthias Heinig,
Lude Franke
Abstract:
In recent years, functional genomics approaches combining genetic information with bulk RNA-sequencing data have identified the downstream expression effects of disease-associated genetic risk factors through so-called expression quantitative trait locus (eQTL) analysis. Single-cell RNA-sequencing creates enormous opportunities for mapping eQTLs across different cell types and in dynamic processes…
▽ More
In recent years, functional genomics approaches combining genetic information with bulk RNA-sequencing data have identified the downstream expression effects of disease-associated genetic risk factors through so-called expression quantitative trait locus (eQTL) analysis. Single-cell RNA-sequencing creates enormous opportunities for mapping eQTLs across different cell types and in dynamic processes, many of which are obscured when using bulk methods. The enormous increase in throughput and reduction in cost per cell now allow this technology to be applied to large-scale population genetics studies. Therefore, we have founded the single-cell eQTLGen consortium (sc-eQTLGen), aimed at pinpointing disease-causing genetic variants and identifying the cellular contexts in which they affect gene expression. Ultimately, this information can enable development of personalized medicine. Here, we outline the goals, approach, potential utility and early proofs-of-concept of the sc-eQTLGen consortium. We also provide a set of study design considerations for future single-cell eQTL studies.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
Transferable MARTINI Model of Poly(ethylene Oxide)
Authors:
Fabian Grunewald,
Giulia Rossi,
Alex H. de Vries,
Siewert J. Marrink,
Luca Monticelli
Abstract:
Motivated by the deficiencies of the previous MARTINI models of poly(ethylene oxide) (PEO), we present a new model featuring a high degree of transferability. The model is parametrized on (a) a set of 8 free energies of transfer of dimethoxyethane (PEO dimer) from water to solvents of varying polarity; (b) the radius of gyration in water at high dilution; and (c) matching angle and dihedral distri…
▽ More
Motivated by the deficiencies of the previous MARTINI models of poly(ethylene oxide) (PEO), we present a new model featuring a high degree of transferability. The model is parametrized on (a) a set of 8 free energies of transfer of dimethoxyethane (PEO dimer) from water to solvents of varying polarity; (b) the radius of gyration in water at high dilution; and (c) matching angle and dihedral distributions from atomistic simulations. We demonstrate that our model behaves well in five different areas of application: (1) it produces accurate densities and phase behavior or small PEO oligomers and water mixtures; (2) it yields chain dimensions in good agreement with the experiment in three different solvents (water, diglyme, and benzene) over a broad range of molecular weights (1.2 kg/mol to 21 kg/mol); (3) it reproduces qualitatively the structural features of lipid bilayers containing PEGylated lipids in the brush and mushroom regime; (4) it is able to reproduce the phase behavior of several PEO-based nonionic surfactants in water; and (5) it can be combined with the existing MARTINI PS to model PS/PEO block copolymers. Overall, the new PEO model outperforms previous models and features a high degree of transferability.
△ Less
Submitted 14 January, 2019;
originally announced January 2019.
-
Systematic Generalization: What Is Required and Can It Be Learned?
Authors:
Dzmitry Bahdanau,
Shikhar Murty,
Michael Noukhovitch,
Thien Huu Nguyen,
Harm de Vries,
Aaron Courville
Abstract:
Numerous models for grounded language understanding have been recently proposed, including (i) generic models that can be easily adapted to any given task and (ii) intuitively appealing modular models that require background knowledge to be instantiated. We compare both types of models in how much they lend themselves to a particular form of systematic generalization. Using a synthetic VQA test, w…
▽ More
Numerous models for grounded language understanding have been recently proposed, including (i) generic models that can be easily adapted to any given task and (ii) intuitively appealing modular models that require background knowledge to be instantiated. We compare both types of models in how much they lend themselves to a particular form of systematic generalization. Using a synthetic VQA test, we evaluate which models are capable of reasoning about all possible object pairs after training on only a small subset of them. Our findings show that the generalization of modular models is much more systematic and that it is highly sensitive to the module layout, i.e. to how exactly the modules are connected. We furthermore investigate if modular models that generalize well could be made more end-to-end by learning their layout and parametrization. We find that end-to-end methods from prior work often learn inappropriate layouts or parametrizations that do not facilitate systematic generalization. Our results suggest that, in addition to modularity, systematic generalization in language understanding may require explicit regularizers or priors.
△ Less
Submitted 21 April, 2019; v1 submitted 30 November, 2018;
originally announced November 2018.
-
Visual Reasoning with Multi-hop Feature Modulation
Authors:
Florian Strub,
Mathieu Seurin,
Ethan Perez,
Harm de Vries,
Jérémie Mary,
Philippe Preux,
Aaron Courville,
Olivier Pietquin
Abstract:
Recent breakthroughs in computer vision and natural language processing have spurred interest in challenging multi-modal tasks such as visual question-answering and visual dialogue. For such tasks, one successful approach is to condition image-based convolutional network computation on language via Feature-wise Linear Modulation (FiLM) layers, i.e., per-channel scaling and shifting. We propose to…
▽ More
Recent breakthroughs in computer vision and natural language processing have spurred interest in challenging multi-modal tasks such as visual question-answering and visual dialogue. For such tasks, one successful approach is to condition image-based convolutional network computation on language via Feature-wise Linear Modulation (FiLM) layers, i.e., per-channel scaling and shifting. We propose to generate the parameters of FiLM layers going up the hierarchy of a convolutional network in a multi-hop fashion rather than all at once, as in prior work. By alternating between attending to the language input and generating FiLM layer parameters, this approach is better able to scale to settings with longer input sequences such as dialogue. We demonstrate that multi-hop FiLM generation achieves state-of-the-art for the short input sequence task ReferIt --- on-par with single-hop FiLM generation --- while also significantly outperforming prior state-of-the-art and single-hop FiLM generation on the GuessWhat?! visual dialogue task.
△ Less
Submitted 12 October, 2018; v1 submitted 3 August, 2018;
originally announced August 2018.
-
Talk the Walk: Navigating New York City through Grounded Dialogue
Authors:
Harm de Vries,
Kurt Shuster,
Dhruv Batra,
Devi Parikh,
Jason Weston,
Douwe Kiela
Abstract:
We introduce "Talk The Walk", the first large-scale dialogue dataset grounded in action and perception. The task involves two agents (a "guide" and a "tourist") that communicate via natural language in order to achieve a common goal: having the tourist navigate to a given target location. The task and dataset, which are described in detail, are challenging and their full solution is an open proble…
▽ More
We introduce "Talk The Walk", the first large-scale dialogue dataset grounded in action and perception. The task involves two agents (a "guide" and a "tourist") that communicate via natural language in order to achieve a common goal: having the tourist navigate to a given target location. The task and dataset, which are described in detail, are challenging and their full solution is an open problem that we pose to the community. We (i) focus on the task of tourist localization and develop the novel Masked Attention for Spatial Convolutions (MASC) mechanism that allows for grounding tourist utterances into the guide's map, (ii) show it yields significant improvements for both emergent and natural language communication, and (iii) using this method, we establish non-trivial baselines on the full task.
△ Less
Submitted 23 December, 2018; v1 submitted 9 July, 2018;
originally announced July 2018.
-
FiLM: Visual Reasoning with a General Conditioning Layer
Authors:
Ethan Perez,
Florian Strub,
Harm de Vries,
Vincent Dumoulin,
Aaron Courville
Abstract:
We introduce a general-purpose conditioning method for neural networks called FiLM: Feature-wise Linear Modulation. FiLM layers influence neural network computation via a simple, feature-wise affine transformation based on conditioning information. We show that FiLM layers are highly effective for visual reasoning - answering image-related questions which require a multi-step, high-level process -…
▽ More
We introduce a general-purpose conditioning method for neural networks called FiLM: Feature-wise Linear Modulation. FiLM layers influence neural network computation via a simple, feature-wise affine transformation based on conditioning information. We show that FiLM layers are highly effective for visual reasoning - answering image-related questions which require a multi-step, high-level process - a task which has proven difficult for standard deep learning methods that do not explicitly model reasoning. Specifically, we show on visual reasoning tasks that FiLM layers 1) halve state-of-the-art error for the CLEVR benchmark, 2) modulate features in a coherent manner, 3) are robust to ablations and architectural modifications, and 4) generalize well to challenging, new data from few examples or even zero-shot.
△ Less
Submitted 18 December, 2017; v1 submitted 22 September, 2017;
originally announced September 2017.
-
Learning Visual Reasoning Without Strong Priors
Authors:
Ethan Perez,
Harm de Vries,
Florian Strub,
Vincent Dumoulin,
Aaron Courville
Abstract:
Achieving artificial visual reasoning - the ability to answer image-related questions which require a multi-step, high-level process - is an important step towards artificial general intelligence. This multi-modal task requires learning a question-dependent, structured reasoning process over images from language. Standard deep learning approaches tend to exploit biases in the data rather than lear…
▽ More
Achieving artificial visual reasoning - the ability to answer image-related questions which require a multi-step, high-level process - is an important step towards artificial general intelligence. This multi-modal task requires learning a question-dependent, structured reasoning process over images from language. Standard deep learning approaches tend to exploit biases in the data rather than learn this underlying structure, while leading methods learn to visually reason successfully but are hand-crafted for reasoning. We show that a general-purpose, Conditional Batch Normalization approach achieves state-of-the-art results on the CLEVR Visual Reasoning benchmark with a 2.4% error rate. We outperform the next best end-to-end method (4.5%) and even methods that use extra supervision (3.1%). We probe our model to shed light on how it reasons, showing it has learned a question-dependent, multi-step process. Previous work has operated under the assumption that visual reasoning calls for a specialized architecture, but we show that a general architecture with proper conditioning can learn to visually reason effectively.
△ Less
Submitted 18 December, 2017; v1 submitted 10 July, 2017;
originally announced July 2017.
-
Modulating early visual processing by language
Authors:
Harm de Vries,
Florian Strub,
Jérémie Mary,
Hugo Larochelle,
Olivier Pietquin,
Aaron Courville
Abstract:
It is commonly assumed that language refers to high-level visual concepts while leaving low-level visual processing unaffected. This view dominates the current literature in computational models for language-vision tasks, where visual and linguistic input are mostly processed independently before being fused into a single representation. In this paper, we deviate from this classic pipeline and pro…
▽ More
It is commonly assumed that language refers to high-level visual concepts while leaving low-level visual processing unaffected. This view dominates the current literature in computational models for language-vision tasks, where visual and linguistic input are mostly processed independently before being fused into a single representation. In this paper, we deviate from this classic pipeline and propose to modulate the \emph{entire visual processing} by linguistic input. Specifically, we condition the batch normalization parameters of a pretrained residual network (ResNet) on a language embedding. This approach, which we call MOdulated RESnet (\MRN), significantly improves strong baselines on two visual question answering tasks. Our ablation study shows that modulating from the early stages of the visual processing is beneficial.
△ Less
Submitted 18 December, 2017; v1 submitted 2 July, 2017;
originally announced July 2017.
-
End-to-end optimization of goal-driven and visually grounded dialogue systems
Authors:
Florian Strub,
Harm de Vries,
Jeremie Mary,
Bilal Piot,
Aaron Courville,
Olivier Pietquin
Abstract:
End-to-end design of dialogue systems has recently become a popular research topic thanks to powerful tools such as encoder-decoder architectures for sequence-to-sequence learning. Yet, most current approaches cast human-machine dialogue management as a supervised learning problem, aiming at predicting the next utterance of a participant given the full history of the dialogue. This vision is too s…
▽ More
End-to-end design of dialogue systems has recently become a popular research topic thanks to powerful tools such as encoder-decoder architectures for sequence-to-sequence learning. Yet, most current approaches cast human-machine dialogue management as a supervised learning problem, aiming at predicting the next utterance of a participant given the full history of the dialogue. This vision is too simplistic to render the intrinsic planning problem inherent to dialogue as well as its grounded nature, making the context of a dialogue larger than the sole history. This is why only chit-chat and question answering tasks have been addressed so far using end-to-end architectures. In this paper, we introduce a Deep Reinforcement Learning method to optimize visually grounded task-oriented dialogues, based on the policy gradient algorithm. This approach is tested on a dataset of 120k dialogues collected through Mechanical Turk and provides encouraging results at solving both the problem of generating natural dialogues and the task of discovering a specific object in a complex picture.
△ Less
Submitted 15 March, 2017;
originally announced March 2017.
-
GuessWhat?! Visual object discovery through multi-modal dialogue
Authors:
Harm de Vries,
Florian Strub,
Sarath Chandar,
Olivier Pietquin,
Hugo Larochelle,
Aaron Courville
Abstract:
We introduce GuessWhat?!, a two-player guessing game as a testbed for research on the interplay of computer vision and dialogue systems. The goal of the game is to locate an unknown object in a rich image scene by asking a sequence of questions. Higher-level image understanding, like spatial reasoning and language grounding, is required to solve the proposed task. Our key contribution is the colle…
▽ More
We introduce GuessWhat?!, a two-player guessing game as a testbed for research on the interplay of computer vision and dialogue systems. The goal of the game is to locate an unknown object in a rich image scene by asking a sequence of questions. Higher-level image understanding, like spatial reasoning and language grounding, is required to solve the proposed task. Our key contribution is the collection of a large-scale dataset consisting of 150K human-played games with a total of 800K visual question-answer pairs on 66K images. We explain our design decisions in collecting the dataset and introduce the oracle and questioner tasks that are associated with the two players of the game. We prototyped deep learning models to establish initial baselines of the introduced tasks.
△ Less
Submitted 6 February, 2017; v1 submitted 23 November, 2016;
originally announced November 2016.
-
Theano: A Python framework for fast computation of mathematical expressions
Authors:
The Theano Development Team,
Rami Al-Rfou,
Guillaume Alain,
Amjad Almahairi,
Christof Angermueller,
Dzmitry Bahdanau,
Nicolas Ballas,
Frédéric Bastien,
Justin Bayer,
Anatoly Belikov,
Alexander Belopolsky,
Yoshua Bengio,
Arnaud Bergeron,
James Bergstra,
Valentin Bisson,
Josh Bleecher Snyder,
Nicolas Bouchard,
Nicolas Boulanger-Lewandowski,
Xavier Bouthillier,
Alexandre de Brébisson,
Olivier Breuleux,
Pierre-Luc Carrier,
Kyunghyun Cho,
Jan Chorowski,
Paul Christiano
, et al. (88 additional authors not shown)
Abstract:
Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu…
▽ More
Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, multiple frameworks have been built on top of it and it has been used to produce many state-of-the-art machine learning models.
The present article is structured as follows. Section I provides an overview of the Theano software and its community. Section II presents the principal features of Theano and how to use them, and compares them with other similar projects. Section III focuses on recently-introduced functionalities and improvements. Section IV compares the performance of Theano against Torch7 and TensorFlow on several machine learning models. Section V discusses current limitations of Theano and potential ways of improving it.
△ Less
Submitted 9 May, 2016;
originally announced May 2016.
-
Can deep learning help you find the perfect match?
Authors:
Harm de Vries,
Jason Yosinski
Abstract:
Is he/she my type or not? The answer to this question depends on the personal preferences of the one asking it. The individual process of obtaining a full answer may generally be difficult and time consuming, but often an approximate answer can be obtained simply by looking at a photo of the potential match. Such approximate answers based on visual cues can be produced in a fraction of a second, a…
▽ More
Is he/she my type or not? The answer to this question depends on the personal preferences of the one asking it. The individual process of obtaining a full answer may generally be difficult and time consuming, but often an approximate answer can be obtained simply by looking at a photo of the potential match. Such approximate answers based on visual cues can be produced in a fraction of a second, a phenomenon that has led to a series of recently successful dating apps in which users rate others positively or negatively using primarily a single photo. In this paper we explore using convolutional networks to create a model of an individual's personal preferences based on rated photos. This introduced task is difficult due to the large number of variations in profile pictures and the noise in attractiveness labels. Toward this task we collect a dataset comprised of $9364$ pictures and binary labels for each. We compare performance of convolutional models trained in three ways: first directly on the collected dataset, second with features transferred from a network trained to predict gender, and third with features transferred from a network trained on ImageNet. Our findings show that ImageNet features transfer best, producing a model that attains $68.1\%$ accuracy on the test set and is moderately successful at predicting matches.
△ Less
Submitted 20 June, 2015; v1 submitted 2 May, 2015;
originally announced May 2015.
-
Equilibrated adaptive learning rates for non-convex optimization
Authors:
Yann N. Dauphin,
Harm de Vries,
Yoshua Bengio
Abstract:
Parameter-specific adaptive learning rate methods are computationally efficient ways to reduce the ill-conditioning problems encountered when training large deep networks. Following recent work that strongly suggests that most of the critical points encountered when training such networks are saddle points, we find how considering the presence of negative eigenvalues of the Hessian could help us d…
▽ More
Parameter-specific adaptive learning rate methods are computationally efficient ways to reduce the ill-conditioning problems encountered when training large deep networks. Following recent work that strongly suggests that most of the critical points encountered when training such networks are saddle points, we find how considering the presence of negative eigenvalues of the Hessian could help us design better suited adaptive learning rate schemes. We show that the popular Jacobi preconditioner has undesirable behavior in the presence of both positive and negative curvature, and present theoretical and empirical evidence that the so-called equilibration preconditioner is comparatively better suited to non-convex problems. We introduce a novel adaptive learning rate scheme, called ESGD, based on the equilibration preconditioner. Our experiments show that ESGD performs as well or better than RMSProp in terms of convergence speed, always clearly improving over plain stochastic gradient descent.
△ Less
Submitted 29 August, 2015; v1 submitted 15 February, 2015;
originally announced February 2015.
-
Measurement of the two-photon exchange contribution to the elastic $e^{\pm}p$ scattering cross sections at the VEPP-3 storage ring
Authors:
I. A. Rachek,
J. Arrington,
V. F. Dmitriev,
V. V. Gauzshtein,
R. E. Gerasimov,
A. V. Gramolin,
R. J. Holt,
V. V. Kaminskiy,
B. A. Lazarenko,
S. I. Mishnev,
N. Yu. Muchnoi,
V. V. Neufeld,
D. M. Nikolenko,
R. Sh. Sadykov,
Yu. V. Shestakov,
V. N. Stibunov,
D. K. Toporkov,
H. de Vries,
S. A. Zevakov,
V. N. Zhilich
Abstract:
The ratio of the elastic $e^+ p$ to $e^- p$ scattering cross sections has been measured precisely, allowing the determination of the two-photon exchange contribution to these processes. This neglected contribution is believed to be the cause of the discrepancy between the Rosenbluth and polarization transfer methods of measuring the proton electromagnetic form factors. The experiment was performed…
▽ More
The ratio of the elastic $e^+ p$ to $e^- p$ scattering cross sections has been measured precisely, allowing the determination of the two-photon exchange contribution to these processes. This neglected contribution is believed to be the cause of the discrepancy between the Rosenbluth and polarization transfer methods of measuring the proton electromagnetic form factors. The experiment was performed at the VEPP-3 storage ring at beam energies of 1.6 and 1.0 GeV and at lepton scattering angles between $15^\circ$ and $105^\circ$. The data obtained show evidence of a significant two-photon exchange effect. The results are compared with several theoretical predictions.
△ Less
Submitted 12 February, 2015; v1 submitted 26 November, 2014;
originally announced November 2014.
-
The Katowice problem and autohomeomorphisms of $ω^*$
Authors:
David Chodounsky,
Alan Dow,
Klaas Pieter Hart,
Harm de Vries
Abstract:
We show that the existence of a homeomorphism between $ω_0^*$ and $ω_1^*$ entails the existence of a non-trivial autohomeomorphism of $ω_0^*$.
We show that the existence of a homeomorphism between $ω_0^*$ and $ω_1^*$ entails the existence of a non-trivial autohomeomorphism of $ω_0^*$.
△ Less
Submitted 6 August, 2015; v1 submitted 15 July, 2013;
originally announced July 2013.
-
Measurement of the two-photon exchange contribution in elastic $ep$ scattering at VEPP-3
Authors:
A. V. Gramolin,
J. Arrington,
L. M. Barkov,
V. F. Dmitriev,
V. V. Gauzshtein,
R. A. Golovin,
R. J. Holt,
V. V. Kaminsky,
B. A. Lazarenko,
S. I. Mishnev,
N. Yu. Muchnoi,
V. V. Neufeld,
D. M. Nikolenko,
I. A. Rachek,
R. Sh. Sadykov,
Yu. V. Shestakov,
V. N. Stibunov,
D. K. Toporkov,
H. de Vries,
S. A. Zevakov,
V. N. Zhilich
Abstract:
We report on the status of the Novosibirsk experiment on a precision measurement of the ratio $R$ of the elastic $e^+ p$ and $e^- p$ scattering cross sections. Such measurements determine the two-photon exchange effect in elastic electron-proton scattering. The experiment is conducted at the VEPP-3 storage ring using a hydrogen internal gas target. The ratio $R$ is measured with a beam energy of 1…
▽ More
We report on the status of the Novosibirsk experiment on a precision measurement of the ratio $R$ of the elastic $e^+ p$ and $e^- p$ scattering cross sections. Such measurements determine the two-photon exchange effect in elastic electron-proton scattering. The experiment is conducted at the VEPP-3 storage ring using a hydrogen internal gas target. The ratio $R$ is measured with a beam energy of 1.6 GeV (electron/positron scattering angles are $θ= 55 ÷75^{\circ}$ and $θ= 15 ÷25^{\circ}$) and 1 GeV ($θ= 65 ÷105^{\circ}$). We briefly describe the experimental method, paying special attention to the radiative corrections. Some preliminary results are presented.
△ Less
Submitted 22 December, 2011;
originally announced December 2011.
-
Modeling the Time Variability of SDSS Stripe 82 Quasars as a Damped Random Walk
Authors:
C. L. MacLeod,
Ž. Ivezić,
C. S. Kochanek,
S. Kozłowski,
B. C. Kelly,
E. Bullock,
A. Kimball,
B. Sesar,
D. Westman,
K. Brooks,
R. Gibson,
A. C. Becker,
W. H. de Vries
Abstract:
We model the time variability of ~9,000 spectroscopically confirmed quasars in SDSS Stripe 82 as a damped random walk. Using 2.7 million photometric measurements collected over 10 years, we confirm the results of Kelly et al. (2009) and Kozłowski et al. (2010) that this model can explain quasar light curves at an impressive fidelity level (0.01-0.02 mag). The damped random walk model provides a si…
▽ More
We model the time variability of ~9,000 spectroscopically confirmed quasars in SDSS Stripe 82 as a damped random walk. Using 2.7 million photometric measurements collected over 10 years, we confirm the results of Kelly et al. (2009) and Kozłowski et al. (2010) that this model can explain quasar light curves at an impressive fidelity level (0.01-0.02 mag). The damped random walk model provides a simple, fast [O(N) for N data points], and powerful statistical description of quasar light curves by a characteristic time scale (tau) and an asymptotic rms variability on long time scales (SF_inf). We searched for correlations between these two variability parameters and physical parameters such as luminosity and black hole mass, and rest-frame wavelength. We find that tau increases with increasing wavelength with a power law index of 0.17, remains nearly constant with redshift and luminosity, and increases with increasing black hole mass with power law index of 0.21+/-0.07. The amplitude of variability is anti-correlated with the Eddington ratio, which suggests a scenario where optical fluctuations are tied to variations in the accretion rate. The radio-loudest quasars have systematically larger variability amplitudes by about 30%, when corrected for the other observed trends, while the distribution of their characteristic time scale is indistinguishable from that of the full sample. We do not detect any statistically robust differences in the characteristic time scale and variability amplitude between the full sample and the small subsample of quasars detected by ROSAT. Our results provide a simple quantitative framework for generating mock quasar light curves, such as currently used in LSST image simulations. (abridged)
△ Less
Submitted 21 August, 2010; v1 submitted 1 April, 2010;
originally announced April 2010.
-
The Transitional Stripped-Envelope SN 2008ax: Spectral Evolution and Evidence for Large Asphericity
Authors:
R. Chornock,
A. V. Filippenko,
W. Li,
G. H. Marion,
R. J. Foley,
M. Modjaz,
M. Rafelski,
G. D. Becker,
W. H. de Vries,
P. Garnavich,
R. A. Jorgenson,
D. K. Lynch,
A. L. Malec,
E. C. Moran,
M. T. Murphy,
R. J. Rudy,
R. W. Russell,
J. M. Silverman,
T. N. Steele,
A. Stockton,
A. M. Wolfe,
C. E. Woodward
Abstract:
Supernova (SN) 2008ax in NGC 4490 was discovered within hours after shock breakout, presenting the rare opportunity to study a core-collapse SN beginning with the initial envelope-cooling phase immediately following shock breakout. We present an extensive sequence of optical and near-infrared spectra, as well as three epochs of optical spectropolarimetry. Our initial spectra, taken two days after…
▽ More
Supernova (SN) 2008ax in NGC 4490 was discovered within hours after shock breakout, presenting the rare opportunity to study a core-collapse SN beginning with the initial envelope-cooling phase immediately following shock breakout. We present an extensive sequence of optical and near-infrared spectra, as well as three epochs of optical spectropolarimetry. Our initial spectra, taken two days after shock breakout, are dominated by hydrogen Balmer lines at high velocity. However, by maximum light, He I lines dominated the optical and near-infrared spectra, which closely resembled those of normal Type Ib supernovae (SNe Ib) such as SN 1999ex. This spectroscopic transition defines Type IIb supernovae, but the strong similarity of SN 2008ax to normal SNe Ib beginning near maximum light, including an absorption feature near 6270A due to H-alpha at high velocities, suggests that many objects classified as SNe Ib in the literature may have ejected similar amounts of hydrogen as SN 2008ax, roughly a few x 0.01 M_sun. Early-time spectropolarimetry (6 and 9 days after shock breakout) revealed strong line polarization modulations of 3.4% across H-alpha, indicating the presence of large asphericities in the outer ejecta. The continuum shares a common polarization angle with the hydrogen, helium, and oxygen lines, while the calcium and iron absorptions are oriented at different angles. This is clear evidence of deviations from axisymmetry even in the outer ejecta. Intrinsic continuum polarization of 0.64% only nine days after shock breakout shows that the outer layers of the ejecta were quite aspherical. A single epoch of late-time spectropolarimetry, as well as the shapes of the nebular line profiles, demonstrate that asphericities extended from the outermost layers all the way down to the center of this SN. [Abridged]
△ Less
Submitted 7 July, 2011; v1 submitted 16 January, 2010;
originally announced January 2010.
-
Roadmap for selected key measurements of LHCb
Authors:
The LHCb Collaboration,
B. Adeva,
M. Adinolfi,
A. Affolder,
Z. Ajaltouni,
J. Albrecht,
F. Alessio,
M. Alexander,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
J. Amoraal,
J. Anderson,
O. Aquines Gutierrez,
L. Arrabito,
M. Artuso,
E. Aslanides,
G. Auriemma,
S. Bachmann,
Y. Bagaturia,
D. S. Bailey,
V. Balagura,
W. Baldini,
MdC. Barandela Pazos
, et al. (487 additional authors not shown)
Abstract:
Six of the key physics measurements that will be made by the LHCb experiment, concerning CP asymmetries and rare B decays, are discussed in detail. The "road map" towards the precision measurements is presented, including the use of control channels and other techniques to understand the performance of the detector with the first data from the LHC.
Six of the key physics measurements that will be made by the LHCb experiment, concerning CP asymmetries and rare B decays, are discussed in detail. The "road map" towards the precision measurements is presented, including the use of control channels and other techniques to understand the performance of the detector with the first data from the LHC.
△ Less
Submitted 23 November, 2010; v1 submitted 18 December, 2009;
originally announced December 2009.
-
LSST Science Book, Version 2.0
Authors:
LSST Science Collaboration,
Paul A. Abell,
Julius Allison,
Scott F. Anderson,
John R. Andrew,
J. Roger P. Angel,
Lee Armus,
David Arnett,
S. J. Asztalos,
Tim S. Axelrod,
Stephen Bailey,
D. R. Ballantyne,
Justin R. Bankert,
Wayne A. Barkhouse,
Jeffrey D. Barr,
L. Felipe Barrientos,
Aaron J. Barth,
James G. Bartlett,
Andrew C. Becker,
Jacek Becla,
Timothy C. Beers,
Joseph P. Bernstein,
Rahul Biswas,
Michael R. Blanton,
Joshua S. Bloom
, et al. (223 additional authors not shown)
Abstract:
A survey that can cover the sky in optical bands over wide fields to faint magnitudes with a fast cadence will enable many of the exciting science opportunities of the next decade. The Large Synoptic Survey Telescope (LSST) will have an effective aperture of 6.7 meters and an imaging camera with field of view of 9.6 deg^2, and will be devoted to a ten-year imaging survey over 20,000 deg^2 south…
▽ More
A survey that can cover the sky in optical bands over wide fields to faint magnitudes with a fast cadence will enable many of the exciting science opportunities of the next decade. The Large Synoptic Survey Telescope (LSST) will have an effective aperture of 6.7 meters and an imaging camera with field of view of 9.6 deg^2, and will be devoted to a ten-year imaging survey over 20,000 deg^2 south of +15 deg. Each pointing will be imaged 2000 times with fifteen second exposures in six broad bands from 0.35 to 1.1 microns, to a total point-source depth of r~27.5. The LSST Science Book describes the basic parameters of the LSST hardware, software, and observing plans. The book discusses educational and outreach opportunities, then goes on to describe a broad range of science that LSST will revolutionize: mapping the inner and outer Solar System, stellar populations in the Milky Way and nearby galaxies, the structure of the Milky Way disk and halo and other objects in the Local Volume, transient and variable objects both at low and high redshift, and the properties of normal and active galaxies at low and high redshift. It then turns to far-field cosmological topics, exploring properties of supernovae to z~1, strong and weak lensing, the large-scale distribution of galaxies and baryon oscillations, and how these different probes may be combined to constrain cosmological models and the physics of dark energy.
△ Less
Submitted 1 December, 2009;
originally announced December 2009.
-
Investigation of the Exclusive ^{3}He(e,e'pn)p Reaction
Authors:
D. G. Middleton,
J. R. M. Annand,
M. Ases Antelo,
C. Ayerbe,
P. Barneo,
D. Baumann,
J. Bermuth,
J. Bernauer,
H. P. Blok,
D. Bosnar,
R. Böhm,
M. Ding,
M. O. Distler,
J. Friedrich,
J. García Llongo,
D. I. Glazier,
J. Golak,
W. Glöckle,
P. Grabmayr,
T. Hehl,
J. Heim,
W. H. A. Hesselink,
E. Jans,
H. Kamada,
G. Jover Mañas
, et al. (24 additional authors not shown)
Abstract:
Cross sections for the ^{3}He(e,e'pn)p reaction were measured for the first time at energy transfers of 220 and 270 MeV for several momentum transfers ranging from 300 to 450 MeV/c. Cross sections are presented as a function of the momentum of the recoil proton and the momentum transfer. Continuum Faddeev calculations using the Argonne V18 and Bonn-B nucleon-nucleon potentials overestimate the m…
▽ More
Cross sections for the ^{3}He(e,e'pn)p reaction were measured for the first time at energy transfers of 220 and 270 MeV for several momentum transfers ranging from 300 to 450 MeV/c. Cross sections are presented as a function of the momentum of the recoil proton and the momentum transfer. Continuum Faddeev calculations using the Argonne V18 and Bonn-B nucleon-nucleon potentials overestimate the measured cross sections by a factor 5 at low recoil proton momentum with the discrepancy becoming much smaller at higher recoil momentum.
△ Less
Submitted 6 March, 2009;
originally announced March 2009.
-
Radio Detection of Radio-Quiet Galaxies
Authors:
J. A. Hodge,
R. H. Becker,
R. L. White,
W. H. de Vries
Abstract:
We investigate the radio emission of ~185,000 quiescent (optically unclassifiable) galaxies selected from the Sloan Digital Sky Survey (SDSS). By median-stacking FIRST cutouts centered on the optically-selected sources, we are able to reach flux densities down to the 10s of microJy. The quiescent galaxy sample is composed of two subgroups inhabiting vastly different regimes: those targeted for t…
▽ More
We investigate the radio emission of ~185,000 quiescent (optically unclassifiable) galaxies selected from the Sloan Digital Sky Survey (SDSS). By median-stacking FIRST cutouts centered on the optically-selected sources, we are able to reach flux densities down to the 10s of microJy. The quiescent galaxy sample is composed of two subgroups inhabiting vastly different regimes: those targeted for the SDSS MAIN Galaxy Sample (~55%), and those targeted for the Luminous Red Galaxy (LRG) sample (~45%). To investigate the star-formation rates (SFRs) of these quiescent galaxies, we calibrate a radio-SFR conversion using a third sample of star-forming galaxies. Comparing this SFR-indicator with indicators in the optical and UV, we derive conflicting SFR estimates for the MAIN sample quiescent galaxies. These radio-derived SFRs intersect those calculated using the 4000-Angstrom break (D4000) around an SFR of 1 Msun/yr and agree to within a factor of 3 over the range of SFRs. However, we find that the radio-derived SFRs are too high relative to the SFRs estimated for similar populations of galaxies using analysis of UV emission, implying either contamination of the radio by Active Galactic Nuclei (AGN) or incomplete dust modeling. If AGN activity is dominant in these galaxies, then a relation between AGN radio luminosity and galaxy mass is required to explain the observed trends. For the LRGs, on the other hand, we find the radio luminosity to be independent of SFR as derived from D4000, indicating an AGN component dominates their radio emission. AGN-based radio emission often implies the existence of radio jets, providing evidence of a mechanism for low-level feedback in these quiescent LRGs. (Abridged)
△ Less
Submitted 25 June, 2008;
originally announced June 2008.
-
From Shock Breakout to Peak and Beyond: Extensive Panchromatic Observations of the Type Ib Supernova 2008D associated with Swift X-ray Transient 080109
Authors:
Maryam Modjaz,
W. Li,
N. Butler,
R. Chornock,
D. Perley,
S. Blondin,
J. S. Bloom,
A. V. Filippenko,
R. P. Kirshner,
D. Kocevski,
D. Poznanski,
M. Hicken,
R. J. Foley,
G. S. Stringfellow,
P. Berlind,
D. Barrado y Navascues,
C. H. Blake,
H. Bouy,
W. R. Brown,
P. Challis,
H. Chen,
W. H. de Vries,
P. Dufour,
E. Falco,
A. Friedman
, et al. (16 additional authors not shown)
Abstract:
We present extensive early photometric (ultraviolet through near-infrared) and spectroscopic (optical and near-infrared) data on supernova (SN) 2008D as well as X-ray data analysis on the associated Swift/X-ray transient (XRT) 080109. Our data span a time range of 5 hours before the detection of the X-ray transient to 150 days after its detection, and detailed analysis allowed us to derive const…
▽ More
We present extensive early photometric (ultraviolet through near-infrared) and spectroscopic (optical and near-infrared) data on supernova (SN) 2008D as well as X-ray data analysis on the associated Swift/X-ray transient (XRT) 080109. Our data span a time range of 5 hours before the detection of the X-ray transient to 150 days after its detection, and detailed analysis allowed us to derive constraints on the nature of the SN and its progenitor; throughout we draw comparisons with results presented in the literature and find several key aspects that differ. We show that the X-ray spectrum of XRT 080109 can be fit equally well by an absorbed power law or a superposition of about equal parts of both power law and blackbody. Our data first established that SN 2008D is a spectroscopically normal SN Ib (i.e., showing conspicuous He lines), and show that SN 2008D had a relatively long rise time of 18 days and a modest optical peak luminosity. The early-time light curves of the SN are dominated by a cooling stellar envelope (for Δt~0.1- 4 day, most pronounced in the blue bands) followed by 56^Ni decay. We construct a reliable measurement of the bolometric output for this stripped-envelope SN, and, combined with estimates of E_K and M_ej from the literature, estimate the stellar radius R_star of its probable Wolf-Rayet progenitor. According to the model of Waxman et al. and of Chevalier & Fransson, we derive R_star^{W07}= 1.2+/-0.7 R_sun and R_star^{CF08}= 12+/-7 R_sun, respectively; the latter being more in line with typical WN stars. Spectra obtained at 3 and 4 months after maximum light show double-peaked oxygen lines that we associate with departures from spherical symmetry, as has been suggested for the inner ejecta of a number of SN Ib cores.
△ Less
Submitted 17 June, 2009; v1 submitted 15 May, 2008;
originally announced May 2008.
-
Star-Formation in Low Radio Luminosity AGN from the Sloan Digital Sky Survey
Authors:
W. H. de Vries,
J. A. Hodge,
R. H. Becker,
R. L. White,
D. J. Helfand
Abstract:
We investigate faint radio emission from low- to high-luminosity Active Galactic Nuclei (AGN) selected from the Sloan Digital Sky Survey (SDSS). Their radio properties are inferred by co-adding large ensembles of radio image cut-outs from the FIRST survey, as almost all of the sources are individually undetected. We correlate the median radio flux densities against a range of other sample proper…
▽ More
We investigate faint radio emission from low- to high-luminosity Active Galactic Nuclei (AGN) selected from the Sloan Digital Sky Survey (SDSS). Their radio properties are inferred by co-adding large ensembles of radio image cut-outs from the FIRST survey, as almost all of the sources are individually undetected. We correlate the median radio flux densities against a range of other sample properties, including median values for redshift, [OIII] luminosity, emission line ratios, and the strength of the 4000A break. We detect a strong trend for sources that are actively undergoing star-formation to have excess radio emission beyond the ~10^28 ergs/s/Hz level found for sources without any discernible star-formation. Furthermore, this additional radio emission correlates well with the strength of the 4000A break in the optical spectrum, and may be used to assess the age of the star-forming component. We examine two subsamples, one containing the systems with emission line ratios most like star-forming systems, and one with the sources that have characteristic AGN ratios. This division also separates the mechanism responsible for the radio emission (star-formation vs. AGN). For both cases we find a strong, almost identical, correlation between [OIII] and radio luminosity, with the AGN sample extending toward lower, and the star-formation sample toward higher luminosities. A clearer separation between the two subsamples is seen as function of the central velocity dispersion of the host galaxy. For systems with similar redshifts and velocity dispersions, the star-formation subsample is brighter than the AGN in the radio by an order of magnitude. This underlines the notion that the radio emission in star-forming systems can dominate the emission associated with the AGN.
△ Less
Submitted 16 April, 2007;
originally announced April 2007.
-
Image Ellipticity from Atmospheric Aberrations
Authors:
W. H. de Vries,
S. S. Olivier,
S. J. Asztalos,
L. J. Rosenberg,
K. L. Baker
Abstract:
We investigate the ellipticity of the point-spread function (PSF) produced by imaging an unresolved source with a telescope, subject to the effects of atmospheric turbulence. It is important to quantify these effects in order to understand the errors in shape measurements of astronomical objects, such as those used to study weak gravitational lensing of field galaxies. The PSF modeling involves…
▽ More
We investigate the ellipticity of the point-spread function (PSF) produced by imaging an unresolved source with a telescope, subject to the effects of atmospheric turbulence. It is important to quantify these effects in order to understand the errors in shape measurements of astronomical objects, such as those used to study weak gravitational lensing of field galaxies. The PSF modeling involves either a Fourier transform of the phase information in the pupil plane or a ray-tracing approach, which has the advantage of requiring fewer computations than the Fourier transform. Using a standard method, involving the Gaussian weighted second moments of intensity, we then calculate the ellipticity of the PSF patterns. We find significant ellipticity for the instantaneous patterns (up to more than 10%). Longer exposures, which we approximate by combining multiple (N) images from uncorrelated atmospheric realizations, yield progressively lower ellipticity (as 1 / sqrt(N)). We also verify that the measured ellipticity does not depend on the sampling interval in the pupil plane using the Fourier method. However, we find that the results using the ray-tracing technique do depend on the pupil sampling interval, representing a gradual breakdown of the geometric approximation at high spatial frequencies. Therefore, ray tracing is generally not an accurate method of modeling PSF ellipticity induced by atmospheric turbulence unless some additional procedure is implemented to correctly account for the effects of high spatial frequency aberrations. The Fourier method, however, can be used directly to accurately model PSF ellipticity, which can give insights into errors in the statistics of field galaxy shapes used in studies of weak gravitational lensing.
△ Less
Submitted 6 March, 2007;
originally announced March 2007.
-
First measurements of the ^16O(e,e'pn)^14N reaction
Authors:
D. G. Middleton,
J. R. M. Annand,
C. Barbieri,
P. Barneo,
P. Bartsch,
D. Bauman,
J. Bermuth,
D. Bosnar,
H. P. Blok,
R. Bohm,
M. Ding,
M. O. Distler,
D. Elsner,
J. Friedrich,
C. Giusti,
D. I. Glazier,
P. Grabmayr,
S. Grozinger,
T. Hehl,
J. Heim,
W. H. A Hesselink,
E. Jans,
F. Klein,
M. Kohl,
L. Lapikas
, et al. (17 additional authors not shown)
Abstract:
This paper reports on the first measurement of the ^16O(e,e'pn)^14N reaction. Data were measured in kinematics centred on a super-parallel geometry at energy and momentum transfers of 215 MeV and 316 MeV/c. The experimental resolution was sufficient to distinguish groups of states in the residual nucleus but not good enough to separate individual states. The data show a strong dependence on miss…
▽ More
This paper reports on the first measurement of the ^16O(e,e'pn)^14N reaction. Data were measured in kinematics centred on a super-parallel geometry at energy and momentum transfers of 215 MeV and 316 MeV/c. The experimental resolution was sufficient to distinguish groups of states in the residual nucleus but not good enough to separate individual states. The data show a strong dependence on missing momentum and this dependence appears to be different for two groups of states in the residual nucleus. Theoretical calculations of the reaction using the Pavia code do not reproduce the shape or the magnitude of the data.
△ Less
Submitted 24 January, 2007;
originally announced January 2007.
-
Star formation in the hosts of GHz peaked spectrum and compact steep spectrum radio galaxies
Authors:
A. Labiano,
C. P. O'Dea,
P. D. Barthel,
W. H. de Vries,
S. A. Baum
Abstract:
AIMS: Search for star formation regions in the hosts of potentially young radio galaxies (Gigahertz Peaked Spectrum and Compact Steep Spectrum sources). METHODS: Near-UV imaging with the Hubble Space Telescope Advanced Camera for Surveys.} RESULTS: We find near-UV light which could be the product of recent star formation in eight of the nine observed sources, though other explanations are not cu…
▽ More
AIMS: Search for star formation regions in the hosts of potentially young radio galaxies (Gigahertz Peaked Spectrum and Compact Steep Spectrum sources). METHODS: Near-UV imaging with the Hubble Space Telescope Advanced Camera for Surveys.} RESULTS: We find near-UV light which could be the product of recent star formation in eight of the nine observed sources, though other explanations are not currently ruled out. The UV luminosities of the GPS and CSS sources are similar to those of a sample of nearby large scale radio galaxies. Stellar population synthesis models are consistent with a burst of recent star formation occuring before the formation of the radio source. However, observations at other wavelengths and colors are needed to definitively establish the nature of the observed UV light. In the CSS sources 1443+77 and 1814-637 the near-UV light is aligned with and is co-spatial with the radio source. We suggest that in these sources the UV light is produced by star formation triggered and/or enhanced by the radio source.
△ Less
Submitted 7 November, 2007; v1 submitted 22 January, 2007;
originally announced January 2007.
-
Properties of Ellipticity Correlation with Atmospheric Structure from Gemini South
Authors:
S. Asztalos,
W. H. de Vries,
L. J Rosenberg,
T. Treadway,
D. Burke,
C. Claver,
A. Saha,
P. Puxley
Abstract:
Cosmic shear holds great promise for a precision independent measurement of $Ω\rm_m$, the mass density of the universe relative to the critical density. The signal is expected to be weak, so a thorough understanding of systematic effects is crucial. An important systematic effect is the atmosphere: shear power introduced by the atmosphere is larger than the expected signal. Algorithms exist to e…
▽ More
Cosmic shear holds great promise for a precision independent measurement of $Ω\rm_m$, the mass density of the universe relative to the critical density. The signal is expected to be weak, so a thorough understanding of systematic effects is crucial. An important systematic effect is the atmosphere: shear power introduced by the atmosphere is larger than the expected signal. Algorithms exist to extract the cosmic shear from the atmospheric component, though a measure of their success applied to a range of seeing conditions is lacking.
To gain insight into atmospheric shear, Gemini South imaging in conjunction with ground condition and satellite wind data were obtained. We find that under good seeing conditions Point-Spread-Function (PSF) correlations persist well beyond the separation typical of high-latitude stars. Under these conditions, ellipticity residuals based on a simple PSF interpolation can be reduced to within a factor of a few of the shot-noise induced ellipticity floor. We also find that the ellipticity residuals are highly correlated with wind direction. Finally, we correct stellar shapes using a more sophisticated procedure and generate shear statistics from stars. Under all seeing conditions in our data set the residual correlations lie everywhere below the target signal level. For good seeing we find that the systematic error attributable to atmospheric turbulence is comparable in magnitude to the statistical error (shape noise) over angular scales relevant to present lensing surveys.
△ Less
Submitted 5 January, 2007;
originally announced January 2007.
-
Measurement of tensor analyzing powers in deuteron photodisintegration
Authors:
I. A. Rachek,
L. M. Barkov,
S. L. Belostotsky,
V. F. Dmitriev,
M. V. Dyug,
R. Gilman,
R. J. Holt,
B. A. Lazarenko,
S. I. Mishnev,
V. V. Nelyubin,
D. M. Nikolenko,
A. V. Osipov,
D. H. Potterveld,
R. Sh. Sadykov,
Yu. V. Shestakov,
V. N. Stibunov,
D. K. Toporkov,
H. de Vries,
S. A. Zevakov
Abstract:
New accurate measurement of tensor analyzing powers T20, T21 and T22 in deuteron photodisintegration has been performed. Wide-aperture non-magnetic detectors allowed to cover broad kinematic ranges in a single setup: photon energy = 25 to 600 MeV, proton emission angle in CM = 24 to 48 deg. and 70 to 102 deg. New data provide a significant improvement of a few existing measurements. The angular…
▽ More
New accurate measurement of tensor analyzing powers T20, T21 and T22 in deuteron photodisintegration has been performed. Wide-aperture non-magnetic detectors allowed to cover broad kinematic ranges in a single setup: photon energy = 25 to 600 MeV, proton emission angle in CM = 24 to 48 deg. and 70 to 102 deg. New data provide a significant improvement of a few existing measurements. The angular dependency of the tensor asymmetries in deuteron photodisintegration is extracted for the first time.
△ Less
Submitted 10 April, 2007; v1 submitted 16 November, 2006;
originally announced November 2006.
-
GPS radio sources: new optical observations and an updated master list
Authors:
A. Labiano,
P. D. Barthel,
C. P. O'Dea,
W. H. de Vries,
I. Pérez,
S. A. Baum
Abstract:
* Aims. Identify optical counterparts, address uncertain identifications and measure previously unknown redshifts of the host galaxies of candidate GPS radio sources, and study their stellar populations. * Methods. Long slit spectroscopy and deep optical imaging in the B, V and R bands, obtained with the Very Large Telescope. * Results. We obtain new redshifts for B0316+161, B0407-658, B0904+039…
▽ More
* Aims. Identify optical counterparts, address uncertain identifications and measure previously unknown redshifts of the host galaxies of candidate GPS radio sources, and study their stellar populations. * Methods. Long slit spectroscopy and deep optical imaging in the B, V and R bands, obtained with the Very Large Telescope. * Results. We obtain new redshifts for B0316+161, B0407-658, B0904+039, B1433-040, and identify the optical counterparts of B0008-421 and B0742+103. We confirm the previous identification for B0316+161, B0407-658, B0554-026, and B0904+039, and find that the previous identification for B0914+114 is incorrect. Using updated published radio spectral information we classify as non GPS the following sources: B0407-658, B0437-454, B1648+015. The optical colors of typical GPS sources are consistent with single instantaneous burst stellar population models but do not yield useful information on age or metallicity. A new master list of GPS sources is presented.
△ Less
Submitted 19 November, 2006;
originally announced November 2006.
-
Star formation in hosts of young radio galaxies
Authors:
A. Labiano,
C. P. O'Dea,
P. D. Barthel,
W. H. de Vries,
S. A. Baum
Abstract:
We present near ultraviolet imaging with the Hubble Space Telescope Advanced Camera for Surveys, targeting young radio galaxies (Gigahertz Peaked Spectrum and Compact Steep Spectrum sources), in search of star formation regions in their hosts. We find near UV light which could be the product of recent star formation in eight of the nine observed sources. However, observations at other wavelength…
▽ More
We present near ultraviolet imaging with the Hubble Space Telescope Advanced Camera for Surveys, targeting young radio galaxies (Gigahertz Peaked Spectrum and Compact Steep Spectrum sources), in search of star formation regions in their hosts. We find near UV light which could be the product of recent star formation in eight of the nine observed sources. However, observations at other wavelengths and colors are needed to definitively establish the nature of the observed UV light. In the CSS sources 1443+77 and 1814--637 the near UV light is aligned with and is co-spatial with the radio source, and we suggest that in these sources the UV light is produced by star formation triggered and/or enhanced by the radio source.
△ Less
Submitted 2 December, 2005;
originally announced December 2005.
-
Double Lobed Radio Quasars from the Sloan Digital Sky Survey
Authors:
W. H. de Vries,
R. H. Becker,
R. L. White
Abstract:
We have combined a sample of 44984 quasars, selected from the Sloan Digital Sky Survey (SDSS) Data Release 3, with the FIRST radio survey. Using a novel technique where the optical quasar position is matched to the complete radio environment within 450", we are able to characterize the radio morphological make-up of what is essentially an optically selected quasar sample, regardless of whether t…
▽ More
We have combined a sample of 44984 quasars, selected from the Sloan Digital Sky Survey (SDSS) Data Release 3, with the FIRST radio survey. Using a novel technique where the optical quasar position is matched to the complete radio environment within 450", we are able to characterize the radio morphological make-up of what is essentially an optically selected quasar sample, regardless of whether the quasar (nucleus) itself has been detected in the radio. About 10% of the quasar population have radio cores brighter than 0.75 mJy at 1.4 GHz, and 1.7% have double lobed FR2-like radio morphologies. About 75% of the FR2 sources have a radio core (> 0.75 mJy). A significant fraction (~40%) of the FR2 quasars are bent by more than 10 degrees, indicating either interactions of the radio plasma with the ICM or IGM. We found no evidence for correlations with redshift among our FR2 quasars: radio lobe flux densities and radio source diameters of the quasars have similar distributions at low (mean 0.77) and high (mean 2.09) redshifts. Using a smaller high reliability FR2 sample of 422 quasars and two comparison samples of radio-quiet and non-FR2 radio-loud quasars, matched in their redshift distributions, we constructed composite optical spectra from the SDSS spectroscopic data. Based on these spectra we can conclude that the FR2 quasars have stronger high-ionization emission lines compared to both the radio quiet and non-FR2 radio loud sources. This is consistent with the notion that the emission lines are brightened by ongoing shock ionization of ambient gas in the quasar host as the radio source expands.
△ Less
Submitted 26 October, 2005;
originally announced October 2005.
-
Atomic hydrogen in the one-sided "compact double" radio galaxy 2050+364
Authors:
R. C. Vermeulen,
A. Labiano,
P. D. Barthel,
S. A. Baum,
W. H. de Vries,
C. P. O'Dea
Abstract:
European VLBI Network spectral imaging of the "compact double" radio source 2050+364 in the UHF band at 1049 MHz has resolved the HI absorbing region, and has shown a faint continuum component to the North (N), in addition to the well-known East-West double (E, W). Re-examination of VLBI continuum images at multiple frequencies suggests that 2050+364 may well be a one-sided core-jet source, whic…
▽ More
European VLBI Network spectral imaging of the "compact double" radio source 2050+364 in the UHF band at 1049 MHz has resolved the HI absorbing region, and has shown a faint continuum component to the North (N), in addition to the well-known East-West double (E, W). Re-examination of VLBI continuum images at multiple frequencies suggests that 2050+364 may well be a one-sided core-jet source, which appears as a double over a limited frequency range. One of the dominant features, W, would then be the innermost visible portion of the jet, and could be at or adjacent to the canonical radio core. The other, E, is probably related to shocks at a sudden bend of the jet, towards extended steep-spectrum region N. A remarkably deep and narrow HI absorption line component extends over the entire projected extent of 2050+364. It coincides in velocity with the [OIII] optical doublet lines to within 10 km/s. This HI absorption could arise in the atomic cores of NLR clouds, and the motion in the NLR is then remarkably coherent both along the line-of-sight and across a projected distance of > 300 pc on the plane of the sky. Broader, shallower HI absorption at lower velocities covers only the plausible core area W. This absorption could be due to gas which is either being entrained by the inner jet or is flowing out from the accretion region; it could be related to the BLR.
△ Less
Submitted 14 October, 2005;
originally announced October 2005.
-
HST/STIS low dispersion spectroscopy of three Compact Steep Spectrum sources Evidence for jet-cloud interaction
Authors:
A. Labiano,
C. P. O'Dea,
R. Gelderman,
W. H. de Vries,
D. J. Axon,
P. D. Barthel,
S. A. Baum,
A. Capetti,
R. Fanti,
A. M. Koekemoer,
R. Morganti,
C. N. Tadhunter
Abstract:
We present Hubble Space Telescope Imaging Spectrograph long-slit spectroscopy of the emission line nebulae in the compact steep spectrum radio sources 3C 67, 3C 277.1, and 3C 303.1. We derive BPT (Baldwin- Philips-Terlevich; Baldwin et al. 1981) diagnostic emission line ratios for the nebulae which are consistent with a mix of shock excitation and photoionization in the extended gas. In addition…
▽ More
We present Hubble Space Telescope Imaging Spectrograph long-slit spectroscopy of the emission line nebulae in the compact steep spectrum radio sources 3C 67, 3C 277.1, and 3C 303.1. We derive BPT (Baldwin- Philips-Terlevich; Baldwin et al. 1981) diagnostic emission line ratios for the nebulae which are consistent with a mix of shock excitation and photoionization in the extended gas. In addition, line ratios indicative of lower ionization gas are found to be associated with higher gas velocities. The results are consistent with a picture in which these galaxy scale radio sources interact with dense clouds in the interstellar medium of the host galaxies, shocking the clouds thereby ionizing and accelerating them.
△ Less
Submitted 14 April, 2005;
originally announced April 2005.