subscribe to arXiv mailings

4-LEGS: 4D Language Embedded Gaussian Splatting

Authors: Gal Fiebelman, Tamir Cohen, Ayellet Morgenstern, Peter Hedman, Hadar Averbuch-Elor

Abstract: The emergence of neural representations has revolutionized our means for digitally viewing a wide range of 3D scenes, enabling the synthesis of photorealistic images rendered from novel views. Recently, several techniques have been proposed for connecting these low-level representations with the high-level semantics understanding embodied within the scene. These methods elevate the rich semantic u… ▽ More The emergence of neural representations has revolutionized our means for digitally viewing a wide range of 3D scenes, enabling the synthesis of photorealistic images rendered from novel views. Recently, several techniques have been proposed for connecting these low-level representations with the high-level semantics understanding embodied within the scene. These methods elevate the rich semantic understanding from 2D imagery to 3D representations, distilling high-dimensional spatial features onto 3D space. In our work, we are interested in connecting language with a dynamic modeling of the world. We show how to lift spatio-temporal features to a 4D representation based on 3D Gaussian Splatting. This enables an interactive interface where the user can spatiotemporally localize events in the video from text prompts. We demonstrate our system on public 3D video datasets of people and animals performing various actions. △ Less

Submitted 15 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

Comments: Project webpage: https://tau-vailab.github.io/4-LEGS/

arXiv:2410.08105 [pdf, other]

What Makes Large Language Models Reason in (Multi-Turn) Code Generation?

Authors: Kunhao Zheng, Juliette Decugis, Jonas Gehring, Taco Cohen, Benjamin Negrevergne, Gabriel Synnaeve

Abstract: Prompting techniques such as chain-of-thought have established themselves as a popular vehicle for improving the outputs of large language models (LLMs). For code generation, however, their exact mechanics and efficacy are under-explored. We thus investigate the effects of a wide range of prompting strategies with a focus on automatic re-prompting over multiple turns and computational requirements… ▽ More Prompting techniques such as chain-of-thought have established themselves as a popular vehicle for improving the outputs of large language models (LLMs). For code generation, however, their exact mechanics and efficacy are under-explored. We thus investigate the effects of a wide range of prompting strategies with a focus on automatic re-prompting over multiple turns and computational requirements. After systematically decomposing reasoning, instruction, and execution feedback prompts, we conduct an extensive grid search on the competitive programming benchmarks CodeContests and TACO for multiple LLM families and sizes (Llama 3.0 and 3.1, 8B, 70B, 405B, and GPT-4o). Our study reveals strategies that consistently improve performance across all models with small and large sampling budgets. We then show how finetuning with such an optimal configuration allows models to internalize the induced reasoning process and obtain improvements in performance and scalability for multi-turn code generation. △ Less

Submitted 12 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

arXiv:2410.05569 [pdf, other]

The Hypertriton Puzzle in Relativistic Heavy-Ion Collisions

Authors: Thomas Cohen, Maneesha Pradeep

Abstract: The yields of hadrons and light nuclei in relativistic collisions of heavy-nuclei at a center of mass energy of 2.6 TeV can be described remarkably well by a thermal distribution of an ideal gas of hadrons and light nuclei interacting only via the decay of resonances. Given the particularly small binding energy of hypertritons relative to the temperature describing the yields (about 156 MeV), one… ▽ More The yields of hadrons and light nuclei in relativistic collisions of heavy-nuclei at a center of mass energy of 2.6 TeV can be described remarkably well by a thermal distribution of an ideal gas of hadrons and light nuclei interacting only via the decay of resonances. Given the particularly small binding energy of hypertritons relative to the temperature describing the yields (about 156 MeV), one might naturally expect hypertrions to dissociate in medium, making the agreement of hypertriton yields with thermal predictions highly puzzling. The puzzle is compounded by the fact that small binding energy is associated with the large size of the hypertriton. This size is on a similar scale to the overall size of the fireball and much larger than the length scale over which temperatures in the fireball vary over phenomenologically relevant amounts. This paper quantifies the tension this effect causes and shows that it is sufficiently large to render the thermal model inconsistent: its natural assumptions are in conflict with its outputs. The possibility that hypertritons are formed at freeze out as compact objects, quark droplets, that subsequently evolve into hypertritons is considered as a way to resolve the puzzle. It is noted that beyond making the assumption that compact quark droplets form, additional detailed dynamical assumptions which have not been justified are needed to make the thermal model work. The issue of why, despite these issues, the hypertriton is well described by a simple statistical description at freeze out is unresolved. Resolving the hypertriton puzzle is important as it may clarify whether the phenomenological success of the simple thermal model for yields accurately reflects the simple picture of the underlying physics on which it is based. △ Less

Submitted 7 October, 2024; originally announced October 2024.

Comments: 21 pages

arXiv:2410.02089 [pdf, other]

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Authors: Jonas Gehring, Kunhao Zheng, Jade Copet, Vegard Mella, Taco Cohen, Gabriel Synnaeve

Abstract: Large language models (LLMs) deployed as agents solve user-specified tasks over multiple steps while keeping the required manual engagement to a minimum. Crucially, such LLMs need to ground their generations in any feedback obtained to reliably achieve desired outcomes. We propose an end-to-end reinforcement learning method for teaching models to leverage execution feedback in the realm of code sy… ▽ More Large language models (LLMs) deployed as agents solve user-specified tasks over multiple steps while keeping the required manual engagement to a minimum. Crucially, such LLMs need to ground their generations in any feedback obtained to reliably achieve desired outcomes. We propose an end-to-end reinforcement learning method for teaching models to leverage execution feedback in the realm of code synthesis, where state-of-the-art LLMs struggle to improve code iteratively compared to independent sampling. We benchmark on competitive programming tasks, where we achieve new start-of-the art results with both small (8B parameters) and large (70B) models while reducing the amount of samples required by an order of magnitude. Our analysis of inference-time behavior demonstrates that our method produces LLMs that effectively leverage automatic feedback over multiple steps. △ Less

Submitted 2 October, 2024; originally announced October 2024.

arXiv:2409.00112 [pdf, ps, other]

Toward Large Language Models as a Therapeutic Tool: Comparing Prompting Techniques to Improve GPT-Delivered Problem-Solving Therapy

Authors: Daniil Filienko, Yinzhou Wang, Caroline El Jazmi, Serena Xie, Trevor Cohen, Martine De Cock, Weichao Yuwen

Abstract: While Large Language Models (LLMs) are being quickly adapted to many domains, including healthcare, their strengths and pitfalls remain under-explored. In our study, we examine the effects of prompt engineering to guide Large Language Models (LLMs) in delivering parts of a Problem-Solving Therapy (PST) session via text, particularly during the symptom identification and assessment phase for person… ▽ More While Large Language Models (LLMs) are being quickly adapted to many domains, including healthcare, their strengths and pitfalls remain under-explored. In our study, we examine the effects of prompt engineering to guide Large Language Models (LLMs) in delivering parts of a Problem-Solving Therapy (PST) session via text, particularly during the symptom identification and assessment phase for personalized goal setting. We present evaluation results of the models' performances by automatic metrics and experienced medical professionals. We demonstrate that the models' capability to deliver protocolized therapy can be improved with the proper use of prompt engineering methods, albeit with limitations. To our knowledge, this study is among the first to assess the effects of various prompting techniques in enhancing a generalist model's ability to deliver psychotherapy, focusing on overall quality, consistency, and empathy. Exploring LLMs' potential in delivering psychotherapy holds promise with the current shortage of mental health professionals amid significant needs, enhancing the potential utility of AI-based and AI-enhanced care services. △ Less

Submitted 27 August, 2024; originally announced September 2024.

Comments: Accepted for AMIA 2024 proceedings

arXiv:2408.12920 [pdf, other]

Cylindrical Cavity Expansion: A Novel Method for Characterizing the Mechanical Properties of Soft Materials

Authors: Jian Li, Zihao Xie, Hannah Varner, Chockalingam Senthilnathan, Tal Cohen

Abstract: The low elastic modulus of soft materials, combined with geometric nonlinearity and rate dependence, presents significant challenges in the characterization of their mechanical response. We introduce a novel method for measuring the mechanical properties of soft materials under large deformations via cylindrical cavity expansion. In this method, a cylindrical cavity is fabricated in the material a… ▽ More The low elastic modulus of soft materials, combined with geometric nonlinearity and rate dependence, presents significant challenges in the characterization of their mechanical response. We introduce a novel method for measuring the mechanical properties of soft materials under large deformations via cylindrical cavity expansion. In this method, a cylindrical cavity is fabricated in the material and expanded by volume-controlled injection of an incompressible fluid with simultaneous measurement of the applied pressure at the cavity wall. The relationship between applied pressure and deformation at the cavity wall is then employed to characterize the nonlinear mechanical properties. We demonstrate the feasibility of the proposed method and validate it by measuring the mechanical properties of synthetic polydimethylsiloxane (PDMS) and comparing with reported values in the literature. Results indicate that the cylindrical cavitation method effectively captures the response of PDMS over a wide range of stiffness (shear modulus ranging from 5 kPa to 300 kPa) and exhibit high repeatability. The proposed method overcomes limitations in characterization of ultra-soft materials using traditional testing methods, such as challenges with fabrication and clamping in unaxial tension testing and friction and adhesion effects in compression and indentation testing, thus enabling accurate and precise characterization. It also offers improved accuracy and repeatability over other needle induced cavity expansion methods due to precise control over the initial cavity dimension and shape at the cost of increased invasiveness of testing. △ Less

Submitted 16 September, 2024; v1 submitted 23 August, 2024; originally announced August 2024.

arXiv:2408.10166 [pdf, other]

The NANOGrav 15 yr Data Set: Running of the Spectral Index

Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Jeremy George Baier, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Paul B. Demorest, Heling Deng, Lankeswar Dey, Timothy Dolch , et al. (80 additional authors not shown)

Abstract: The NANOGrav 15-year data provides compelling evidence for a stochastic gravitational-wave (GW) background at nanohertz frequencies. The simplest model-independent approach to characterizing the frequency spectrum of this signal consists in a simple power-law fit involving two parameters: an amplitude A and a spectral index γ. In this paper, we consider the next logical step beyond this minimal sp… ▽ More The NANOGrav 15-year data provides compelling evidence for a stochastic gravitational-wave (GW) background at nanohertz frequencies. The simplest model-independent approach to characterizing the frequency spectrum of this signal consists in a simple power-law fit involving two parameters: an amplitude A and a spectral index γ. In this paper, we consider the next logical step beyond this minimal spectral model, allowing for a running (i.e., logarithmic frequency dependence) of the spectral index, γ_run(f) = γ+ β\ln(f/f_ref). We fit this running-power-law (RPL) model to the NANOGrav 15-year data and perform a Bayesian model comparison with the minimal constant-power-law (CPL) model, which results in a 95% credible interval for the parameter βconsistent with no running, β\in [-0.80,2.96], and an inconclusive Bayes factor, B(RPL vs. CPL) = 0.69 +- 0.01. We thus conclude that, at present, the minimal CPL model still suffices to adequately describe the NANOGrav signal; however, future data sets may well lead to a measurement of nonzero β. Finally, we interpret the RPL model as a description of primordial GWs generated during cosmic inflation, which allows us to combine our results with upper limits from big-bang nucleosynthesis, the cosmic microwave background, and LIGO-Virgo-KAGRA. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Comments: 17 pages, 4 figures, 2 tables

arXiv:2407.20510 [pdf, other]

The NANOGrav 15 yr data set: Posterior predictive checks for gravitational-wave detection with pulsar timing arrays

Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Jeremy George Baier, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Katerina Chatziioannou, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Paul B. Demorest, Heling Deng, Lankeswar Dey , et al. (77 additional authors not shown)

Abstract: Pulsar-timing-array experiments have reported evidence for a stochastic background of nanohertz gravitational waves consistent with the signal expected from a population of supermassive--black-hole binaries. Those analyses assume power-law spectra for intrinsic pulsar noise and for the background, as well as a Hellings--Downs cross-correlation pattern among the gravitational-wave--induced residual… ▽ More Pulsar-timing-array experiments have reported evidence for a stochastic background of nanohertz gravitational waves consistent with the signal expected from a population of supermassive--black-hole binaries. Those analyses assume power-law spectra for intrinsic pulsar noise and for the background, as well as a Hellings--Downs cross-correlation pattern among the gravitational-wave--induced residuals across pulsars. These assumptions are idealizations that may not be realized in actuality. We test them in the NANOGrav 15 yr data set using Bayesian posterior predictive checks: after fitting our fiducial model to real data, we generate a population of simulated data-set replications, and use them to assess whether the optimal-statistic significance, inter-pulsar correlations, and spectral coefficients assume extreme values for the real data when compared to the replications. We confirm that the NANOGrav 15 yr data set is consistent with power-law and Hellings--Downs assumptions. We also evaluate the evidence for the stochastic background using posterior-predictive versions of the frequentist optimal statistic and of Bayesian model comparison, and find comparable significance (3.2\ $σ$ and 3\ $σ$ respectively) to what was previously reported for the standard statistics. We conclude with novel visualizations of the reconstructed gravitational waveforms that enter the residuals for each pulsar. Our analysis strengthens confidence in the identification and characterization of the gravitational-wave background as reported by NANOGrav. △ Less

Submitted 29 July, 2024; originally announced July 2024.

Comments: 20 pages, 14 Figures

arXiv:2407.17477 [pdf]

Toward Automated Detection of Biased Social Signals from the Content of Clinical Conversations

Authors: Feng Chen, Manas Satish Bedmutha, Ray-Yuan Chung, Janice Sabin, Wanda Pratt, Brian R. Wood, Nadir Weibel, Andrea L. Hartzler, Trevor Cohen

Abstract: Implicit bias can impede patient-provider interactions and lead to inequities in care. Raising awareness is key to reducing such bias, but its manifestations in the social dynamics of patient-provider communication are difficult to detect. In this study, we used automated speech recognition (ASR) and natural language processing (NLP) to identify social signals in patient-provider interactions. We… ▽ More Implicit bias can impede patient-provider interactions and lead to inequities in care. Raising awareness is key to reducing such bias, but its manifestations in the social dynamics of patient-provider communication are difficult to detect. In this study, we used automated speech recognition (ASR) and natural language processing (NLP) to identify social signals in patient-provider interactions. We built an automated pipeline to predict social signals from audio recordings of 782 primary care visits that achieved 90.1% average accuracy across codes, and exhibited fairness in its predictions for white and non-white patients. Applying this pipeline, we identified statistically significant differences in provider communication behavior toward white versus non-white patients. In particular, providers expressed more patient-centered behaviors towards white patients including more warmth, engagement, and attentiveness. Our study underscores the potential of automated tools in identifying subtle communication signals that may be linked with bias and impact healthcare quality and equity. △ Less

Submitted 30 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

Comments: Accepted by AMIA 2024 Annual Symposium

arXiv:2407.13982 [pdf, other]

Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance

Authors: Changye Li, Trevor Cohen, Serguei Pakhomov

Abstract: Automatic speech recognition (ASR) models trained on large amounts of audio data are now widely used to convert speech to written text in a variety of applications from video captioning to automated assistants used in healthcare and other domains. As such, it is important that ASR models and their use is fair and equitable. Prior work examining the performance of commercial ASR systems on the Corp… ▽ More Automatic speech recognition (ASR) models trained on large amounts of audio data are now widely used to convert speech to written text in a variety of applications from video captioning to automated assistants used in healthcare and other domains. As such, it is important that ASR models and their use is fair and equitable. Prior work examining the performance of commercial ASR systems on the Corpus of Regional African American Language (CORAAL) demonstrated significantly worse ASR performance on African American English (AAE). The current study seeks to understand the factors underlying this disparity by examining the performance of the current state-of-the-art neural network based ASR system (Whisper, OpenAI) on the CORAAL dataset. Two key findings have been identified as a result of the current study. The first confirms prior findings of significant dialectal variation even across neighboring communities, and worse ASR performance on AAE that can be improved to some extent with fine-tuning of ASR models. The second is a novel finding not discussed in prior work on CORAAL: differences in audio recording practices within the dataset have a significant impact on ASR accuracy resulting in a ``confounding by provenance'' effect in which both language use and recording quality differ by study location. These findings highlight the need for further systematic investigation to disentangle the effects of recording quality and inherent linguistic diversity when examining the fairness and bias present in neural ASR models, as any bias in ASR accuracy may have negative downstream effects on disparities in various domains of life in which ASR technology is used. △ Less

Submitted 18 July, 2024; originally announced July 2024.

arXiv:2407.08581 [pdf, other]

Operator Origin of Anomalous Dimensions in de Sitter Space

Authors: Timothy Cohen, Daniel Green, Yiwen Huang

Abstract: The late time limit of the power spectrum for heavy (principal series) fields in de Sitter space yields a series of polynomial terms with complex scaling dimensions. Such scaling behavior is expected to result from an associated operator with a complex dimension. In a free theory, these complex dimensions are known to match the constraints imposed by unitarity on the space of states. Yet, perturba… ▽ More The late time limit of the power spectrum for heavy (principal series) fields in de Sitter space yields a series of polynomial terms with complex scaling dimensions. Such scaling behavior is expected to result from an associated operator with a complex dimension. In a free theory, these complex dimensions are known to match the constraints imposed by unitarity on the space of states. Yet, perturbative corrections to the scaling behavior of operators are naively inconsistent with unitary evolution of the quantum fields in dS. This paper demonstrates how to compute one-loop corrections to the scaling dimensions that appear in the two point function from the field theory description in terms of local operators. We first show how to evaluate these anomalous dimensions using Mellin space, which has the feature that it naturally accommodates a scaleless regulator. We then explore the consequences for the Soft de Sitter Effective Theory (SdSET) description that emerges in the long wavelength limit. Carefully matching between the UV and SdSET descriptions requires the introduction of novel non-dynamical "operators" in the effective theory. This is not only necessary to reproduce results extracted from the Källén-Lehmann representation (that use the space of unitary states directly), but it is also required by general arguments that invoke positivity. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 42 pages, 5 figures

arXiv:2406.18314 [pdf, other]

ContactNet: Geometric-Based Deep Learning Model for Predicting Protein-Protein Interactions

Authors: Matan Halfon, Tomer Cohen, Raanan Fattal, Dina Schneidman-Duhovny

Abstract: Deep learning approaches achieved significant progress in predicting protein structures. These methods are often applied to protein-protein interactions (PPIs) yet require Multiple Sequence Alignment (MSA) which is unavailable for various interactions, such as antibody-antigen. Computational docking methods are capable of sampling accurate complex models, but also produce thousands of invalid conf… ▽ More Deep learning approaches achieved significant progress in predicting protein structures. These methods are often applied to protein-protein interactions (PPIs) yet require Multiple Sequence Alignment (MSA) which is unavailable for various interactions, such as antibody-antigen. Computational docking methods are capable of sampling accurate complex models, but also produce thousands of invalid configurations. The design of scoring functions for identifying accurate models is a long-standing challenge. We develop a novel attention-based Graph Neural Network (GNN), ContactNet, for classifying PPI models obtained from docking algorithms into accurate and incorrect ones. When trained on docked antigen and modeled antibody structures, ContactNet doubles the accuracy of current state-of-the-art scoring functions, achieving accurate models among its Top-10 at 43% of the test cases. When applied to unbound antibodies, its Top-10 accuracy increases to 65%. This performance is achieved without MSA and the approach is applicable to other types of interactions, such as host-pathogens or general PPIs. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2406.03540 [pdf, other]

Recursion for Wilson-line Form Factors

Authors: Timothy Cohen, Marc Riembau

Abstract: Matrix elements of Wilson-line dressed operators play a central role in the factorization of soft and collinear modes in gauge theories. When expressed using spinor helicity variables, these so-called form factors admit a classification starting from a Maximally Helicity Violating configuration, in close analogy with gauge theory amplitudes. We show that a single-line complex momentum shift can be… ▽ More Matrix elements of Wilson-line dressed operators play a central role in the factorization of soft and collinear modes in gauge theories. When expressed using spinor helicity variables, these so-called form factors admit a classification starting from a Maximally Helicity Violating configuration, in close analogy with gauge theory amplitudes. We show that a single-line complex momentum shift can be used to derive recursion relations that efficiently compute these helicity form factors at tree-level: a combination of lower point form factors and on-shell amplitudes serve as the input building blocks. We obtain novel compact expressions for the $1\to 2$ and $1\to 3$ splitting functions in QCD, which also serves to validate our methods. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 28 pages + appendix

Report number: CERN-TH-2024-069

arXiv:2406.02830 [pdf, other]

Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies

Authors: Changye Li, Zhecheng Sheng, Trevor Cohen, Serguei Pakhomov

Abstract: As artificial neural networks grow in complexity, understanding their inner workings becomes increasingly challenging, which is particularly important in healthcare applications. The intrinsic evaluation metrics of autoregressive neural language models (NLMs), perplexity (PPL), can reflect how "surprised" an NLM model is at novel input. PPL has been widely used to understand the behavior of NLMs.… ▽ More As artificial neural networks grow in complexity, understanding their inner workings becomes increasingly challenging, which is particularly important in healthcare applications. The intrinsic evaluation metrics of autoregressive neural language models (NLMs), perplexity (PPL), can reflect how "surprised" an NLM model is at novel input. PPL has been widely used to understand the behavior of NLMs. Previous findings show that changes in PPL when masking attention layers in pre-trained transformer-based NLMs reflect linguistic anomalies associated with Alzheimer's disease dementia. Building upon this, we explore a novel bidirectional attention head ablation method that exhibits properties attributed to the concepts of cognitive and brain reserve in human brain studies, which postulate that people with more neurons in the brain and more efficient processing are more resilient to neurodegeneration. Our results show that larger GPT-2 models require a disproportionately larger share of attention heads to be masked/ablated to display degradation of similar magnitude to masking in smaller models. These results suggest that the attention mechanism in transformer models may present an analogue to the notions of cognitive and brain reserve and could potentially be used to model certain aspects of the progression of neurodegenerative disorders and aging. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: Accepted to ACL 2024 findings

arXiv:2405.10294 [pdf, ps, other]

Corrections to adiabatic behavior for long paths

Authors: Thomas D. Cohen, Hyunwoo Oh

Abstract: The cost and the error of the adiabatic theorem for preparing the final eigenstate are discussed in terms of path length. Previous studies in terms of the norm of the Hamiltonian and its derivatives with the spectral gap are limited in their ability to describe the cost of adiabatic state preparation for certain physically large systems. We argue that total time is not a good measure for determini… ▽ More The cost and the error of the adiabatic theorem for preparing the final eigenstate are discussed in terms of path length. Previous studies in terms of the norm of the Hamiltonian and its derivatives with the spectral gap are limited in their ability to describe the cost of adiabatic state preparation for certain physically large systems. We argue that total time is not a good measure for determining the computational difficulty of adiabatic quantum computation by developing a no-go theorem. From the result of time-periodic Hamiltonian cases, we suggest that there are proxies for computational cost which typically grow as path length increases when the error is kept fixed and small and consider possible conjectures on how general the behavior is. △ Less

Submitted 29 August, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: 11 pages, 0 figures

arXiv:2405.09462 [pdf, other]

Zeno Effect Suppression of Gauge Drift in Quantum Simulations

Authors: Carter Ball, Thomas D. Cohen

Abstract: Quantum simulation of lattice gauge theories is a promising tool for the study of many complicated problems including ones with real-time dynamics. For gauge theories, however, there is a major challenge in maintaining gauge invariance during time evolution. Such theories have a full Hilbert space that is larger than the physical space -- the set of states which are gauge invariant or equivalently… ▽ More Quantum simulation of lattice gauge theories is a promising tool for the study of many complicated problems including ones with real-time dynamics. For gauge theories, however, there is a major challenge in maintaining gauge invariance during time evolution. Such theories have a full Hilbert space that is larger than the physical space -- the set of states which are gauge invariant or equivalently respect the Gauss law. While an exact implementation of Hamiltonian dynamics starting in the physical Hilbert space will keep the system in the physical space, various types of errors will inevitably produce components outside of it. This work proposes a method of suppressing this gauge drift via the Zeno effect. As in the standard picture of the Zeno effect, our method relies on frequent projection onto the physical subspace. Additionally, a technique is discussed to reduce the speed of the gauge drift, which helps to reduce the required frequency of projections. We demonstrate our method on a $\mathbb{Z}_2$ gauge theory toy model. △ Less

Submitted 25 July, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.03865 [pdf, other]

Information-driven Affordance Discovery for Efficient Robotic Manipulation

Authors: Pietro Mazzaglia, Taco Cohen, Daniel Dijkman

Abstract: Robotic affordances, providing information about what actions can be taken in a given situation, can aid robotic manipulation. However, learning about affordances requires expensive large annotated datasets of interactions or demonstrations. In this work, we argue that well-directed interactions with the environment can mitigate this problem and propose an information-based measure to augment the… ▽ More Robotic affordances, providing information about what actions can be taken in a given situation, can aid robotic manipulation. However, learning about affordances requires expensive large annotated datasets of interactions or demonstrations. In this work, we argue that well-directed interactions with the environment can mitigate this problem and propose an information-based measure to augment the agent's objective and accelerate the affordance discovery process. We provide a theoretical justification of our approach and we empirically validate the approach both in simulation and real-world tasks. Our method, which we dub IDA, enables the efficient discovery of visual affordances for several action primitives, such as grasping, stacking objects, or opening drawers, strongly improving data efficiency in simulation, and it allows us to learn grasping affordances in a small number of interactions, on a real-world setup with a UFACTORY XArm 6 robot arm. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2308.14915

arXiv:2404.16960 [pdf, other]

doi 10.1039/D4SM00573B

Explaining the spread in measurement of PDMS elastic properties: influence of test method and curing protocol

Authors: Hannah Varner, Tal Cohen

Abstract: Accuracy in the measurement of mechanical properties is essential for precision engineering and for the interrogation of composition-property relationships. Conventional methods of mechanical testing, such as uniaxial tension, compression, and nanoindentation, provide highly repeatable and reliable results for stiff materials, for which they were originally developed. However, when applied to the… ▽ More Accuracy in the measurement of mechanical properties is essential for precision engineering and for the interrogation of composition-property relationships. Conventional methods of mechanical testing, such as uniaxial tension, compression, and nanoindentation, provide highly repeatable and reliable results for stiff materials, for which they were originally developed. However, when applied to the characterization of soft and biological materials, the same cannot be said, and the spread of reported properties of similar materials is vast. Polydimethylsiloxane (PDMS), commonly obtained from Dow as SYLGARD 184, is a ubiquitous such material, which has been integral to the rapid development of biocompatible microfluidic devices and flexible electronics in recent decades. However, reported shear moduli of this material range over 2 orders of magnitude for similar chemical compositions. Taking advantage of the increased mechanical scrutiny afforded to SYLGARD 184 in recent years, we combine both published and new experimental data obtained using 9 mechanical test methods. A statistical analysis then elucidates the significant bias induced by the test method itself, and distinguishes this bias from the influence of curing protocols on the mechanical properties. The goal of this work is thus two-fold: (i) it provides a quantitative understanding of the different factors that influence reported properties of this particular material, and (ii) it serves as a cautionary tale. As researchers in the field of mechanics strive to quantify the properties of increasingly complex soft and biological materials, converging on a standardized measurement of PDMS is a necessary first step. △ Less

Submitted 19 July, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.15036 [pdf, ps, other]

An Accessible Instrument for Measuring Soft Material Mechanical Properties

Authors: B. M. Unikewicz, A. M. Pincot, T. Cohen

Abstract: Soft material research has seen significant growth in recent years, with emerging applications in robotics, electronics, and healthcare diagnostics where understanding material mechanical response is crucial for precision design. Traditional methods for measuring nonlinear mechanical properties of soft materials require specially sized samples that are extracted from their natural environment to b… ▽ More Soft material research has seen significant growth in recent years, with emerging applications in robotics, electronics, and healthcare diagnostics where understanding material mechanical response is crucial for precision design. Traditional methods for measuring nonlinear mechanical properties of soft materials require specially sized samples that are extracted from their natural environment to be mounted on the testing instrument. This has been shown to compromise data accuracy and precision in various soft and biological materials. To overcome this, the Volume Controlled Cavity Expansion (VCCE) method was developed. This technique tests soft materials by controlling the formation rate of a liquid cavity inside the materials at the tip of an injection needle, and simultaneously measuring the resisting pressure which describes the material response. Despite VCCE's early successes, expansion of its application beyond academia has been hindered by cost, size, and expertise. In response to this, the first portable, bench-top instrument utilizing VCCE is presented here. This device, built with affordable, readily available components and open-source software, streamlines VCCE experimentation without sacrificing performance or precision. It is especially suitable for space-limited settings and designed for use by non-experts, promoting widespread adoption. The instrument's efficacy was demonstrated through testing Polydimethylsiloxane (PDMS) samples of varying stiffness. This study not only validates instrument performance, but also sets the stage for further advancements and broader applications in soft material testing. All data, along with acquisition, control, and post-processing scripts, are made available on GitHub. △ Less

Submitted 23 September, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.07020 [pdf, other]

The NANOGrav 15 yr Data Set: Looking for Signs of Discreteness in the Gravitational-wave Background

Authors: Gabriella Agazie, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Lucas Brown, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Megan E. DeCesar, Paul B. Demorest, Heling Deng, Timothy Dolch, Elizabeth C. Ferrara, William Fiore, Emmanuel Fonseca, Gabriel E. Freedman, Nate Garver-Daniels , et al. (58 additional authors not shown)

Abstract: The cosmic merger history of supermassive black hole binaries (SMBHBs) is expected to produce a low-frequency gravitational wave background (GWB). Here we investigate how signs of the discrete nature of this GWB can manifest in pulsar timing arrays through excursions from, and breaks in, the expected $f_{\mathrm{GW}}^{-2/3}$ power-law of the GWB strain spectrum. To do this, we create a semi-analyt… ▽ More The cosmic merger history of supermassive black hole binaries (SMBHBs) is expected to produce a low-frequency gravitational wave background (GWB). Here we investigate how signs of the discrete nature of this GWB can manifest in pulsar timing arrays through excursions from, and breaks in, the expected $f_{\mathrm{GW}}^{-2/3}$ power-law of the GWB strain spectrum. To do this, we create a semi-analytic SMBHB population model, fit to NANOGrav's 15 yr GWB amplitude, and with 1,000 realizations we study the populations' characteristic strain and residual spectra. Comparing our models to the NANOGrav 15 yr spectrum, we find two interesting excursions from the power-law. The first, at $2 \; \mathrm{nHz}$, is below our GWB realizations with $p$-value significance $p = 0.05$ to $0.06$ ($\approx 1.8 σ- 1.9 σ$). The second, at $16 \; \mathrm{nHz}$, is above our GWB realizations with $p = 0.04$ to $0.15$ ($\approx 1.4 σ- 2.1 σ$). We explore the properties of a loud SMBHB which could cause such an excursion. Our simulations also show that the expected number of SMBHBs decreases by three orders of magnitude, from $\sim 10^6$ to $\sim 10^3$, between $2\; \mathrm{nHz}$ and $20 \; \mathrm{nHz}$. This causes a break in the strain spectrum as the stochasticity of the background breaks down at $26^{+28}_{-19} \; \mathrm{nHz}$, consistent with predictions pre-dating GWB measurements. The diminished GWB signal from SMBHBs at frequencies above the $26$~nHz break opens a window for PTAs to detect continuous GWs from individual SMBHBs or GWs from the early universe. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 10 pages, 8 figures, 1 appendix, submitted to ApJ

arXiv:2402.15867 [pdf, ps, other]

An Invitation to Analytic Group Theory

Authors: Tal Cohen, Tsachik Gelander

Abstract: This book is concerned with analytic approaches of studying groups and their actions. Much attention is devoted to the study of amenability and Kazhdan's property (T), which are perhaps the most important analytic properties of a group, but we also discuss other analytic notions. We tried to introduce tricks, ideas and lemmas that repeatedly turn out to be useful in various situations. Our main gu… ▽ More This book is concerned with analytic approaches of studying groups and their actions. Much attention is devoted to the study of amenability and Kazhdan's property (T), which are perhaps the most important analytic properties of a group, but we also discuss other analytic notions. We tried to introduce tricks, ideas and lemmas that repeatedly turn out to be useful in various situations. Our main guideline was to expose the beauty of the theory and to present many different aspects of it while keeping the text short, simple and accessible, sometimes at the expense of diving deep or providing thorough expositions. Hopefully this book could serve as a smooth entry to Analytic Group Theory. △ Less

Submitted 24 February, 2024; originally announced February 2024.

arXiv:2402.04858 [pdf, other]

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Authors: Natasha Butt, Blazej Manczak, Auke Wiggers, Corrado Rainone, David W. Zhang, Michaël Defferrard, Taco Cohen

Abstract: Large language models are increasingly solving tasks that are commonly believed to require human-level reasoning ability. However, these models still perform very poorly on benchmarks of general intelligence such as the Abstraction and Reasoning Corpus (ARC). In this paper, we approach ARC as a programming-by-examples problem, and introduce a novel and scalable method for language model self-impro… ▽ More Large language models are increasingly solving tasks that are commonly believed to require human-level reasoning ability. However, these models still perform very poorly on benchmarks of general intelligence such as the Abstraction and Reasoning Corpus (ARC). In this paper, we approach ARC as a programming-by-examples problem, and introduce a novel and scalable method for language model self-improvement called Code Iteration (CodeIt). Our method iterates between 1) program sampling and hindsight relabeling, and 2) learning from prioritized experience replay. By relabeling the goal of an episode (i.e., the target program output given input) to the realized output produced by the sampled program, our method effectively deals with the extreme sparsity of rewards in program synthesis. Applying CodeIt to the ARC dataset, we demonstrate that prioritized hindsight replay, along with pre-training and data-augmentation, leads to successful inter-task generalization. CodeIt is the first neuro-symbolic approach that scales to the full ARC evaluation dataset. Our method solves 15% of ARC evaluation tasks, achieving state-of-the-art performance and outperforming existing neural and symbolic baselines. Our code is available at https://github.com/Qualcomm-AI-research/codeit . △ Less

Submitted 1 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

Comments: ICML'24 camera-ready version

arXiv:2401.05551 [pdf, other]

Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification?

Authors: Changye Li, Weizhe Xu, Trevor Cohen, Serguei Pakhomov

Abstract: \textbf{Objectives}: We aimed to investigate how errors from automatic speech recognition (ASR) systems affect dementia classification accuracy, specifically in the ``Cookie Theft'' picture description task. We aimed to assess whether imperfect ASR-generated transcripts could provide valuable information for distinguishing between language samples from cognitively healthy individuals and those wit… ▽ More \textbf{Objectives}: We aimed to investigate how errors from automatic speech recognition (ASR) systems affect dementia classification accuracy, specifically in the ``Cookie Theft'' picture description task. We aimed to assess whether imperfect ASR-generated transcripts could provide valuable information for distinguishing between language samples from cognitively healthy individuals and those with Alzheimer's disease (AD). \textbf{Methods}: We conducted experiments using various ASR models, refining their transcripts with post-editing techniques. Both these imperfect ASR transcripts and manually transcribed ones were used as inputs for the downstream dementia classification. We conducted comprehensive error analysis to compare model performance and assess ASR-generated transcript effectiveness in dementia classification. \textbf{Results}: Imperfect ASR-generated transcripts surprisingly outperformed manual transcription for distinguishing between individuals with AD and those without in the ``Cookie Theft'' task. These ASR-based models surpassed the previous state-of-the-art approach, indicating that ASR errors may contain valuable cues related to dementia. The synergy between ASR and classification models improved overall accuracy in dementia classification. \textbf{Conclusion}: Imperfect ASR transcripts effectively capture linguistic anomalies linked to dementia, improving accuracy in classification tasks. This synergy between ASR and classification models underscores ASR's potential as a valuable tool in assessing cognitive impairment and related clinical applications. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: To appear on Journal of Biomedical Informatics

arXiv:2401.04194 [pdf, other]

On interpretation of fluctuations of conserved charges at high T

Authors: T. D. Cohen, L. Ya. Glozman

Abstract: Fluctuations of conserved charges calculated on the lattice which can be measured experimentally, are well reproduced by a hadron resonanse gas model at temperatures below T_{ch} ~ 155 MeV and radically deviate from the hadron resonance gas predictions above the chiral restoration crossover. This behavior is typically interpreted as an indication of deconfinement in the quark-gluon plasma regime.… ▽ More Fluctuations of conserved charges calculated on the lattice which can be measured experimentally, are well reproduced by a hadron resonanse gas model at temperatures below T_{ch} ~ 155 MeV and radically deviate from the hadron resonance gas predictions above the chiral restoration crossover. This behavior is typically interpreted as an indication of deconfinement in the quark-gluon plasma regime. We present an argument that this interpretation may be too simple. The argument is based on the scaling of quantities with the number of colors: demonstration of deconfinement and QGP requires observable that is sensitive to N_c^2 gluons while the conserved charges are sensitive only to quarks and above T_{ch} scale as N_c^1. The latter scaling is consistent with the existence of an intermediate regime characterized by restored chiral symmetry and by approximate chiral spin symmetry which is a symmetry of confining interaction. In this regime the energy density, pressure and entropy density scale as N_c^1. In the large N_c limit this regime might become a distinct phase separated from the hadron gas and from QGP by phase transitions. A natural observable that associates with deconfinement and is directly sensitive to deconfined N_c^2-1 gluons is the Polyakov loop; in the N_c=3 world it remains very close to 0 at temperatures well above chiral crossover, reaches the value 0.5 around 3T_{ch} and the value close to 1 at temperatures ~1 GeV. △ Less

Submitted 4 August, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: 5 pages; final version accepted by EPJA

arXiv:2401.03631 [pdf]

Bridging the Skills Gap: Evaluating an AI-Assisted Provider Platform to Support Care Providers with Empathetic Delivery of Protocolized Therapy

Authors: William R. Kearns, Jessica Bertram, Myra Divina, Lauren Kemp, Yinzhou Wang, Alex Marin, Trevor Cohen, Weichao Yuwen

Abstract: Despite the high prevalence and burden of mental health conditions, there is a global shortage of mental health providers. Artificial Intelligence (AI) methods have been proposed as a way to address this shortage, by supporting providers with less extensive training as they deliver care. To this end, we developed the AI-Assisted Provider Platform (A2P2), a text-based virtual therapy interface that… ▽ More Despite the high prevalence and burden of mental health conditions, there is a global shortage of mental health providers. Artificial Intelligence (AI) methods have been proposed as a way to address this shortage, by supporting providers with less extensive training as they deliver care. To this end, we developed the AI-Assisted Provider Platform (A2P2), a text-based virtual therapy interface that includes a response suggestion feature, which supports providers in delivering protocolized therapies empathetically. We studied providers with and without expertise in mental health treatment delivering a therapy session using the platform with (intervention) and without (control) AI-assistance features. Upon evaluation, the AI-assisted system significantly decreased response times by 29.34% (p=0.002), tripled empathic response accuracy (p=0.0001), and increased goal recommendation accuracy by 66.67% (p=0.001) across both user groups compared to the control. Both groups rated the system as having excellent usability. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Comments: Accepted: AMIA Annual Symposium 2023. To appear as: Kearns W, Bertram J, Divina M, Kemp L, Wang Y, Marin A, Cohen T, Yuwen W. Bridging the Skills Gap: Evaluating an AI-Assisted Provider Platform to Support Care Providers with Empathetic Delivery of Protocolized Therapy. AMIA Annual Symposium Proceedings 2023. American Medical Informatics Association

arXiv:2312.07511 [pdf, other]

A Hitchhiker's Guide to Geometric GNNs for 3D Atomic Systems

Authors: Alexandre Duval, Simon V. Mathis, Chaitanya K. Joshi, Victor Schmidt, Santiago Miret, Fragkiskos D. Malliaros, Taco Cohen, Pietro Liò, Yoshua Bengio, Michael Bronstein

Abstract: Recent advances in computational modelling of atomic systems, spanning molecules, proteins, and materials, represent them as geometric graphs with atoms embedded as nodes in 3D Euclidean space. In these graphs, the geometric attributes transform according to the inherent physical symmetries of 3D atomic systems, including rotations and translations in Euclidean space, as well as node permutations.… ▽ More Recent advances in computational modelling of atomic systems, spanning molecules, proteins, and materials, represent them as geometric graphs with atoms embedded as nodes in 3D Euclidean space. In these graphs, the geometric attributes transform according to the inherent physical symmetries of 3D atomic systems, including rotations and translations in Euclidean space, as well as node permutations. In recent years, Geometric Graph Neural Networks have emerged as the preferred machine learning architecture powering applications ranging from protein structure prediction to molecular simulations and material generation. Their specificity lies in the inductive biases they leverage - such as physical symmetries and chemical properties - to learn informative representations of these geometric graphs. In this opinionated paper, we provide a comprehensive and self-contained overview of the field of Geometric GNNs for 3D atomic systems. We cover fundamental background material and introduce a pedagogical taxonomy of Geometric GNN architectures: (1) invariant networks, (2) equivariant networks in Cartesian basis, (3) equivariant networks in spherical basis, and (4) unconstrained networks. Additionally, we outline key datasets and application areas and suggest future research directions. The objective of this work is to present a structured perspective on the field, making it accessible to newcomers and aiding practitioners in gaining an intuition for its mathematical abstractions. △ Less

Submitted 13 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

arXiv:2312.06748 [pdf, other]

On Amplitudes and Field Redefinitions

Authors: Timothy Cohen, Xiaochuan Lu, Dave Sutherland

Abstract: We derive an off-shell recursion relation for correlators that holds at all loop orders. This allows us to prove how generalized amplitudes transform under generic field redefinitions, starting from an assumed behavior of the one-particle-irreducible effective action. The form of the recursion relation resembles the operation of raising the rank of a tensor by acting with a covariant derivative. T… ▽ More We derive an off-shell recursion relation for correlators that holds at all loop orders. This allows us to prove how generalized amplitudes transform under generic field redefinitions, starting from an assumed behavior of the one-particle-irreducible effective action. The form of the recursion relation resembles the operation of raising the rank of a tensor by acting with a covariant derivative. This inspires a geometric interpretation, whose features and flaws we investigate. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: 50 pages

Report number: CERN-TH-2023-233

arXiv:2312.05435 [pdf, other]

Enhancing Robustness of Foundation Model Representations under Provenance-related Distribution Shifts

Authors: Xiruo Ding, Zhecheng Sheng, Brian Hur, Feng Chen, Serguei V. S. Pakhomov, Trevor Cohen

Abstract: Foundation models are a current focus of attention in both industry and academia. While they have shown their capabilities in a variety of tasks, in-depth research is required to determine their robustness to distribution shift when used as a basis for supervised machine learning. This is especially important in the context of clinical data, with particular limitations related to data accessibilit… ▽ More Foundation models are a current focus of attention in both industry and academia. While they have shown their capabilities in a variety of tasks, in-depth research is required to determine their robustness to distribution shift when used as a basis for supervised machine learning. This is especially important in the context of clinical data, with particular limitations related to data accessibility, lack of pretraining materials, and limited availability of high-quality annotations. In this work, we examine the stability of models based on representations from foundation models under distribution shift. We focus on confounding by provenance, a form of distribution shift that emerges in the context of multi-institutional datasets when there are differences in source-specific language use and class distributions. Using a sampling strategy that synthetically induces varying degrees of distribution shift, we evaluate the extent to which representations from foundation models result in predictions that are inherently robust to confounding by provenance. Additionally, we examine the effectiveness of a straightforward confounding adjustment method inspired by Pearl's conception of backdoor adjustment. Results indicate that while foundation models do show some out-of-the-box robustness to confounding-by-provenance related distribution shifts, this can be considerably improved through adjustment. These findings suggest a need for deliberate adjustment of predictive models using representations from foundation models in the context of source-specific distributional differences. △ Less

Submitted 8 December, 2023; originally announced December 2023.

Comments: Accepted in Workshop on Distribution Shifts, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

arXiv:2312.03881 [pdf, other]

FoMo Rewards: Can we cast foundation models as reward functions?

Authors: Ekdeep Singh Lubana, Johann Brehmer, Pim de Haan, Taco Cohen

Abstract: We explore the viability of casting foundation models as generic reward functions for reinforcement learning. To this end, we propose a simple pipeline that interfaces an off-the-shelf vision model with a large language model. Specifically, given a trajectory of observations, we infer the likelihood of an instruction describing the task that the user wants an agent to perform. We show that this ge… ▽ More We explore the viability of casting foundation models as generic reward functions for reinforcement learning. To this end, we propose a simple pipeline that interfaces an off-the-shelf vision model with a large language model. Specifically, given a trajectory of observations, we infer the likelihood of an instruction describing the task that the user wants an agent to perform. We show that this generic likelihood function exhibits the characteristics ideally expected from a reward function: it associates high values with the desired behaviour and lower values for several similar, but incorrect policies. Overall, our work opens the possibility of designing open-ended agents for interactive tasks via foundation models. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: Accepted to NeurIPS FMDM workshop

arXiv:2311.09481 [pdf, other]

Personalized Jargon Identification for Enhanced Interdisciplinary Communication

Authors: Yue Guo, Joseph Chee Chang, Maria Antoniak, Erin Bransom, Trevor Cohen, Lucy Lu Wang, Tal August

Abstract: Scientific jargon can impede researchers when they read materials from other domains. Current methods of jargon identification mainly use corpus-level familiarity indicators (e.g., Simple Wikipedia represents plain language). However, researchers' familiarity of a term can vary greatly based on their own background. We collect a dataset of over 10K term familiarity annotations from 11 computer sci… ▽ More Scientific jargon can impede researchers when they read materials from other domains. Current methods of jargon identification mainly use corpus-level familiarity indicators (e.g., Simple Wikipedia represents plain language). However, researchers' familiarity of a term can vary greatly based on their own background. We collect a dataset of over 10K term familiarity annotations from 11 computer science researchers for terms drawn from 100 paper abstracts. Analysis of this data reveals that jargon familiarity and information needs vary widely across annotators, even within the same sub-domain (e.g., NLP). We investigate features representing individual, sub-domain, and domain knowledge to predict individual jargon familiarity. We compare supervised and prompt-based approaches, finding that prompt-based methods including personal publications yields the highest accuracy, though zero-shot prompting provides a strong baseline. This research offers insight into features and methods to integrate personal data into scientific jargon identification. △ Less

Submitted 15 November, 2023; originally announced November 2023.

arXiv:2311.07333 [pdf, ps, other]

Large $N_c$ QCD phase diagram at $μ_B = 0$

Authors: T. D. Cohen, L. Ya. Glozman

Abstract: Lattice studies suggest that at zero baryon chemical potential and increasing temperature there are three characteristic regimes in QCD that are connected by smooth analytical crossovers: a hadron gas regime at T < T_ch ~ 155 MeV, an intermediate regime, called stringy fluid, at T_ch < T < ~ 3 T_ch, and a quark-gluon plasma regime at higher temperatures. These regimes have been interpreted to refl… ▽ More Lattice studies suggest that at zero baryon chemical potential and increasing temperature there are three characteristic regimes in QCD that are connected by smooth analytical crossovers: a hadron gas regime at T < T_ch ~ 155 MeV, an intermediate regime, called stringy fluid, at T_ch < T < ~ 3 T_ch, and a quark-gluon plasma regime at higher temperatures. These regimes have been interpreted to reflect different approximate symmetries and effective degrees of freedom. In the hadron gas the effective degrees of freedom are hadrons and the approximate chiral symmetry of QCD is spontaneously broken. The intermediate regime has been interpreted as lacking spontaneous chiral symmetry breaking along with the emergence of new approximate symmetry, chiral spin symmetry, that is not a symmetry of the Dirac Lagrangian, but is a symmetry of the confining part of the QCD Lagrangian. While the high temperature regime is the usual quark-gluon plasma which is often considered to reflect "deconfinement" in some way. This paper explores the behavior of these regimes of QCD as the number of colors in the theory, N_c, gets large. In the large N_c limit the theory is center-symmetric and notions of confinement and deconfinement are unambiguous. The energy density is O(N_c^0) in the meson gas, O(N_c^1) in the intermediate regime and O(N_c^2) in the quark-gluon plasma regime. In the large N_c limit these regimes may become distinct phases separated by first order phase transitions. The intermediate phase has the peculiar feature that glueballs should exist and have properties that are unchanged from what is seen in the vacuum (up to 1/N_c corrections), while the ordinary dilute gas of mesons with broken chiral symmetry disappears and approximate chiral spin symmetry should emerge. △ Less

Submitted 20 August, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

Comments: 10 pages. Final version accepted by EPJA

arXiv:2311.04744 [pdf, other]

Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant Transformers

Authors: Pim de Haan, Taco Cohen, Johann Brehmer

Abstract: The Geometric Algebra Transformer (GATr) is a versatile architecture for geometric deep learning based on projective geometric algebra. We generalize this architecture into a blueprint that allows one to construct a scalable transformer architecture given any geometric (or Clifford) algebra. We study versions of this architecture for Euclidean, projective, and conformal algebras, all of which are… ▽ More The Geometric Algebra Transformer (GATr) is a versatile architecture for geometric deep learning based on projective geometric algebra. We generalize this architecture into a blueprint that allows one to construct a scalable transformer architecture given any geometric (or Clifford) algebra. We study versions of this architecture for Euclidean, projective, and conformal algebras, all of which are suited to represent 3D data, and evaluate them in theory and practice. The simplest Euclidean architecture is computationally cheap, but has a smaller symmetry group and is not as sample-efficient, while the projective model is not sufficiently expressive. Both the conformal algebra and an improved version of the projective algebra define powerful, performant architectures. △ Less

Submitted 14 March, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

Comments: Accepted to AISTATS 2024

arXiv:2310.19229 [pdf, other]

Efficient vacuum state preparation for quantum simulation of strongly interacting local quantum field theories

Authors: Thomas D. Cohen, Hyunwoo Oh

Abstract: We present an efficient approach for preparing ground states in the context of strongly interacting local quantum field theories on quantum computers. The approach produces the vacuum state in a time proportional to the square-root of the volume, which is a square-root improvement in speed compared to traditional approaches. The approach exploits a novel method for traversing the path in parameter… ▽ More We present an efficient approach for preparing ground states in the context of strongly interacting local quantum field theories on quantum computers. The approach produces the vacuum state in a time proportional to the square-root of the volume, which is a square-root improvement in speed compared to traditional approaches. The approach exploits a novel method for traversing the path in parameter space in which the resources scale linearly with a path length suitably defined in parameter space. Errors due to practical limitations are controlled and do not exhibit secular growth along the path. The final accuracy can be arbitrarily improved with an additive cost, which is independent of the volume and grows slower than logarithmically with the overlap between the state produced and the exact ground state. We expect that the method could potentially hold practical value not only within the realm of quantum field theories but also in addressing other challenges involving long path lengths. △ Less

Submitted 1 March, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

Comments: 6 pages, 3 figures

arXiv:2310.13731 [pdf, other]

doi 10.1007/JHEP04(2024)070

Dark Sector Glueballs at the LHC

Authors: Austin Batz, Timothy Cohen, David Curtin, Caleb Gemmell, Graham D. Kribs

Abstract: We study confining dark sectors where the lightest hadrons are glueballs. Such models can provide viable dark matter candidates and appear in some neutral naturalness scenarios. In this work, we introduce a new phenomenological model of dark glueball hadronization inspired by the Lund string model. This enables us to make realistic predictions for dark glueball phenomenology at the LHC for the fir… ▽ More We study confining dark sectors where the lightest hadrons are glueballs. Such models can provide viable dark matter candidates and appear in some neutral naturalness scenarios. In this work, we introduce a new phenomenological model of dark glueball hadronization inspired by the Lund string model. This enables us to make realistic predictions for dark glueball phenomenology at the LHC for the first time. Our model reproduces the expected thermal distribution of hadron species as an emergent consequence of hadronization dynamics. The ability to predict the production of glueball states heavier than the lightest species significantly expands the reach of long-lived glueball searches in MATHUSLA compared to previous simplified estimates. We also characterize regions of parameter space where emerging and/or semivisible jets could arise from pure-glue dark sectors, thereby providing new benchmark models that motivate searches for these signatures. △ Less

Submitted 15 April, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

Comments: 27 pages + appendices + references, 11 + 4 figures; journal edits, including new section on total hadron multiplicities

Report number: CERN-TH-2023-194

Journal ref: JHEP04(2024)070

arXiv:2310.12138 [pdf, other]

The NANOGrav 15-year data set: Search for Transverse Polarization Modes in the Gravitational-Wave Background

Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Jeremy Baier, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Rand Burnette, Robin Case, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Dallas DeGan, Paul B. Demorest , et al. (74 additional authors not shown)

Abstract: Recently we found compelling evidence for a gravitational wave background with Hellings and Downs (HD) correlations in our 15-year data set. These correlations describe gravitational waves as predicted by general relativity, which has two transverse polarization modes. However, more general metric theories of gravity can have additional polarization modes which produce different interpulsar correl… ▽ More Recently we found compelling evidence for a gravitational wave background with Hellings and Downs (HD) correlations in our 15-year data set. These correlations describe gravitational waves as predicted by general relativity, which has two transverse polarization modes. However, more general metric theories of gravity can have additional polarization modes which produce different interpulsar correlations. In this work we search the NANOGrav 15-year data set for evidence of a gravitational wave background with quadrupolar Hellings and Downs (HD) and Scalar Transverse (ST) correlations. We find that HD correlations are the best fit to the data, and no significant evidence in favor of ST correlations. While Bayes factors show strong evidence for a correlated signal, the data does not strongly prefer either correlation signature, with Bayes factors $\sim 2$ when comparing HD to ST correlations, and $\sim 1$ for HD plus ST correlations to HD correlations alone. However, when modeled alongside HD correlations, the amplitude and spectral index posteriors for ST correlations are uninformative, with the HD process accounting for the vast majority of the total signal. Using the optimal statistic, a frequentist technique that focuses on the pulsar-pair cross-correlations, we find median signal-to-noise-ratios of 5.0 for HD and 4.6 for ST correlations when fit for separately, and median signal-to-noise-ratios of 3.5 for HD and 3.0 for ST correlations when fit for simultaneously. While the signal-to-noise-ratios for each of the correlations are comparable, the estimated amplitude and spectral index for HD are a significantly better fit to the total signal, in agreement with our Bayesian analysis. △ Less

Submitted 18 October, 2023; originally announced October 2023.

Comments: 11 pages, 5 figures

arXiv:2310.03232 [pdf, other]

Deep Representations of First-person Pronouns for Prediction of Depression Symptom Severity

Authors: Xinyang Ren, Hannah A Burkhardt, Patricia A Areán, Thomas D Hull, Trevor Cohen

Abstract: Prior work has shown that analyzing the use of first-person singular pronouns can provide insight into individuals' mental status, especially depression symptom severity. These findings were generated by counting frequencies of first-person singular pronouns in text data. However, counting doesn't capture how these pronouns are used. Recent advances in neural language modeling have leveraged metho… ▽ More Prior work has shown that analyzing the use of first-person singular pronouns can provide insight into individuals' mental status, especially depression symptom severity. These findings were generated by counting frequencies of first-person singular pronouns in text data. However, counting doesn't capture how these pronouns are used. Recent advances in neural language modeling have leveraged methods generating contextual embeddings. In this study, we sought to utilize the embeddings of first-person pronouns obtained from contextualized language representation models to capture ways these pronouns are used, to analyze mental status. De-identified text messages sent during online psychotherapy with weekly assessment of depression severity were used for evaluation. Results indicate the advantage of contextualized first-person pronoun embeddings over standard classification token embeddings and frequency-based pronoun analysis results in predicting depression symptom severity. This suggests contextual representations of first-person pronouns can enhance the predictive utility of language used by people with depression symptoms. △ Less

Submitted 4 October, 2023; originally announced October 2023.

Comments: Accepted: AMIA Annual Symposium 2023. To appear as: Ren X, Burkhardt H, Areán P, Hull T, Cohen T. Deep Representations of First-person Pronouns for Prediction of Depression Symptom Severity. AMIA Annual Symposium Proceedings 2023. American Medical Informatics Association

arXiv:2310.02451 [pdf, other]

Backdoor Adjustment of Confounding by Provenance for Robust Text Classification of Multi-institutional Clinical Notes

Authors: Xiruo Ding, Zhecheng Sheng, Meliha Yetişgen, Serguei Pakhomov, Trevor Cohen

Abstract: Natural Language Processing (NLP) methods have been broadly applied to clinical tasks. Machine learning and deep learning approaches have been used to improve the performance of clinical NLP. However, these approaches require sufficiently large datasets for training, and trained models have been shown to transfer poorly across sites. These issues have led to the promotion of data collection and in… ▽ More Natural Language Processing (NLP) methods have been broadly applied to clinical tasks. Machine learning and deep learning approaches have been used to improve the performance of clinical NLP. However, these approaches require sufficiently large datasets for training, and trained models have been shown to transfer poorly across sites. These issues have led to the promotion of data collection and integration across different institutions for accurate and portable models. However, this can introduce a form of bias called confounding by provenance. When source-specific data distributions differ at deployment, this may harm model performance. To address this issue, we evaluate the utility of backdoor adjustment for text classification in a multi-site dataset of clinical notes annotated for mentions of substance abuse. Using an evaluation framework devised to measure robustness to distributional shifts, we assess the utility of backdoor adjustment. Our results indicate that backdoor adjustment can effectively mitigate for confounding shift. △ Less

Submitted 3 October, 2023; originally announced October 2023.

Comments: Accepted in AMIA 2023 Annual Symposium

arXiv:2310.02411 [pdf, other]

doi 10.1093/mnras/stad3745

Pre-supernova outbursts by core magnetic activity

Authors: Tamar Cohen, Noam Soker

Abstract: We conduct one-dimensional stellar evolutionary numerical simulations under the assumption that an efficient dynamo operates in the core of massive stars years to months before core collapse and find that the magnetic activity enhances mass loss rate and might trigger binary interaction that leads to outbursts. We assume that the magnetic flux tubes that the dynamo forms in the inner core buoy out… ▽ More We conduct one-dimensional stellar evolutionary numerical simulations under the assumption that an efficient dynamo operates in the core of massive stars years to months before core collapse and find that the magnetic activity enhances mass loss rate and might trigger binary interaction that leads to outbursts. We assume that the magnetic flux tubes that the dynamo forms in the inner core buoy out to the outer core where there is a steep entropy rise and a molecular weight drop. There the magnetic fields turn to thermal energy, i.e., by reconnection. We simulate this energy deposition where the entropy steeply rises and find that for our simulated cases the envelope radius increases by a factor of 1.2-2 and luminosity by about an order of magnitude. These changes enhance the mass loss rate. The envelope expansion can trigger a binary interaction that powers an outburst. Because magnetic field amplification depends positively on the core rotation rate and operates in cycles, not in all cases the magnetic activity will be powerful enough to change envelope properties. Namely, only a fraction of core-collapse supernovae experiences pre-explosion outbursts. △ Less

Submitted 1 December, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

Comments: Accepted for publication in MNRAS

Journal ref: 2024MNRAS.52710025C

arXiv:2310.01559 [pdf, other]

Mechanical Forces Quench Frontal Polymerization: Experiments and Theory

Authors: Xuanhe Li, Tal Cohen

Abstract: Frontal polymerization is a promising energy-saving method for rapid fabrication of polymer components with good mechanical properties. In these systems, a small energy input is sufficient to convert monomers, from a liquid or soft solid state, into a stiff polymer component. Once the reaction is initiated, it propagates as a self-sustaining front that is driven by the heat released from the react… ▽ More Frontal polymerization is a promising energy-saving method for rapid fabrication of polymer components with good mechanical properties. In these systems, a small energy input is sufficient to convert monomers, from a liquid or soft solid state, into a stiff polymer component. Once the reaction is initiated, it propagates as a self-sustaining front that is driven by the heat released from the reaction itself. While several studies have been proposed to capture the coupling between thermodynamics and extreme chemical kinetics in these systems, and can explain experimentally observed thermo-chemical instabilities, only few have considered the potential influence of mechanical forces that develop in these systems during fabrication. Nonetheless, some experiments do indicate that local volume changes induced by the competing effects of thermal expansion and chemical shrinkage, can lead to significant deformation or even failure in the resulting component. In this work, we present a unique experimental approach to elucidate the effect of mechanics on the propagation. Our experiments reveal that residual stresses that arise in frontal polymerization are not only a potential cause of undesired deformations in polymer products, but can also quench the reaction front. This thermo-chemo-mechanically coupled effect is captured by our theoretical model, which explains the mechanical limitations on frontal polymerization and can guide future fabrication. Overall, the findings of this work suggest that mechanical coupling needs to be taken into consideration to enable industrial applications of frontal polymerization at large scales. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:2309.17438 [pdf, other]

The NANOGrav 12.5-year data set: A computationally efficient eccentric binary search pipeline and constraints on an eccentric supermassive binary candidate in 3C 66B

Authors: Gabriella Agazie, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Harsha Blumer, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Belinda D. Cheeseboro, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Megan E. DeCesar, Paul B. Demorest, Lankeswar Dey, Timothy Dolch, Justin A. Ellis, Robert D. Ferdman, Elizabeth C. Ferrara , et al. (63 additional authors not shown)

Abstract: The radio galaxy 3C 66B has been hypothesized to host a supermassive black hole binary (SMBHB) at its center based on electromagnetic observations. Its apparent 1.05-year period and low redshift ($\sim0.02$) make it an interesting testbed to search for low-frequency gravitational waves (GWs) using Pulsar Timing Array (PTA) experiments. This source has been subjected to multiple searches for contin… ▽ More The radio galaxy 3C 66B has been hypothesized to host a supermassive black hole binary (SMBHB) at its center based on electromagnetic observations. Its apparent 1.05-year period and low redshift ($\sim0.02$) make it an interesting testbed to search for low-frequency gravitational waves (GWs) using Pulsar Timing Array (PTA) experiments. This source has been subjected to multiple searches for continuous GWs from a circular SMBHB, resulting in progressively more stringent constraints on its GW amplitude and chirp mass. In this paper, we develop a pipeline for performing Bayesian targeted searches for eccentric SMBHBs in PTA data sets, and test its efficacy by applying it on simulated data sets with varying injected signal strengths. We also search for a realistic eccentric SMBHB source in 3C 66B using the NANOGrav 12.5-year data set employing PTA signal models containing Earth term-only as well as Earth+Pulsar term contributions using this pipeline. Due to limitations in our PTA signal model, we get meaningful results only when the initial eccentricity $e_0<0.5$ and the symmetric mass ratio $η>0.1$. We find no evidence for an eccentric SMBHB signal in our data, and therefore place 95% upper limits on the PTA signal amplitude of $88.1\pm3.7$ ns for the Earth term-only and $81.74\pm0.86$ ns for the Earth+Pulsar term searches for $e_0<0.5$ and $η>0.1$. Similar 95% upper limits on the chirp mass are $(1.98 \pm 0.05) \times 10^9\,M_{\odot}$ and $(1.81 \pm 0.01) \times 10^9\,M_{\odot}$. These upper limits, while less stringent than those calculated from a circular binary search in the NANOGrav 12.5-year data set, are consistent with the SMBHB model of 3C 66B developed from electromagnetic observations. △ Less

Submitted 15 January, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

Comments: 27 Pages, 10 Figures, 1 Table, Accepted for publication in ApJ

arXiv:2309.04443 [pdf, other]

doi 10.3847/1538-4357/ad09e4

How to Detect an Astrophysical Nanohertz Gravitational-Wave Background

Authors: Bence Bécsy, Neil J. Cornish, Patrick M. Meyers, Luke Zoltan Kelley, Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Katerina Chatziioannou, Tyler Cohen, James M. Cordes, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Paul B. Demorest, Timothy Dolch , et al. (71 additional authors not shown)

Abstract: Analysis of pulsar timing data have provided evidence for a stochastic gravitational wave background in the nHz frequency band. The most plausible source of such a background is the superposition of signals from millions of supermassive black hole binaries. The standard statistical techniques used to search for such a background and assess its significance make several simplifying assumptions, nam… ▽ More Analysis of pulsar timing data have provided evidence for a stochastic gravitational wave background in the nHz frequency band. The most plausible source of such a background is the superposition of signals from millions of supermassive black hole binaries. The standard statistical techniques used to search for such a background and assess its significance make several simplifying assumptions, namely: i) Gaussianity; ii) isotropy; and most often iii) a power-law spectrum. However, a stochastic background from a finite collection of binaries does not exactly satisfy any of these assumptions. To understand the effect of these assumptions, we test standard analysis techniques on a large collection of realistic simulated datasets. The dataset length, observing schedule, and noise levels were chosen to emulate the NANOGrav 15-year dataset. Simulated signals from millions of binaries drawn from models based on the Illustris cosmological hydrodynamical simulation were added to the data. We find that the standard statistical methods perform remarkably well on these simulated datasets, despite their fundamental assumptions not being strictly met. They are able to achieve a confident detection of the background. However, even for a fixed set of astrophysical parameters, different realizations of the universe result in a large variance in the significance and recovered parameters of the background. We also find that the presence of loud individual binaries can bias the spectral recovery of the background if we do not account for them. △ Less

Submitted 1 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

Comments: 14 pages, 8 figures, version matching published paper

Journal ref: ApJ 959 9 (2023)

arXiv:2309.00693 [pdf, other]

Comparing recent PTA results on the nanohertz stochastic gravitational wave background

Authors: The International Pulsar Timing Array Collaboration, G. Agazie, J. Antoniadis, A. Anumarlapudi, A. M. Archibald, P. Arumugam, S. Arumugam, Z. Arzoumanian, J. Askew, S. Babak, M. Bagchi, M. Bailes, A. -S. Bak Nielsen, P. T. Baker, C. G. Bassa, A. Bathula, B. Bécsy, A. Berthereau, N. D. R. Bhat, L. Blecha, M. Bonetti, E. Bortolas, A. Brazier, P. R. Brook, M. Burgay , et al. (220 additional authors not shown)

Abstract: The Australian, Chinese, European, Indian, and North American pulsar timing array (PTA) collaborations recently reported, at varying levels, evidence for the presence of a nanohertz gravitational wave background (GWB). Given that each PTA made different choices in modeling their data, we perform a comparison of the GWB and individual pulsar noise parameters across the results reported from the PTA… ▽ More The Australian, Chinese, European, Indian, and North American pulsar timing array (PTA) collaborations recently reported, at varying levels, evidence for the presence of a nanohertz gravitational wave background (GWB). Given that each PTA made different choices in modeling their data, we perform a comparison of the GWB and individual pulsar noise parameters across the results reported from the PTAs that constitute the International Pulsar Timing Array (IPTA). We show that despite making different modeling choices, there is no significant difference in the GWB parameters that are measured by the different PTAs, agreeing within $1σ$. The pulsar noise parameters are also consistent between different PTAs for the majority of the pulsars included in these analyses. We bridge the differences in modeling choices by adopting a standardized noise model for all pulsars and PTAs, finding that under this model there is a reduction in the tension in the pulsar noise parameters. As part of this reanalysis, we "extended" each PTA's data set by adding extra pulsars that were not timed by that PTA. Under these extensions, we find better constraints on the GWB amplitude and a higher signal-to-noise ratio for the Hellings and Downs correlations. These extensions serve as a prelude to the benefits offered by a full combination of data across all pulsars in the IPTA, i.e., the IPTA's Data Release 3, which will involve not just adding in additional pulsars, but also including data from all three PTAs where any given pulsar is timed by more than as single PTA. △ Less

Submitted 1 September, 2023; originally announced September 2023.

Comments: 21 pages, 9 figures, submitted to ApJ

arXiv:2308.14915 [pdf, other]

Information-driven Affordance Discovery for Efficient Robotic Manipulation

Authors: Pietro Mazzaglia, Taco Cohen, Daniel Dijkman

Abstract: Robotic affordances, providing information about what actions can be taken in a given situation, can aid robotic manipulation. However, learning about affordances requires expensive large annotated datasets of interactions or demonstrations. In this work, we argue that well-directed interactions with the environment can mitigate this problem and propose an information-based measure to augment the… ▽ More Robotic affordances, providing information about what actions can be taken in a given situation, can aid robotic manipulation. However, learning about affordances requires expensive large annotated datasets of interactions or demonstrations. In this work, we argue that well-directed interactions with the environment can mitigate this problem and propose an information-based measure to augment the agent's objective and accelerate the affordance discovery process. We provide a theoretical justification of our approach and we empirically validate the approach both in simulation and real-world tasks. Our method, which we dub IDA, enables the efficient discovery of visual affordances for several action primitives, such as grasping, stacking objects, or opening drawers, strongly improving data efficiency in simulation, and it allows us to learn grasping affordances in a small number of interactions, on a real-world setup with a UFACTORY XArm 6 robot arm. △ Less

Submitted 5 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: restoring 2308.14915v1 which was accidentally replaced with a different paper

arXiv:2307.13797 [pdf, other]

The NANOGrav 12.5-year Data Set: Search for Gravitational Wave Memory

Authors: Gabriella Agazie, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Harsha Blumer, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Rand Burnette, Robin Case, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Megan E. DeCesar, Dallas DeGan, Paul B. Demorest, Timothy Dolch, Brendan Drachler, Justin A. Ellis , et al. (65 additional authors not shown)

Abstract: We present the results of a Bayesian search for gravitational wave (GW) memory in the NANOGrav 12.5-yr data set. We find no convincing evidence for any gravitational wave memory signals in this data set (Bayes factor = 2.8). As such, we go on to place upper limits on the strain amplitude of GW memory events as a function of sky location and event epoch. These upper limits are computed using a sign… ▽ More We present the results of a Bayesian search for gravitational wave (GW) memory in the NANOGrav 12.5-yr data set. We find no convincing evidence for any gravitational wave memory signals in this data set (Bayes factor = 2.8). As such, we go on to place upper limits on the strain amplitude of GW memory events as a function of sky location and event epoch. These upper limits are computed using a signal model that assumes the existence of a common, spatially uncorrelated red noise in addition to a GW memory signal. The median strain upper limit as a function of sky position is approximately $3.3 \times 10^{-14}$. We also find that there are some differences in the upper limits as a function of sky position centered around PSR J0613$-$0200. This suggests that this pulsar has some excess noise which can be confounded with GW memory. Finally, the upper limits as a function of burst epoch continue to improve at later epochs. This improvement is attributable to the continued growth of the pulsar timing array. △ Less

Submitted 25 July, 2023; originally announced July 2023.

Comments: 29 pages, 5 figures

arXiv:2307.06927 [pdf, other]

doi 10.1016/j.jmps.2024.105627

A large deformation theory for coupled swelling and growth with application to growing tumors and bacterial biofilms

Authors: Chockalingam Senthilnathan, Tal Cohen

Abstract: There is significant interest in modelling the mechanics and physics of growth of soft biological systems such as tumors and bacterial biofilms. Solid tumors account for more than 85% of cancer mortality and bacterial biofilms account for a significant part of all human microbial infections.These growing biological systems are a mixture of fluid and solid components and increase their mass by inta… ▽ More There is significant interest in modelling the mechanics and physics of growth of soft biological systems such as tumors and bacterial biofilms. Solid tumors account for more than 85% of cancer mortality and bacterial biofilms account for a significant part of all human microbial infections.These growing biological systems are a mixture of fluid and solid components and increase their mass by intake of diffusing species such as fluids and nutrients (swelling) and subsequent conversion of some of the diffusing species into solid material (growth). Experiments indicate that these systems swell by large amounts and that the swelling and growth are intrinsically coupled. However, many existing theories for swelling coupled growth employ linear poroelasticity, which is limited to small swelling deformations, and employ phenomenological prescriptions for the dependence of growth rate on concentration of diffusing species and the stress-state in the system. In particular, the termination of growth is enforced through the prescription of a critical concentration of diffusing species and a homeostatic stress. In contrast, by developing a fully coupled swelling-growth theory that accounts for large swelling through nonlinear poroelasticity, we show that the emergent driving stress for growth automatically captures all the above phenomena. Further, we show that for the soft growing systems considered here, the effects of the homeostatic stress and critical concentration can be encapsulated under a single notion of a critical swelling ratio. The applicability of the theory is shown by its ability to capture experimental observations of growing tumors and biofilms under various mechanical and diffusion-consumption constraints. Additionally, compared to generalized mixture theories, our theory is amenable to relatively easy numerical implementation with a minimal physically motivated parameter space. △ Less

Submitted 25 March, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

arXiv:2307.02137 [pdf, ps, other]

Improved Approximation for Two-dimensional Vector Multiple Knapsack

Authors: Tomer Cohen, Ariel Kulik, Hadas Shachnai

Abstract: We study the uniform $2$-dimensional vector multiple knapsack (2VMK) problem, a natural variant of multiple knapsack arising in real-world applications such as virtual machine placement. The input for 2VMK is a set of items, each associated with a $2$-dimensional weight vector and a positive profit, along with $m$ $2$-dimensional bins of uniform (unit) capacity in each dimension. The goal is to fi… ▽ More We study the uniform $2$-dimensional vector multiple knapsack (2VMK) problem, a natural variant of multiple knapsack arising in real-world applications such as virtual machine placement. The input for 2VMK is a set of items, each associated with a $2$-dimensional weight vector and a positive profit, along with $m$ $2$-dimensional bins of uniform (unit) capacity in each dimension. The goal is to find an assignment of a subset of the items to the bins, such that the total weight of items assigned to a single bin is at most one in each dimension, and the total profit is maximized. Our main result is a $(1- \frac{\ln 2}{2} - \varepsilon)$-approximation algorithm for 2VMK, for every fixed $\varepsilon > 0$, thus improving the best known ratio of $(1 - \frac{1}{e}-\varepsilon)$ which follows as a special case from a result of [Fleischer at al., MOR 2011]. Our algorithm relies on an adaptation of the Round$\&$Approx framework of [Bansal et al., SICOMP 2010], originally designed for set covering problems, to maximization problems. The algorithm uses randomized rounding of a configuration-LP solution to assign items to $\approx m\cdot \ln 2 \approx 0.693\cdot m$ of the bins, followed by a reduction to the ($1$-dimensional) Multiple Knapsack problem for assigning items to the remaining bins. △ Less

Submitted 5 July, 2023; originally announced July 2023.

arXiv:2306.16223 [pdf, other]

The NANOGrav 15-year Gravitational-Wave Background Analysis Pipeline

Authors: Aaron D. Johnson, Patrick M. Meyers, Paul T. Baker, Neil J. Cornish, Jeffrey S. Hazboun, Tyson B. Littenberg, Joseph D. Romano, Stephen R. Taylor, Michele Vallisneri, Sarah J. Vigeland, Ken D. Olum, Xavier Siemens, Justin A. Ellis, Rutger van Haasteren, Sophie Hourihane, Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Bence Bécsy, J. Andrew Casey-Clyde , et al. (71 additional authors not shown)

Abstract: This paper presents rigorous tests of pulsar timing array methods and software, examining their consistency across a wide range of injected parameters and signal strength. We discuss updates to the 15-year isotropic gravitational-wave background analyses and their corresponding code representations. Descriptions of the internal structure of the flagship algorithms \texttt{Enterprise} and \texttt{P… ▽ More This paper presents rigorous tests of pulsar timing array methods and software, examining their consistency across a wide range of injected parameters and signal strength. We discuss updates to the 15-year isotropic gravitational-wave background analyses and their corresponding code representations. Descriptions of the internal structure of the flagship algorithms \texttt{Enterprise} and \texttt{PTMCMCSampler} are given to facilitate understanding of the PTA likelihood structure, how models are built, and what methods are currently used in sampling the high-dimensional PTA parameter space. We introduce a novel version of the PTA likelihood that uses a two-step marginalization procedure that performs much faster when the white noise parameters remain fixed. We perform stringent tests of consistency and correctness of the Bayesian and frequentist analysis software. For the Bayesian analysis, we test prior recovery, injection recovery, and Bayes factors. For the frequentist analysis, we test that the cross-correlation-based optimal statistic, when modified to account for a non-negligible gravitational-wave background, accurately recovers the amplitude of the background. We also summarize recent advances and tests performed on the optimal statistic in the literature from both GWB detection and parameter estimation perspectives. The tests presented here validate current and future analyses of PTA data. △ Less

Submitted 7 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: 30 pages, 10 figures, 1 table; Companion paper to "The NANOGrav 15-year Data Set: Evidence for a Gravitational-Wave Background"; For questions or comments, please email comments@nanograv.org

arXiv:2306.16222 [pdf, other]

doi 10.3847/2041-8213/ace18a

The NANOGrav 15-year Data Set: Bayesian Limits on Gravitational Waves from Individual Supermassive Black Hole Binaries

Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Robin Case, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan DeCesar, Paul B. Demorest, Matthew C. Digman, Timothy Dolch, Brendan Drachler , et al. (74 additional authors not shown)

Abstract: Evidence for a low-frequency stochastic gravitational wave background has recently been reported based on analyses of pulsar timing array data. The most likely source of such a background is a population of supermassive black hole binaries, the loudest of which may be individually detected in these datasets. Here we present the search for individual supermassive black hole binaries in the NANOGrav… ▽ More Evidence for a low-frequency stochastic gravitational wave background has recently been reported based on analyses of pulsar timing array data. The most likely source of such a background is a population of supermassive black hole binaries, the loudest of which may be individually detected in these datasets. Here we present the search for individual supermassive black hole binaries in the NANOGrav 15-year dataset. We introduce several new techniques, which enhance the efficiency and modeling accuracy of the analysis. The search uncovered weak evidence for two candidate signals, one with a gravitational-wave frequency of $\sim$4 nHz, and another at $\sim$170 nHz. The significance of the low-frequency candidate was greatly diminished when Hellings-Downs correlations were included in the background model. The high-frequency candidate was discounted due to the lack of a plausible host galaxy, the unlikely astrophysical prior odds of finding such a source, and since most of its support comes from a single pulsar with a commensurate binary period. Finding no compelling evidence for signals from individual binary systems, we place upper limits on the strain amplitude of gravitational waves emitted by such systems. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 23 pages, 13 figures, 2 tables. Accepted for publication in Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email comments@nanograv.org

arXiv:2306.16221 [pdf, other]

The NANOGrav 15-year Data Set: Search for Anisotropy in the Gravitational-Wave Background

Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Paul B. Demorest, Timothy Dolch, Brendan Drachler, Elizabeth C. Ferrara, William Fiore , et al. (68 additional authors not shown)

Abstract: The North American Nanohertz Observatory for Gravitational Waves (NANOGrav) has reported evidence for the presence of an isotropic nanohertz gravitational wave background (GWB) in its 15 yr dataset. However, if the GWB is produced by a population of inspiraling supermassive black hole binary (SMBHB) systems, then the background is predicted to be anisotropic, depending on the distribution of these… ▽ More The North American Nanohertz Observatory for Gravitational Waves (NANOGrav) has reported evidence for the presence of an isotropic nanohertz gravitational wave background (GWB) in its 15 yr dataset. However, if the GWB is produced by a population of inspiraling supermassive black hole binary (SMBHB) systems, then the background is predicted to be anisotropic, depending on the distribution of these systems in the local Universe and the statistical properties of the SMBHB population. In this work, we search for anisotropy in the GWB using multiple methods and bases to describe the distribution of the GWB power on the sky. We do not find significant evidence of anisotropy, and place a Bayesian $95\%$ upper limit on the level of broadband anisotropy such that $(C_{l>0} / C_{l=0}) < 20\%$. We also derive conservative estimates on the anisotropy expected from a random distribution of SMBHB systems using astrophysical simulations conditioned on the isotropic GWB inferred in the 15-yr dataset, and show that this dataset has sufficient sensitivity to probe a large fraction of the predicted level of anisotropy. We end by highlighting the opportunities and challenges in searching for anisotropy in pulsar timing array data. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 19 pages, 11 figures; submitted to Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email comments@nanograv.org

arXiv:2306.16220 [pdf, other]

doi 10.3847/2041-8213/ace18b

The NANOGrav 15-year Data Set: Constraints on Supermassive Black Hole Binaries from the Gravitational Wave Background

Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Paul T. Baker, Bence Bécsy, Laura Blecha, Alexander Bonilla, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Rand Burnette, Robin Case, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Katerina Chatziioannou, Belinda D. Cheeseboro, Siyuan Chen, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Curt J. Cutler , et al. (89 additional authors not shown)

Abstract: The NANOGrav 15-year data set shows evidence for the presence of a low-frequency gravitational-wave background (GWB). While many physical processes can source such low-frequency gravitational waves, here we analyze the signal as coming from a population of supermassive black hole (SMBH) binaries distributed throughout the Universe. We show that astrophysically motivated models of SMBH binary popul… ▽ More The NANOGrav 15-year data set shows evidence for the presence of a low-frequency gravitational-wave background (GWB). While many physical processes can source such low-frequency gravitational waves, here we analyze the signal as coming from a population of supermassive black hole (SMBH) binaries distributed throughout the Universe. We show that astrophysically motivated models of SMBH binary populations are able to reproduce both the amplitude and shape of the observed low-frequency gravitational-wave spectrum. While multiple model variations are able to reproduce the GWB spectrum at our current measurement precision, our results highlight the importance of accurately modeling binary evolution for producing realistic GWB spectra. Additionally, while reasonable parameters are able to reproduce the 15-year observations, the implied GWB amplitude necessitates either a large number of parameters to be at the edges of expected values, or a small number of parameters to be notably different from standard expectations. While we are not yet able to definitively establish the origin of the inferred GWB signal, the consistency of the signal with astrophysical expectations offers a tantalizing prospect for confirming that SMBH binaries are able to form, reach sub-parsec separations, and eventually coalesce. As the significance grows over time, higher-order features of the GWB spectrum will definitively determine the nature of the GWB and allow for novel constraints on SMBH populations. △ Less

Submitted 18 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: Accepted by Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email comments@nanograv.org. Edited to fix two equation typos (Eq.13 & 21), and minor text typos

Showing 1–50 of 419 results for author: Cohen, T