subscribe to arXiv mailings

A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1758 additional authors not shown)

Abstract: The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by… ▽ More The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by CHIME/FRB, as well as X-ray glitches and X-ray bursts detected by NICER and NuSTAR close to the time of one of the FRBs. We do not detect any significant GW emission from any of the events. Instead, using a short-duration GW search (for bursts $\leq$ 1 s) we derive 50\% (90\%) upper limits of $10^{48}$ ($10^{49}$) erg for GWs at 300 Hz and $10^{49}$ ($10^{50}$) erg at 2 kHz, and constrain the GW-to-radio energy ratio to $\leq 10^{14} - 10^{16}$. We also derive upper limits from a long-duration search for bursts with durations between 1 and 10 s. These represent the strictest upper limits on concurrent GW emission from FRBs. △ Less

Submitted 11 October, 2024; originally announced October 2024.

Comments: 15 pages of text including references, 4 figures, 5 tables

Report number: LIGO-P2400192

arXiv:2410.06511 [pdf, other]

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training

Authors: Wanchao Liang, Tianyu Liu, Less Wright, Will Constable, Andrew Gu, Chien-Chin Huang, Iris Zhang, Wei Feng, Howard Huang, Junjie Wang, Sanket Purandare, Gokul Nadathur, Stratos Idreos

Abstract: The development of large language models (LLMs) has been instrumental in advancing state-of-the-art natural language processing applications. Training LLMs with billions of parameters and trillions of tokens require sophisticated distributed systems that enable composing and comparing several state-of-the-art techniques in order to efficiently scale across thousands of accelerators. However, exist… ▽ More The development of large language models (LLMs) has been instrumental in advancing state-of-the-art natural language processing applications. Training LLMs with billions of parameters and trillions of tokens require sophisticated distributed systems that enable composing and comparing several state-of-the-art techniques in order to efficiently scale across thousands of accelerators. However, existing solutions are complex, scattered across multiple libraries/repositories, lack interoperability, and are cumbersome to maintain. Thus, curating and empirically comparing training recipes require non-trivial engineering effort. This paper introduces TorchTitan, an open-source, PyTorch-native distributed training system that unifies state-of-the-art techniques, streamlining integration and reducing overhead. TorchTitan enables 3D parallelism in a modular manner with elastic scaling, providing comprehensive logging, checkpointing, and debugging tools for production-ready training. It also incorporates hardware-software co-designed solutions, leveraging features like Float8 training and SymmetricMemory. As a flexible test bed, TorchTitan facilitates custom recipe curation and comparison, allowing us to develop optimized training recipes for Llama 3.1 and provide guidance on selecting techniques for maximum efficiency based on our experiences. We thoroughly assess TorchTitan on the Llama 3.1 family of LLMs, spanning 8 billion to 405 billion parameters, and showcase its exceptional performance, modular composability, and elastic scalability. By stacking training optimizations, we demonstrate accelerations of 65.08% with 1D parallelism at the 128-GPU scale (Llama 3.1 8B), an additional 12.59% with 2D parallelism at the 256-GPU scale (Llama 3.1 70B), and an additional 30% with 3D parallelism at the 512-GPU scale (Llama 3.1 405B) on NVIDIA H100 GPUs over optimized baselines. △ Less

Submitted 8 October, 2024; originally announced October 2024.

arXiv:2407.12867 [pdf, other]

Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run

Authors: Gayathri Raman, Samuele Ronchini, James Delaunay, Aaron Tohuvavohu, Jamie A. Kennea, Tyler Parsotan, Elena Ambrosi, Maria Grazia Bernardini, Sergio Campana, Giancarlo Cusumano, Antonino D'Ai, Paolo D'Avanzo, Valerio D'Elia, Massimiliano De Pasquale, Simone Dichiara, Phil Evans, Dieter Hartmann, Paul Kuin, Andrea Melandri, Paul O'Brien, Julian P. Osborne, Kim Page, David M. Palmer, Boris Sbarufatti, Gianpiero Tagliaferri , et al. (1797 additional authors not shown)

Abstract: We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav… ▽ More We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wave Transient Catalogs (GWTC-3). Targeted searches were carried out on the entire GW sample using the maximum--likelihood NITRATES pipeline on the BAT data made available via the GUANO infrastructure. We do not detect any significant electromagnetic emission that is temporally and spatially coincident with any of the GW candidates. We report flux upper limits in the 15-350 keV band as a function of sky position for all the catalog candidates. For GW candidates where the Swift-BAT false alarm rate is less than 10$^{-3}$ Hz, we compute the GW--BAT joint false alarm rate. Finally, the derived Swift-BAT upper limits are used to infer constraints on the putative electromagnetic emission associated with binary black hole mergers. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: 50 pages, 10 figures, 4 tables

arXiv:2406.03372 [pdf, other]

Training of Physical Neural Networks

Authors: Ali Momeni, Babak Rahmani, Benjamin Scellier, Logan G. Wright, Peter L. McMahon, Clara C. Wanjura, Yuhang Li, Anas Skalli, Natalia G. Berloff, Tatsuhiro Onodera, Ilker Oguz, Francesco Morichetti, Philipp del Hougne, Manuel Le Gallo, Abu Sebastian, Azalia Mirhoseini, Cheng Zhang, Danijela Marković, Daniel Brunner, Christophe Moser, Sylvain Gigan, Florian Marquardt, Aydogan Ozcan, Julie Grollier, Andrea J. Liu , et al. (3 additional authors not shown)

Abstract: Physical neural networks (PNNs) are a class of neural-like networks that leverage the properties of physical systems to perform computation. While PNNs are so far a niche research area with small-scale laboratory demonstrations, they are arguably one of the most underappreciated important opportunities in modern AI. Could we train AI models 1000x larger than current ones? Could we do this and also… ▽ More Physical neural networks (PNNs) are a class of neural-like networks that leverage the properties of physical systems to perform computation. While PNNs are so far a niche research area with small-scale laboratory demonstrations, they are arguably one of the most underappreciated important opportunities in modern AI. Could we train AI models 1000x larger than current ones? Could we do this and also have them perform inference locally and privately on edge devices, such as smartphones or sensors? Research over the past few years has shown that the answer to all these questions is likely "yes, with enough research": PNNs could one day radically change what is possible and practical for AI systems. To do this will however require rethinking both how AI models work, and how they are trained - primarily by considering the problems through the constraints of the underlying hardware physics. To train PNNs at large scale, many methods including backpropagation-based and backpropagation-free approaches are now being explored. These methods have various trade-offs, and so far no method has been shown to scale to the same scale and performance as the backpropagation algorithm widely used in deep learning today. However, this is rapidly changing, and a diverse ecosystem of training techniques provides clues for how PNNs may one day be utilized to create both more efficient realizations of current-scale AI models, and to enable unprecedented-scale models. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 29 pages, 4 figures

arXiv:2406.01399 [pdf, other]

doi 10.1145/3630106.3658998

Null Compliance: NYC Local Law 144 and the Challenges of Algorithm Accountability

Authors: Lucas Wright, Roxana Mike Muenster, Briana Vecchione, Tianyao Qu, Pika, Cai, COMM/INFO 2450 Student Investigators, Jacob Metcalf, J. Nathan Matias

Abstract: In July 2023, New York City became the first jurisdiction globally to mandate bias audits for commercial algorithmic systems, specifically for automated employment decisions systems (AEDTs) used in hiring and promotion. Local Law 144 (LL 144) requires AEDTs to be independently audited annually for race and gender bias, and the audit report must be publicly posted. Additionally, employers are oblig… ▽ More In July 2023, New York City became the first jurisdiction globally to mandate bias audits for commercial algorithmic systems, specifically for automated employment decisions systems (AEDTs) used in hiring and promotion. Local Law 144 (LL 144) requires AEDTs to be independently audited annually for race and gender bias, and the audit report must be publicly posted. Additionally, employers are obligated to post a transparency notice with the job listing. In this study, 155 student investigators recorded 391 employers' compliance with LL 144 and the user experience for prospective job applicants. Among these employers, 18 posted audit reports and 13 posted transparency notices. These rates could potentially be explained by a significant limitation in the accountability mechanisms enacted by LL 144. Since the law grants employers substantial discretion over whether their system is in scope of the law, a null result cannot be said to indicate non-compliance, a condition we call ``null compliance." Employer discretion may also explain our finding that nearly all audits reported an impact factor over 0.8, a rule of thumb often used in employment discrimination cases. We also find that the benefit of LL 144 to ordinary job seekers is limited due to shortcomings in accessibility and usability. Our findings offer important lessons for policy-makers as they consider regulating algorithmic systems, particularly the degree of discretion to grant to regulated parties and the limitations of relying on transparency and end-user accountability. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2404.04248 [pdf, other]

doi 10.3847/2041-8213/ad5beb

Observation of Gravitational Waves from the Coalescence of a $2.5\text{-}4.5~M_\odot$ Compact Object and a Neutron Star

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, S. Akçay, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah , et al. (1771 additional authors not shown)

Abstract: We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the so… ▽ More We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the source has a mass less than $5~M_\odot$ at 99% credibility. We cannot definitively determine from gravitational-wave data alone whether either component of the source is a neutron star or a black hole. However, given existing estimates of the maximum neutron star mass, we find the most probable interpretation of the source to be the coalescence of a neutron star with a black hole that has a mass between the most massive neutron stars and the least massive black holes observed in the Galaxy. We provisionally estimate a merger rate density of $55^{+127}_{-47}~\text{Gpc}^{-3}\,\text{yr}^{-1}$ for compact binary coalescences with properties similar to the source of GW230529_181500; assuming that the source is a neutron star-black hole merger, GW230529_181500-like sources constitute about 60% of the total merger rate inferred for neutron star-black hole coalescences. The discovery of this system implies an increase in the expected rate of neutron star-black hole mergers with electromagnetic counterparts and provides further evidence for compact objects existing within the purported lower mass gap. △ Less

Submitted 26 July, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

Comments: 45 pages (10 pages author list, 13 pages main text, 1 page acknowledgements, 13 pages appendices, 8 pages bibliography), 17 figures, 16 tables. Update to match version published in The Astrophysical Journal Letters. Data products available from https://zenodo.org/records/10845779

Report number: LIGO-P2300352

Journal ref: ApJL 970, L34 (2024)

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2402.19247 [pdf, other]

Noisy intermediate-scale quantum simulation of the one-dimensional wave equation

Authors: Lewis Wright, Conor Mc Keever, Jeremy T. First, Rory Johnston, Jeremy Tillay, Skylar Chaney, Matthias Rosenkranz, Michael Lubasch

Abstract: We design and implement quantum circuits for the simulation of the one-dimensional wave equation on the Quantinuum H1-1 quantum computer. The circuit depth of our approach scales as $O(n^{2})$ for $n$ qubits representing the solution on $2^n$ grid points, and leads to infidelities of $O(2^{-4n} t^{2})$ for simulation time $t$ assuming smooth initial conditions. By varying the qubit count we study… ▽ More We design and implement quantum circuits for the simulation of the one-dimensional wave equation on the Quantinuum H1-1 quantum computer. The circuit depth of our approach scales as $O(n^{2})$ for $n$ qubits representing the solution on $2^n$ grid points, and leads to infidelities of $O(2^{-4n} t^{2})$ for simulation time $t$ assuming smooth initial conditions. By varying the qubit count we study the interplay between the algorithmic and physical gate errors to identify the optimal working point of minimum total error. Our approach to simulating the wave equation can readily be adapted to other quantum processors and serve as an application-oriented benchmark. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: 10 pages, 7 figures, 1 table

arXiv:2402.17750 [pdf, other]

Scaling on-chip photonic neural processors using arbitrarily programmable wave propagation

Authors: Tatsuhiro Onodera, Martin M. Stein, Benjamin A. Ash, Mandar M. Sohoni, Melissa Bosch, Ryotatsu Yanagimoto, Marc Jankowski, Timothy P. McKenna, Tianyu Wang, Gennady Shvets, Maxim R. Shcherbakov, Logan G. Wright, Peter L. McMahon

Abstract: On-chip photonic processors for neural networks have potential benefits in both speed and energy efficiency but have not yet reached the scale at which they can outperform electronic processors. The dominant paradigm for designing on-chip photonics is to make networks of relatively bulky discrete components connected by one-dimensional waveguides. A far more compact alternative is to avoid explici… ▽ More On-chip photonic processors for neural networks have potential benefits in both speed and energy efficiency but have not yet reached the scale at which they can outperform electronic processors. The dominant paradigm for designing on-chip photonics is to make networks of relatively bulky discrete components connected by one-dimensional waveguides. A far more compact alternative is to avoid explicitly defining any components and instead sculpt the continuous substrate of the photonic processor to directly perform the computation using waves freely propagating in two dimensions. We propose and demonstrate a device whose refractive index as a function of space, $n(x,z)$, can be rapidly reprogrammed, allowing arbitrary control over the wave propagation in the device. Our device, a 2D-programmable waveguide, combines photoconductive gain with the electro-optic effect to achieve massively parallel modulation of the refractive index of a slab waveguide, with an index modulation depth of $10^{-3}$ and approximately $10^4$ programmable degrees of freedom. We used a prototype device with a functional area of $12\,\text{mm}^2$ to perform neural-network inference with up to 49-dimensional input vectors in a single pass, achieving 96% accuracy on vowel classification and 86% accuracy on $7 \times 7$-pixel MNIST handwritten-digit classification. This is a scale beyond that of previous photonic chips relying on discrete components, illustrating the benefit of the continuous-waves paradigm. In principle, with large enough chip area, the reprogrammability of the device's refractive index distribution enables the reconfigurable realization of any passive, linear photonic circuit or device. This promises the development of more compact and versatile photonic systems for a wide range of applications, including optical processing, smart sensing, spectroscopy, and optical communications. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.16695 [pdf, other]

Indirect pumping of alkali-metal gases in a miniature silicon-wafer cell

Authors: J. D. Zipfel, P. Bevington, L. Wright, W. Chalupczak, G. Quick, B. Steele, J. Nicholson, V. Guarrera

Abstract: Atom spin sensors occupy a prominent position in the scenario of quantum technology, as they can combine precise measurements with appealing miniature packages which are crucial for many applications. In this work, we report on the design and realization of miniature silicon-wafer cells, with a double-chamber configuration and integrated heaters. The cells are tested by systematically studying the… ▽ More Atom spin sensors occupy a prominent position in the scenario of quantum technology, as they can combine precise measurements with appealing miniature packages which are crucial for many applications. In this work, we report on the design and realization of miniature silicon-wafer cells, with a double-chamber configuration and integrated heaters. The cells are tested by systematically studying the spin dynamics dependence on the main pump parameters, temperature, and bias magnetic field. The results are benchmarked against cm-sized paraffin-coated cells, which allows for optimisation of operating conditions of a radio-frequency driven atomic magnetometer. In particular, we observe that, when indirect optical pumping is performed on the two cells, an analogous line narrowing mechanism appears in otherwise very different cells' conditions. Competitive results are obtained, with magnetic resonance linewidths of roughly 100 Hz at the maximum signal-to-noise ratio, in a non-zero magnetic field setting, and in an atomic shot-noise limited regime. △ Less

Submitted 26 February, 2024; originally announced February 2024.

arXiv:2402.00025 [pdf, other]

Accelerating a Triton Fused Kernel for W4A16 Quantized Inference with SplitK work decomposition

Authors: Adnan Hoque, Less Wright, Chih-Chieh Yang, Mudhakar Srivatsa, Raghu Ganti

Abstract: We propose an implementation of an efficient fused matrix multiplication kernel for W4A16 quantized inference, where we perform dequantization and GEMM in a fused kernel using a SplitK work decomposition. Our implementation shows improvement for the type of skinny matrix-matrix multiplications found in foundation model inference workloads. In particular, this paper surveys the type of matrix multi… ▽ More We propose an implementation of an efficient fused matrix multiplication kernel for W4A16 quantized inference, where we perform dequantization and GEMM in a fused kernel using a SplitK work decomposition. Our implementation shows improvement for the type of skinny matrix-matrix multiplications found in foundation model inference workloads. In particular, this paper surveys the type of matrix multiplication between a skinny activation matrix and a square weight matrix. Our results show an average of 65% speed improvement on A100, and an average of 124% speed improvement on H100 (with a peak of 295%) for a range of matrix dimensions including those found in a llama-style model, where m < n = k. △ Less

Submitted 22 February, 2024; v1 submitted 5 January, 2024; originally announced February 2024.

arXiv:2401.06119 [pdf, other]

Highly multimode visible squeezed light with programmable spectral correlations through broadband up-conversion

Authors: Federico Presutti, Logan G. Wright, Shi-Yuan Ma, Tianyu Wang, Benjamin K. Malia, Tatsuhiro Onodera, Peter L. McMahon

Abstract: Multimode squeezed states of light have been proposed as a resource for achieving quantum advantage in computing and sensing. Recent experiments that demonstrate multimode Gaussian states to this end have most commonly opted for spatial or temporal modes, whereas a complete system based on frequency modes has yet to be realized. Instead, we show how to use the frequency modes simultaneously squeez… ▽ More Multimode squeezed states of light have been proposed as a resource for achieving quantum advantage in computing and sensing. Recent experiments that demonstrate multimode Gaussian states to this end have most commonly opted for spatial or temporal modes, whereas a complete system based on frequency modes has yet to be realized. Instead, we show how to use the frequency modes simultaneously squeezed in a conventional, single-spatial-mode, optical parametric amplifier when pumped by ultrashort pulses. Specifically, we show how adiabatic frequency conversion can be used not only to convert the quantum state from infrared to visible wavelengths, but to concurrently manipulate the joint spectrum. This near unity-efficiency quantum frequency conversion, over a bandwidth >45 THz and, to our knowledge, the broadest to date, allows us to measure the state with an electron-multiplying CCD (EMCCD) camera-based spectrometer, at non-cryogenic temperatures. We demonstrate the squeezing of >400 frequency modes, with a mean of approximately 700 visible photons per shot. Our work shows how many-mode quantum states of light can be generated, manipulated, and measured with efficient use of hardware resources -- in our case, using one pulsed laser, two nonlinear crystals, and one camera. This ability to produce, with modest hardware resources, large multimode squeezed states with partial programmability motivates the use of frequency encoding for photonics-based quantum information processing. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2312.16166 [pdf, other]

doi 10.1038/s41467-024-51161-8

Microwave signal processing using an analog quantum reservoir computer

Authors: Alen Senanian, Sridhar Prabhu, Vladimir Kremenetski, Saswata Roy, Yingkang Cao, Jeremy Kline, Tatsuhiro Onodera, Logan G. Wright, Xiaodi Wu, Valla Fatemi, Peter L. McMahon

Abstract: Quantum reservoir computing (QRC) has been proposed as a paradigm for performing machine learning with quantum processors where the training is efficient in the number of required runs of the quantum processor and takes place in the classical domain, avoiding the issue of barren plateaus in parameterized-circuit quantum neural networks. It is natural to consider using a quantum processor based on… ▽ More Quantum reservoir computing (QRC) has been proposed as a paradigm for performing machine learning with quantum processors where the training is efficient in the number of required runs of the quantum processor and takes place in the classical domain, avoiding the issue of barren plateaus in parameterized-circuit quantum neural networks. It is natural to consider using a quantum processor based on superconducting circuits to classify microwave signals that are analog -- continuous in time. However, while theoretical proposals of analog QRC exist, to date QRC has been implemented using circuit-model quantum systems -- imposing a discretization of the incoming signal in time, with each time point input by executing a gate operation. In this paper we show how a quantum superconducting circuit comprising an oscillator coupled to a qubit can be used as an analog quantum reservoir for a variety of classification tasks, achieving high accuracy on all of them. Our quantum system was operated without artificially discretizing the input data, directly taking in microwave signals. Our work does not attempt to address the question of whether QRCs could provide a quantum computational advantage in classifying pre-recorded classical signals. However, beyond illustrating that sophisticated tasks can be performed with a modest-size quantum system and inexpensive training, our work opens up the possibility of achieving a different kind of advantage than a purely computational advantage: superconducting circuits can act as extremely sensitive detectors of microwave photons; our work demonstrates processing of ultra-low-power microwave signals in our superconducting circuit, and by combining sensitive detection with QRC processing within the same system, one could achieve a quantum sensing-computational advantage, i.e., an advantage in the overall analysis of microwave signals comprising just a few photons. △ Less

Submitted 5 September, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

Journal ref: Nature Communications 15, 7490 (2024)

arXiv:2311.13775 [pdf, other]

doi 10.1364/OPTICA.514075

Mesoscopic ultrafast nonlinear optics -- The emergence of multimode quantum non-Gaussian physics

Authors: Ryotatsu Yanagimoto, Edwin Ng, Marc Jankowski, Rajveer Nehra, Timothy P. McKenna, Tatsuhiro Onodera, Logan G. Wright, Ryan Hamerly, Alireza Marandi, M. M. Fejer, Hideo Mabuchi

Abstract: Over the last few decades, nonlinear optics has become significantly more nonlinear, traversing nearly a billionfold improvement in energy efficiency, with ultrafast nonlinear nanophotonics in particular emerging as a frontier for combining both spatial and temporal engineering. At present, cutting-edge experiments in nonlinear nanophotonics place us just above the mesoscopic regime, where a few h… ▽ More Over the last few decades, nonlinear optics has become significantly more nonlinear, traversing nearly a billionfold improvement in energy efficiency, with ultrafast nonlinear nanophotonics in particular emerging as a frontier for combining both spatial and temporal engineering. At present, cutting-edge experiments in nonlinear nanophotonics place us just above the mesoscopic regime, where a few hundred photons suffice to trigger nonlinear saturation. In contrast to classical or deep-quantum optics, the mesoscale is characterized by dynamical interactions between mean-field, Gaussian, and non-Gaussian quantum features, all within a close hierarchy of scales. When combined with the inherent multimode complexity of optical fields, such hybrid quantum-classical dynamics present theoretical, experimental, and engineering challenges to the contemporary framework of quantum optics. In this review, we highlight the unique physics that emerges in multimode nonlinear optics at the mesoscale and outline key principles for exploiting both classical and quantum features to engineer novel functionalities. We briefly survey the experimental landscape and draw attention to outstanding technical challenges in materials, dispersion engineering, and device design for accessing mesoscopic operation. Finally, we speculate on how these capabilities might usher in some new paradigms in quantum photonics, from quantum-augmented information processing to nonclassical-light-driven dynamics and phenomena to all-optical non-Gaussian measurement and sensing. The physics unlocked at the mesoscale present significant challenges and opportunities in theory and experiment alike, and this review is intended to serve as a guidepost as we begin to navigate this new frontier in ultrafast quantum nonlinear optics. △ Less

Submitted 22 November, 2023; originally announced November 2023.

Comments: The first two authors contributed equally to this work; 26 pages, 7 figures

Journal ref: Optica 11, 896(2024)

arXiv:2311.05394 [pdf, other]

Remarkably Compact Quiescent Candidates at $3<z<5$ in JWST-CEERS

Authors: Lillian Wright, Katherine E. Whitaker, John R. Weaver, Sam E. Cutler, Bingjie Wang, Adam Carnall, Katherine A. Suess, Rachel Bezanson, Erica Nelson, Tim B. Miller, Kei Ito, Francesco Valentino

Abstract: In this letter, we measure the rest-frame optical and near-infrared sizes of ten quiescent candidates at $3<z<5$, first reported by Carnall et al. (2023a). We use James Webb Space Telescope (JWST) Near-Infrared Camera (NIRCam) F277W and F444W imaging obtained through the public CEERS Early Release Science (ERS) program and imcascade, an astronomical fitting code that utilizes Multi-Gaussian Expans… ▽ More In this letter, we measure the rest-frame optical and near-infrared sizes of ten quiescent candidates at $3<z<5$, first reported by Carnall et al. (2023a). We use James Webb Space Telescope (JWST) Near-Infrared Camera (NIRCam) F277W and F444W imaging obtained through the public CEERS Early Release Science (ERS) program and imcascade, an astronomical fitting code that utilizes Multi-Gaussian Expansion, to carry out our size measurements. When compared to the extrapolation of rest-optical size-mass relations for quiescent galaxies at lower redshift, eight out of ten candidates in our sample (80%) are on average more compact by $\sim$40%. Seven out of ten candidates (70%) exhibit rest-frame infrared sizes $\sim$10% smaller than rest-frame optical sizes, indicative of negative color gradients. Two candidates (20%) have rest-frame infrared sizes $\sim$1.4$\times$ larger than rest-frame optical sizes; one of these candidates exhibits signs of ongoing or residual star formation, suggesting this galaxy may not be fully quenched. The remaining candidate is unresolved in both filters, which may indicate an Active Galactic Nuclei (AGN). Strikingly, we observe three of the most massive galaxies in the sample (log(M$_{\star}$/M$_{\odot}$) = 10.74 - 10.95) are extremely compact, with effective radii ${\sim}$0.7 kpc. Our findings provide no indication that the size evolution relation flattens out, and may indicate that the size evolution of quiescent galaxies is steeper than previously anticipated beyond $z>3$. △ Less

Submitted 27 February, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: Accepted for publication in ApJL. 11 pages, 4 figures, 1 table

MSC Class: 85

arXiv:2310.18335 [pdf, other]

The hardware is the software

Authors: Jeremie Laydevant, Logan G. Wright, Tianyu Wang, Peter L. McMahon

Abstract: Human brains and bodies are not hardware running software: the hardware is the software. We reason that because the microscopic physics of artificial-intelligence hardware and of human biological "hardware" is distinct, neuromorphic engineers need to be cautious (and yet also creative) in how we take inspiration from biological intelligence. We should focus primarily on principles and design ideas… ▽ More Human brains and bodies are not hardware running software: the hardware is the software. We reason that because the microscopic physics of artificial-intelligence hardware and of human biological "hardware" is distinct, neuromorphic engineers need to be cautious (and yet also creative) in how we take inspiration from biological intelligence. We should focus primarily on principles and design ideas that respect -- and embrace -- the underlying hardware physics of non-biological intelligent systems, rather than abstracting it away. We see a major role for neuroscience in neuromorphic computing as identifying the physics-agnostic principles of biological intelligence -- that is the principles of biological intelligence that can be gainfully adapted and applied to any physical hardware. △ Less

Submitted 20 October, 2023; originally announced October 2023.

arXiv:2310.12918 [pdf, other]

The Near-Earth Object Surveyor Mission

Authors: A. K. Mainzer, Joseph R. Masiero, Paul A. Abell, J. M. Bauer, William Bottke, Bonnie J. Buratti, Sean J. Carey, D. Cotto-Figueroa, R. M. Cutri, D. Dahlen, Peter R. M. Eisenhardt, 6 Y. R. Fernandez, Roberto Furfaro, Tommy Grav, T. L. Hoffman, Michael S. Kelley, Yoonyoung Kim, J. Davy Kirkpatrick, Christopher R. Lawler, Eva Lilly, X. Liu, Federico Marocco, K. A. Marsh, Frank J. Masci, Craig W. McMurtry , et al. (12 additional authors not shown)

Abstract: The Near-Earth Object (NEO) Surveyor mission is a NASA observatory designed to discover and characterize near-Earth asteroids and comets. The mission's primary objective is to find the majority of objects large enough to cause severe regional impact damage ($>$140 m in effective spherical diameter) within its five-year baseline survey. Operating at the Sun-Earth L1 Lagrange point, the mission will… ▽ More The Near-Earth Object (NEO) Surveyor mission is a NASA observatory designed to discover and characterize near-Earth asteroids and comets. The mission's primary objective is to find the majority of objects large enough to cause severe regional impact damage ($>$140 m in effective spherical diameter) within its five-year baseline survey. Operating at the Sun-Earth L1 Lagrange point, the mission will survey to within 45 degrees of the Sun in an effort to find the objects in the most Earth-like orbits. The survey cadence is optimized to provide observational arcs long enough to reliably distinguish near-Earth objects from more distant small bodies that cannot pose an impact hazard. Over the course of its survey, NEO Surveyor will discover $\sim$200,000 - 300,000 new NEOs down to sizes as small as $\sim$10 m and thousands of comets, significantly improving our understanding of the probability of an Earth impact over the next century. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: accepted to PSJ

arXiv:2309.13158 [pdf, other]

Size and Albedo Constraints for (152830) Dinkinesh Using WISE Data

Authors: Kiana D. McFadden, Amy K. Mainzer, Joseph R. Masiero, James M. Bauer, Roc M. Cutri, Dar Dahlen, Frank J. Masci, Jana Pittichová, Akash Satpathy, Edward L. Wright

Abstract: Probing small main-belt asteroids provides insight into their formation and evolution through multiple dynamical and collisional processes. These asteroids also overlap in size with the potentially hazardous near-earth object population and supply the majority of these objects. The Lucy mission will provide an opportunity for study of a small main-belt asteroid, (152830) Dinkinesh. The spacecraft… ▽ More Probing small main-belt asteroids provides insight into their formation and evolution through multiple dynamical and collisional processes. These asteroids also overlap in size with the potentially hazardous near-earth object population and supply the majority of these objects. The Lucy mission will provide an opportunity for study of a small main-belt asteroid, (152830) Dinkinesh. The spacecraft will perform a flyby of this object on November 1, 2023, in preparation for its mission to the Jupiter Trojan asteroids. We employed aperture photometry on stacked frames of Dinkinesh obtained by the Wide-field-Infrared Survey Explorer and performed thermal modeling on a detection at 12 $μ$m to compute diameter and albedo values. Through this method, we determined Dinkinesh has an effective spherical diameter of $0.76^{+0.11}_{-0.21}$ km and a visual geometric albedo of $0.27^{+0.25}_{-0.06}$ at the 16th and 84th percentiles. This albedo is consistent with typical stony (S-type) asteroids. △ Less

Submitted 22 September, 2023; originally announced September 2023.

Comments: Submitted to Astrophysical Journal Letters

arXiv:2308.15265 [pdf, other]

A Multi-Perspective Learning to Rank Approach to Support Children's Information Seeking in the Classroom

Authors: Garrett Allen, Katherine Landau Wright, Jerry Alan Fails, Casey Kennington, Maria Soledad Pera

Abstract: We introduce a novel re-ranking model that aims to augment the functionality of standard search engines to support classroom search activities for children (ages 6 to 11). This model extends the known listwise learning-to-rank framework by balancing risk and reward. Doing so enables the model to prioritize Web resources of high educational alignment, appropriateness, and adequate readability by an… ▽ More We introduce a novel re-ranking model that aims to augment the functionality of standard search engines to support classroom search activities for children (ages 6 to 11). This model extends the known listwise learning-to-rank framework by balancing risk and reward. Doing so enables the model to prioritize Web resources of high educational alignment, appropriateness, and adequate readability by analyzing the URLs, snippets, and page titles of Web resources retrieved by a given mainstream search engine. Experimental results, including an ablation study and comparisons with existing baselines, showcase the correctness of the proposed model. The outcomes of this work demonstrate the value of considering multiple perspectives inherent to the classroom setting, e.g., educational alignment, readability, and objectionability, when applied to the design of algorithms that can better support children's information discovery. △ Less

Submitted 29 August, 2023; originally announced August 2023.

Comments: Extended version of the manuscript to appear in proceedings of the 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology

arXiv:2307.15712 [pdf, other]

Quantum-noise-limited optical neural networks operating at a few quanta per activation

Authors: Shi-Yuan Ma, Tianyu Wang, Jérémie Laydevant, Logan G. Wright, Peter L. McMahon

Abstract: Analog physical neural networks, which hold promise for improved energy efficiency and speed compared to digital electronic neural networks, are nevertheless typically operated in a relatively high-power regime so that the signal-to-noise ratio (SNR) is large (>10). What happens if an analog system is instead operated in an ultra-low-power regime, in which the behavior of the system becomes highly… ▽ More Analog physical neural networks, which hold promise for improved energy efficiency and speed compared to digital electronic neural networks, are nevertheless typically operated in a relatively high-power regime so that the signal-to-noise ratio (SNR) is large (>10). What happens if an analog system is instead operated in an ultra-low-power regime, in which the behavior of the system becomes highly stochastic and the noise is no longer a small perturbation on the signal? In this paper, we study this question in the setting of optical neural networks operated in the limit where some layers use only a single photon to cause a neuron activation. Neuron activations in this limit are dominated by quantum noise from the fundamentally probabilistic nature of single-photon detection of weak optical signals. We show that it is possible to train stochastic optical neural networks to perform deterministic image-classification tasks with high accuracy in spite of the extremely high noise (SNR ~ 1) by using a training procedure that directly models the stochastic behavior of photodetection. We experimentally demonstrated MNIST classification with a test accuracy of 98% using an optical neural network with a hidden layer operating in the single-photon regime; the optical energy used to perform the classification corresponds to 0.008 photons per multiply-accumulate (MAC) operation, which is equivalent to 0.003 attojoules of optical energy per MAC. Our experiment used >40x fewer photons per inference than previous state-of-the-art low-optical-energy demonstrations, to achieve the same accuracy of >90%. Our work shows that some extremely stochastic analog systems, including those operating in the limit where quantum noise dominates, can nevertheless be used as layers in neural networks that deterministically perform classification tasks with high accuracy if they are appropriately trained. △ Less

Submitted 28 July, 2023; originally announced July 2023.

Comments: 55 pages, 27 figures

arXiv:2307.06994 [pdf, other]

Size - Stellar Mass Relation and Morphology of Quiescent Galaxies at $z\geq3$ in Public $JWST$ Fields

Authors: Kei Ito, Francesco Valentino, Gabriel Brammer, Andreas L. Faisst, Steven Gillman, Carlos Gomez-Guijarro, Katriona M. L. Gould, Kasper E. Heintz, Olivier Ilbert, Christian Kragh Jespersen, Vasily Kokorev, Mariko Kubo, Georgios E. Magdis, Conor McPartland, Masato Onodera, Francesca Rizzo, Masayuki Tanaka, Sune Toft, Aswin P. Vijayan, John R. Weaver, Katherine E. Whitaker, Lillian Wright

Abstract: We present the results of a systematic study of the rest-frame optical morphology of quiescent galaxies at $z \geq 3$ using the Near-Infrared Camera (NIRCam) onboard $JWST$. Based on a sample selected by $UVJ$ color or $NUVUVJ$ color, we focus on 26 quiescent galaxies with $9.8<\log{(M_\star/M_\odot)}<11.4$ at $2.8<z_{\rm phot}<4.6$ with publicly available $JWST$ data. Their sizes are constrained… ▽ More We present the results of a systematic study of the rest-frame optical morphology of quiescent galaxies at $z \geq 3$ using the Near-Infrared Camera (NIRCam) onboard $JWST$. Based on a sample selected by $UVJ$ color or $NUVUVJ$ color, we focus on 26 quiescent galaxies with $9.8<\log{(M_\star/M_\odot)}<11.4$ at $2.8<z_{\rm phot}<4.6$ with publicly available $JWST$ data. Their sizes are constrained by fitting the Sérsic profile to all available NIRCam images. We see a negative correlation between the observed wavelength and the size in our sample and derive their size at the rest-frame $0.5\, {\rm μm}$ taking into account this trend. Our quiescent galaxies show a significant correlation between the rest-frame $0.5\, {\rm μm}$ size and the stellar mass at $z\geq3$. The analytical fit for them at $\log{(M_\star/M_\odot)}>10.3$ implies that our size - stellar mass relations are below those at lower redshifts, with the amplitude of $\sim0.6\, {\rm kpc}$ at $M_\star = 5\times 10^{10}\, M_\odot$. This value agrees with the extrapolation from the size evolution of quiescent galaxies at $z<3$ in the literature, implying that the size of quiescent galaxies increases monotonically from $z\sim3-5$. Our sample is mainly composed of galaxies with bulge-like structures according to their median Sérsic index and axis ratio of $n\sim3-4$ and $q\sim0.6-0.8$, respectively. On the other hand, there is a trend of increasing fraction of galaxies with low Sérsic index, suggesting $3<z<5$ might be the epoch of onset of morphological transformation with a fraction of very notable disky quenched galaxies. △ Less

Submitted 6 February, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

Comments: 25 pages, 17 figures, 3 tables; accepted for publication in ApJ

arXiv:2306.13533 [pdf]

doi 10.1038/s41467-024-45924-6

Continuous Ultraviolet to Blue-Green Astrocomb

Authors: Yuk Shan Cheng, Kamalesh Dadi, Toby Mitchell, Samantha Thompson, Nikolai Piskunov, Lewis D. Wright, Corin B. E. Gawith, Richard A. McCracken, Derryck T. Reid

Abstract: The characterization of Earth-like exoplanets and precision tests of cosmological models using next-generation telescopes such as the ELT will demand precise calibration of astrophysical spectrographs in the visible region, where stellar absorption lines are most abundant. Astrocombs--lasers providing a broadband sequence of ultra-narrow, drift-free, regularly spaced optical frequencies on a multi… ▽ More The characterization of Earth-like exoplanets and precision tests of cosmological models using next-generation telescopes such as the ELT will demand precise calibration of astrophysical spectrographs in the visible region, where stellar absorption lines are most abundant. Astrocombs--lasers providing a broadband sequence of ultra-narrow, drift-free, regularly spaced optical frequencies on a multi-GHz grid--promise an atomically-traceable, versatile calibration scale, but their realization is challenging because of the need for ultra-broadband frequency conversion of mode-locked infrared lasers into the blue-green region. Here, we introduce a new concept achieving a broad, continuous spectrum by combining second-harmonic generation and sum-frequency-mixing in an aperiodically-poled MgO:PPLN waveguide to generate gap-free 390-520 nm light from a 1 GHz Ti:sapphire laser frequency comb. We lock a low-dispersion Fabry-Perot etalon to extract a sub-comb of bandwidth from 392-472 nm with a spacing of 30 GHz, visualizing the thousands of resulting comb modes on a high resolution cross-dispersion spectrograph. Complementary experimental data and simulations demonstrate the effectiveness of the approach for eliminating the spectral gaps present in second-harmonic-only conversion, in which weaker fundamental frequencies are suppressed by the quadratic \{chi}^((2)) nonlinearity. Requiring only ~100 pJ pulse energies, our concept establishes a practical new route to broadband UV-visible generation at GHz repetition rates. △ Less

Submitted 23 June, 2023; originally announced June 2023.

Comments: 14 pages; 4 figures

Journal ref: NATURE COMMUNICATIONS 15:1466 (2024)

arXiv:2306.11945 [pdf, other]

doi 10.3847/1538-3881/acda93

IRAS 00450+7401 and the mid-infrared fade/burst cycle of R Coronae Borealis-type stars

Authors: William A. Burris, Carl Melis, Allen W. Shafter, Georgia V. Panopoulou, Edward L. Wright, John Della Costa

Abstract: We present optical and infrared imaging and spectroscopy of the R Coronae Borealis-type (R Cor Bor) star IRAS 00450+7401. Optical spectra further confirm its classification as a cool R Cor Bor system, having a hydrogen-deficient carbon star spectral sub-class of HdC5 or later. Mid-infrared spectroscopy reveals the typical ~8 um ``hump'' seen in other R Cor Bor stars and no other features. A modern… ▽ More We present optical and infrared imaging and spectroscopy of the R Coronae Borealis-type (R Cor Bor) star IRAS 00450+7401. Optical spectra further confirm its classification as a cool R Cor Bor system, having a hydrogen-deficient carbon star spectral sub-class of HdC5 or later. Mid-infrared spectroscopy reveals the typical ~8 um ``hump'' seen in other R Cor Bor stars and no other features. A modern-epoch spectral energy distribution shows bright emission from hot dust having Tdust>600 K. Historical infrared data reveal generally cooler dust color temperatures combined with long-term fading trends, but provide no discernible correlation between flux level and temperature. Investigating the most mid-infrared variable R Cor Bor stars found in IRAS, AKARI, and WISE data reveals similar fading trends, bursts that can show a factor of up to 10 change in flux density between epochs, and blackbody-fit dust color temperatures that span 400-1300 K. While some R Cor Bor stars such as IRAS 00450+7401 appear to undergo fade/burst cycles in the mid-infrared, significant gaps in temporal coverage prevent conclusively identifying any preferred timescale for their mid-infrared variability and circumstellar dust temperature changes. △ Less

Submitted 20 June, 2023; originally announced June 2023.

Comments: AJ accepted, 15 pages, 6 figures, 5 tables, and an appendix

arXiv:2304.11277 [pdf, other]

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

Authors: Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-Chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Pritam Damania, Bernard Nguyen, Geeta Chauhan, Yuchen Hao, Ajit Mathews, Shen Li

Abstract: It is widely acknowledged that large models have the potential to deliver superior performance across a broad range of domains. Despite the remarkable progress made in the field of machine learning systems research, which has enabled the development and exploration of large models, such abilities remain confined to a small group of advanced users and industry leaders, resulting in an implicit tech… ▽ More It is widely acknowledged that large models have the potential to deliver superior performance across a broad range of domains. Despite the remarkable progress made in the field of machine learning systems research, which has enabled the development and exploration of large models, such abilities remain confined to a small group of advanced users and industry leaders, resulting in an implicit technical barrier for the wider community to access and leverage these technologies. In this paper, we introduce PyTorch Fully Sharded Data Parallel (FSDP) as an industry-grade solution for large model training. FSDP has been closely co-designed with several key PyTorch core components including Tensor implementation, dispatcher system, and CUDA memory caching allocator, to provide non-intrusive user experiences and high training efficiency. Additionally, FSDP natively incorporates a range of techniques and settings to optimize resource utilization across a variety of hardware configurations. The experimental results demonstrate that FSDP is capable of achieving comparable performance to Distributed Data Parallel while providing support for significantly larger models with near-linear scalability in terms of TFLOPS. △ Less

Submitted 12 September, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

arXiv:2302.12043 [pdf, ps, other]

Conversational Agents and Children: Let Children Learn

Authors: Casey Kennington, Jerry Alan Fails, Katherine Landau Wright, Maria Soledad Pera

Abstract: Using online information discovery as a case study, in this position paper we discuss the need to design, develop, and deploy (conversational) agents that can -- non-intrusively -- guide children in their quest for online resources rather than simply finding resources for them. We argue that agents should "let children learn" and should be built to take on a teacher-facilitator function, allowing… ▽ More Using online information discovery as a case study, in this position paper we discuss the need to design, develop, and deploy (conversational) agents that can -- non-intrusively -- guide children in their quest for online resources rather than simply finding resources for them. We argue that agents should "let children learn" and should be built to take on a teacher-facilitator function, allowing children to develop their technical and critical thinking abilities as they interact with varied technology in a broad range of use cases. △ Less

Submitted 23 February, 2023; originally announced February 2023.

Comments: 6 pages

arXiv:2302.10360 [pdf, other]

Optical Transformers

Authors: Maxwell G. Anderson, Shi-Yuan Ma, Tianyu Wang, Logan G. Wright, Peter L. McMahon

Abstract: The rapidly increasing size of deep-learning models has caused renewed and growing interest in alternatives to digital computers to dramatically reduce the energy cost of running state-of-the-art neural networks. Optical matrix-vector multipliers are best suited to performing computations with very large operands, which suggests that large Transformer models could be a good target for optical comp… ▽ More The rapidly increasing size of deep-learning models has caused renewed and growing interest in alternatives to digital computers to dramatically reduce the energy cost of running state-of-the-art neural networks. Optical matrix-vector multipliers are best suited to performing computations with very large operands, which suggests that large Transformer models could be a good target for optical computing. To test this idea, we performed small-scale optical experiments with a prototype accelerator to demonstrate that Transformer operations can run on optical hardware despite noise and errors. Using simulations, validated by our experiments, we then explored the energy efficiency of optical implementations of Transformers and identified scaling laws for model performance with respect to optical energy usage. We found that the optical energy per multiply-accumulate (MAC) scales as $\frac{1}{d}$ where $d$ is the Transformer width, an asymptotic advantage over digital systems. We conclude that with well-engineered, large-scale optical hardware, it may be possible to achieve a $100 \times$ energy-efficiency advantage for running some of the largest current Transformer models, and that if both the models and the optical hardware are scaled to the quadrillion-parameter regime, optical computers could have a $>8,000\times$ energy-efficiency advantage over state-of-the-art digital-electronic processors that achieve 300 fJ/MAC. We analyzed how these results motivate and inform the construction of future optical accelerators along with optics-amenable deep-learning approaches. With assumptions about future improvements to electronics and Transformer quantization techniques (5$\times$ cheaper memory access, double the digital--analog conversion efficiency, and 4-bit precision), we estimated that optical computers' advantage against current 300-fJ/MAC digital processors could grow to $>100,000\times$. △ Less

Submitted 20 February, 2023; originally announced February 2023.

Comments: 27 pages, 13 figures

Journal ref: Transactions on Machine Learning Research, 03/2024, https://openreview.net/forum?id=Xxw0edFFQC

arXiv:2210.11273 [pdf]

doi 10.1088/2040-8986/ace4dc

Roadmap on spatiotemporal light fields

Authors: Yijie Shen, Qiwen Zhan, Logan G. Wright, Demetrios N. Christodoulides, Frank W. Wise, Alan E. Willner, Zhe Zhao, Kai-heng Zou, Chen-Ting Liao, Carlos Hernández-García, Margaret Murnane, Miguel A. Porras, Andy Chong, Chenhao Wan, Konstantin Y. Bliokh, Murat Yessenov, Ayman F. Abouraddy, Liang Jie Wong, Michael Go, Suraj Kumar, Cheng Guo, Shanhui Fan, Nikitas Papasimakis, Nikolay I. Zheludev, Lu Chen , et al. (20 additional authors not shown)

Abstract: Spatiotemporal sculpturing of light pulse with ultimately sophisticated structures represents the holy grail of the human everlasting pursue of ultrafast information transmission and processing as well as ultra-intense energy concentration and extraction. It also holds the key to unlock new extraordinary fundamental physical effects. Traditionally, spatiotemporal light pulses are always treated as… ▽ More Spatiotemporal sculpturing of light pulse with ultimately sophisticated structures represents the holy grail of the human everlasting pursue of ultrafast information transmission and processing as well as ultra-intense energy concentration and extraction. It also holds the key to unlock new extraordinary fundamental physical effects. Traditionally, spatiotemporal light pulses are always treated as spatiotemporally separable wave packet as solution of the Maxwell's equations. In the past decade, however, more generalized forms of spatiotemporally nonseparable solution started to emerge with growing importance for their striking physical effects. This roadmap intends to highlight the recent advances in the creation and control of increasingly complex spatiotemporally sculptured pulses, from spatiotemporally separable to complex nonseparable states, with diverse geometric and topological structures, presenting a bird's eye viewpoint on the zoology of spatiotemporal light fields and the outlook of future trends and open challenges. △ Less

Submitted 20 October, 2022; originally announced October 2022.

Comments: This is the version of the article before peer review or editing, as submitted by an author to Journal of Optics. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it

arXiv:2208.05088 [pdf, other]

doi 10.1038/s41567-023-02075-7

Programmable large-scale simulation of bosonic transport in optical synthetic frequency lattices

Authors: Alen Senanian, Logan G. Wright, Peter F. Wade, Hannah K. Doyle, Peter L. McMahon

Abstract: Photonic simulators using synthetic frequency dimensions have enabled flexible experimental analogues of condensed-matter systems, realizing phenomena that are impractical to observe in real-space systems. However, to date such photonic simulators have been limited to small systems suffering from finite-size effects. Here, we present an analog simulator capable of simulating large 2D and 3D lattic… ▽ More Photonic simulators using synthetic frequency dimensions have enabled flexible experimental analogues of condensed-matter systems, realizing phenomena that are impractical to observe in real-space systems. However, to date such photonic simulators have been limited to small systems suffering from finite-size effects. Here, we present an analog simulator capable of simulating large 2D and 3D lattices, as well as lattices with non-planar connectivity, including a tree lattice that serves as a toy model in quantum gravity. Our demonstration is enabled by the broad bandwidth achievable in photonics, allowing our simulator to realize lattices with over 100,000 lattice sites. We explore these large lattices in a wide range of previously inaccessible regimes by using a novel method to excite arbitrary states. Our work establishes the scalability and flexibility of programmable simulators based on synthetic frequency dimensions in the optical domain. We anticipate that future extensions of this platform will leverage advances in high-bandwidth optoelectronics to support simulations of dynamic, non-equilibrium phases at the scale of millions of lattice sites, and Kerr-frequency-comb technology to simulate models with higher-order interactions, ultimately in regimes and at scales inaccessible to both digital computers and realizable materials. △ Less

Submitted 9 August, 2022; originally announced August 2022.

arXiv:2207.14293 [pdf, other]

doi 10.1038/s41566-023-01170-8

Image sensing with multilayer, nonlinear optical neural networks

Authors: Tianyu Wang, Mandar M. Sohoni, Logan G. Wright, Martin M. Stein, Shi-Yuan Ma, Tatsuhiro Onodera, Maxwell G. Anderson, Peter L. McMahon

Abstract: Optical imaging is commonly used for both scientific and technological applications across industry and academia. In image sensing, a measurement, such as of an object's position, is performed by computational analysis of a digitized image. An emerging image-sensing paradigm breaks this delineation between data collection and analysis by designing optical components to perform not imaging, but enc… ▽ More Optical imaging is commonly used for both scientific and technological applications across industry and academia. In image sensing, a measurement, such as of an object's position, is performed by computational analysis of a digitized image. An emerging image-sensing paradigm breaks this delineation between data collection and analysis by designing optical components to perform not imaging, but encoding. By optically encoding images into a compressed, low-dimensional latent space suitable for efficient post-analysis, these image sensors can operate with fewer pixels and fewer photons, allowing higher-throughput, lower-latency operation. Optical neural networks (ONNs) offer a platform for processing data in the analog, optical domain. ONN-based sensors have however been limited to linear processing, but nonlinearity is a prerequisite for depth, and multilayer NNs significantly outperform shallow NNs on many tasks. Here, we realize a multilayer ONN pre-processor for image sensing, using a commercial image intensifier as a parallel optoelectronic, optical-to-optical nonlinear activation function. We demonstrate that the nonlinear ONN pre-processor can achieve compression ratios of up to 800:1 while still enabling high accuracy across several representative computer-vision tasks, including machine-vision benchmarks, flow-cytometry image classification, and identification of objects in real scenes. In all cases we find that the ONN's nonlinearity and depth allowed it to outperform a purely linear ONN encoder. Although our experiments are specialized to ONN sensors for incoherent-light images, alternative ONN platforms should facilitate a range of ONN sensors. These ONN sensors may surpass conventional sensors by pre-processing optical information in spatial, temporal, and/or spectral dimensions, potentially with coherent and quantum qualities, all natively in the optical domain. △ Less

Submitted 27 July, 2022; originally announced July 2022.

Journal ref: Nat. Photon. 18, 1-8 (2023)

arXiv:2205.09768 [pdf, other]

Deterministic Tensor Network Classifiers

Authors: L. Wright, F. Barratt, J. Dborin, V. Wimalaweera, B. Coyle, A. G. Green

Abstract: We present tensor networks for feature extraction and refinement of classifier performance. These networks can be initialised deterministically and have the potential for implementation on near-term intermediate-scale quantum (NISQ) devices. Feature extraction proceeds through a direct combination and compression of images amplitude-encoded over just $\log N_{\text{pixels}}$ qubits. Performance is… ▽ More We present tensor networks for feature extraction and refinement of classifier performance. These networks can be initialised deterministically and have the potential for implementation on near-term intermediate-scale quantum (NISQ) devices. Feature extraction proceeds through a direct combination and compression of images amplitude-encoded over just $\log N_{\text{pixels}}$ qubits. Performance is refined using `Quantum Stacking', a deterministic method that can be applied to the predictions of any classifier regardless of structure, and implemented on NISQ devices using data re-uploading. These procedures are applied to a tensor network encoding of data, and benchmarked against the 10 class MNIST and fashion MNIST datasets. Good training and test accuracy are achieved without any variational training. △ Less

Submitted 19 May, 2022; originally announced May 2022.

arXiv:2204.05412 [pdf, other]

NEOWISE Observations Of The Potentially Hazardous Asteroid (99942) Apophis

Authors: Akash Satpathy, Amy Mainzer, Joseph R. Masiero, Tyler Linder, Roc M. Cutri, Edward L. Wright, Jana Pittichova, Tommy Grav, Emily Kramer

Abstract: Large potentially hazardous asteroids (PHAs) are capable of causing a global catastrophe in the event of a planetary collision. Thus, rapid assessment of such an object's physical characteristics is crucial for determining its potential risk scale. We treated the near-Earth asteroid (99942) Apophis as a newly discovered object during its 2020-2021 close-approach as part of a mock planetary defense… ▽ More Large potentially hazardous asteroids (PHAs) are capable of causing a global catastrophe in the event of a planetary collision. Thus, rapid assessment of such an object's physical characteristics is crucial for determining its potential risk scale. We treated the near-Earth asteroid (99942) Apophis as a newly discovered object during its 2020-2021 close-approach as part of a mock planetary defense exercise. The object was detected by the Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE), and data collected by the two active bands (3.4 $μ$m and 4.6 $μ$m) were analyzed using thermal and thermophysical modeling. Our results indicate that Apophis is an elongated object with an effective spherical diameter D$_{eff}$ = 340 $\pm$ 70 m, a geometric visual albedo p$_{V}$ = 0.31 $\pm$ 0.09, and a thermal inertia $Γ$ $\sim$ 150 - 2850 Jm$^{-2}$s$^{-0.5}$K$^{-1}$ with a best-fit value of 550 Jm$^{-2}$s$^{-0.5}$K$^{-1}$. NEOWISE "discovery" observations reveal that (99942) Apophis is a potentially hazardous asteroid that would likely cause damage at a regional level and not a global one. △ Less

Submitted 11 April, 2022; originally announced April 2022.

Comments: 19 Pages, 6 Figures, Accepted for publication in PSJ

arXiv:2203.04979 [pdf, other]

doi 10.3847/1538-4357/aca677

REQUIEM-2D: A diversity of formation pathways in a sample of spatially-resolved massive quiescent galaxies at z~2

Authors: Mohammad Akhshik, Katherine E. Whitaker, Joel Leja, Johan Richard, Justin S. Spilker, Mimi Song, Gabriel Brammer, Rachel Bezanson, Harald Ebeling, Anna R. Gallazzi, Guillaume Mahler, Lamiya A. Mowla, Erica J. Nelson, Camilla Pacifici, Keren Sharon, Sune Toft, Christina C. Williams, Lillian Wright, Johannes Zabl

Abstract: REQUIEM-2D (REsolving QUIEscent Magnified galaxies with 2D grism spectroscopy) is comprised of a sample of 8 massive ($\log M_*/M_\odot > 10.6$) strongly lensed quiescent galaxies at $z\sim2$. REQUIEM-2D combines the natural magnification from strong gravitational lensing with the high spatial-resolution grism spectroscopy of \emph{Hubble Space Telescope} through a spectrophotometric fit to study… ▽ More REQUIEM-2D (REsolving QUIEscent Magnified galaxies with 2D grism spectroscopy) is comprised of a sample of 8 massive ($\log M_*/M_\odot > 10.6$) strongly lensed quiescent galaxies at $z\sim2$. REQUIEM-2D combines the natural magnification from strong gravitational lensing with the high spatial-resolution grism spectroscopy of \emph{Hubble Space Telescope} through a spectrophotometric fit to study spatially resolved stellar populations. We show that quiescent galaxies in the REQUIEM-2D survey have diverse formation histories manifesting as a gradient in stellar ages, including examples of (1) a younger central region supporting outside-in formation, (2) flat age gradients that show evidence for both spatially-uniform early formation or inside-out quenching, and (3) regions at a fixed radial distance having different ages (such asymmetries cannot be recovered when averaging stellar population measurements azimuthally). The typical dust attenuation curve for the REQUIEM-2D galaxies is constrained to be steeper than Calzetti's law in the UV and generally consistent with $A_V<1$. Combined together and accounting for the different physical radial distances and formation time-scales, we find that the REQUIEM-2D galaxies that formed earlier in the universe exhibit slow and uniform growth in their inner core, whereas the galaxies that formed later have rapid inner growth in their inner core with younger ages relative to the outskirts. These results challenge the currently accepted paradigm of how massive quiescent galaxies form, where the earliest galaxies are thought to form most rapidly. Significantly larger samples close to the epoch of formation with similar data quality and higher spectral resolution are required to validate this finding. △ Less

Submitted 9 March, 2022; originally announced March 2022.

Comments: Submitted to ApJ

arXiv:2203.03366 [pdf, other]

Improvements to Gradient Descent Methods for Quantum Tensor Network Machine Learning

Authors: Fergus Barratt, James Dborin, Lewis Wright

Abstract: Tensor networks have demonstrated significant value for machine learning in a myriad of different applications. However, optimizing tensor networks using standard gradient descent has proven to be difficult in practice. Tensor networks suffer from initialization problems resulting in exploding or vanishing gradients and require extensive hyperparameter tuning. Efforts to overcome these problems us… ▽ More Tensor networks have demonstrated significant value for machine learning in a myriad of different applications. However, optimizing tensor networks using standard gradient descent has proven to be difficult in practice. Tensor networks suffer from initialization problems resulting in exploding or vanishing gradients and require extensive hyperparameter tuning. Efforts to overcome these problems usually depend on specific network architectures, or ad hoc prescriptions. In this paper we address the problems of initialization and hyperparameter tuning, making it possible to train tensor networks using established machine learning techniques. We introduce a `copy node' method that successfully initializes arbitrary tensor networks, in addition to a gradient based regularization technique for bond dimensions. We present numerical results that show that the combination of techniques presented here produces quantum inspired tensor network models with far fewer parameters, while improving generalization performance. △ Less

Submitted 3 March, 2022; originally announced March 2022.

Journal ref: Second Workshop on Quantum Tensor Networks in Machine Learning, 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

arXiv:2111.13799 [pdf, other]

doi 10.1364/OPTICA.447782

Onset of non-Gaussian quantum physics in pulsed squeezing with mesoscopic fields

Authors: Ryotatsu Yanagimoto, Edwin Ng, Atsushi Yamamura, Tatsuhiro Onodera, Logan G. Wright, Marc Jankowski, M. M. Fejer, Peter L. McMahon, Hideo Mabuchi

Abstract: We study the emergence of non-Gaussian quantum features in pulsed squeezed light generation with a mesoscopic number (i.e., dozens to hundreds) of pump photons. Due to the strong optical nonlinearities necessarily involved in this regime, squeezing occurs alongside significant pump depletion, compromising the predictions made by conventional semiclassical models for squeezing. Furthermore, nonline… ▽ More We study the emergence of non-Gaussian quantum features in pulsed squeezed light generation with a mesoscopic number (i.e., dozens to hundreds) of pump photons. Due to the strong optical nonlinearities necessarily involved in this regime, squeezing occurs alongside significant pump depletion, compromising the predictions made by conventional semiclassical models for squeezing. Furthermore, nonlinear interactions among multiple frequency modes render the system dynamics exponentially intractable in naïve quantum models, requiring a more sophisticated modeling framework. To this end, we construct a nonlinear Gaussian approximation to the squeezing dynamics, defining a "Gaussian interaction frame" (GIF) in which non-Gaussian quantum dynamics can be isolated and concisely described using a few dominant (i.e., principal) supermodes. Numerical simulations of our model reveal non-Gaussian distortions of squeezing in the mesoscopic regime, largely associated with signal-pump entanglement. We argue that the state of the art in nonlinear nanophotonics is quickly approaching this regime, providing an all-optical platform for experimental studies of the semiclassical-to-quantum transition in a rich paradigm of coherent, multimode nonlinear dynamics. Mesoscopic pulsed squeezing thus provides an intriguing case study of the rapid rise in dynamic complexity associated with semiclassical-to-quantum crossover, which we view as a correlate of the emergence of new information-processing capacities in the quantum regime. △ Less

Submitted 26 November, 2021; originally announced November 2021.

Comments: The first two authors contributed equally to this work; 16 pages, 7 figures

Journal ref: Optica 9, 379 (2022)

arXiv:2107.14055 [pdf]

Profit and loss manipulations by online trading brokers

Authors: Golnaz Shahtahmassebi, Lascelles Wright

Abstract: Online trading has attracted millions of people around the world. In March 2021, it was reported there were 18 million accounts from just one broker. Historically, manipulation in financial markets is considered to be fraudulently influencing share, currency pairs or any other indices prices. This article introduces the idea that online trading platform technical issues can be considered as broker… ▽ More Online trading has attracted millions of people around the world. In March 2021, it was reported there were 18 million accounts from just one broker. Historically, manipulation in financial markets is considered to be fraudulently influencing share, currency pairs or any other indices prices. This article introduces the idea that online trading platform technical issues can be considered as brokers manipulation to control traders profit and loss. More importantly it shows these technical issues are the contributing factors of the 82% risk of retail traders losing money. We identify trading platform technical issues of one of the world's leading online trading providers and calculate retail traders losses caused by these issues. To do this, we independently record each trade details using the REST API response provided by the broker. We show traders log activity files is the only way to assess any suspected profit or loss manipulation by the broker. Therefore, it is essential for any retail trader to have access to their log files. We compare our findings with broker's Trustpilot customer reviews. We illustrate how traders' profit and loss can be negatively affected by broker's platform technical issues such as not being able to close profitable trades, closing trades with delays, disappearance of trades, disappearance of profit from clients statements, profit and loss discrepancies, stop loss not being triggered, stop loss or limit order triggered too early. Although regulatory bodies try to ensure that consumers get a fair deal, these attempts are hugely insufficient in protecting retail traders. Therefore, regulatory bodies such as the FCA should take these technical issues seriously and not rely on brokers' internal investigations, because under any other circumstances, these platform manipulations would be considered as crimes and connivingly misappropriating funds. △ Less

Submitted 29 July, 2021; originally announced July 2021.

Comments: 20 pages, 7 tables, 5 figures

arXiv:2107.07481 [pdf, other]

Asteroid Diameters and Albedos from NEOWISE Reactivation Mission Years Six and Seven

Authors: Joseph R. Masiero, A. K. Mainzer, J. M. Bauer, R. M. Cutri, T. Grav, E. Kramer, J. Pittichová, E. L. Wright

Abstract: We present diameters and albedos computed for the near-Earth and Main Belt asteroids observed by the Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE) spacecraft during the sixth and seventh years of its Reactivation mission. These diameters and albedos are calculated from fitting thermal models to NEOWISE observations of $199$ NEOs and $5851$ MBAs detected during the sixth year of t… ▽ More We present diameters and albedos computed for the near-Earth and Main Belt asteroids observed by the Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE) spacecraft during the sixth and seventh years of its Reactivation mission. These diameters and albedos are calculated from fitting thermal models to NEOWISE observations of $199$ NEOs and $5851$ MBAs detected during the sixth year of the survey, and $175$ NEOs and $5861$ MBAs from the seventh year. Comparisons of the near-Earth object diameters derived from Reactivation data with those derived from the WISE cryogenic mission data show a $\sim30\%$ relative uncertainty. This larger uncertainty compared to data from the cryogenic mission is due to the need to assume a beaming parameter for the fits to the shorter wavelength data that the Reactivation mission is limited to. We also present an analysis of the orbital parameters of the Main Belt asteroids that have been discovered by NEOWISE during Reactivation, finding that these objects tend to be on orbits that result in their perihelia being far from the ecliptic, and thus missed by other surveys. To date, the NEOWISE Reactivation survey has provided thermal fits of $1415$ unique NEOs. Including the mission phases before spacecraft hibernation increases the count of unique NEOs characterized to $1845$ from WISE's launch to the present. △ Less

Submitted 15 July, 2021; originally announced July 2021.

Comments: Accepted to PSJ

arXiv:2107.00506 [pdf, other]

doi 10.3847/1538-4357/ac12cb

An Improved Near-Infrared Spectrum of the Archetype Y Dwarf WISEP J182831.08+265037.8

Authors: Michael C. Cushing, Adam C. Schneider, J. Davy Kirkpatrick, Caroline V. Morley, Mark S. Marley, Christopher R. Gelino, Gregory N. Mace, Edward L. Wright, Peter R. Eisenhardt, Michael F. Skrutskie, Kenneth A. Marsh

Abstract: We present a Hubble Space Telescope/Wide-Field Camera 3 near infrared spectrum of the archetype Y dwarf WISEP 182831.08+265037.8. The spectrum covers the 0.9-1.7 um wavelength range at a resolving power of lambda/Delta lambda ~180 and is a significant improvement over the previously published spectrum because it covers a broader wavelength range and is uncontaminated by light from a background sta… ▽ More We present a Hubble Space Telescope/Wide-Field Camera 3 near infrared spectrum of the archetype Y dwarf WISEP 182831.08+265037.8. The spectrum covers the 0.9-1.7 um wavelength range at a resolving power of lambda/Delta lambda ~180 and is a significant improvement over the previously published spectrum because it covers a broader wavelength range and is uncontaminated by light from a background star. The spectrum is unique for a cool brown dwarf in that the flux peaks in the Y, J, and H band are of near equal intensity in units of f_lambda. We fail to detect any absorption bands of NH_3 in the spectrum, in contrast to the predictions of chemical equilibrium models, but tentatively identify CH_4 as the carrier of an unknown absorption feature centered at 1.015 um. Using previously published ground- and spaced-based photometry, and using a Rayleigh Jeans tail to account for flux emerging longward of 4.5 um, we compute a bolometric luminosity of log (L_bol/L_sun)=-6.50+-0.02 which is significantly lower than previously published results. Finally, we compare the spectrum and photometry to two sets of atmospheric models and find that best overall match to the observed properties of WISEP 182831.08+265037.8 is a ~1 Gyr old binary composed of two T_eff~325 K, ~5 M_Jup brown dwarfs with subsolar [C/O] ratios. △ Less

Submitted 1 July, 2021; originally announced July 2021.

Comments: Accepted for publication in the Astrophysical Journal

arXiv:2106.13731 [pdf, other]

Ranger21: a synergistic deep learning optimizer

Authors: Less Wright, Nestor Demeure

Abstract: As optimizers are critical to the performances of neural networks, every year a large number of papers innovating on the subject are published. However, while most of these publications provide incremental improvements to existing algorithms, they tend to be presented as new optimizers rather than composable algorithms. Thus, many worthwhile improvements are rarely seen out of their initial public… ▽ More As optimizers are critical to the performances of neural networks, every year a large number of papers innovating on the subject are published. However, while most of these publications provide incremental improvements to existing algorithms, they tend to be presented as new optimizers rather than composable algorithms. Thus, many worthwhile improvements are rarely seen out of their initial publication. Taking advantage of this untapped potential, we introduce Ranger21, a new optimizer which combines AdamW with eight components, carefully selected after reviewing and testing ideas from the literature. We found that the resulting optimizer provides significantly improved validation accuracy and training speed, smoother training curves, and is even able to train a ResNet50 on ImageNet2012 without Batch Normalization layers. A problem on which AdamW stays systematically stuck in a bad initial state. △ Less

Submitted 6 August, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

Comments: for associated code, see https://github.com/lessw2020/Ranger21

ACM Class: I.2.6

arXiv:2106.13408 [pdf, other]

doi 10.3847/2041-8213/ac0437

The Enigmatic Brown Dwarf WISEA J153429.75-104303.3 (aka "The Accident")

Authors: J. Davy Kirkpatrick, Federico Marocco, Dan Caselden, Aaron M. Meisner, Jacqueline K. Faherty, Adam C. Schneider, Marc J. Kuchner, S. L. Casewell, Christopher R. Gelino, Michael C. Cushing, Peter R. Eisenhardt, Edward L. Wright, Steven D. Schurr

Abstract: Continued follow-up of WISEA J153429.75-104303.3, announced in Meisner et al (2020), has proven it to have an unusual set of properties. New imaging data from Keck/MOSFIRE and HST/WFC3 show that this object is one of the few faint proper motion sources known with J-ch2 > 8 mag, indicating a very cold temperature consistent with the latest known Y dwarfs. Despite this, it has W1-W2 and ch1-ch2 colo… ▽ More Continued follow-up of WISEA J153429.75-104303.3, announced in Meisner et al (2020), has proven it to have an unusual set of properties. New imaging data from Keck/MOSFIRE and HST/WFC3 show that this object is one of the few faint proper motion sources known with J-ch2 > 8 mag, indicating a very cold temperature consistent with the latest known Y dwarfs. Despite this, it has W1-W2 and ch1-ch2 colors ~1.6 mag bluer than a typical Y dwarf. A new trigonometric parallax measurement from a combination of WISE, Spitzer, and HST astrometry confirms a nearby distance of $16.3^{+1.4}_{-1.2}$ pc and a large transverse velocity of $207.4{\pm}15.9$ km/s. The absolute J, W2, and ch2 magnitudes are in line with the coldest known Y dwarfs, despite the highly discrepant W1-W2 and ch1-ch2 colors. We explore possible reasons for the unique traits of this object and conclude that it is most likely an old, metal-poor brown dwarf and possibly the first Y subdwarf. Given that the object has an HST F110W magnitude of 24.7 mag, broad-band spectroscopy and photometry from JWST are the best options for testing this hypothesis. △ Less

Submitted 24 June, 2021; originally announced June 2021.

Comments: 8 pages, 4 figures, accepted for publication in The Astrophysical Journal Letters

arXiv:2106.05742 [pdf, other]

Matrix Product State Pre-Training for Quantum Machine Learning

Authors: James Dborin, Fergus Barratt, Vinul Wimalaweera, Lewis Wright, Andrew G. Green

Abstract: Hybrid Quantum-Classical algorithms are a promising candidate for developing uses for NISQ devices. In particular, Parametrised Quantum Circuits (PQCs) paired with classical optimizers have been used as a basis for quantum chemistry and quantum optimization problems. Training PQCs relies on methods to overcome the fact that the gradients of PQCs vanish exponentially in the size of the circuits use… ▽ More Hybrid Quantum-Classical algorithms are a promising candidate for developing uses for NISQ devices. In particular, Parametrised Quantum Circuits (PQCs) paired with classical optimizers have been used as a basis for quantum chemistry and quantum optimization problems. Training PQCs relies on methods to overcome the fact that the gradients of PQCs vanish exponentially in the size of the circuits used. Tensor network methods are being increasingly used as a classical machine learning tool, as well as a tool for studying quantum systems. We introduce a circuit pre-training method based on matrix product state machine learning methods, and demonstrate that it accelerates training of PQCs for both supervised learning, energy minimization, and combinatorial optimization. △ Less

Submitted 14 July, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

Comments: v2: Added short comparison to entanglement devised barren plateau mitigation - relevant paper missed in first submission

arXiv:2105.03976 [pdf, other]

Tailoring interfacial effect in multilayers with Dzyaloshinskii-Moriya interaction by helium ion irradiation

Authors: A. Sud, S. Tacchi, D. Sagkovits, C. Barton, M. Sall, L. H. Diez, E. Stylianidis, N. Smith, L. Wright, S. Zhang, X. Zhang, D. Ravelosona, G. Carlotti, H. Kurebayashi, O. Kazakova, M. Cubukcu

Abstract: We show a method to control magnetic interfacial effects in multilayers with Dzyaloshinskii-Moriya interaction (DMI) using helium (He$^{+}$) ion irradiation. We report results from SQUID magnetometry, ferromagnetic resonance as well as Brillouin light scattering results on multilayers with DMI as a function of irradiation fluence to study the effect of irradiation on the magnetic properties of the… ▽ More We show a method to control magnetic interfacial effects in multilayers with Dzyaloshinskii-Moriya interaction (DMI) using helium (He$^{+}$) ion irradiation. We report results from SQUID magnetometry, ferromagnetic resonance as well as Brillouin light scattering results on multilayers with DMI as a function of irradiation fluence to study the effect of irradiation on the magnetic properties of the multilayers. Our results show clear evidence of the He$^{+}$ irradiation effects on the magnetic properties which is consistent with interface modification due to the effects of the He$^{+}$ irradiation. This external degree of freedom offers promising perspectives to further improve the control of magnetic skyrmions in multilayers, that could push them towards integration in future technologies. △ Less

Submitted 18 September, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

Comments: 10 pages, 6 figures

arXiv:2105.03456 [pdf, other]

CASTing a Net: Supporting Teachers with Search Technology

Authors: Garrett Allen, Katherine Landau Wright, Jerry Alan Fails, Casey Kennington, Maria Soledad Pera

Abstract: Past and current research has typically focused on ensuring that search technology for the classroom serves children. In this paper, we argue for the need to broaden the research focus to include teachers and how search technology can aid them. In particular, we share how furnishing a behind-the-scenes portal for teachers can empower them by providing a window into the spelling, writing, and conce… ▽ More Past and current research has typically focused on ensuring that search technology for the classroom serves children. In this paper, we argue for the need to broaden the research focus to include teachers and how search technology can aid them. In particular, we share how furnishing a behind-the-scenes portal for teachers can empower them by providing a window into the spelling, writing, and concept connection skills of their students. △ Less

Submitted 7 May, 2021; originally announced May 2021.

Comments: KidRec '21: 5th International and Interdisciplinary Perspectives on Children & Recommender and Information Retrieval Systems (KidRec) Search and Recommendation Technology through the Lens of a Teacher- Co-located with ACM IDC 2021

arXiv:2104.13467 [pdf, other]

doi 10.1038/s41467-021-27774-8

An optical neural network using less than 1 photon per multiplication

Authors: Tianyu Wang, Shi-Yuan Ma, Logan G. Wright, Tatsuhiro Onodera, Brian Richard, Peter L. McMahon

Abstract: Deep learning has rapidly become a widespread tool in both scientific and commercial endeavors. Milestones of deep learning exceeding human performance have been achieved for a growing number of tasks over the past several years, across areas as diverse as game-playing, natural-language translation, and medical-image analysis. However, continued progress is increasingly hampered by the high energy… ▽ More Deep learning has rapidly become a widespread tool in both scientific and commercial endeavors. Milestones of deep learning exceeding human performance have been achieved for a growing number of tasks over the past several years, across areas as diverse as game-playing, natural-language translation, and medical-image analysis. However, continued progress is increasingly hampered by the high energy costs associated with training and running deep neural networks on electronic processors. Optical neural networks have attracted attention as an alternative physical platform for deep learning, as it has been theoretically predicted that they can fundamentally achieve higher energy efficiency than neural networks deployed on conventional digital computers. Here, we experimentally demonstrate an optical neural network achieving 99% accuracy on handwritten-digit classification using ~3.2 detected photons per weight multiplication and ~90% accuracy using ~0.64 photons (~$2.4 \times 10^{-19}$ J of optical energy) per weight multiplication. This performance was achieved using a custom free-space optical processor that executes matrix-vector multiplications in a massively parallel fashion, with up to ~0.5 million scalar (weight) multiplications performed at the same time. Using commercially available optical components and standard neural-network training methods, we demonstrated that optical neural networks can operate near the standard quantum limit with extremely low optical powers and still achieve high accuracy. Our results provide a proof-of-principle for low-optical-power operation, and with careful system design including the surrounding electronics used for data storage and control, open up a path to realizing optical processors that require only $10^{-16}$ J total energy per scalar multiplication -- which is orders of magnitude more efficient than current digital processors. △ Less

Submitted 27 April, 2021; originally announced April 2021.

Comments: 42 pages, 21 figures

Journal ref: Nature Communications 13, 123 (2022)

arXiv:2104.13386 [pdf, other]

doi 10.1038/s41586-021-04223-6

Deep physical neural networks enabled by a backpropagation algorithm for arbitrary physical systems

Authors: Logan G. Wright, Tatsuhiro Onodera, Martin M. Stein, Tianyu Wang, Darren T. Schachter, Zoey Hu, Peter L. McMahon

Abstract: Deep neural networks have become a pervasive tool in science and engineering. However, modern deep neural networks' growing energy requirements now increasingly limit their scaling and broader use. We propose a radical alternative for implementing deep neural network models: Physical Neural Networks. We introduce a hybrid physical-digital algorithm called Physics-Aware Training to efficiently trai… ▽ More Deep neural networks have become a pervasive tool in science and engineering. However, modern deep neural networks' growing energy requirements now increasingly limit their scaling and broader use. We propose a radical alternative for implementing deep neural network models: Physical Neural Networks. We introduce a hybrid physical-digital algorithm called Physics-Aware Training to efficiently train sequences of controllable physical systems to act as deep neural networks. This method automatically trains the functionality of any sequence of real physical systems, directly, using backpropagation, the same technique used for modern deep neural networks. To illustrate their generality, we demonstrate physical neural networks with three diverse physical systems-optical, mechanical, and electrical. Physical neural networks may facilitate unconventional machine learning hardware that is orders of magnitude faster and more energy efficient than conventional electronic processors. △ Less

Submitted 27 April, 2021; originally announced April 2021.

Journal ref: Nature 601, 549-555 (2022)

arXiv:2102.05902 [pdf, other]

doi 10.1364/OPTICA.423044

Efficient simulation of ultrafast quantum nonlinear optics with matrix product states

Authors: Ryotatsu Yanagimoto, Edwin Ng, Logan G. Wright, Tatsuhiro Onodera, Hideo Mabuchi

Abstract: Ultra-short pulses propagating in nonlinear nanophotonic waveguides can simultaneously leverage both temporal and spatial field confinement, promising a route towards single-photon nonlinearities in an all-photonic platform. In this multimode quantum regime, however, faithful numerical simulations of pulse dynamics naïvely require a representation of the state in an exponentially large Hilbert spa… ▽ More Ultra-short pulses propagating in nonlinear nanophotonic waveguides can simultaneously leverage both temporal and spatial field confinement, promising a route towards single-photon nonlinearities in an all-photonic platform. In this multimode quantum regime, however, faithful numerical simulations of pulse dynamics naïvely require a representation of the state in an exponentially large Hilbert space. Here, we employ a time-domain, matrix product state (MPS) representation to enable efficient simulations by exploiting the entanglement structure of the system. In order to extract physical insight from these simulations, we develop an algorithm to unravel the MPS quantum state into constituent temporal supermodes, enabling, e.g., access to the phase-space portraits of arbitrary pulse waveforms. As a demonstration, we perform exact numerical simulations of a Kerr soliton in the quantum regime. We observe the development of non-classical Wigner-function negativity in the solitonic mode as well as quantum corrections to the semiclassical dynamics of the pulse. A similar analysis of $χ^{(2)}$ simultons reveals a unique entanglement structure between the fundamental and second harmonic. Our approach is also readily compatible with quantum trajectory theory, allowing full quantum treatment of propagation loss and decoherence. We expect this work to establish the MPS technique as part of a unified engineering framework for the emerging field of broadband quantum photonics. △ Less

Submitted 11 February, 2021; originally announced February 2021.

Comments: 12 pages, 7 figures

Journal ref: Optica 8, 1306 (2021)

arXiv:2102.00645 [pdf, other]

An End-to-End Food Image Analysis System

Authors: Jiangpeng He, Runyu Mao, Zeman Shao, Janine L. Wright, Deborah A. Kerr, Carol J. Boushey, Fengqing Zhu

Abstract: Modern deep learning techniques have enabled advances in image-based dietary assessment such as food recognition and food portion size estimation. Valuable information on the types of foods and the amount consumed are crucial for prevention of many chronic diseases. However, existing methods for automated image-based food analysis are neither end-to-end nor are capable of processing multiple tasks… ▽ More Modern deep learning techniques have enabled advances in image-based dietary assessment such as food recognition and food portion size estimation. Valuable information on the types of foods and the amount consumed are crucial for prevention of many chronic diseases. However, existing methods for automated image-based food analysis are neither end-to-end nor are capable of processing multiple tasks (e.g., recognition and portion estimation) together, making it difficult to apply to real life applications. In this paper, we propose an image-based food analysis framework that integrates food localization, classification and portion size estimation. Our proposed framework is end-to-end, i.e., the input can be an arbitrary food image containing multiple food items and our system can localize each single food item with its corresponding predicted food type and portion size. We also improve the single food portion estimation by consolidating localization results with a food energy distribution map obtained by conditional GAN to generate a four-channel RGB-Distribution image. Our end-to-end framework is evaluated on a real life food image dataset collected from a nutrition feeding study. △ Less

Submitted 1 February, 2021; originally announced February 2021.

arXiv:2101.02728 [pdf, other]

Uncertainties on Asteroid Albedos Determined by Thermal Modeling

Authors: Joseph R. Masiero, E. L. Wright, A. K. Mainzer

Abstract: We present an analysis of the accuracy of geometric albedos determined for asteroids through the modeling of observed thermal infrared radiation. We show that albedo uncertainty is dominated by the uncertainty on the measured $H_V$ absolute magnitude, and that any analysis using albedos in a statistical application will also be dominated by this source of uncertainty. For all but the small fractio… ▽ More We present an analysis of the accuracy of geometric albedos determined for asteroids through the modeling of observed thermal infrared radiation. We show that albedo uncertainty is dominated by the uncertainty on the measured $H_V$ absolute magnitude, and that any analysis using albedos in a statistical application will also be dominated by this source of uncertainty. For all but the small fraction of asteroids with a large amount of characterization data, improved knowledge of the $H_V$ magnitude will be fundamentally limited by incomplete phase curve coverage, incomplete light curve knowledge, and the necessary conversion from the observed band to the $V$ band. Switching the absolute magnitude standard to a different band such a $r'$ would mitigate the uncertainty due to band conversion for many surveys, but this only represents a small component of the total uncertainty. Therefore, techniques making use of these albedos must ensure that their uncertainties are being properly accounted for. △ Less

Submitted 7 January, 2021; originally announced January 2021.

Comments: 10 pages, 1 figure. Accepted to the Planetary Science Journal

arXiv:2012.13084 [pdf, other]

doi 10.3847/1538-4365/abd805

The CatWISE2020 Catalog

Authors: Federico Marocco, Peter R. M. Eisenhardt, John W. Fowler, J. Davy Kirkpatrick, Aaron M. Meisner, Edward F. Schlafly, S. Adam Stanford, Nelson Garcia, Dan Caselden, Michael C. Cushing, Roc M. Cutri, Jacqueline K. Faherty, Christopher R. Gelino, Anthony H. Gonzalez, Thomas H. Jarrett, Renata Koontz, Amanda Mainzer, Elijah J. Marchese, Bahram Mobasher, David J. Schlegel, Daniel Stern, Harry I. Teplitz, Edward L. Wright

Abstract: The CatWISE2020 Catalog consists of 1,890,715,640 sources over the entire sky selected from WISE and NEOWISE survey data at 3.4 and 4.6 $μ$m (W1 and W2) collected from 2010 Jan. 7 to 2018 Dec. 13. This dataset adds two years to that used for the CatWISE Preliminary Catalog (Eisenhardt et al., 2020), bringing the total to six times as many exposures spanning over sixteen times as large a time basel… ▽ More The CatWISE2020 Catalog consists of 1,890,715,640 sources over the entire sky selected from WISE and NEOWISE survey data at 3.4 and 4.6 $μ$m (W1 and W2) collected from 2010 Jan. 7 to 2018 Dec. 13. This dataset adds two years to that used for the CatWISE Preliminary Catalog (Eisenhardt et al., 2020), bringing the total to six times as many exposures spanning over sixteen times as large a time baseline as the AllWISE catalog. The other major change from the CatWISE Preliminary Catalog is that the detection list for the CatWISE2020 Catalog was generated using ${\it crowdsource}$ (Schlafly et al. 2019), while the CatWISE Preliminary Catalog used the detection software used for AllWISE. These two factors result in roughly twice as many sources in the CatWISE2020 Catalog. The scatter with respect to ${\it Spitzer}$ photometry at faint magnitudes in the COSMOS field, which is out of the Galactic plane and at low ecliptic latitude (corresponding to lower WISE coverage depth) is similar to that for the CatWISE Preliminary Catalog. The 90% completeness depth for the CatWISE2020 Catalog is at W1=17.7 mag and W2=17.5 mag, 1.7 mag deeper than in the CatWISE Preliminary Catalog. From comparison to ${\it Gaia}$, CatWISE2020 motions are accurate at the 20 mas yr$^{-1}$ level for W1$\sim$15 mag sources, and at the $\sim100$ mas yr$^{-1}$ level for W1$\sim$17 mag sources. This level of precision represents a 12$\times$ improvement over AllWISE. The CatWISE catalogs are available in the WISE/NEOWISE Enhanced and Contributed Products area of the NASA/IPAC Infrared Science Archive. △ Less

Submitted 23 December, 2020; originally announced December 2020.

Comments: 27 pages, 24 figure, 2 tables. Accepted for publication in ApJS. arXiv admin note: text overlap with arXiv:1908.08902

arXiv:2012.12110 [pdf]

Direct Observation of Thermalization to a Rayleigh-Jeans Distribution in Multimode Optical Fibers

Authors: Hamed Pourbeyram, Pavel Sidorenko, Fan Wu, Nicholas Bender, Logan Wright, Demetrios Christodoulides, Frank Wise

Abstract: Recent years have witnessed a resurgence of interest in nonlinear multimode optical systems where a host of intriguing effects have been observed that are impossible in single-mode settings. While nonlinearity can provide a rich environment where the chaotic power exchange among thousands of modes can lead to novel behaviors, at the same time, it poses a major challenge in terms of understanding a… ▽ More Recent years have witnessed a resurgence of interest in nonlinear multimode optical systems where a host of intriguing effects have been observed that are impossible in single-mode settings. While nonlinearity can provide a rich environment where the chaotic power exchange among thousands of modes can lead to novel behaviors, at the same time, it poses a major challenge in terms of understanding and harnessing these processes to advantage. Over the years, statistical models have been developed to macroscopically describe the response of these complex systems. One of the cornerstones of these theoretical formalisms is the prediction of a photon-photon mediated thermalization process that leads to a Rayleigh-Jeans distribution of mode occupations. Here we report the use of mode-resolved measurement techniques to make the first direct observations of thermalization to a Rayleigh-Jeans power distribution in a multimode optical fiber. We experimentally demonstrate that the underlying system Hamiltonian remains invariant during propagation while power equipartition takes place among degenerate groups of modes - all in full accord with theoretical predictions. Our results may pave the way toward a new generation of high-power optical sources whose brightness and modal content can be controlled using principles from thermodynamics and statistical mechanics. △ Less

Submitted 14 February, 2022; v1 submitted 22 December, 2020; originally announced December 2020.

arXiv:2011.11616 [pdf, other]

doi 10.3847/1538-4365/abd107

The Field Substellar Mass Function Based on the Full-sky 20-pc Census of 525 L, T, and Y Dwarfs

Authors: J. Davy Kirkpatrick, Christopher R. Gelino, Jacqueline K. Faherty, Aaron M. Meisner, Dan Caselden, Adam C. Schneider, Federico Marocco, Alfred J. Cayago, R. L. Smart, Peter R. Eisenhardt, Marc J. Kuchner, Edward L. Wright, Michael C. Cushing, Katelyn N. Allers, Daniella C. Bardalez Gagliuffi, Adam J. Burgasser, Jonathan Gagne, Sarah E. Logsdon, Emily C. Martin, James G. Ingalls, Patrick J. Lowrance, Ellianna S. Abrahams, Christian Aganze, Roman Gerasimov, Eileen C. Gonzales , et al. (27 additional authors not shown)

Abstract: We present final Spitzer trigonometric parallaxes for 361 L, T, and Y dwarfs. We combine these with prior studies to build a list of 525 known L, T, and Y dwarfs within 20 pc of the Sun, 38 of which are presented here for the first time. Using published photometry and spectroscopy as well as our own follow-up, we present an array of color-magnitude and color-color diagrams to further characterize… ▽ More We present final Spitzer trigonometric parallaxes for 361 L, T, and Y dwarfs. We combine these with prior studies to build a list of 525 known L, T, and Y dwarfs within 20 pc of the Sun, 38 of which are presented here for the first time. Using published photometry and spectroscopy as well as our own follow-up, we present an array of color-magnitude and color-color diagrams to further characterize census members, and we provide polynomial fits to the bulk trends. Using these characterizations, we assign each object a $T_{\rm eff}$ value and judge sample completeness over bins of $T_{\rm eff}$ and spectral type. Except for types $\ge$ T8 and $T_{\rm eff} <$ 600K, our census is statistically complete to the 20-pc limit. We compare our measured space densities to simulated density distributions and find that the best fit is a power law ($dN/dM \propto M^{-α}$) with $α= 0.6{\pm}0.1$. We find that the evolutionary models of Saumon & Marley correctly predict the observed magnitude of the space density spike seen at 1200K $< T_{\rm eff} <$ 1350K, believed to be caused by an increase in the cooling timescale across the L/T transition. Defining the low-mass terminus using this sample requires a more statistically robust and complete sample of dwarfs $\ge$Y0.5 and with $T_{\rm eff} <$ 400K. We conclude that such frigid objects must exist in substantial numbers, despite the fact that few have so far been identified, and we discuss possible reasons why they have largely eluded detection. △ Less

Submitted 23 November, 2020; originally announced November 2020.

Comments: 101 pages, 31 figures, accepted for publication in the Astrophysical Journal Supplement Series

Showing 1–50 of 391 results for author: Wright, L