subscribe to arXiv mailings

AI Foundation Model for Heliophysics: Applications, Design, and Implementation

Authors: Sujit Roy, Talwinder Singh, Marcus Freitag, Johannes Schmude, Rohit Lal, Dinesha Hegde, Soumya Ranjan, Amy Lin, Vishal Gaur, Etienne Eben Vos, Rinki Ghosal, Badri Narayana Patro, Berkay Aydin, Nikolai Pogorelov, Juan Bernabe Moreno, Manil Maskey, Rahul Ramachandran

Abstract: Deep learning-based methods have been widely researched in the areas of language and vision, demonstrating their capacity to understand long sequences of data and their usefulness in numerous helio-physics applications. Foundation models (FMs), which are pre-trained on a large-scale datasets, form the basis for a variety of downstream tasks. These models, especially those based on transformers in… ▽ More Deep learning-based methods have been widely researched in the areas of language and vision, demonstrating their capacity to understand long sequences of data and their usefulness in numerous helio-physics applications. Foundation models (FMs), which are pre-trained on a large-scale datasets, form the basis for a variety of downstream tasks. These models, especially those based on transformers in vision and language, show exceptional potential for adapting to a wide range of downstream applications. In this paper, we provide our perspective on the criteria for designing an FM for heliophysics and associated challenges and applications using the Solar Dynamics Observatory (SDO) dataset. We believe that this is the first study to design an FM in the domain of heliophysics. △ Less

Submitted 30 September, 2024; originally announced October 2024.

Comments: 31 Pages, 12 figures

arXiv:2410.09250 [pdf, other]

Quantum-Trained Convolutional Neural Network for Deepfake Audio Detection

Authors: Chu-Hsuan Abraham Lin, Chen-Yu Liu, Samuel Yen-Chi Chen, Kuan-Cheng Chen

Abstract: The rise of deepfake technologies has posed significant challenges to privacy, security, and information integrity, particularly in audio and multimedia content. This paper introduces a Quantum-Trained Convolutional Neural Network (QT-CNN) framework designed to enhance the detection of deepfake audio, leveraging the computational power of quantum machine learning (QML). The QT-CNN employs a hybrid… ▽ More The rise of deepfake technologies has posed significant challenges to privacy, security, and information integrity, particularly in audio and multimedia content. This paper introduces a Quantum-Trained Convolutional Neural Network (QT-CNN) framework designed to enhance the detection of deepfake audio, leveraging the computational power of quantum machine learning (QML). The QT-CNN employs a hybrid quantum-classical approach, integrating Quantum Neural Networks (QNNs) with classical neural architectures to optimize training efficiency while reducing the number of trainable parameters. Our method incorporates a novel quantum-to-classical parameter mapping that effectively utilizes quantum states to enhance the expressive power of the model, achieving up to 70% parameter reduction compared to classical models without compromising accuracy. Data pre-processing involved extracting essential audio features, label encoding, feature scaling, and constructing sequential datasets for robust model evaluation. Experimental results demonstrate that the QT-CNN achieves comparable performance to traditional CNNs, maintaining high accuracy during training and testing phases across varying configurations of QNN blocks. The QT framework's ability to reduce computational overhead while maintaining performance underscores its potential for real-world applications in deepfake detection and other resource-constrained scenarios. This work highlights the practical benefits of integrating quantum computing into artificial intelligence, offering a scalable and efficient approach to advancing deepfake detection technologies. △ Less

Submitted 11 October, 2024; originally announced October 2024.

arXiv:2410.09239 [pdf, other]

Scaling Gaussian Processes for Learning Curve Prediction via Latent Kronecker Structure

Authors: Jihao Andreas Lin, Sebastian Ament, Maximilian Balandat, Eytan Bakshy

Abstract: A key task in AutoML is to model learning curves of machine learning models jointly as a function of model hyper-parameters and training progression. While Gaussian processes (GPs) are suitable for this task, naïve GPs require $\mathcal{O}(n^3m^3)$ time and $\mathcal{O}(n^2 m^2)$ space for $n$ hyper-parameter configurations and $\mathcal{O}(m)$ learning curve observations per hyper-parameter. Effi… ▽ More A key task in AutoML is to model learning curves of machine learning models jointly as a function of model hyper-parameters and training progression. While Gaussian processes (GPs) are suitable for this task, naïve GPs require $\mathcal{O}(n^3m^3)$ time and $\mathcal{O}(n^2 m^2)$ space for $n$ hyper-parameter configurations and $\mathcal{O}(m)$ learning curve observations per hyper-parameter. Efficient inference via Kronecker structure is typically incompatible with early-stopping due to missing learning curve values. We impose $\textit{latent Kronecker structure}$ to leverage efficient product kernels while handling missing values. In particular, we interpret the joint covariance matrix of observed values as the projection of a latent Kronecker product. Combined with iterative linear solvers and structured matrix-vector multiplication, our method only requires $\mathcal{O}(n^3 + m^3)$ time and $\mathcal{O}(n^2 + m^2)$ space. We show that our GP model can match the performance of a Transformer on a learning curve prediction task. △ Less

Submitted 11 October, 2024; originally announced October 2024.

Comments: Bayesian Decision-making and Uncertainty Workshop at NeurIPS 2024

arXiv:2410.06387 [pdf, ps, other]

Shafarevich's conjecture for families of hypersurfaces over function fields

Authors: Philip Engel, Alice Lin, Salim Tayou

Abstract: Given a smooth quasi-projective complex algebraic variety $\mathcal{S}$, we prove that there are only finitely many Hodge-generic non-isotrivial families of smooth projective hypersurfaces over $\mathcal{S}$ of degree $d$ in $\mathbb{P}_{\mathbb C}^{n+1}$. We prove that the finiteness is uniform in $\mathcal{S}$ and we give examples where the result is sharp. We also prove similar results for cert… ▽ More Given a smooth quasi-projective complex algebraic variety $\mathcal{S}$, we prove that there are only finitely many Hodge-generic non-isotrivial families of smooth projective hypersurfaces over $\mathcal{S}$ of degree $d$ in $\mathbb{P}_{\mathbb C}^{n+1}$. We prove that the finiteness is uniform in $\mathcal{S}$ and we give examples where the result is sharp. We also prove similar results for certain complete intersections in $\mathbb{P}_{\mathbb C}^{n+1}$ of higher codimension and more generally for algebraic varieties whose moduli space admits a period map that satisfies the infinitesimal Torelli theorem. △ Less

Submitted 8 October, 2024; originally announced October 2024.

arXiv:2410.01625 [pdf, other]

A Fourth Planet in the Kepler-51 System Revealed by Transit Timing Variations

Authors: Kento Masuda, Jessica E. Libby-Roberts, John H. Livingston, Kevin B. Stevenson, Peter Gao, Shreyas Vissapragada, Guangwei Fu, Te Han, Michael Greklek-McKeon, Suvrath Mahadevan, Eric Agol, Aaron Bello-Arufe, Zachory Berta-Thompson, Caleb I. Canas, Yayaati Chachan, Leslie Hebb, Renyu Hu, Yui Kawashima, Heather A. Knutson, Caroline V. Morley, Catriona A. Murray, Kazumasa Ohno, Armen Tokadjian, Xi Zhang, Luis Welbanks , et al. (27 additional authors not shown)

Abstract: Kepler-51 is a $\lesssim 1\,\mathrm{Gyr}$-old Sun-like star hosting three transiting planets with radii $\approx 6$-$9\,R_\oplus$ and orbital periods $\approx 45$-$130\,\mathrm{days}$. Transit timing variations (TTVs) measured with past Kepler and Hubble Space Telescope (HST) observations have been successfully modeled by considering gravitational interactions between the three transiting planets,… ▽ More Kepler-51 is a $\lesssim 1\,\mathrm{Gyr}$-old Sun-like star hosting three transiting planets with radii $\approx 6$-$9\,R_\oplus$ and orbital periods $\approx 45$-$130\,\mathrm{days}$. Transit timing variations (TTVs) measured with past Kepler and Hubble Space Telescope (HST) observations have been successfully modeled by considering gravitational interactions between the three transiting planets, yielding low masses and low mean densities ($\lesssim 0.1\,\mathrm{g/cm^3}$) for all three planets. However, the transit time of the outermost transiting planet Kepler-51d recently measured by the James Webb Space Telescope (JWST) 10 years after the Kepler observations is significantly discrepant from the prediction made by the three-planet TTV model, which we confirmed with ground-based and follow-up HST observations. We show that the departure from the three-planet model is explained by including a fourth outer planet, Kepler-51e, in the TTV model. A wide range of masses ($\lesssim M_\mathrm{Jup}$) and orbital periods ($\lesssim 10\,\mathrm{yr}$) are possible for Kepler-51e. Nevertheless, all the coplanar solutions found from our brute-force search imply masses $\lesssim 10\,M_\oplus$ for the inner transiting planets. Thus their densities remain low, though with larger uncertainties than previously estimated. Unlike other possible solutions, the one in which Kepler-51e is around the $2:1$ mean motion resonance with Kepler-51d implies low orbital eccentricities ($\lesssim 0.05$) and comparable masses ($\sim 5\,M_\oplus$) for all four planets, as is seen in other compact multi-planet systems. This work demonstrates the importance of long-term follow-up of TTV systems for probing longer period planets in a system. △ Less

Submitted 4 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

Comments: 48 pages, 26 figures, accepted for publication in AJ

arXiv:2409.16889 [pdf, other]

Searching for GEMS: TOI-6383Ab, a giant planet transiting an M3-dwarf star in a binary system

Authors: Lia Marta Bernabò, Shubham Kanodia, Caleb I. Canas, William D. Cochran, Szilárd Csizmadia, Suvrath Mahadevan, Gudhmundur Stefánsson, Arvind F. Gupta, Andrew Monson, Henry A. Kobulnicky, Alexander K. Larsen, Ethan G. Cotter, Alexina Birkholz, Tera N. Swaby, Gregory Zeimann, Chad F. Bender, Scott A. Diddams, Jessica E. Libby-Roberts, Andrea S. J. Lin, Joe P. Ninan, Heike Rauer, Varghese Reji, Paul Robertson, Arpita Roy, Christian Schwab

Abstract: We report on the discovery of a transiting giant planet around the 3500 K M3-dwarf star TOI-6383A located 172 pc from Earth. It was detected by the Transiting Exoplanet Survey Satellite (TESS) and confirmed by a combination of ground-based follow-up photometry and precise radial velocity measurements. This planet has an orbital period of $\sim$1.791 days, mass of 1.040$\pm$0.094 $M_J$ and a radius… ▽ More We report on the discovery of a transiting giant planet around the 3500 K M3-dwarf star TOI-6383A located 172 pc from Earth. It was detected by the Transiting Exoplanet Survey Satellite (TESS) and confirmed by a combination of ground-based follow-up photometry and precise radial velocity measurements. This planet has an orbital period of $\sim$1.791 days, mass of 1.040$\pm$0.094 $M_J$ and a radius of 1d.008$^{+0.036}_{-0.033} ~R_J$, resulting in a mean bulk density of 1.26$^{+0.18}_{-0.17}$ g cm$^{-3}$. TOI-6383A has an M-dwarf companion star, TOI-6383B, which has a stellar effective temperature $T_{eff}$ $\sim$ 3100 K and a projected orbital separation of 3100 AU. TOI-6383A is a low-mass dwarf star hosting a giant planet and is an intriguing object for planetary evolution studies due to its high planet-to-star mass ratio. This discovery is part of the \textit{Searching for Giant Exoplanets around M-dwarf Stars (GEMS)} Survey, intending to provide robust and accurate estimates of the occurrence of GEMS and the statistics on their physical and orbital parameters. This paper presents an interesting addition to the small number of confirmed GEMS, particularly notable since its formation necessitates massive, ust-rich protoplanetary discs and high accretion efficiency ($>$ 10\%). △ Less

Submitted 25 September, 2024; originally announced September 2024.

Comments: 20 pages, 8 figures

arXiv:2409.13598 [pdf, other]

Prithvi WxC: Foundation Model for Weather and Climate

Authors: Johannes Schmude, Sujit Roy, Will Trojak, Johannes Jakubik, Daniel Salles Civitarese, Shraddha Singh, Julian Kuehnert, Kumar Ankur, Aman Gupta, Christopher E Phillips, Romeo Kienzler, Daniela Szwarcman, Vishal Gaur, Rajat Shinde, Rohit Lal, Arlindo Da Silva, Jorge Luis Guevara Diaz, Anne Jones, Simon Pfreundschuh, Amy Lin, Aditi Sheshadri, Udaysankar Nair, Valentine Anantharaj, Hendrik Hamann, Campbell Watson , et al. (4 additional authors not shown)

Abstract: Triggered by the realization that AI emulators can rival the performance of traditional numerical weather prediction models running on HPC systems, there is now an increasing number of large AI models that address use cases such as forecasting, downscaling, or nowcasting. While the parallel developments in the AI literature focus on foundation models -- models that can be effectively tuned to addr… ▽ More Triggered by the realization that AI emulators can rival the performance of traditional numerical weather prediction models running on HPC systems, there is now an increasing number of large AI models that address use cases such as forecasting, downscaling, or nowcasting. While the parallel developments in the AI literature focus on foundation models -- models that can be effectively tuned to address multiple, different use cases -- the developments on the weather and climate side largely focus on single-use cases with particular emphasis on mid-range forecasting. We close this gap by introducing Prithvi WxC, a 2.3 billion parameter foundation model developed using 160 variables from the Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2). Prithvi WxC employs an encoder-decoder-based architecture, incorporating concepts from various recent transformer models to effectively capture both regional and global dependencies in the input data. The model has been designed to accommodate large token counts to model weather phenomena in different topologies at fine resolutions. Furthermore, it is trained with a mixed objective that combines the paradigms of masked reconstruction with forecasting. We test the model on a set of challenging downstream tasks namely: Autoregressive rollout forecasting, Downscaling, Gravity wave flux parameterization, and Extreme events estimation. The pretrained model with 2.3 billion parameters, along with the associated fine-tuning workflows, has been publicly released as an open-source contribution via Hugging Face. △ Less

Submitted 20 September, 2024; originally announced September 2024.

arXiv:2409.12315 [pdf, other]

The NEID Earth Twin Survey. I. Confirmation of a 31-day planet orbiting HD 86728

Authors: Arvind F. Gupta, Jacob K. Luhn, Jason T. Wright, Suvrath Mahadevan, Paul Robertson, Daniel M. Krolikowski, Eric B. Ford, Caleb I. Cañas, Samuel Halverson, Andrea S. J. Lin, Shubham Kanodia, Evan Fitzmaurice, Christian Gilbertson, Chad F. Bender, Cullen H. Blake, Jiayin Dong, Mark R. Giovinazzi, Sarah E. Logsdon, Andrew Monson, Joe P. Ninan, Jayadev Rajagopal, Arpita Roy, Christian Schwab, Guðmundur Stefánsson

Abstract: With close to three years of observations in hand, the NEID Earth Twin Survey (NETS) is starting to unearth new astrophysical signals for a curated sample of bright, radial velocity (RV)-quiet stars. We present the discovery of the first NETS exoplanet, HD 86728 b, a $m_p\sin i = 9.16^{+0.55}_{-0.56}\ \rm{M}_\oplus$ planet on a circular, $P=31.1503^{+0.0062}_{-0.0066}$ d orbit, thereby confirming… ▽ More With close to three years of observations in hand, the NEID Earth Twin Survey (NETS) is starting to unearth new astrophysical signals for a curated sample of bright, radial velocity (RV)-quiet stars. We present the discovery of the first NETS exoplanet, HD 86728 b, a $m_p\sin i = 9.16^{+0.55}_{-0.56}\ \rm{M}_\oplus$ planet on a circular, $P=31.1503^{+0.0062}_{-0.0066}$ d orbit, thereby confirming a candidate signal identified by Hirsch et al. (2021). We confirm the planetary origin of the detected signal, which has a semi-amplitude of just $K=1.91^{+0.11}_{-0.12}$ m s$^{-1}$, via careful analysis of the NEID RVs and spectral activity indicators, and we constrain the mass and orbit via fits to NEID and archival RV measurements. The host star is intrinsically quiet at the $\sim1$ m s$^{-1}$ level, with the majority of this variability likely stemming from short-timescale granulation. HD 86728 b is among the small fraction of exoplanets with similar masses and periods that have no known planetary siblings. △ Less

Submitted 18 September, 2024; originally announced September 2024.

Comments: Submitted to AAS Journals. 18 pages, 10 figures, 3 tables, 1 appendix

arXiv:2409.06992 [pdf, other]

Quantum-Train with Tensor Network Mapping Model and Distributed Circuit Ansatz

Authors: Chen-Yu Liu, Chu-Hsuan Abraham Lin, Kuan-Cheng Chen

Abstract: In the Quantum-Train (QT) framework, mapping quantum state measurements to classical neural network weights is a critical challenge that affects the scalability and efficiency of hybrid quantum-classical models. The traditional QT framework employs a multi-layer perceptron (MLP) for this task, but it struggles with scalability and interpretability. To address these issues, we propose replacing the… ▽ More In the Quantum-Train (QT) framework, mapping quantum state measurements to classical neural network weights is a critical challenge that affects the scalability and efficiency of hybrid quantum-classical models. The traditional QT framework employs a multi-layer perceptron (MLP) for this task, but it struggles with scalability and interpretability. To address these issues, we propose replacing the MLP with a tensor network-based model and introducing a distributed circuit ansatz designed for large-scale quantum machine learning with multiple small quantum processing unit nodes. This approach enhances scalability, efficiently represents high-dimensional data, and maintains a compact model structure. Our enhanced QT framework retains the benefits of reduced parameter count and independence from quantum resources during inference. Experimental results on benchmark datasets demonstrate that the tensor network-based QT framework achieves competitive performance with improved efficiency and generalization, offering a practical solution for scalable hybrid quantum-classical machine learning. △ Less

Submitted 10 September, 2024; originally announced September 2024.

Comments: 4 pages, 3 figures

arXiv:2409.02552 [pdf, ps, other]

Cointegration test in time series analysis by global optimisation

Authors: Alvey Qianli Lin, Zhiwen Zhang

Abstract: In this paper, we provide an optimisation approach motivated by the Blind Source Separation, or also known as Independent Component Analysis, for cointegration between financial time series. Two methods for cointegration tests are introduced, namely decorrelation for the bivariate case and maximisation of nongaussianity for higher-dimensions. The advantages of our methods, especially the better pe… ▽ More In this paper, we provide an optimisation approach motivated by the Blind Source Separation, or also known as Independent Component Analysis, for cointegration between financial time series. Two methods for cointegration tests are introduced, namely decorrelation for the bivariate case and maximisation of nongaussianity for higher-dimensions. The advantages of our methods, especially the better performances in limited sample size, enable a wider range of application and accessibility for researchers and practitioners to identify cointegrating relationships. △ Less

Submitted 4 September, 2024; v1 submitted 4 September, 2024; originally announced September 2024.

MSC Class: 65K05

arXiv:2409.01371 [pdf, other]

Searching for GEMS: TOI-5688 A b, a low-density giant orbiting a high-metallicity early M-dwarf

Authors: Varghese Reji, Shubham Kanodia, Joe Ninan, Caleb I. Cañas, Jessica Libby-Roberts, Andrea S. J. Lin, Arvind F Gupta, Tera N. Sewaby, Alexander Larsen, Henry A. Kobulnicky, Philip I. Choi, Nez Evans, Sage Santomenna, Isabelle Winnick, Larry Yu, Jaime A. Alvarado-Montes, Chad Bender, Lia Marta Bernabò, Cullen H. Blake, William D. Cochran, Scott A. Diddams, Samuel Halverson, Te Han, Fred Hearty, Sarah E. Logsdon , et al. (9 additional authors not shown)

Abstract: We present the discovery of a low-density planet transiting TOI-5688 A b, a high-metallicity M2V star. This planet was discovered as part of the search for transiting giant planets ($R \gtrsim8$ M$_\oplus$) through the Searching for GEMS (Giant Exoplanets around M-dwarf Stars) survey. The planet TOI-5688 A b was discovered with the Transiting Exoplanet Survey Satellite (TESS), and characterized wi… ▽ More We present the discovery of a low-density planet transiting TOI-5688 A b, a high-metallicity M2V star. This planet was discovered as part of the search for transiting giant planets ($R \gtrsim8$ M$_\oplus$) through the Searching for GEMS (Giant Exoplanets around M-dwarf Stars) survey. The planet TOI-5688 A b was discovered with the Transiting Exoplanet Survey Satellite (TESS), and characterized with ground-based transits from Red Buttes Observatory (RBO), the Table Mountain Observatory of Pomona College, and radial velocity (RV) measurements with the Habitable-Zone Planet Finder (HPF) on the 10 m Hobby Eberly Telescope (HET) and NEID on the WIYN 3.5 m telescope. From the joint fit of transit and RV data, the mass of the planet is $124\pm24$ M$_\oplus$ and the radius is $10.4\pm0.7$ R$_\oplus$. This planet has a density of $0.61^{+0.20}_{-0.15}$ g/cm${}^3$, and is on a $\sim2.95$ day orbit around its host star. The spectroscopic and photometric analysis of the host star TOI-5688 A shows that it is a high metallicity ([Fe/H] $ = 0.47\pm0.16$ dex) M2V star, favoring the core-accretion formation pathway as the likely formation scenario for this planet. In this paper, we analyze potential mechanisms of planet formation in the context of the formation of TOI-5688 A b. Additionally, observations with Gaia suggest the presence of a wide-separation binary companion, TOI-5688 B, which has a projected separation of $\sim5"$ (1110 AU) and is an M4V. This makes TOI-5688 A b part of a growing number of GEMS in wide-separation binary systems. △ Less

Submitted 4 September, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

Comments: 20 pages, 7 figures, Submitted to AJ, Comments are welcome

arXiv:2408.14694 [pdf, other]

Searching for GEMS: Characterizing Six Giant Planets around Cool Dwarfs

Authors: Shubham Kanodia, Arvind F. Gupta, Caleb I. Canas, Lia Marta Bernabo, Varghese Reji, Te Han, Madison Brady, Andreas Seifahrt, William D. Cochran, Nidia Morrell, Ritvik Basant, Jacob Bean, Chad F. Bender, Zoe L. de Beurs, Allyson Bieryla, Alexina Birkholz, Nina Brown, Franklin Chapman, David R. Ciardi, Catherine A. Clark, Ethan G. Cotter, Scott A. Diddams, Samuel Halverson, Suzanne Hawley, Leslie Hebb , et al. (20 additional authors not shown)

Abstract: Transiting giant exoplanets around M-dwarf stars (GEMS) are rare, owing to the low-mass host stars. However, the all-sky coverage of TESS has enabled the detection of an increasingly large number of them to enable statistical surveys like the \textit{Searching for GEMS} survey. As part of this endeavour, we describe the observations of six transiting giant planets, which includes precise mass meas… ▽ More Transiting giant exoplanets around M-dwarf stars (GEMS) are rare, owing to the low-mass host stars. However, the all-sky coverage of TESS has enabled the detection of an increasingly large number of them to enable statistical surveys like the \textit{Searching for GEMS} survey. As part of this endeavour, we describe the observations of six transiting giant planets, which includes precise mass measurements for two GEMS (K2-419Ab, TOI-6034b) and statistical validation for four systems, which includes validation and mass upper limits for three of them (TOI-5218b, TOI-5616b, TOI-5634Ab), while the fourth one -- TOI-5414b is classified as a `likely planet'. Our observations include radial velocities from the Habitable-zone Planet Finder on the Hobby-Eberly Telescope, and MAROON-X on Gemini-North, along with photometry and high-contrast imaging from multiple ground-based facilities. In addition to TESS photometry, K2-419Ab was also observed and statistically validated as part of the K2 mission in Campaigns 5 and 18, which provides precise orbital and planetary constraints despite the faint host star and long orbital period of $\sim 20.4$ days. With an equilibrium temperature of only 380 K, K2-419Ab is one of the coolest known well-characterized transiting planets. TOI-6034 has a late F-type companion about 40\arcsec~away, making it the first GEMS host star to have an earlier main-sequence binary companion. These confirmations add to the existing small sample of confirmed transiting GEMS. △ Less

Submitted 27 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

Comments: Accepted in AJ

arXiv:2408.13318 [pdf, ps, other]

Earths within Reach: Evaluation of Strategies for Mitigating Solar Variability using 3.5 years of NEID Sun-as-a-Star Observations

Authors: Eric B. Ford, Chad F. Bender, Cullen H. Blake, Arvind F. Gupta, Shubham Kanodia, Andrea S. J. Lin, Sarah E. Logsdon, Jacob K. Luhn, Suvrath Mahadevan, Michael L. Palumbo III, Ryan C. Terrien, Jason T. Wright, Jinglin Zhao, Samuel Halverson, Emily Hunting, Paul Robertson, Arpita Roy, Gudmundur Stefansson

Abstract: We present the results of Sun-as-a-star observations by the NEID Solar Telescope at WIYN Observatory, spanning January 1, 2021 through June 30, 2024. We identify 117,060 observations which are unlikely to be significantly affected by weather, hardware or major calibration issues. We describe several high-level data products being made available to the community to aid in the interpretation and int… ▽ More We present the results of Sun-as-a-star observations by the NEID Solar Telescope at WIYN Observatory, spanning January 1, 2021 through June 30, 2024. We identify 117,060 observations which are unlikely to be significantly affected by weather, hardware or major calibration issues. We describe several high-level data products being made available to the community to aid in the interpretation and inter comparisons of NEID solar observations. Solar observations demonstrate excellent performance of NEID, including radial velocity (RV) accuracy and long-term stability of better than $\simeq 0.37$ m s$^{-1}$ over $\simeq 3.5$ years, even though NEID was not originally designed or optimized for daytime observations of the Sun. Currently, intrinsic stellar variability is the primary barrier to detecting Earth-analog planets for most nearby, Sun-like stars. We present a comparison of the effectiveness of several methods proposed to mitigate the effects of solar variability on the Sun's estimated RV. We find that the Scalpels algorithm performs particularly well and substantially reduces the RMS RV of solar spectra from over 2 m s$^{-1}$ to 0.277 m s$^{-1}$. Even when training on a subset of days with NEID solar observations and testing on a held-out sample, the RMS of cleaned RV is 0.34-0.42 m s$^{-1}$. This is significantly better than previous attempts at removing solar variability and suggests that the current generation of EPRV instruments are technically capable of detecting Earth-mass planets orbiting a solar twin if provided with sufficient observing time allocations ($\sim 10^3$ nights of observations). △ Less

Submitted 23 August, 2024; originally announced August 2024.

Comments: 25 pages, 14 figures. Submitted to AAS Journals. Data release archived at https://zenodo.org/doi/10.5281/zenodo.13363761

arXiv:2408.02873 [pdf, other]

Utilizing Photometry from Multiple Sources to Mitigate Stellar Variability in Precise Radial Velocities: A Case Study of Kepler-21

Authors: Corey Beard, Paul Robertson, Mark R. Giovinazzi, Joseph M. Akana Murphy, Eric B. Ford, Samuel Halverson, Te Han, Rae Holcomb, Jack Lubin, Rafael Luque, Pranav Premnath, Chad F. Bender, Cullen H. Blake, Qian Gong, Howard Isaacson, Shubham Kanodia, Dan Li, Andrea S. J. Lin, 5 Sarah E. Logsdon, Emily Lubar, Michael W. McElwain, Andrew Monson, Joe P. Ninan, Jayadev Rajagopal, Arpita Roy , et al. (4 additional authors not shown)

Abstract: We present a new analysis of Kepler-21, the brightest (V = 8.5) Kepler system with a known transiting exoplanet, Kepler-21 b. Kepler-21 b is a radius valley planet ($R = 1.6\pm 0.2 R_{\oplus}$) with an Earth-like composition (8.38$\pm$1.62 g/cc), though its mass and radius fall in the regime of possible "water worlds." We utilize new Keck/HIRES and WIYN/NEID radial velocity (RV) data in conjunctio… ▽ More We present a new analysis of Kepler-21, the brightest (V = 8.5) Kepler system with a known transiting exoplanet, Kepler-21 b. Kepler-21 b is a radius valley planet ($R = 1.6\pm 0.2 R_{\oplus}$) with an Earth-like composition (8.38$\pm$1.62 g/cc), though its mass and radius fall in the regime of possible "water worlds." We utilize new Keck/HIRES and WIYN/NEID radial velocity (RV) data in conjunction with Kepler and TESS photometry to perform a detailed study of activity mitigation between photometry and RVs. We additionally refine the system parameters, and we utilize Gaia astrometry to place constraints on a long-term RV trend. Our activity analysis affirms the quality of Kepler photometry for removing correlated noise from RVs, despite its temporal distance, though we reveal some cases where TESS may be superior. Using refined orbital parameters and updated composition curves, we rule out a ``water world" scenario for Kepler-21 b, and we identify a long period super-Jupiter planetary candidate, Kepler-21 (c). △ Less

Submitted 5 August, 2024; originally announced August 2024.

arXiv:2407.21075 [pdf, other]

Apple Intelligence Foundation Language Models

Authors: Tom Gunter, Zirui Wang, Chong Wang, Ruoming Pang, Andy Narayanan, Aonan Zhang, Bowen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek , et al. (130 additional authors not shown)

Abstract: We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used… ▽ More We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development. △ Less

Submitted 29 July, 2024; originally announced July 2024.

arXiv:2407.17565 [pdf, other]

Periodicity significance testing with null-signal templates: reassessment of PTF's SMBH binary candidates

Authors: Jakob Robnik, Adrian E. Bayer, Maria Charisi, Zoltán Haiman, Allison Lin, Uroš Seljak

Abstract: Periodograms are widely employed for identifying periodicity in time series data, yet they often struggle to accurately quantify the statistical significance of detected periodic signals when the data complexity precludes reliable simulations. We develop a data-driven approach to address this challenge by introducing a null-signal template (NST). The NST is created by carefully randomizing the per… ▽ More Periodograms are widely employed for identifying periodicity in time series data, yet they often struggle to accurately quantify the statistical significance of detected periodic signals when the data complexity precludes reliable simulations. We develop a data-driven approach to address this challenge by introducing a null-signal template (NST). The NST is created by carefully randomizing the period of each cycle in the periodogram template, rendering it non-periodic. It has the same frequentist properties as a periodic signal template regardless of the noise probability distribution, and we show with simulations that the distribution of false positives is the same as with the original periodic template, regardless of the underlying data. Thus, performing a periodicity search with the NST acts as an effective simulation of the null (no-signal) hypothesis, without having to simulate the noise properties of the data. We apply the NST method to the supermassive black hole binaries (SMBHB) search in the Palomar Transient Factory (PTF), where Charisi et al. had previously proposed 33 high signal to (white) noise candidates utilizing simulations to quantify their significance. Our approach reveals that these simulations do not capture the complexity of the real data. There are no statistically significant periodic signal detections above the non-periodic background. To improve the search sensitivity we introduce a Gaussian quadrature based algorithm for the Bayes Factor with correlated noise as a test statistic, in contrast to the standard signal to white noise. We show with simulations that this improves sensitivity to true signals by more than an order of magnitude. However, using the Bayes Factor approach also results in no statistically significant detections in the PTF data. △ Less

Submitted 24 July, 2024; originally announced July 2024.

Comments: 13 pages, 12 figures

arXiv:2407.08617 [pdf, other]

Quantum-Train Long Short-Term Memory: Application on Flood Prediction Problem

Authors: Chu-Hsuan Abraham Lin, Chen-Yu Liu, Kuan-Cheng Chen

Abstract: Flood prediction is a critical challenge in the context of climate change, with significant implications for ecosystem preservation, human safety, and infrastructure protection. In this study, we tackle this problem by applying the Quantum-Train (QT) technique to a forecasting Long Short-Term Memory (LSTM) model trained by Quantum Machine Learning (QML) with significant parameter reduction. The QT… ▽ More Flood prediction is a critical challenge in the context of climate change, with significant implications for ecosystem preservation, human safety, and infrastructure protection. In this study, we tackle this problem by applying the Quantum-Train (QT) technique to a forecasting Long Short-Term Memory (LSTM) model trained by Quantum Machine Learning (QML) with significant parameter reduction. The QT technique, originally successful in the A Matter of Taste challenge at QHack 2024, leverages QML to reduce the number of trainable parameters to a polylogarithmic function of the number of parameters in a classical neural network (NN). This innovative framework maps classical NN weights to a Hilbert space, altering quantum state probability distributions to adjust NN parameters. Our approach directly processes classical data without the need for quantum embedding and operates independently of quantum computing resources post-training, making it highly practical and accessible for real-world flood prediction applications. This model aims to improve the efficiency of flood forecasts, ultimately contributing to better disaster preparedness and response. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 6 pages, 4 figures

arXiv:2407.06766 [pdf, other]

Relational Perspective on Graph Query Languages

Authors: Diego Figueira, Anthony W. Lin, Liat Peterfreund

Abstract: We study a relational perspective of graph database querying. Such a perspective underlies various graph database systems but very few theoretical investigations have been conducted on it. This perspective offers a powerful and unified framework to study graph database querying, by which algorithms and complexity follow from classical results. We provide two concrete applications. The first is q… ▽ More We study a relational perspective of graph database querying. Such a perspective underlies various graph database systems but very few theoretical investigations have been conducted on it. This perspective offers a powerful and unified framework to study graph database querying, by which algorithms and complexity follow from classical results. We provide two concrete applications. The first is querying property graphs. The property graph data model supersedes previously proposed graph models and underlies the new standard GQL for graph query languages. We show that this standard can be, by and large, expressed by extensions of relational calculus with transitive closure operators (FO[TC]) and existential second-order quantifiers (ESO). With this, we obtain optimal data complexity bounds, along with extensions including schema validation. The second application is incorporating data from concrete domains (e.g., numbers) in graph database querying. We use embedded finite model theory and, by exploiting a generic Restricted Quantifier Collapse (RQC) result for FO[TC] and ESO, we obtain optimal data complexity bounds for GQL with arithmetics and comparisons. Moreover, we show that Regular Data Path Querying with operations on data (i.e. using register automata formalisms) can be captured in FO[TC] over embedded finite graphs while preserving nondeterministic logspace data complexity. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.06103 [pdf, other]

QTRL: Toward Practical Quantum Reinforcement Learning via Quantum-Train

Authors: Chen-Yu Liu, Chu-Hsuan Abraham Lin, Chao-Han Huck Yang, Kuan-Cheng Chen, Min-Hsiu Hsieh

Abstract: Quantum reinforcement learning utilizes quantum layers to process information within a machine learning model. However, both pure and hybrid quantum reinforcement learning face challenges such as data encoding and the use of quantum computers during the inference stage. We apply the Quantum-Train method to reinforcement learning tasks, called QTRL, training the classical policy network model using… ▽ More Quantum reinforcement learning utilizes quantum layers to process information within a machine learning model. However, both pure and hybrid quantum reinforcement learning face challenges such as data encoding and the use of quantum computers during the inference stage. We apply the Quantum-Train method to reinforcement learning tasks, called QTRL, training the classical policy network model using a quantum machine learning model with polylogarithmic parameter reduction. This QTRL approach eliminates the data encoding issues of conventional quantum machine learning and reduces the training parameters of the corresponding classical policy network. Most importantly, the training result of the QTRL is a classical model, meaning the inference stage only requires classical computer. This is extremely practical and cost-efficient for reinforcement learning tasks, where low-latency feedback from the policy model is essential. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 6 pages, 1 figure

arXiv:2406.17871 [pdf, other]

Revisiting the Expressiveness Landscape of Data Graph Queries

Authors: Michael Benedikt, Anthony Widjaja Lin, Di-De Yen

Abstract: The study of graph queries in database theory has spanned more than three decades, resulting in a multitude of proposals for graph query languages. These languages differ in the mechanisms. We can identify three main families of languages, with the canonical representatives being: (1) regular path queries, (2) walk logic, and (3) first-order logic with transitive closure operators. This paper prov… ▽ More The study of graph queries in database theory has spanned more than three decades, resulting in a multitude of proposals for graph query languages. These languages differ in the mechanisms. We can identify three main families of languages, with the canonical representatives being: (1) regular path queries, (2) walk logic, and (3) first-order logic with transitive closure operators. This paper provides a complete picture of the expressive power of these languages in the context of data graphs. Specifically, we consider a graph data model that supports querying over both data and topology. For example, "Does there exist a path between two different persons in a social network with the same last name?". We also show that an extension of (1), augmented with transitive closure operators, can unify the expressivity of (1)--(3) without increasing the query evaluation complexity. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.16942 [pdf, other]

Enhancing Diagnostic Reliability of Foundation Model with Uncertainty Estimation in OCT Images

Authors: Yuanyuan Peng, Aidi Lin, Meng Wang, Tian Lin, Ke Zou, Yinglin Cheng, Tingkun Shi, Xulong Liao, Lixia Feng, Zhen Liang, Xinjian Chen, Huazhu Fu, Haoyu Chen

Abstract: Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RE… ▽ More Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RETFound and UIOS, and got further improvement with thresholding strategy to 98.44%. In the external test sets obtained from other OCT devices, FMUE achieved an accuracy of 88.75% and 92.73% before and after thresholding. Our model is superior to two ophthalmologists with a higher F1 score (95.17% vs. 61.93% &71.72%). Besides, our model correctly predicts high uncertainty scores for samples with ambiguous features, of non-target-category diseases, or with low-quality to prompt manual checks and prevent misdiagnosis. FMUE provides a trustworthy method for automatic retinal anomalies detection in the real-world clinical open set environment. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: All codes are available at https://github.com/yuanyuanpeng0129/FMUE

arXiv:2406.09317 [pdf, other]

Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources, encompassing a diverse range of diseases across multiple ethnicities and countries. RetiZero exhibits superior performance in several downstream tasks, including zero-shot disease recognition, image-to-image retrieval, and internal- and cross-domain disease identification. In zero-shot scenarios, RetiZero achieves Top5 accuracy scores of 0.8430 for 15 fundus diseases and 0.7561 for 52 fundus diseases. For image retrieval, it achieves Top5 scores of 0.9500 and 0.8860 for the same disease sets, respectively. Clinical evaluations show that RetiZero's Top3 zero-shot performance surpasses the average of 19 ophthalmologists from Singapore, China and the United States. Furthermore, RetiZero significantly enhances clinicians' accuracy in diagnosing fundus disease. These findings underscore the value of integrating the RetiZero foundation model into clinical settings, where a variety of fundus diseases are encountered. △ Less

Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.06038 [pdf, other]

Navigation and 3D Surface Reconstruction from Passive Whisker Sensing

Authors: Michael A. Lin, Hao Li, Chengyi Xing, Mark R. Cutkosky

Abstract: Whiskers provide a way to sense surfaces in the immediate environment without disturbing it. In this paper we present a method for using highly flexible, curved, passive whiskers mounted along a robot arm to gather sensory data as they brush past objects during normal robot motion. The information is useful both for guiding the robot in cluttered spaces and for reconstructing the exposed faces of… ▽ More Whiskers provide a way to sense surfaces in the immediate environment without disturbing it. In this paper we present a method for using highly flexible, curved, passive whiskers mounted along a robot arm to gather sensory data as they brush past objects during normal robot motion. The information is useful both for guiding the robot in cluttered spaces and for reconstructing the exposed faces of objects. Surface reconstruction depends on accurate localization of contact points along each whisker. We present an algorithm based on Bayesian filtering that rapidly converges to within 1\,mm of the actual contact locations. The piecewise-continuous history of contact locations from each whisker allows for accurate reconstruction of curves on object surfaces. Employing multiple whiskers and traces, we are able to produce an occupancy map of proximal objects. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: arXiv admin note: text overlap with arXiv:2210.12387

arXiv:2406.02778 [pdf, other]

MS-IMAP -- A Multi-Scale Graph Embedding Approach for Interpretable Manifold Learning

Authors: Shay Deutsch, Lionel Yelibi, Alex Tong Lin, Arjun Ravi Kannan

Abstract: Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence… ▽ More Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence between the embedding space and the input feature space which aids in deriving feature importance of the original features. We theoretically justify our approach and demonstrate that, in Paley-Wiener spaces on combinatorial graphs, the spectral graph wavelets operator offers greater flexibility and better control over smoothness properties compared to the Laplacian operator. We validate the effectiveness of our proposed graph embedding on a variety of public datasets through a range of downstream tasks, including clustering and unsupervised feature importance. △ Less

Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

arXiv:2405.18530 [pdf]

doi 10.18429/JACoW-IPAC2024-THYN1

First results of AUP Nb3Sn quadrupole horizontal tests

Authors: M. Baldini, G. Ambrosio, G. Apollinari, J. Blowers, R. Bossert, R. Carcagno, G. Chlachidze, J. DiMarco, S. Feher, S. Krave, V. Lombardo, L. Martin, C. Narug, T. H. Nicol, V. Nikolic, A. Nobrega, V. Marinozzi, C. Orozco, T. Page, S. Stoynev, T. Strauss, M. Turenne, D. Turrioni, A. Vouris, M. Yu , et al. (26 additional authors not shown)

Abstract: The Large Hadron Collider will soon undergo an upgrade to increase its luminosity by a factor of ~10 [1]. A crucial part of this upgrade will be replacement of the NbTi focusing magnets with Nb3Sn magnets that achieve a ~50% increase in the field strength. This will be the first ever large-scale implementation of Nb3Sn magnets in a particle accelerator. The High-Luminosity LHC Upgrade, HL-LHC is a… ▽ More The Large Hadron Collider will soon undergo an upgrade to increase its luminosity by a factor of ~10 [1]. A crucial part of this upgrade will be replacement of the NbTi focusing magnets with Nb3Sn magnets that achieve a ~50% increase in the field strength. This will be the first ever large-scale implementation of Nb3Sn magnets in a particle accelerator. The High-Luminosity LHC Upgrade, HL-LHC is a CERN project with a world-wide collaboration. It is under construction and utilizes Nb3Sn Magnets (named MQXF) as key ingredients to increase tenfold the integrated luminosity delivered to the CMS and ATLAS experiments in the next decade. The HL-LHC AUP is the US effort to contribute approximately 50% of the low-beta focusing magnets and crab cavities for the HL-LHC. This paper will present the program to fabricate the Nb3Sn superconducting magnets. We are reporting the status of the HL-LHC AUP project present the results from horizontal tests of the first fully assembled cryo-assembly. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: IPAC'24 - 15th International Particle Accelerator Conference

Report number: FERMILAB-CONF-24-0273-TD

Journal ref: JACoW IPAC2024 (2024) THYN1

arXiv:2405.18457 [pdf, other]

Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian Processes

Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, Javier Antorán, José Miguel Hernández-Lobato

Abstract: Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating projections or stochastic gradient descent, to construct an estimate of the marginal likelihood gradient. We discuss three key improvements which are applicable across so… ▽ More Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating projections or stochastic gradient descent, to construct an estimate of the marginal likelihood gradient. We discuss three key improvements which are applicable across solvers: (i) a pathwise gradient estimator, which reduces the required number of solver iterations and amortises the computational cost of making predictions, (ii) warm starting linear system solvers with the solution from the previous step, which leads to faster solver convergence at the cost of negligible bias, (iii) early stopping linear system solvers after a limited computational budget, which synergises with warm starting, allowing solver progress to accumulate over multiple marginal likelihood steps. These techniques provide speed-ups of up to $72\times$ when solving to tolerance, and decrease the average residual norm by up to $7\times$ when stopping early. △ Less

Submitted 6 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

Comments: Preprint. arXiv admin note: text overlap with arXiv:2405.18328

arXiv:2405.18328 [pdf, other]

Warm Start Marginal Likelihood Optimisation for Iterative Gaussian Processes

Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, José Miguel Hernández-Lobato

Abstract: Gaussian processes are a versatile probabilistic machine learning model whose effectiveness often depends on good hyperparameters, which are typically learned by maximising the marginal likelihood. In this work, we consider iterative methods, which use iterative linear system solvers to approximate marginal likelihood gradients up to a specified numerical precision, allowing a trade-off between co… ▽ More Gaussian processes are a versatile probabilistic machine learning model whose effectiveness often depends on good hyperparameters, which are typically learned by maximising the marginal likelihood. In this work, we consider iterative methods, which use iterative linear system solvers to approximate marginal likelihood gradients up to a specified numerical precision, allowing a trade-off between compute time and accuracy of a solution. We introduce a three-level hierarchy of marginal likelihood optimisation for iterative Gaussian processes, and identify that the computational costs are dominated by solving sequential batches of large positive-definite systems of linear equations. We then propose to amortise computations by reusing solutions of linear system solvers as initialisations in the next step, providing a $\textit{warm start}$. Finally, we discuss the necessary conditions and quantify the consequences of warm starts and demonstrate their effectiveness on regression tasks, where warm starts achieve the same results as the conventional procedure while providing up to a $16 \times$ average speed-up among datasets. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: Advances in Approximate Bayesian Inference 2024

arXiv:2405.16166 [pdf, other]

The Power of Hard Attention Transformers on Data Sequences: A Formal Language Theoretic Perspective

Authors: Pascal Bergsträßer, Chris Köcher, Anthony Widjaja Lin, Georg Zetzsche

Abstract: Formal language theory has recently been successfully employed to unravel the power of transformer encoders. This setting is primarily applicable in Natural Languange Processing (NLP), as a token embedding function (where a bounded number of tokens is admitted) is first applied before feeding the input to the transformer. On certain kinds of data (e.g. time series), we want our transformers to be… ▽ More Formal language theory has recently been successfully employed to unravel the power of transformer encoders. This setting is primarily applicable in Natural Languange Processing (NLP), as a token embedding function (where a bounded number of tokens is admitted) is first applied before feeding the input to the transformer. On certain kinds of data (e.g. time series), we want our transformers to be able to handle \emph{arbitrary} input sequences of numbers (or tuples thereof) without \emph{a priori} limiting the values of these numbers. In this paper, we initiate the study of the expressive power of transformer encoders on sequences of data (i.e. tuples of numbers). Our results indicate an increase in expressive power of hard attention transformers over data sequences, in stark contrast to the case of strings. In particular, we prove that Unique Hard Attention Transformers (UHAT) over inputs as data sequences no longer lie within the circuit complexity class $AC^0$ (even without positional encodings), unlike the case of string inputs, but are still within the complexity class $TC^0$ (even with positional encodings). Over strings, UHAT without positional encodings capture only regular languages. In contrast, we show that over data sequences UHAT can capture non-regular properties. Finally, we show that UHAT capture languages definable in an extension of linear temporal logic with unary numeric predicates and arithmetics. △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.11304 [pdf, other]

Quantum-Train: Rethinking Hybrid Quantum-Classical Machine Learning in the Model Compression Perspective

Authors: Chen-Yu Liu, En-Jui Kuo, Chu-Hsuan Abraham Lin, Jason Gemsun Young, Yeong-Jar Chang, Min-Hsiu Hsieh, Hsi-Sheng Goan

Abstract: We introduces the Quantum-Train(QT) framework, a novel approach that integrates quantum computing with classical machine learning algorithms to address significant challenges in data encoding, model compression, and inference hardware requirements. Even with a slight decrease in accuracy, QT achieves remarkable results by employing a quantum neural network alongside a classical mapping model, whic… ▽ More We introduces the Quantum-Train(QT) framework, a novel approach that integrates quantum computing with classical machine learning algorithms to address significant challenges in data encoding, model compression, and inference hardware requirements. Even with a slight decrease in accuracy, QT achieves remarkable results by employing a quantum neural network alongside a classical mapping model, which significantly reduces the parameter count from $M$ to $O(\text{polylog} (M))$ during training. Our experiments demonstrate QT's effectiveness in classification tasks, offering insights into its potential to revolutionize machine learning by leveraging quantum computational advantages. This approach not only improves model efficiency but also reduces generalization errors, showcasing QT's potential across various machine learning applications. △ Less

Submitted 10 June, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

Comments: 12 pages, 6 figures

arXiv:2405.06945 [pdf, other]

Direct Learning of Mesh and Appearance via 3D Gaussian Splatting

Authors: Ancheng Lin, Jun Li

Abstract: Accurately reconstructing a 3D scene including explicit geometry information is both attractive and challenging. Geometry reconstruction can benefit from incorporating differentiable appearance models, such as Neural Radiance Fields and 3D Gaussian Splatting (3DGS). However, existing methods encounter efficiency issues due to indirect geometry learning and the paradigm of separately modeling geome… ▽ More Accurately reconstructing a 3D scene including explicit geometry information is both attractive and challenging. Geometry reconstruction can benefit from incorporating differentiable appearance models, such as Neural Radiance Fields and 3D Gaussian Splatting (3DGS). However, existing methods encounter efficiency issues due to indirect geometry learning and the paradigm of separately modeling geometry and surface appearance. In this work, we propose a learnable scene model that incorporates 3DGS with an explicit geometry representation, namely a mesh. Our model learns the mesh and appearance in an end-to-end manner, where we bind 3D Gaussians to the mesh faces and perform differentiable rendering of 3DGS to obtain photometric supervision. The model creates an effective information pathway to supervise the learning of both 3DGS and mesh. Experimental results demonstrate that the learned scene model not only achieves state-of-the-art efficiency and rendering quality but also supports manipulation using the explicit mesh. In addition, our model has a unique advantage in adapting to scene updates, thanks to the end-to-end learning of both mesh and appearance. △ Less

Submitted 26 September, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

arXiv:2404.08887 [pdf, other]

doi 10.1007/978-3-031-56069-9_6

Countering Mainstream Bias via End-to-End Adaptive Local Learning

Authors: Jinhao Pan, Ziwei Zhu, Jianling Wang, Allen Lin, James Caverlee

Abstract: Collaborative filtering (CF) based recommendations suffer from mainstream bias -- where mainstream users are favored over niche users, leading to poor recommendation quality for many long-tail users. In this paper, we identify two root causes of this mainstream bias: (i) discrepancy modeling, whereby CF algorithms focus on modeling mainstream users while neglecting niche users with unique preferen… ▽ More Collaborative filtering (CF) based recommendations suffer from mainstream bias -- where mainstream users are favored over niche users, leading to poor recommendation quality for many long-tail users. In this paper, we identify two root causes of this mainstream bias: (i) discrepancy modeling, whereby CF algorithms focus on modeling mainstream users while neglecting niche users with unique preferences; and (ii) unsynchronized learning, where niche users require more training epochs than mainstream users to reach peak performance. Targeting these causes, we propose a novel end-To-end Adaptive Local Learning (TALL) framework to provide high-quality recommendations to both mainstream and niche users. TALL uses a loss-driven Mixture-of-Experts module to adaptively ensemble experts to provide customized local models for different users. Further, it contains an adaptive weight module to synchronize the learning paces of different users by dynamically adjusting weights in the loss. Extensive experiments demonstrate the state-of-the-art performance of the proposed model. Code and data are provided at \url{https://github.com/JP-25/end-To-end-Adaptive-Local-Leanring-TALL-} △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: ECIR 2024

Journal ref: In European Conference on Information Retrieval 2024, vol 14612 (pp. 75-89)

arXiv:2403.19594 [pdf]

Reproducibility Made Easy: A Tool for Methodological Transparency and Efficient Standardized Reporting based on the proposed MRSinMRS Consensus

Authors: Antonia Susnjar, Antonia Kaiser, Dunja Simicic, Gianna Nossa, Alexander Lin, Georg Oeltzschner, Aaron Gudmundson

Abstract: A recent expert consensus found that non-standard reporting in MRS studies led to poor reproducibility. In order to address this, MRSinMRS guidelines were introduced; however, because of the disparate nomenclature and data formats, adoption has been slow. To get around this problem, REMY, a toolbox that supports major vendor formats, was created. By efficiently filling in important fields in the M… ▽ More A recent expert consensus found that non-standard reporting in MRS studies led to poor reproducibility. In order to address this, MRSinMRS guidelines were introduced; however, because of the disparate nomenclature and data formats, adoption has been slow. To get around this problem, REMY, a toolbox that supports major vendor formats, was created. By efficiently filling in important fields in the MRSinMRS table, it improves reproducibility. Even with certain hardware-related restrictions, REMY makes a substantial contribution to the completion of acquisition parameters, which facilitates reporting. Its compatibility and user-friendly interface should promote widespread adoption of MRSinMRS, raising the caliber of MRS research. △ Less

Submitted 6 August, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.19039 [pdf, other]

Expanding Density-Correlation Machine Learning Representations for Anisotropic Coarse-Grained Particles

Authors: Arthur Y. Lin, Kevin K. Huguenin-Dumittan, Yong-Cheol Cho, Jigyasa Nigam, Rose K. Cersonsky

Abstract: Physics-based, atom-centered machine learning (ML) representations have been instrumental to the effective integration of ML within the atomistic simulation community. Many of these representations build off the idea of atoms as having spherical, or isotropic, interactions. In many communities, there is often a need to represent groups of atoms, either to increase the computational efficiency of s… ▽ More Physics-based, atom-centered machine learning (ML) representations have been instrumental to the effective integration of ML within the atomistic simulation community. Many of these representations build off the idea of atoms as having spherical, or isotropic, interactions. In many communities, there is often a need to represent groups of atoms, either to increase the computational efficiency of simulation via coarse-graining or to understand molecular influences on system behavior. In such cases, atom-centered representations will have limited utility, as groups of atoms may not be well-approximated as spheres. In this work, we extend the popular Smooth Overlap of Atomic Positions (SOAP) ML representation for systems consisting of non-spherical anisotropic particles or clusters of atoms. We show the power of this anisotropic extension of SOAP, which we deem \AniSOAP, in accurately characterizing liquid crystal systems and predicting the energetics of Gay-Berne ellipsoids and coarse-grained benzene crystals. With our study of these prototypical anisotropic systems, we derive fundamental insights into how molecular shape influences mesoscale behavior and explain how to reincorporate important atom-atom interactions typically not captured by coarse-grained models. Moving forward, we propose \AniSOAP as a flexible, unified framework for coarse-graining in complex, multiscale simulation. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: The following article has been submitted to the Journal of Chemical Physics. After it is published, the updated version can be found through their website

arXiv:2402.16465 [pdf, other]

Training Classical Neural Networks by Quantum Machine Learning

Authors: Chen-Yu Liu, En-Jui Kuo, Chu-Hsuan Abraham Lin, Sean Chen, Jason Gemsun Young, Yeong-Jar Chang, Min-Hsiu Hsieh

Abstract: In recent years, advanced deep neural networks have required a large number of parameters for training. Therefore, finding a method to reduce the number of parameters has become crucial for achieving efficient training. This work proposes a training scheme for classical neural networks (NNs) that utilizes the exponentially large Hilbert space of a quantum system. By mapping a classical NN with… ▽ More In recent years, advanced deep neural networks have required a large number of parameters for training. Therefore, finding a method to reduce the number of parameters has become crucial for achieving efficient training. This work proposes a training scheme for classical neural networks (NNs) that utilizes the exponentially large Hilbert space of a quantum system. By mapping a classical NN with $M$ parameters to a quantum neural network (QNN) with $O(\text{polylog} (M))$ rotational gate angles, we can significantly reduce the number of parameters. These gate angles can be updated to train the classical NN. Unlike existing quantum machine learning (QML) methods, the results obtained from quantum computers using our approach can be directly used on classical computers. Numerical results on the MNIST and Iris datasets are presented to demonstrate the effectiveness of our approach. Additionally, we investigate the effects of deeper QNNs and the number of measurement shots for the QNN, followed by the theoretical perspective of the proposed method. This work opens a new branch of QML and offers a practical tool that can greatly enhance the influence of QML, as the trained QML results can benefit classical computing in our daily lives. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: 7 pages, 3 figures

arXiv:2402.14817 [pdf, other]

Cameras as Rays: Pose Estimation via Ray Diffusion

Authors: Jason Y. Zhang, Amy Lin, Moneish Kumar, Tzu-Hsuan Yang, Deva Ramanan, Shubham Tulsiani

Abstract: Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views (<10). In contrast to existing approaches that pursue top-down prediction of global parametrizations of camera extrinsics, we propose a distributed representation of camera pose that treats a camera as a bundle of rays. This representation allows for a tight coupling with spatia… ▽ More Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views (<10). In contrast to existing approaches that pursue top-down prediction of global parametrizations of camera extrinsics, we propose a distributed representation of camera pose that treats a camera as a bundle of rays. This representation allows for a tight coupling with spatial image features improving pose precision. We observe that this representation is naturally suited for set-level transformers and develop a regression-based approach that maps image patches to corresponding rays. To capture the inherent uncertainties in sparse-view pose inference, we adapt this approach to learn a denoising diffusion model which allows us to sample plausible modes while improving performance. Our proposed methods, both regression- and diffusion-based, demonstrate state-of-the-art performance on camera pose estimation on CO3D while generalizing to unseen object categories and in-the-wild captures. △ Less

Submitted 4 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: In ICLR 2024 (oral). v2-3: updated references. Project webpage: https://jasonyzhang.com/RayDiffusion

arXiv:2402.09430 [pdf, other]

WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing

Authors: Shuokang Huang, Kaihan Li, Di You, Yichong Chen, Arvin Lin, Siying Liu, Xiaohui Li, Julie A. McCann

Abstract: WiFi-based human sensing has exhibited remarkable potential to analyze user behaviors in a non-intrusive and device-free manner, benefiting applications as diverse as smart homes and healthcare. However, most previous works focus on single-user sensing, which has limited practicability in scenarios involving multiple users. Although recent studies have begun to investigate WiFi-based multi-user se… ▽ More WiFi-based human sensing has exhibited remarkable potential to analyze user behaviors in a non-intrusive and device-free manner, benefiting applications as diverse as smart homes and healthcare. However, most previous works focus on single-user sensing, which has limited practicability in scenarios involving multiple users. Although recent studies have begun to investigate WiFi-based multi-user sensing, there remains a lack of benchmark datasets to facilitate reproducible and comparable research. To bridge this gap, we present WiMANS, to our knowledge, the first dataset for multi-user sensing based on WiFi. WiMANS contains over 9.4 hours of dual-band WiFi Channel State Information (CSI), as well as synchronized videos, monitoring simultaneous activities of multiple users. We exploit WiMANS to benchmark the performance of state-of-the-art WiFi-based human sensing models and video-based models, posing new challenges and opportunities for future work. We believe WiMANS can push the boundaries of current studies and catalyze the research on WiFi-based multi-user sensing. △ Less

Submitted 12 March, 2024; v1 submitted 24 January, 2024; originally announced February 2024.

Comments: We present WiMANS, to our knowledge, the first dataset for multi-user activity sensing based on WiFi

arXiv:2402.04946 [pdf, other]

Searching for Giant Exoplanets around M-dwarf Stars (GEMS) I: Survey Motivation

Authors: Shubham Kanodia, Caleb I. Cañas, Suvrath Mahadevan, Eric B. Ford, Ravit Helled, Dana E. Anderson, Alan Boss, William D. Cochran, Megan Delamer, Te Han, Jessica E. Libby-Roberts, Andrea S. J. Lin, Simon Müller, Paul Robertson, Guðmundur Stefánsson, Johanna Teske

Abstract: Recent discoveries of transiting giant exoplanets around M-dwarf stars (GEMS), aided by the all-sky coverage of TESS, are starting to stretch theories of planet formation through the core-accretion scenario. Recent upper limits on their occurrence suggest that they decrease with lower stellar masses, with fewer GEMS around lower-mass stars compared to solar-type. In this paper, we discuss existing… ▽ More Recent discoveries of transiting giant exoplanets around M-dwarf stars (GEMS), aided by the all-sky coverage of TESS, are starting to stretch theories of planet formation through the core-accretion scenario. Recent upper limits on their occurrence suggest that they decrease with lower stellar masses, with fewer GEMS around lower-mass stars compared to solar-type. In this paper, we discuss existing GEMS both through confirmed planets, as well as protoplanetary disk observations, and a combination of tests to reconcile these with theoretical predictions. We then introduce the \textit{Searching for GEMS} survey, where we utilize multi-dimensional nonparameteric statistics to simulate hypothetical survey scenarios to predict the required sample size of transiting GEMS with mass measurements to robustly compare their bulk-density with canonical hot-Jupiters orbiting FGK stars. Our Monte-Carlo simulations predict that a robust comparison requires about 40 transiting GEMS (compared to the existing sample of $\sim$ 15) with 5-$σ$ mass measurements. Furthermore, we discuss the limitations of existing occurrence estimates for GEMS, and provide a brief description of our planned systematic search to improve the occurrence rate estimates for GEMS. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 16 pages + references, including 7 figures. Accepted in AAS Journals

arXiv:2402.03665 [pdf, ps, other]

Multi-color Wavefront Sensor using Talbot effect for High-order Harmonic Generation

Authors: Yang Du, Kui Li, Jin Niu, Angyi Lin, Jie Li, Zhongwei Fan, Guorong Wu, Xiaoshi Zhang, Fucai Zhang

Abstract: We present a novel method for multi-color wavefront measurement of high-order harmonic generation beams using the Talbot effect, validated both theoretically and experimentally for the first time. Each harmonic maintains a unique wavefront and produces an independent set of self-images along the optical axis.We achieved the wavefronts reconstruction of three harmonics in a single measurement scan,… ▽ More We present a novel method for multi-color wavefront measurement of high-order harmonic generation beams using the Talbot effect, validated both theoretically and experimentally for the first time. Each harmonic maintains a unique wavefront and produces an independent set of self-images along the optical axis.We achieved the wavefronts reconstruction of three harmonics in a single measurement scan, expanding the spectrally-resolved capability of the conventional Talbot effect wavefront sensor. This breakthrough introduces a novel tool for studying the multi-color wavefront in high-order harmonic generation, unlocking the potential to investigate spatiotemporal ultrafast nonlinear dynamics in attosecond pulse formation on a shot-by-shot basis. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.01695 [pdf, other]

Language-Guided World Models: A Model-Based Approach to AI Control

Authors: Alex Zhang, Khanh Nguyen, Jens Tuyls, Albert Lin, Karthik Narasimhan

Abstract: This paper introduces the concept of Language-Guided World Models (LWMs) -- probabilistic models that can simulate environments by reading texts. Agents equipped with these models provide humans with more extensive and efficient control, allowing them to simultaneously alter agent behaviors in multiple tasks via natural verbal communication. In this work, we take initial steps in developing robust… ▽ More This paper introduces the concept of Language-Guided World Models (LWMs) -- probabilistic models that can simulate environments by reading texts. Agents equipped with these models provide humans with more extensive and efficient control, allowing them to simultaneously alter agent behaviors in multiple tasks via natural verbal communication. In this work, we take initial steps in developing robust LWMs that can generalize to compositionally novel language descriptions. We design a challenging world modeling benchmark based on the game of MESSENGER (Hanjie et al., 2021), featuring evaluation settings that require varying degrees of compositional generalization. Our experiments reveal the lack of generalizability of the state-of-the-art Transformer model, as it offers marginal improvements in simulation quality over a no-text baseline. We devise a more robust model by fusing the Transformer with the EMMA attention mechanism (Hanjie et al., 2021). Our model substantially outperforms the Transformer and approaches the performance of a model with an oracle semantic parsing and grounding capability. To demonstrate the practicality of this model in improving AI safety and transparency, we simulate a scenario in which the model enables an agent to present plans to a human before execution, and to revise plans based on their language feedback. △ Less

Submitted 4 September, 2024; v1 submitted 23 January, 2024; originally announced February 2024.

Comments: SpLU-RoboNLP workshop at ACL 2024

arXiv:2401.02618 [pdf, ps, other]

doi 10.1145/3632864

Regular Abstractions for Array Systems

Authors: Chih-Duo Hong, Anthony W. Lin

Abstract: Verifying safety and liveness over array systems is a highly challenging problem. Array systems naturally capture parameterized systems such as distributed protocols with an unbounded number of processes. Such distributed protocols often exploit process IDs during their computation, resulting in array systems whose element values range over an infinite domain. In this paper, we develop a novel fra… ▽ More Verifying safety and liveness over array systems is a highly challenging problem. Array systems naturally capture parameterized systems such as distributed protocols with an unbounded number of processes. Such distributed protocols often exploit process IDs during their computation, resulting in array systems whose element values range over an infinite domain. In this paper, we develop a novel framework for proving safety and liveness over array systems. The crux of the framework is to overapproximate an array system as a string rewriting system (i.e. over a finite alphabet) by means of a new predicate abstraction that exploits the so-called indexed predicates. This allows us to tap into powerful verification methods for string rewriting systems that have been heavily developed in the last few decades (e.g. regular model checking). We demonstrate how our method yields simple, automatically verifiable proofs of safety and liveness properties for challenging examples, including Dijkstra's self-stabilizing protocol and the Chang-Roberts leader election protocol. △ Less

Submitted 4 January, 2024; originally announced January 2024.

arXiv:2312.10074 [pdf]

STAGER checklist: Standardized Testing and Assessment Guidelines for Evaluating Generative AI Reliability

Authors: Jinghong Chen, Lingxuan Zhu, Weiming Mou, Zaoqu Liu, Quan Cheng, Anqi Lin, Jian Zhang, Peng Luo

Abstract: Generative Artificial Intelligence (AI) holds immense potential in medical applications. Numerous studies have explored the efficacy of various generative AI models within healthcare contexts, but there is a lack of a comprehensive and systematic evaluation framework. Given that some studies evaluating the ability of generative AI for medical applications have deficiencies in their methodological… ▽ More Generative Artificial Intelligence (AI) holds immense potential in medical applications. Numerous studies have explored the efficacy of various generative AI models within healthcare contexts, but there is a lack of a comprehensive and systematic evaluation framework. Given that some studies evaluating the ability of generative AI for medical applications have deficiencies in their methodological design, standardized guidelines for their evaluation are also currently lacking. In response, our objective is to devise standardized assessment guidelines tailored for evaluating the performance of generative AI systems in medical contexts. To this end, we conducted a thorough literature review using the PubMed and Google Scholar databases, focusing on research that tests generative AI capabilities in medicine. Our multidisciplinary team, comprising experts in life sciences, clinical medicine, medical engineering, and generative AI users, conducted several discussion sessions and developed a checklist of 23 items. The checklist is designed to encompass the critical evaluation aspects of generative AI in medical applications comprehensively. This checklist, and the broader assessment framework it anchors, address several key dimensions, including question collection, querying methodologies, and assessment techniques. We aim to provide a holistic evaluation of AI systems. The checklist delineates a clear pathway from question gathering to result assessment, offering researchers guidance through potential challenges and pitfalls. Our framework furnishes a standardized, systematic approach for research involving the testing of generative AI's applicability in medicine. It enhances the quality of research reporting and aids in the evolution of generative AI in medicine and life sciences. △ Less

Submitted 7 December, 2023; originally announced December 2023.

Comments: 11 pages, 0 figure, 2 tables

arXiv:2312.08604 [pdf, other]

Verification of Neural Reachable Tubes via Scenario Optimization and Conformal Prediction

Authors: Albert Lin, Somil Bansal

Abstract: Learning-based approaches for controlling safety-critical systems are rapidly growing in popularity; thus, it is important to assure their performance and safety. Hamilton-Jacobi (HJ) reachability analysis is a popular formal verification tool for providing such guarantees, since it can handle general nonlinear system dynamics, bounded adversarial system disturbances, and state and input constrain… ▽ More Learning-based approaches for controlling safety-critical systems are rapidly growing in popularity; thus, it is important to assure their performance and safety. Hamilton-Jacobi (HJ) reachability analysis is a popular formal verification tool for providing such guarantees, since it can handle general nonlinear system dynamics, bounded adversarial system disturbances, and state and input constraints. However, its computational and memory complexity scales exponentially with the state dimension, making it intractable for large-scale systems. To overcome this challenge, neural approaches, such as DeepReach, have been used to synthesize reachable tubes and safety controllers for high-dimensional systems. However, verifying these neural reachable tubes remains challenging. In this work, we propose two verification methods, based on robust scenario optimization and conformal prediction, to provide probabilistic safety guarantees for neural reachable tubes. Our methods allow a direct trade-off between resilience to outlier errors in the neural tube, which are inevitable in a learning-based approach, and the strength of the probabilistic safety guarantee. Furthermore, we show that split conformal prediction, a widely used method in the machine learning community for uncertainty quantification, reduces to a scenario-based approach, making the two methods equivalent not only for verification of neural reachable tubes but also more generally. To our knowledge, our proof is the first in the literature to show a strong relationship between conformal prediction and scenario optimization. Finally, we propose an outlier-adjusted verification approach that uses the error distribution in neural reachable tubes to recover greater safe volumes. We demonstrate the efficacy of the proposed approaches for the high-dimensional problems of multi-vehicle collision avoidance and rocket landing with no-go zones. △ Less

Submitted 9 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

Comments: Accepted to 6th Annual Learning for Dynamics & Control Conference. arXiv admin note: text overlap with arXiv:2209.12336

arXiv:2311.17037 [pdf, other]

Concurrent Stochastic Lossy Channel Games

Authors: Daniel Stan, Muhammad Najib, Anthony Widjaja Lin, Parosh Aziz Abdulla

Abstract: Concurrent stochastic games are an important formalism for the rational verification of probabilistic multi-agent systems, which involves verifying whether a temporal logic property is satisfied in some or all game-theoretic equilibria of such systems. In this work, we study the rational verification of probabilistic multi-agent systems where agents can cooperate by communicating over unbounded lo… ▽ More Concurrent stochastic games are an important formalism for the rational verification of probabilistic multi-agent systems, which involves verifying whether a temporal logic property is satisfied in some or all game-theoretic equilibria of such systems. In this work, we study the rational verification of probabilistic multi-agent systems where agents can cooperate by communicating over unbounded lossy channels. To model such systems, we present concurrent stochastic lossy channel games (CSLCG) and employ an equilibrium concept from cooperative game theory known as the core, which is the most fundamental and widely studied cooperative equilibrium concept. Our main contribution is twofold. First, we show that the rational verification problem is undecidable for systems whose agents have almost-sure LTL objectives. Second, we provide a decidable fragment of such a class of objectives that subsumes almost-sure reachability and safety. Our techniques involve reductions to solving infinite-state zero-sum games with conjunctions of qualitative objectives. To the best of our knowledge, our result represents the first decidability result on the rational verification of stochastic multi-agent systems on infinite arenas. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: To appear at CSL 2024. Extended version

arXiv:2311.16811 [pdf, other]

Four errors students make with inverse-square law vectors

Authors: Colin S. Wallace, Liam Jones, Alex Lin

Abstract: In this paper, we discuss four errors introductory physics students make when attempting to add two inverse-square law vectors. We observe multiple instances in which students 1) add vectors as if they were scalars, 2) project the $r$ (or $r^2$) in the denominator, instead of the entire vector, when attempting to find the vector's components, 3) incorrectly apply the Pythagorean theorem when attem… ▽ More In this paper, we discuss four errors introductory physics students make when attempting to add two inverse-square law vectors. We observe multiple instances in which students 1) add vectors as if they were scalars, 2) project the $r$ (or $r^2$) in the denominator, instead of the entire vector, when attempting to find the vector's components, 3) incorrectly apply the Pythagorean theorem when attempting to calculate the magnitude of the resultant vector, and 4) incorrectly relate the signs of the components of an electric field (or force) to the signs of the electric charges. While these are not the only errors students make, they are the most frequently occurring based on our analysis of 678 exams taken by students in either introductory mechanics or electricity and magnetism (E&M). We then show how these errors can be encoded into a new type of activity or assessment question which we call a ``student error task." Introductory physics instructors can use the student error task in this paper as a way to engage or assess their students' understandings of how to add two inverse-square law vectors. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: 22 pages, 7 figures, submitted to the European Journal of Physics

arXiv:2311.16237 [pdf, other]

TOI-1670 c, a 40-day Orbital Period Warm Jupiter in a Compact System, is Well-aligned

Authors: Jack Lubin, Xian-Yu Wang, Malena Rice, Jiayin Dong, Songhu Wang, Brandon T. Radzom, Paul Robertson, Gudmundur Stefansson, Jaime A. Alvarado-Montes, Corey Beard, Chad F. Bender, Arvind F. Gupta, Samuel Halverson, Shubham Kanodia, Dan Li, Andrea S. J. Lin, Sarah E. Logsdon, Emily Lubar, Suvrath Mahadevan, Joe P. Ninan, Jayadev Rajagopal, Aripta Roy, Christian Schwab, Jason T. Wright

Abstract: We report the measurement of the sky-projected obliquity angle $λ$ of the Warm Jovian exoplanet TOI-1670 c via the Rossiter-McLaughlin effect as part of the Stellar Obliquities in Long-period Exoplanet Systems (SOLES) project. We observed the transit window during UT 20 April 2023 for 7 continuous hours with NEID on the 3.5 m WIYN Telescope at Kitt Peak National Observatory. TOI-1670 hosts a sub-N… ▽ More We report the measurement of the sky-projected obliquity angle $λ$ of the Warm Jovian exoplanet TOI-1670 c via the Rossiter-McLaughlin effect as part of the Stellar Obliquities in Long-period Exoplanet Systems (SOLES) project. We observed the transit window during UT 20 April 2023 for 7 continuous hours with NEID on the 3.5 m WIYN Telescope at Kitt Peak National Observatory. TOI-1670 hosts a sub-Neptune (P ~11 days; planet b) interior to the Warm Jovian (P ~40 days; planet c), which presents an opportunity to investigate the dynamics of a Warm Jupiter with an inner companion. Additionally, TOI-1670 c is now among the longest-period planets to date to have its sky-projected obliquity angle measured. We find planet c is well-aligned to the host star, with $λ$ = -0.3 +/- 2.2 degrees. TOI-1670 c joins a growing census of aligned Warm Jupiters around single stars and aligned planets in multi-planet systems. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 11 pages, 2 figures, 1 table. Accepted to ApJ Letters

arXiv:2311.15883 [pdf, other]

Characterising and Verifying the Core in Concurrent Multi-Player Mean-Payoff Games (Full Version)

Authors: Julian Gutierrez, Anthony W. Lin, Muhammad Najib, Thomas Steeples, Michael Wooldridge

Abstract: Concurrent multi-player mean-payoff games are important models for systems of agents with individual, non-dichotomous preferences. Whilst these games have been extensively studied in terms of their equilibria in non-cooperative settings, this paper explores an alternative solution concept: the core from cooperative game theory. This concept is particularly relevant for cooperative AI systems, as i… ▽ More Concurrent multi-player mean-payoff games are important models for systems of agents with individual, non-dichotomous preferences. Whilst these games have been extensively studied in terms of their equilibria in non-cooperative settings, this paper explores an alternative solution concept: the core from cooperative game theory. This concept is particularly relevant for cooperative AI systems, as it enables the modelling of cooperation among agents, even when their goals are not fully aligned. Our contribution is twofold. First, we provide a characterisation of the core using discrete geometry techniques and establish a necessary and sufficient condition for its non-emptiness. We then use the characterisation to prove the existence of polynomial witnesses in the core. Second, we use the existence of such witnesses to solve key decision problems in rational verification and provide tight complexity bounds for the problem of checking whether some/every equilibrium in a game satisfies a given LTL or GR(1) specification. Our approach is general and can be adapted to handle other specifications expressed in various fragments of LTL without incurring additional computational costs. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: This is the full version of the paper with the same title that appears in the CSL'24 proceedings

arXiv:2311.04690 [pdf, other]

Learning Quantum Phase Estimation by Variational Quantum Circuits

Authors: Chen-Yu Liu, Chu-Hsuan Abraham Lin, Kuan-Cheng Chen

Abstract: Quantum Phase Estimation (QPE) stands as a pivotal quantum computing subroutine that necessitates an inverse Quantum Fourier Transform (QFT). However, it is imperative to recognize that enhancing the precision of the estimation inevitably results in a significantly deeper circuit. We developed a variational quantum circuit (VQC) approximation to reduce the depth of the QPE circuit, yielding enhanc… ▽ More Quantum Phase Estimation (QPE) stands as a pivotal quantum computing subroutine that necessitates an inverse Quantum Fourier Transform (QFT). However, it is imperative to recognize that enhancing the precision of the estimation inevitably results in a significantly deeper circuit. We developed a variational quantum circuit (VQC) approximation to reduce the depth of the QPE circuit, yielding enhanced performance in noisy simulations and real hardware. Our experiments demonstrated that the VQC outperformed both Noisy QPE and standard QPE on real hardware by reducing circuit noise. This VQC integration into quantum compilers as an intermediate step between input and transpiled circuits holds significant promise for quantum algorithms with deep circuits. Future research will explore its potential applicability across various quantum computing hardware architectures. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: 6 pages, 7 figures

arXiv:2311.04031 [pdf, other]

Ramsey Quantifiers in Linear Arithmetics

Authors: Pascal Bergsträßer, Moses Ganardi, Anthony W. Lin, Georg Zetzsche

Abstract: We study Satisfiability Modulo Theories (SMT) enriched with the so-called Ramsey quantifiers, which assert the existence of cliques (complete graphs) in the graph induced by some formulas. The extended framework is known to have applications in proving program termination (in particular, whether a transitive binary predicate is well-founded), and monadic decomposability of SMT formulas. Our main r… ▽ More We study Satisfiability Modulo Theories (SMT) enriched with the so-called Ramsey quantifiers, which assert the existence of cliques (complete graphs) in the graph induced by some formulas. The extended framework is known to have applications in proving program termination (in particular, whether a transitive binary predicate is well-founded), and monadic decomposability of SMT formulas. Our main result is a new algorithm for eliminating Ramsey quantifiers from three common SMT theories: Linear Integer Arithmetic (LIA), Linear Real Arithmetic (LRA), and Linear Integer Real Arithmetic (LIRA). In particular, if we work only with existentially quantified formulas, then our algorithm runs in polynomial time and produces a formula of linear size. One immediate consequence is that checking well-foundedness of a given formula in the aforementioned theory defining a transitive predicate can be straightforwardly handled by highly optimized SMT-solvers. We show also how this provides a uniform semi-algorithm for verifying termination and liveness with completeness guarantee (in fact, with an optimal computational complexity) for several well-known classes of infinite-state systems, which include succinct timed systems, one-counter systems, and monotonic counter systems. Another immediate consequence is a solution to an open problem on checking monadic decomposability of a given relation in quantifier-free fragments of LRA and LIRA, which is an important problem in automated reasoning and constraint databases. Our result immediately implies decidability of this problem with an optimal complexity (coNP-complete) and enables exploitation of SMT-solvers. It also provides a termination guarantee for the generic monadic decomposition algorithm of Veanes et al. for LIA, LRA, and LIRA. We report encouraging experimental results on a prototype implementation of our algorithms on micro-benchmarks. △ Less

Submitted 7 November, 2023; originally announced November 2023.

arXiv:2311.03901 [pdf, ps, other]

Parikh's Theorem Made Symbolic

Authors: Matthew Hague, Artur Jeż, Anthony W. Lin

Abstract: Parikh's Theorem is a fundamental result in automata theory with numerous applications in computer science: software verification (e.g. infinite-state verification, string constraints, and theory of arrays), verification of cryptographic protocols (e.g. using Horn clauses modulo equational theories) and database querying (e.g. evaluating path-queries in graph databases). Parikh's Theorem states th… ▽ More Parikh's Theorem is a fundamental result in automata theory with numerous applications in computer science: software verification (e.g. infinite-state verification, string constraints, and theory of arrays), verification of cryptographic protocols (e.g. using Horn clauses modulo equational theories) and database querying (e.g. evaluating path-queries in graph databases). Parikh's Theorem states that the letter-counting abstraction of a language recognized by finite automata or context-free grammars is definable in Presburger Arithmetic. Unfortunately, real-world applications typically require large alphabets - which are well-known to be not amenable to explicit treatment of the alphabets. Symbolic automata have proven in the last decade to be an effective algorithmic framework for handling large finite or even infinite alphabets. A symbolic automaton employs an effective boolean algebra, which offers a symbolic representation of character sets and often lends itself to an exponentially more succinct representation of a language. Instead of letter-counting, Parikh's Theorem for symbolic automata amounts to counting the number of times different predicates are satisfied by an input sequence. Unfortunately, naively applying Parikh's Theorem from classical automata theory to symbolic automata yields existential Presburger formulas of exponential size. We provide a new construction for Parikh's Theorem for symbolic automata and grammars, which avoids this exponential blowup: our algorithm computes an existential formula in polynomial-time over (quantifier-free) Presburger and the base theory. In fact, our algorithm extends to the model of parametric symbolic grammars, which are one of the most expressive models of languages over infinite alphabets. We have implemented our algorithm and show it can be used to solve string constraints that are difficult to solve by existing solvers. △ Less

Submitted 31 July, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

Comments: Accepted tp POPL '24

arXiv:2310.20634 [pdf, other]

doi 10.3847/1538-3881/ad09c2

TOI-5344 b: A Saturn-like planet orbiting a super-Solar metallicity M0 dwarf

Authors: Te Han, Paul Robertson, Shubham Kanodia, Caleb Cañas, Andrea S. J. Lin, Guðmundur Stefánsson, Jessica E. Libby-Roberts, Alexander Larsen, Henry A. Kobulnicky, Suvrath Mahadevan, Chad F. Bender, William D. Cochran, Michael Endl, Mark E. Everett, Arvind F. Gupta, Samuel Halverson, Fred Hearty, Andrew Monson, Joe P. Ninan, Arpita Roy, Christian Schwab, Ryan C. Terrien

Abstract: We confirm the planetary nature of TOI-5344 b as a transiting giant exoplanet around an M0 dwarf star. TOI-5344 b was discovered with the Transiting Exoplanet Survey Satellite photometry and confirmed with ground-based photometry (the Red Buttes Observatory 0.6m telescope), radial velocity (the Habitable-zone Planet Finder), and speckle imaging (the NN-Explore Exoplanet Stellar Speckle Imager). TO… ▽ More We confirm the planetary nature of TOI-5344 b as a transiting giant exoplanet around an M0 dwarf star. TOI-5344 b was discovered with the Transiting Exoplanet Survey Satellite photometry and confirmed with ground-based photometry (the Red Buttes Observatory 0.6m telescope), radial velocity (the Habitable-zone Planet Finder), and speckle imaging (the NN-Explore Exoplanet Stellar Speckle Imager). TOI-5344 b is a Saturn-like giant planet ($ρ= 0.80^{+0.17}_{-0.15}\ \text{g cm}^{-3}$) with a planetary radius of $9.7 \pm \ 0.5 \ \text{R}_{\oplus}$ ($0.87 \pm \ 0.04 \ \text{R}_{\text{Jup}}$) and a planetary mass of $135^{+17}_{-18} \text{M}_{\oplus}$ ($0.42^{+0.05}_{-0.06} \ \text{M}_{\text{Jup}}$). It has an orbital period of $3.792622 \pm 0.000010$ days and an orbital eccentricity of $0.06^{+0.07}_{-0.04}$. We measure a high metallicity for TOI-5344 of [Fe/H] = $0.48 \pm 0.12$, where the high metallicity is consistent with expectations from formation through core accretion. We compare the metallicity of the M-dwarf hosts of giant exoplanets to that of M-dwarf hosts of non-giants ($\lesssim 8\ \text{R}_{\oplus}$). While the two populations appear to show different metallicity distributions, quantitative tests are prohibited by various sample caveats. △ Less

Submitted 7 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

Comments: 19 pages, 10 figures, 4 tables, AJ accepted. Added references

Journal ref: AJ 167 4 (2024)

Showing 1–50 of 248 results for author: Lin, A