-
OpenCUBE: Building an Open Source Cloud Blueprint with EPI Systems
Authors:
Ivy Peng,
Martin Schulz,
Utz-Uwe Haus,
Craig Prunty,
Pedro Marcuello,
Emanuele Danovaro,
Gabin Schieffer,
Jacob Wahlgren,
Daniel Medeiros,
Philipp Friese,
Stefano Markidis
Abstract:
OpenCUBE aims to develop an open-source full software stack for Cloud computing blueprint deployed on EPI hardware, adaptable to emerging workloads across the computing continuum. OpenCUBE prioritizes energy awareness and utilizes open APIs, Open Source components, advanced SiPearl Rhea processors, and RISC-V accelerator. The project leverages representative workloads, such as cloud-native workloa…
▽ More
OpenCUBE aims to develop an open-source full software stack for Cloud computing blueprint deployed on EPI hardware, adaptable to emerging workloads across the computing continuum. OpenCUBE prioritizes energy awareness and utilizes open APIs, Open Source components, advanced SiPearl Rhea processors, and RISC-V accelerator. The project leverages representative workloads, such as cloud-native workloads and workflows of weather forecast data management, molecular docking, and space weather, for evaluation and validation.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
Free boundary minimal Möbius bands in toroids
Authors:
Mario B. Schulz
Abstract:
We prove that strictly mean convex toroids contain infinitely many (geometrically distinct) embedded free boundary minimal Möbius bands as well as infinitely many embedded free boundary minimal annuli. The surfaces in both families are constructed by means of equivariant variational methods and their areas grow linearly with the order of their symmetry groups.
We prove that strictly mean convex toroids contain infinitely many (geometrically distinct) embedded free boundary minimal Möbius bands as well as infinitely many embedded free boundary minimal annuli. The surfaces in both families are constructed by means of equivariant variational methods and their areas grow linearly with the order of their symmetry groups.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
Comparison of Atom Detection Algorithms for Neutral Atom Quantum Computing
Authors:
Jonas Winklmann,
Andrea Alberti,
Martin Schulz
Abstract:
In neutral atom quantum computers, readout and preparation of the atomic qubits are usually based on fluorescence imaging and subsequent analysis of the acquired image. For each atom site, the brightness or some comparable metric is estimated and used to predict the presence or absence of an atom. Across different setups, we can see a vast number of different approaches used to analyze these image…
▽ More
In neutral atom quantum computers, readout and preparation of the atomic qubits are usually based on fluorescence imaging and subsequent analysis of the acquired image. For each atom site, the brightness or some comparable metric is estimated and used to predict the presence or absence of an atom. Across different setups, we can see a vast number of different approaches used to analyze these images. Often, the choice of detection algorithm is either not mentioned at all or it is not justified. We investigate several different algorithms and compare their performance in terms of both precision and execution run time. To do so, we rely on a set of synthetic images across different simulated exposure times with known occupancy states. Since the use of simulation provides us with the ground truth of atom site occupancy, we can easily state precise error rates and variances of the reconstructed property. To also rule out the possibility of better algorithms existing, we calculated the Cramér-Rao bound in order to establish an upper limit that even a perfect estimator cannot outperform. As the metric of choice, we used the number of photonelectrons that can be contributed to a specific atom site. Since the bound depends on the occupancy of neighboring sites, we provide the best and worst cases, as well as a half filled one. Our comparison shows that of our tested algorithms, a global non-linear least-squares solver that uses the optical system's PSF to return a each sites' number of photoelectrons performed the best, on average crossing the worst-case bound for longer exposure times. Its main drawback is its huge computational complexity and, thus, required calculation time. We manage to somewhat reduce this problem, suggesting that its use may be viable. However, our study also shows that for cases where utmost speed is required, simple algorithms may be preferable.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Genus one critical catenoid
Authors:
Giada Franz,
Daniel Ketover,
Mario B. Schulz
Abstract:
We use variational methods to construct a free boundary minimal surface in the three-dimensional unit ball with genus one, two boundary components and prismatic symmetry. Key ingredients are an extension of the equivariant min-max theory to include orientation-reversing isometries and the discovery of a nontrivial two-parameter sweepout.
We use variational methods to construct a free boundary minimal surface in the three-dimensional unit ball with genus one, two boundary components and prismatic symmetry. Key ingredients are an extension of the equivariant min-max theory to include orientation-reversing isometries and the discovery of a nontrivial two-parameter sweepout.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
Tensor Network Python (TeNPy) version 1
Authors:
Johannes Hauschild,
Jakob Unfried,
Sajant Anand,
Bartholomew Andrews,
Marcus Bintz,
Umberto Borla,
Stefan Divic,
Markus Drescher,
Jan Geiger,
Martin Hefel,
Kévin Hémery,
Wilhelm Kadow,
Jack Kemp,
Nico Kirchner,
Vincent S. Liu,
Gunnar Möller,
Daniel Parker,
Michael Rader,
Anton Romen,
Samuel Scalet,
Leon Schoonderwoerd,
Maximilian Schulz,
Tomohiro Soejima,
Philipp Thoma,
Yantao Wu
, et al. (5 additional authors not shown)
Abstract:
TeNPy (short for 'Tensor Network Python') is a python library for the simulation of strongly correlated quantum systems with tensor networks. The philosophy of this library is to achieve a balance of readability and usability for new-comers, while at the same time providing powerful algorithms for experts. The focus is on MPS algorithms for 1D and 2D lattices, such as DMRG ground state search, as…
▽ More
TeNPy (short for 'Tensor Network Python') is a python library for the simulation of strongly correlated quantum systems with tensor networks. The philosophy of this library is to achieve a balance of readability and usability for new-comers, while at the same time providing powerful algorithms for experts. The focus is on MPS algorithms for 1D and 2D lattices, such as DMRG ground state search, as well as dynamics using TEBD, TDVP, or MPO evolution. This article is a companion to the recent version 1.0 release of TeNPy and gives a brief overview of the package.
△ Less
Submitted 30 August, 2024; v1 submitted 4 August, 2024;
originally announced August 2024.
-
Integration of Quantum Accelerators into HPC: Toward a Unified Quantum Platform
Authors:
Amr Elsharkawy,
Xiaorang Guo,
Martin Schulz
Abstract:
To harness the power of quantum computing (QC) in the near future, tight and efficient integration of QC with high performance computing (HPC) infrastructure (both on the software (SW) and the hardware (HW) level) is crucial. This paper addresses the development of a unified quantum platform (UQP) and how it is being integrated into the HPC ecosystem. It builds on the concepts of hybrid high perfo…
▽ More
To harness the power of quantum computing (QC) in the near future, tight and efficient integration of QC with high performance computing (HPC) infrastructure (both on the software (SW) and the hardware (HW) level) is crucial. This paper addresses the development of a unified quantum platform (UQP) and how it is being integrated into the HPC ecosystem. It builds on the concepts of hybrid high performance computing - quantum computing (HPCQC) workflows and a unified HPCQC toolchain, introduced in our previous work and makes the next needed step: it unifies the low-level interface between the existing classical HPC systems and the emerging quantum hardware technologies, including but not limited to machines based on superconducting qubits, neutral atoms or trapped ions. The UQP consists of three core components: a runtime library, an instruction set architecture (ISA) and a quantum control processor (QCP) micro-architecture. In particular, this work contributes a unified HPCQC runtime library that bridges the gap between programming systems built on quantum intermediate representation (QIR) standard with a novel, unified hybrid ISA. It then introduces the initial extension of an ISA and QCP micro-architecture to be platform and technology agnostic and enables it as an efficient execution platform. The UQP has been verified to ensure correctness. Further, our performance analysis shows that the execution time and memory requirements of the runtime library scale super-linearly with number of qubits, which is critical to support scalability efforts in QC hardware.
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
Dataset Distillation by Automatic Training Trajectories
Authors:
Dai Liu,
Jindong Gu,
Hu Cao,
Carsten Trinitis,
Martin Schulz
Abstract:
Dataset Distillation is used to create a concise, yet informative, synthetic dataset that can replace the original dataset for training purposes. Some leading methods in this domain prioritize long-range matching, involving the unrolling of training trajectories with a fixed number of steps (NS) on the synthetic dataset to align with various expert training trajectories. However, traditional long-…
▽ More
Dataset Distillation is used to create a concise, yet informative, synthetic dataset that can replace the original dataset for training purposes. Some leading methods in this domain prioritize long-range matching, involving the unrolling of training trajectories with a fixed number of steps (NS) on the synthetic dataset to align with various expert training trajectories. However, traditional long-range matching methods possess an overfitting-like problem, the fixed step size NS forces synthetic dataset to distortedly conform seen expert training trajectories, resulting in a loss of generality-especially to those from unencountered architecture. We refer to this as the Accumulated Mismatching Problem (AMP), and propose a new approach, Automatic Training Trajectories (ATT), which dynamically and adaptively adjusts trajectory length NS to address the AMP. Our method outperforms existing methods particularly in tests involving cross-architectures. Moreover, owing to its adaptive nature, it exhibits enhanced stability in the face of parameter variations.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Index growth not imputable to topology
Authors:
Alessandro Carlotto,
Mario B. Schulz,
David Wiygul
Abstract:
We employ partitioning methods, in the spirit of Montiel--Ros but here recast for general actions of compact Lie groups, to prove effective lower bounds on the Morse index of certain families of closed minimal hypersurfaces in the round four-dimensional sphere, and of free boundary minimal hypersurfaces in the Euclidean four-dimensional ball. Our analysis reveals, in particular, phenomena of linea…
▽ More
We employ partitioning methods, in the spirit of Montiel--Ros but here recast for general actions of compact Lie groups, to prove effective lower bounds on the Morse index of certain families of closed minimal hypersurfaces in the round four-dimensional sphere, and of free boundary minimal hypersurfaces in the Euclidean four-dimensional ball. Our analysis reveals, in particular, phenomena of linear index growth for sequences of minimal hypersurfaces of fixed topological type, in strong contrast to the three-dimensional scenario.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Equivariant free boundary minimal discs and annuli in ellipsoids
Authors:
Mario B. Schulz
Abstract:
We employ equivariant variational methods to construct new examples of nonplanar free boundary minimal discs in ellipsoids. We also prove that every ellipsoid contains at least three distinct embedded free boundary minimal annuli with dihedral symmetry.
We employ equivariant variational methods to construct new examples of nonplanar free boundary minimal discs in ellipsoids. We also prove that every ellipsoid contains at least three distinct embedded free boundary minimal annuli with dihedral symmetry.
△ Less
Submitted 3 September, 2024; v1 submitted 19 June, 2024;
originally announced June 2024.
-
Fatigue and mental underload further pronounced in L3 conditionally automated driving: Results from an EEG experiment on a test track
Authors:
Nikol Figalová,
Hans Joachim Bieg,
Michael Schulz,
Jürgen Pichen,
Martin Baumann,
Lewis Chuang,
Olga Pollatos
Abstract:
Drivers' role changes with increasing automation from the primary driver to a system supervisor. This study investigates how supervising an SAE L2 and L3 automated vehicle (AV) affects drivers' mental workload and sleepiness compared to manual driving. Using an AV prototype on a test track, the oscillatory brain activity of 23 adult participants was recorded during L2, L3, and manual driving. Resu…
▽ More
Drivers' role changes with increasing automation from the primary driver to a system supervisor. This study investigates how supervising an SAE L2 and L3 automated vehicle (AV) affects drivers' mental workload and sleepiness compared to manual driving. Using an AV prototype on a test track, the oscillatory brain activity of 23 adult participants was recorded during L2, L3, and manual driving. Results showed decreased mental workload and increased sleepiness in L3 drives compared to L2 and manual drives, indicated by self-report scales and changes in the frontal alpha and theta power spectral density. These findings suggest that fatigue and mental underload are significant issues in L3 driving and should be considered when designing future AV interfaces.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach
Authors:
Urvij Saroliya,
Eishi Arima,
Dai Liu,
Martin Schulz
Abstract:
GPU-based heterogeneous architectures are now commonly used in HPC clusters. Due to their architectural simplicity specialized for data-level parallelism, GPUs can offer much higher computational throughput and memory bandwidth than CPUs in the same generation do. However, as the available resources in GPUs have increased exponentially over the past decades, it has become increasingly difficult fo…
▽ More
GPU-based heterogeneous architectures are now commonly used in HPC clusters. Due to their architectural simplicity specialized for data-level parallelism, GPUs can offer much higher computational throughput and memory bandwidth than CPUs in the same generation do. However, as the available resources in GPUs have increased exponentially over the past decades, it has become increasingly difficult for a single program to fully utilize them. As a consequence, the industry has started supporting several resource partitioning features in order to improve the resource utilization by co-scheduling multiple programs on the same GPU die at the same time. Driven by the technological trend, this paper focuses on hierarchical resource partitioning on modern GPUs, and as an example, we utilize a combination of two different features available on recent NVIDIA GPUs in a hierarchical manner: MPS (Multi-Process Service), a finer-grained logical partitioning; and MIG (Multi-Instance GPU), a coarse-grained physical partitioning. We propose a method for comprehensively co-optimizing the setup of hierarchical partitioning and the selection of co-scheduling groups from a given set of jobs, based on reinforcement learning using their profiles. Our thorough experimental results demonstrate that our approach can successfully set up job concurrency, partitioning, and co-scheduling group selections simultaneously. This results in a maximum throughput improvement by a factor of 1.87 compared to the time-sharing scheduling.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
On the Convergence of Malleability and the HPC PowerStack: Exploiting Dynamism in Over-Provisioned and Power-Constrained HPC Systems
Authors:
Eishi Arima,
Isaías A. Comprés,
Martin Schulz
Abstract:
Recent High-Performance Computing (HPC) systems are facing important challenges, such as massive power consumption, while at the same time significantly under-utilized system resources. Given the power consumption trends, future systems will be deployed in an over-provisioned manner where more resources are installed than they can afford to power simultaneously. In such a scenario, maximizing reso…
▽ More
Recent High-Performance Computing (HPC) systems are facing important challenges, such as massive power consumption, while at the same time significantly under-utilized system resources. Given the power consumption trends, future systems will be deployed in an over-provisioned manner where more resources are installed than they can afford to power simultaneously. In such a scenario, maximizing resource utilization and energy efficiency, while keeping a given power constraint, is pivotal. Driven by this observation, in this position paper we first highlight the recent trends of resource management techniques, with a particular focus on malleability support (i.e., dynamically scaling resource allocations/requirements for a job), co-scheduling (i.e., co-locating multiple jobs within a node), and power management. Second, we consider putting them together, assess their relationships/synergies, and discuss the functionality requirements in each software component for future over-provisioned and power-constrained HPC systems. Third, we briefly introduce our ongoing efforts on the integration of software tools, which will ultimately lead to the convergence of malleability and power management, as it is designed in the HPC PowerStack initiative.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Optimizing Hardware Resource Partitioning and Job Allocations on Modern GPUs under Power Caps
Authors:
Eishi Arima,
Minjoon Kang,
Issa Saba,
Josef Weidendorfer,
Carsten Trinitis,
Martin Schulz
Abstract:
CPU-GPU heterogeneous systems are now commonly used in HPC (High-Performance Computing). However, improving the utilization and energy-efficiency of such systems is still one of the most critical issues. As one single program typically cannot fully utilize all resources within a node/chip, co-scheduling (or co-locating) multiple programs with complementary resource requirements is a promising solu…
▽ More
CPU-GPU heterogeneous systems are now commonly used in HPC (High-Performance Computing). However, improving the utilization and energy-efficiency of such systems is still one of the most critical issues. As one single program typically cannot fully utilize all resources within a node/chip, co-scheduling (or co-locating) multiple programs with complementary resource requirements is a promising solution. Meanwhile, as power consumption has become the first-class design constraint for HPC systems, such co-scheduling techniques should be well-tailored for power-constrained environments. To this end, the industry recently started supporting hardware-level resource partitioning features on modern GPUs for realizing efficient co-scheduling, which can operate with existing power capping features. For example, NVidia's MIG (Multi-Instance GPU) partitions one single GPU into multiple instances at the granularity of a GPC (Graphics Processing Cluster). In this paper, we explicitly target the combination of hardware-level GPU partitioning features and power capping for power-constrained HPC systems. We provide a systematic methodology to optimize the combination of chip partitioning, job allocations, as well as power capping based on our scalability/interference modeling while taking a variety of aspects into account, such as compute/memory intensity and utilization in heterogeneous computational resources (e.g., Tensor Cores). The experimental result indicates that our approach is successful in selecting a near optimal combination across multiple different workloads.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Orchestrated Co-scheduling, Resource Partitioning, and Power Capping on CPU-GPU Heterogeneous Systems via Machine Learning
Authors:
Issa Saba,
Eishi Arima,
Dai Liu,
Martin Schulz
Abstract:
CPU-GPU heterogeneous architectures are now commonly used in a wide variety of computing systems from mobile devices to supercomputers. Maximizing the throughput for multi-programmed workloads on such systems is indispensable as one single program typically cannot fully exploit all available resources. At the same time, power consumption is a key issue and often requires optimizing power allocatio…
▽ More
CPU-GPU heterogeneous architectures are now commonly used in a wide variety of computing systems from mobile devices to supercomputers. Maximizing the throughput for multi-programmed workloads on such systems is indispensable as one single program typically cannot fully exploit all available resources. At the same time, power consumption is a key issue and often requires optimizing power allocations to the CPU and GPU while enforcing a total power constraint, in particular when the power/thermal requirements are strict. The result is a system-wide optimization problem with several knobs. In particular we focus on (1) co-scheduling decisions, i.e., selecting programs to co-locate in a space sharing manner; (2) resource partitioning on both CPUs and GPUs; and (3) power capping on both CPUs and GPUs. We solve this problem using predictive performance modeling using machine learning in order to coordinately optimize the above knob setups. Our experiential results using a real system show that our approach achieves up to 67% of speedup compared to a time-sharing-based scheduling with a naive power capping that evenly distributes power budgets across components.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Analysis Facilities White Paper
Authors:
D. Ciangottini,
A. Forti,
L. Heinrich,
N. Skidmore,
C. Alpigiani,
M. Aly,
D. Benjamin,
B. Bockelman,
L. Bryant,
J. Catmore,
M. D'Alfonso,
A. Delgado Peris,
C. Doglioni,
G. Duckeck,
P. Elmer,
J. Eschle,
M. Feickert,
J. Frost,
R. Gardner,
V. Garonne,
M. Giffels,
J. Gooding,
E. Gramstad,
L. Gray,
B. Hegner
, et al. (41 additional authors not shown)
Abstract:
This white paper presents the current status of the R&D for Analysis Facilities (AFs) and attempts to summarize the views on the future direction of these facilities. These views have been collected through the High Energy Physics (HEP) Software Foundation's (HSF) Analysis Facilities forum, established in March 2022, the Analysis Ecosystems II workshop, that took place in May 2022, and the WLCG/HS…
▽ More
This white paper presents the current status of the R&D for Analysis Facilities (AFs) and attempts to summarize the views on the future direction of these facilities. These views have been collected through the High Energy Physics (HEP) Software Foundation's (HSF) Analysis Facilities forum, established in March 2022, the Analysis Ecosystems II workshop, that took place in May 2022, and the WLCG/HSF pre-CHEP workshop, that took place in May 2023. The paper attempts to cover all the aspects of an analysis facility.
△ Less
Submitted 15 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Design Principles of Dynamic Resource Management for High-Performance Parallel Programming Models
Authors:
Dominik Huber,
Martin Schreiber,
Martin Schulz,
Howard Pritchard,
Daniel Holmes
Abstract:
With Dynamic Resource Management (DRM) the resources assigned to a job can be changed dynamically during its execution. From the system's perspective, DRM opens a new level of flexibility in resource allocation and job scheduling and therefore has the potential to improve system efficiency metrics such as the utilization rate, job throughput, energy efficiency, and responsiveness. From the applica…
▽ More
With Dynamic Resource Management (DRM) the resources assigned to a job can be changed dynamically during its execution. From the system's perspective, DRM opens a new level of flexibility in resource allocation and job scheduling and therefore has the potential to improve system efficiency metrics such as the utilization rate, job throughput, energy efficiency, and responsiveness. From the application perspective, users can tailor the resources they request to their needs offering potential optimizations in queuing time or charged costs. Despite these obvious advantages and many attempts over the last decade to establish DRM in HPC, it remains a concept discussed in academia rather than being successfully deployed on production systems. This stems from the fact that support for DRM requires changes in all the layers of the HPC system software stack including applications, programming models, process managers, and resource management software, as well as an extensive and holistic co-design process to establish new techniques and policies for scheduling and resource optimization. In this work, we therefore start with the assumption that resources are accessible by processes executed either on them (e.g., on CPU) or controlling them (e.g., GPU-offloading). Then, the overall DRM problem can be decomposed into dynamic process management (DPM) and dynamic resource mapping or allocation (DRA). The former determines which processes (or which change in processes) must be managed and the latter identifies the resources where they will be executed. The interfaces for such \mbox{DPM/DPA} in these layers need to be standardized, which requires a careful design to be interoperable while providing high flexibility. Based on a survey of existing approaches we propose design principles, that form the basis of a holistic approach to DMR in HPC and provide a prototype implementation using MPI.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
The Morawetz Problem for Supersonic Flow with Cavitation
Authors:
Gui-Qiang G. Chen,
Tristan P. Giron,
Simon M. Schulz
Abstract:
We are concerned with the existence and compactness of entropy solutions of the compressible Euler system for two-dimensional steady potential flow around an obstacle for a polytropic gas with supersonic far-field velocity. The existence problem, initially posed by Morawetz \cite{morawetz85} in 1985, has remained open since then. In this paper, we establish the first complete existence theorem for…
▽ More
We are concerned with the existence and compactness of entropy solutions of the compressible Euler system for two-dimensional steady potential flow around an obstacle for a polytropic gas with supersonic far-field velocity. The existence problem, initially posed by Morawetz \cite{morawetz85} in 1985, has remained open since then. In this paper, we establish the first complete existence theorem for the Morawetz problem by developing a new entropy analysis, coupled with a vanishing viscosity method and compensated compactness ideas. The main challenge arises when the flow approaches cavitation, leading to a loss of strict hyperbolicity of the system and a singularity of the entropy equation, particularly for the case of adiabatic exponent $γ=3$. Our analysis provides a complete description of the entropy and entropy-flux pairs via the Loewner--Morawetz relations, which, in turn, leads to the establishment of a compensated compactness framework. As direct applications of our entropy analysis and the compensated compactness framework, we obtain the compactness of entropy solutions and the weak continuity of the compressible Euler system in the supersonic regime.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Excitations of N$_{2 }$ and O$_{2}$ molecules due to helium ion impact and a polarization effect
Authors:
M. Gochitashvili,
R. Lomsadze,
R. Ya. Kezerashvili,
I. Noselidze,
M. Schulz
Abstract:
We present an experimental study of the dissociative excitation in the collision of helium ions with nitrogen and oxygen molecules for collision energy of $0.7-10$ keV. Absolute emission cross sections are measured and reported for most nitrogen and oxygen atomic and ionic lines in wide, vacuum ultraviolet ($80-130$ nm) and visible ($380-800$ nm), spectral regions. Remarkable similarities of the p…
▽ More
We present an experimental study of the dissociative excitation in the collision of helium ions with nitrogen and oxygen molecules for collision energy of $0.7-10$ keV. Absolute emission cross sections are measured and reported for most nitrogen and oxygen atomic and ionic lines in wide, vacuum ultraviolet ($80-130$ nm) and visible ($380-800$ nm), spectral regions. Remarkable similarities of the processes realized in He$^{+}+$N$_{2}$ and He$^{+}+$O$_{2}$ collision systems are observed. We present polarization measurements for He$^{+}+$N$_{2}$ collision system.
The emission of excited dissociative products was detected using an improved high-resolution optical spectroscopy method. This method incorporates the retarding potential method and a high resolution electrostatic energy analyzer to precisely measure the energy of incident particles and the energy of dispersion. The improvement in the optics resolution allows us to measure the cross section on the order of 10$^{-19}$ cm$^{2}$ or lower.
△ Less
Submitted 26 January, 2024; v1 submitted 27 December, 2023;
originally announced December 2023.
-
Quantum Task Offloading with the OpenMP API
Authors:
Joseph K. L. Lee,
Oliver T. Brown,
Mark Bull,
Martin Ruefenacht,
Johannes Doerfert,
Michael Klemm,
Martin Schulz
Abstract:
Most of the widely used quantum programming languages and libraries are not designed for the tightly coupled nature of hybrid quantum-classical algorithms, which run on quantum resources that are integrated on-premise with classical HPC infrastructure. We propose a programming model using the API provided by OpenMP to target quantum devices, which provides an easy-to-use and efficient interface fo…
▽ More
Most of the widely used quantum programming languages and libraries are not designed for the tightly coupled nature of hybrid quantum-classical algorithms, which run on quantum resources that are integrated on-premise with classical HPC infrastructure. We propose a programming model using the API provided by OpenMP to target quantum devices, which provides an easy-to-use and efficient interface for HPC applications to utilize quantum compute resources. We have implemented a variational quantum eigensolver using the programming model, which has been tested using a classical simulator. We are in the process of testing on the quantum resources hosted at the Leibniz Supercomputing Centre (LRZ).
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
GreenCourier: Carbon-Aware Scheduling for Serverless Functions
Authors:
Mohak Chadha,
Thandayuthapani Subramanian,
Eishi Arima,
Michael Gerndt,
Martin Schulz,
Osama Abboud
Abstract:
This paper presents GreenCourier, a novel scheduling framework that enables the runtime scheduling of serverless functions across geographically distributed regions based on their carbon efficiencies. Our framework incorporates an intelligent scheduling strategy for Kubernetes and supports Knative as the serverless platform. To obtain real-time carbon information for different geographical regions…
▽ More
This paper presents GreenCourier, a novel scheduling framework that enables the runtime scheduling of serverless functions across geographically distributed regions based on their carbon efficiencies. Our framework incorporates an intelligent scheduling strategy for Kubernetes and supports Knative as the serverless platform. To obtain real-time carbon information for different geographical regions, our framework supports multiple marginal carbon emissions sources such as WattTime and the Carbon-aware SDK. We comprehensively evaluate the performance of our framework using the Google Kubernetes Engine and production serverless function traces for scheduling functions across Spain, France, Belgium, and the Netherlands. Results from our experiments show that compared to other approaches, GreenCourier reduces carbon emissions per function invocation by an average of 13.25%.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.
-
Realistic Neutral Atom Image Simulation
Authors:
Jonas Winklmann,
Dimitrios Tsevas,
Martin Schulz
Abstract:
Neutral atom quantum computers require accurate single atom detection for the preparation and readout of their qubits. This is usually done using fluorescence imaging. The occupancy of an atom site in these images is often somewhat ambiguous due to the stochastic nature of the imaging process. Further, the lack of ground truth makes it difficult to rate the accuracy of reconstruction algorithms. W…
▽ More
Neutral atom quantum computers require accurate single atom detection for the preparation and readout of their qubits. This is usually done using fluorescence imaging. The occupancy of an atom site in these images is often somewhat ambiguous due to the stochastic nature of the imaging process. Further, the lack of ground truth makes it difficult to rate the accuracy of reconstruction algorithms. We introduce a bottom-up simulator that is capable of generating sample images of neutral atom experiments from a description of the actual state in the simulated system. Possible use cases include the creation of exemplary images for demonstration purposes, fast training iterations for deconvolution algorithms, and generation of labeled data for machine-learning-based atom detection approaches. The implementation is available through our GitHub as a C library or wrapped Python package. We show the modeled effects and implementation of the simulations at different stages of the imaging process. Not all real-world phenomena can be reproduced perfectly. The main discrepancies are that the simulator allows for only one characterization of optical aberrations across the whole image, supports only discrete atom locations, and does not model all effects of CMOS cameras perfectly. Nevertheless, our experiments show that the generated images closely match real-world pictures to the point that they are practically indistinguishable and can be used as labeled data for training the next generation of detection algorithms.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
On the Recursive Behaviour of the Number of Irreducible Polynomials with Certain Properties over Finite Fields
Authors:
Max Schulz
Abstract:
Let $\mathbb{F}_q$ be the field with $q$ elements and of characteristic $p$. For $a\in\mathbb{F}_p$ consider the set \begin{equation*}
S_a(n)=\{f\in\mathbb{F}_q[x]\mid\operatorname{deg}(f)=n,~f\text{ irreducible, monic and} \operatorname{Tr}(f)=a\}. \end{equation*} In a recent paper, Robert Granger proved for $q=2$ and $n\ge 2$ that $|S_1(n)|-|S_0(n)|= 0$ if $2\nmid n$ and…
▽ More
Let $\mathbb{F}_q$ be the field with $q$ elements and of characteristic $p$. For $a\in\mathbb{F}_p$ consider the set \begin{equation*}
S_a(n)=\{f\in\mathbb{F}_q[x]\mid\operatorname{deg}(f)=n,~f\text{ irreducible, monic and} \operatorname{Tr}(f)=a\}. \end{equation*} In a recent paper, Robert Granger proved for $q=2$ and $n\ge 2$ that $|S_1(n)|-|S_0(n)|= 0$ if $2\nmid n$ and $|S_1(n)|-|S_0(n)|=|S_1(n/2)|$ if $2\mid n$.
We will prove a generalization of this result for all finite fields. This is possible due to an observation about the size of certain subsets of monic irreducible polynomials arising in the context of a group action of subgroups of $\operatorname{PGL}_2(\mathbb{F}_q)$ on monic polynomials. Additionally, it enables us to apply these methods to prove two further results that are very similar in nature.
△ Less
Submitted 27 December, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
DeepRepViz: Identifying Confounders in Deep Learning Model Predictions
Authors:
Roshan Prakash Rane,
JiHoon Kim,
Arjun Umesha,
Didem Stark,
Marc-André Schulz,
Kerstin Ritter
Abstract:
Deep Learning (DL) models have gained popularity in neuroimaging studies for predicting psychological behaviors, cognitive traits, and brain pathologies. However, these models can be biased by confounders such as age, sex, or imaging artifacts from the acquisition process. To address this, we introduce 'DeepRepViz', a two-part framework designed to identify confounders in DL model predictions. The…
▽ More
Deep Learning (DL) models have gained popularity in neuroimaging studies for predicting psychological behaviors, cognitive traits, and brain pathologies. However, these models can be biased by confounders such as age, sex, or imaging artifacts from the acquisition process. To address this, we introduce 'DeepRepViz', a two-part framework designed to identify confounders in DL model predictions. The first component is a visualization tool that can be used to qualitatively examine the final latent representation of the DL model. The second component is a metric called 'Con-score' that quantifies the confounder risk associated with a variable, using the final latent representation of the DL model. We demonstrate the effectiveness of the Con-score using a simple simulated setup by iteratively altering the strength of a simulated confounder and observing the corresponding change in the Con-score. Next, we validate the DeepRepViz framework on a large-scale neuroimaging dataset (n=12000) by performing three MRI-phenotype prediction tasks that include (a) predicting chronic alcohol users, (b) classifying participant sex, and (c) predicting performance speed on a cognitive task called 'trail making'. DeepRepViz identifies sex as a significant confounder in the DL model predicting chronic alcohol users (Con-score=0.35) and age as a confounder in the model predicting cognitive task performance (Con-score=0.3). In conclusion, the DeepRepViz framework provides a systematic approach to test for potential confounders such as age, sex, and imaging artifacts and improves the transparency of DL models for neuroimaging studies.
△ Less
Submitted 15 March, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Sustainability in HPC: Vision and Opportunities
Authors:
Mohak Chadha,
Eishi Arima,
Amir Raoofy,
Michael Gerndt,
Martin Schulz
Abstract:
Tackling climate change by reducing and eventually eliminating carbon emissions is a significant milestone on the path toward establishing an environmentally sustainable society. As we transition into the exascale era, marked by an increasing demand and scale of HPC resources, the HPC community must embrace the challenge of reducing carbon emissions from designing and operating modern HPC systems.…
▽ More
Tackling climate change by reducing and eventually eliminating carbon emissions is a significant milestone on the path toward establishing an environmentally sustainable society. As we transition into the exascale era, marked by an increasing demand and scale of HPC resources, the HPC community must embrace the challenge of reducing carbon emissions from designing and operating modern HPC systems. In this position paper, we describe challenges and highlight different opportunities that can aid HPC sites in reducing the carbon footprint of modern HPC systems.
△ Less
Submitted 23 September, 2023;
originally announced September 2023.
-
Integration of Quantum Accelerators with High Performance Computing -- A Review of Quantum Programming Tools
Authors:
Amr Elsharkawy,
Xiao-Ting Michelle To,
Philipp Seitz,
Yanbin Chen,
Yannick Stade,
Manuel Geiger,
Qunsheng Huang,
Xiaorang Guo,
Muhammad Arslan Ansari,
Christian B. Mendl,
Dieter Kranzlmüller,
Martin Schulz
Abstract:
Quantum computing (QC) introduces a novel mode of computation with the possibility of greater computational power that remains to be exploited - presenting exciting opportunities for high performance computing (HPC) applications. However, recent advancements in the field have made clear that QC does not supplant conventional HPC, but can rather be incorporated into current heterogeneous HPC infras…
▽ More
Quantum computing (QC) introduces a novel mode of computation with the possibility of greater computational power that remains to be exploited - presenting exciting opportunities for high performance computing (HPC) applications. However, recent advancements in the field have made clear that QC does not supplant conventional HPC, but can rather be incorporated into current heterogeneous HPC infrastructures as an additional accelerator, thereby enabling the optimal utilization of both paradigms. The desire for such integration significantly affects the development of software for quantum computers, which in turn influences the necessary software infrastructure. To date, previous review papers have investigated various quantum programming tools (QPTs) (such as languages, libraries, frameworks) in their ability to program, compile, and execute quantum circuits. However, the integration effort with classical HPC frameworks or systems has not been addressed. This study aims to characterize existing QPTs from an HPC perspective, investigating if existing QPTs have the potential to be efficiently integrated with classical computing models and determining where work is still required. This work structures a set of criteria into an analysis blueprint that enables HPC scientists to assess whether a QPT is suitable for the quantum-accelerated classical application at hand.
△ Less
Submitted 18 September, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Toward a Unified Hybrid HPCQC Toolchain
Authors:
Philipp Seitz,
Amr Elsharkawy,
Xiao-Ting Michelle To,
Martin Schulz
Abstract:
In the expanding field of Quantum Computing (QC), efficient and seamless integration of QC and high performance computing (HPC) elements (e.g., quantum hardware, classical hardware, and software infrastructure on both sides) plays a crucial role. This paper addresses the development of a unified toolchain designed for hybrid quantum-classical systems. Our work proposes a design for a unified hybri…
▽ More
In the expanding field of Quantum Computing (QC), efficient and seamless integration of QC and high performance computing (HPC) elements (e.g., quantum hardware, classical hardware, and software infrastructure on both sides) plays a crucial role. This paper addresses the development of a unified toolchain designed for hybrid quantum-classical systems. Our work proposes a design for a unified hybrid high performance computing - quantum computing (HPCQC) toolchain that tackles pressing issues such as scalability, cross-technology execution, and ahead-of-time (AOT) optimization.
△ Less
Submitted 7 September, 2023; v1 submitted 4 September, 2023;
originally announced September 2023.
-
HiSEP-Q: A Highly Scalable and Efficient Quantum Control Processor for Superconducting Qubits
Authors:
Xiaorang Guo,
Kun Qin,
Martin Schulz
Abstract:
Quantum computing promises an effective way to solve targeted problems that are classically intractable. Among them, quantum computers built with superconducting qubits are considered one of the most advanced technologies, but they suffer from short coherence times. This can get exaggerated when they are controlled directly by general-purpose host machines, which leads to the loss of quantum infor…
▽ More
Quantum computing promises an effective way to solve targeted problems that are classically intractable. Among them, quantum computers built with superconducting qubits are considered one of the most advanced technologies, but they suffer from short coherence times. This can get exaggerated when they are controlled directly by general-purpose host machines, which leads to the loss of quantum information. To mitigate this, we need quantum control processors (QCPs) positioned between quantum processing units and host machines to reduce latencies. However, existing QCPs are built on top of designs with no or inefficient scalability, requiring a large number of instructions when scaling to more qubits. In addition, interactions between current QCPs and host machines require frequent data transmissions and offline computations to obtain final results, which limits the performance of quantum computers.
In this paper, we propose a QCP called HiSEP-Q featuring a novel quantum instruction set architecture (QISA) and its microarchitecture implementation. For efficient control, we utilize mixed-type addressing modes and mixed-length instructions in HiSEP-Q, which provides an efficient way to concurrently address more than 100 qubits. Further, for efficient read-out and analysis, we develop a novel onboard accumulation and sorting unit, which eliminates the data transmission of raw data between the QCPs and host machines and enables real-time result processing. Compared to the state-of-the-art, our proposed QISA achieves at least 62% and 28% improvements in encoding efficiency with real and synthetic quantum circuits, respectively. We also validate the microarchitecture on a field-programmable gate array, which exhibits low power and resource consumption. Both hardware and ISA evaluations demonstrate that HiSEP-Q features high scalability and efficiency toward the number of controlled qubits.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Disc stackings and their Morse index
Authors:
Alessandro Carlotto,
Mario B. Schulz,
David Wiygul
Abstract:
We construct free boundary minimal disc stackings, with any number of strata, in the three-dimensional Euclidean unit ball, and prove uniform, linear lower and upper bounds on the Morse index of all such surfaces. Among other things, our work implies for any positive integer $k$ the existence of $k$-tuples of distinct, pairwise non-congruent, embedded free boundary minimal surfaces all having the…
▽ More
We construct free boundary minimal disc stackings, with any number of strata, in the three-dimensional Euclidean unit ball, and prove uniform, linear lower and upper bounds on the Morse index of all such surfaces. Among other things, our work implies for any positive integer $k$ the existence of $k$-tuples of distinct, pairwise non-congruent, embedded free boundary minimal surfaces all having the same topological type. In addition, since we prove that the equivariant Morse index of any such free boundary minimal stacking, with respect to its maximal symmetry group, is bounded from below by (the integer part of) half the number of layers and from above strictly by twice the same number, it follows that any possible realization of such surfaces via an equivariant min-max method would need to employ sweepouts with an arbitrarily large number of parameters. This also shows that it is only for $N=2$ and $N=3$ layers that free boundary minimal disc stackings are achievable by means of one-dimensional mountain pass schemes.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
Topological control for min-max free boundary minimal surfaces
Authors:
Giada Franz,
Mario B. Schulz
Abstract:
We establish general bounds on the topology of free boundary minimal surfaces obtained via min-max methods in compact, three-dimensional ambient manifolds with mean convex boundary. We prove that the first Betti number is lower semicontinuous along min-max sequences converging in the sense of varifolds to free boundary minimal surfaces. In the orientable case, we obtain an even stronger result whi…
▽ More
We establish general bounds on the topology of free boundary minimal surfaces obtained via min-max methods in compact, three-dimensional ambient manifolds with mean convex boundary. We prove that the first Betti number is lower semicontinuous along min-max sequences converging in the sense of varifolds to free boundary minimal surfaces. In the orientable case, we obtain an even stronger result which implies that if the number of boundary components increases in the varifold limit, then the genus decreases at least as much. We also present several compelling applications, such as the variational construction of a free boundary minimal trinoid in the Euclidean unit ball.
△ Less
Submitted 14 August, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Rational Transformations and Invariant Polynomials
Authors:
Max Schulz
Abstract:
Rational transformations of polynomials are extensively studied in the context of finite fields, especially for the construction of irreducible polynomials. In this paper, we consider the factorization of rational transformations with (normalized) generators of the field $K(x)^G$ of $G$-invariant rational functions for $G$ a finite subgroup of $\operatorname{PGL}_2(K)$, where $K$ is an arbitrary f…
▽ More
Rational transformations of polynomials are extensively studied in the context of finite fields, especially for the construction of irreducible polynomials. In this paper, we consider the factorization of rational transformations with (normalized) generators of the field $K(x)^G$ of $G$-invariant rational functions for $G$ a finite subgroup of $\operatorname{PGL}_2(K)$, where $K$ is an arbitrary field. Our main theorem shows that the factorization is related to a well-known group action of $G$ on a subset of monic polynomials. With this, we are able to extend a result by Lucas Reis for $G$-invariant irreducible polynomials. Additionally, some new results about the number of irreducible factors of rational transformations for $Q$ a generator of $\mathbb{F}_q(x)^G$ are given when $G$ is non-cyclic.
△ Less
Submitted 5 September, 2023; v1 submitted 23 June, 2023;
originally announced June 2023.
-
Workflows Community Summit 2022: A Roadmap Revolution
Authors:
Rafael Ferreira da Silva,
Rosa M. Badia,
Venkat Bala,
Debbie Bard,
Peer-Timo Bremer,
Ian Buckley,
Silvina Caino-Lores,
Kyle Chard,
Carole Goble,
Shantenu Jha,
Daniel S. Katz,
Daniel Laney,
Manish Parashar,
Frederic Suter,
Nick Tyler,
Thomas Uram,
Ilkay Altintas,
Stefan Andersson,
William Arndt,
Juan Aznar,
Jonathan Bader,
Bartosz Balis,
Chris Blanton,
Kelly Rosa Braghetto,
Aharon Brodutch
, et al. (80 additional authors not shown)
Abstract:
Scientific workflows have become integral tools in broad scientific computing use cases. Science discovery is increasingly dependent on workflows to orchestrate large and complex scientific experiments that range from execution of a cloud-based data preprocessing pipeline to multi-facility instrument-to-edge-to-HPC computational workflows. Given the changing landscape of scientific computing and t…
▽ More
Scientific workflows have become integral tools in broad scientific computing use cases. Science discovery is increasingly dependent on workflows to orchestrate large and complex scientific experiments that range from execution of a cloud-based data preprocessing pipeline to multi-facility instrument-to-edge-to-HPC computational workflows. Given the changing landscape of scientific computing and the evolving needs of emerging scientific applications, it is paramount that the development of novel scientific workflows and system functionalities seek to increase the efficiency, resilience, and pervasiveness of existing systems and applications. Specifically, the proliferation of machine learning/artificial intelligence (ML/AI) workflows, need for processing large scale datasets produced by instruments at the edge, intensification of near real-time data processing, support for long-term experiment campaigns, and emergence of quantum computing as an adjunct to HPC, have significantly changed the functional and operational requirements of workflow systems. Workflow systems now need to, for example, support data streams from the edge-to-cloud-to-HPC enable the management of many small-sized files, allow data reduction while ensuring high accuracy, orchestrate distributed services (workflows, instruments, data movement, provenance, publication, etc.) across computing and user facilities, among others. Further, to accelerate science, it is also necessary that these systems implement specifications/standards and APIs for seamless (horizontal and vertical) integration between systems and applications, as well as enabling the publication of workflows and their associated products according to the FAIR principles. This document reports on discussions and findings from the 2022 international edition of the Workflows Community Summit that took place on November 29 and 30, 2022.
△ Less
Submitted 31 March, 2023;
originally announced April 2023.
-
RoboCup 2022 AdultSize Winner NimbRo: Upgraded Perception, Capture Steps Gait and Phase-based In-walk Kicks
Authors:
Dmytro Pavlichenko,
Grzegorz Ficht,
Arash Amini,
Mojtaba Hosseini,
Raphael Memmesheimer,
Angel Villar-Corrales,
Stefan M. Schulz,
Marcell Missura,
Maren Bennewitz,
Sven Behnke
Abstract:
Beating the human world champions by 2050 is an ambitious goal of the Humanoid League that provides a strong incentive for RoboCup teams to further improve and develop their systems. In this paper, we present upgrades of our system which enabled our team NimbRo to win the Soccer Tournament, the Drop-in Games, and the Technical Challenges in the Humanoid AdultSize League of RoboCup 2022. Strong per…
▽ More
Beating the human world champions by 2050 is an ambitious goal of the Humanoid League that provides a strong incentive for RoboCup teams to further improve and develop their systems. In this paper, we present upgrades of our system which enabled our team NimbRo to win the Soccer Tournament, the Drop-in Games, and the Technical Challenges in the Humanoid AdultSize League of RoboCup 2022. Strong performance in these competitions resulted in the Best Humanoid award in the Humanoid League. The mentioned upgrades include: hardware upgrade of the vision module, balanced walking with Capture Steps, and the introduction of phase-based in-walk kicks.
△ Less
Submitted 7 February, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Promises and pitfalls of deep neural networks in neuroimaging-based psychiatric research
Authors:
Fabian Eitel,
Marc-André Schulz,
Moritz Seiler,
Henrik Walter,
Kerstin Ritter
Abstract:
By promising more accurate diagnostics and individual treatment recommendations, deep neural networks and in particular convolutional neural networks have advanced to a powerful tool in medical imaging. Here, we first give an introduction into methodological key concepts and resulting methodological promises including representation and transfer learning, as well as modelling domain-specific prior…
▽ More
By promising more accurate diagnostics and individual treatment recommendations, deep neural networks and in particular convolutional neural networks have advanced to a powerful tool in medical imaging. Here, we first give an introduction into methodological key concepts and resulting methodological promises including representation and transfer learning, as well as modelling domain-specific priors. After reviewing recent applications within neuroimaging-based psychiatric research, such as the diagnosis of psychiatric diseases, delineation of disease subtypes, normative modeling, and the development of neuroimaging biomarkers, we discuss current challenges. This includes for example the difficulty of training models on small, heterogeneous and biased data sets, the lack of validity of clinical labels, algorithmic bias, and the influence of confounding variables.
△ Less
Submitted 20 January, 2023;
originally announced January 2023.
-
Spectral estimates for free boundary minimal surfaces via Montiel-Ros partitioning methods
Authors:
Alessandro Carlotto,
Mario B. Schulz,
David Wiygul
Abstract:
We adapt and extend the Montiel-Ros methodology to compact manifolds with boundary, allowing for mixed (including oblique) boundary conditions and also accounting for the action of a finite group $G$ together with an additional twisting homomorphism $σ\colon G\to\operatorname O(1)$. We then apply this machinery in order to obtain quantitative lower and upper bounds on the growth rate of the Morse…
▽ More
We adapt and extend the Montiel-Ros methodology to compact manifolds with boundary, allowing for mixed (including oblique) boundary conditions and also accounting for the action of a finite group $G$ together with an additional twisting homomorphism $σ\colon G\to\operatorname O(1)$. We then apply this machinery in order to obtain quantitative lower and upper bounds on the growth rate of the Morse index of free boundary minimal surfaces with respect to the topological data (i. e. the genus and the number of boundary components) of the surfaces in question. In particular, we compute the exact values of the equivariant Morse index and nullity for two infinite families of examples, with respect to their maximal symmetry groups, and thereby derive explicit two-sided linear bounds when the equivariance constraint is lifted.
△ Less
Submitted 21 July, 2024; v1 submitted 8 January, 2023;
originally announced January 2023.
-
Second Analysis Ecosystem Workshop Report
Authors:
Mohamed Aly,
Jackson Burzynski,
Bryan Cardwell,
Daniel C. Craik,
Tal van Daalen,
Tomas Dado,
Ayanabha Das,
Antonio Delgado Peris,
Caterina Doglioni,
Peter Elmer,
Engin Eren,
Martin B. Eriksen,
Jonas Eschle,
Giulio Eulisse,
Conor Fitzpatrick,
José Flix Molina,
Alessandra Forti,
Ben Galewsky,
Sean Gasiorowski,
Aman Goel,
Loukas Gouskos,
Enrico Guiraud,
Kanhaiya Gupta,
Stephan Hageboeck,
Allison Reinsvold Hall
, et al. (44 additional authors not shown)
Abstract:
The second workshop on the HEP Analysis Ecosystem took place 23-25 May 2022 at IJCLab in Orsay, to look at progress and continuing challenges in scaling up HEP analysis to meet the needs of HL-LHC and DUNE, as well as the very pressing needs of LHC Run 3 analysis.
The workshop was themed around six particular topics, which were felt to capture key questions, opportunities and challenges. Each to…
▽ More
The second workshop on the HEP Analysis Ecosystem took place 23-25 May 2022 at IJCLab in Orsay, to look at progress and continuing challenges in scaling up HEP analysis to meet the needs of HL-LHC and DUNE, as well as the very pressing needs of LHC Run 3 analysis.
The workshop was themed around six particular topics, which were felt to capture key questions, opportunities and challenges. Each topic arranged a plenary session introduction, often with speakers summarising the state-of-the art and the next steps for analysis. This was then followed by parallel sessions, which were much more discussion focused, and where attendees could grapple with the challenges and propose solutions that could be tried. Where there was significant overlap between topics, a joint discussion between them was arranged.
In the weeks following the workshop the session conveners wrote this document, which is a summary of the main discussions, the key points raised and the conclusions and outcomes. The document was circulated amongst the participants for comments before being finalised here.
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
Gravitational-wave cosmology with dark sirens: state of the art and perspectives for 3G detectors
Authors:
Michele Mancarella,
Nicola Borghi,
Stefano Foffa,
Edwin Genoud-Prachex,
Francesco Iacovelli,
Michele Maggiore,
Michele Moresco,
Matteo Schulz
Abstract:
A joint fit of the mass and redshift distributions of the population of Binary Black Holes detected with Gravitational-Wave observations can be used to obtain constraints on the Hubble parameter and on deviations from General Relativity in the propagation of Gravitational Waves. We first present applications of this technique to the latest catalog of Gravitational-Wave events, focusing on the comp…
▽ More
A joint fit of the mass and redshift distributions of the population of Binary Black Holes detected with Gravitational-Wave observations can be used to obtain constraints on the Hubble parameter and on deviations from General Relativity in the propagation of Gravitational Waves. We first present applications of this technique to the latest catalog of Gravitational-Wave events, focusing on the comparison of different parametrizations for the source-frame mass distribution of Black Hole Binaries. We find that models with more than one feature are favourite by the data, as suggested by population studies, even when varying the cosmology. Then, we discuss perspectives for the use of this technique with third generation Gravitational-Wave detectors, exploiting the recently developed Fisher information matrix Python code GWFAST.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
Neutron Depolarization due to Ferromagnetism and Spin Freezing in CePd$_{1-x}$Rh$_x$
Authors:
M. Seifert,
P. Schmakat,
M. Schulz,
P. Jorba,
V. Hutanu,
C. Geibel,
M. Deppe,
C. Pfleiderer
Abstract:
We report neutron depolarization measurements of the suppression of long-range ferromagnetism and the emergence of magnetic irreversibilities and spin freezing in CePd$_{1-x}$Rh$_x$ around $x^*\approx0.6$. Tracking the temperature versus field history of the neutron depolarization, we find clear signatures of long-range Ising ferromagnetism below a Curie temperature $T_{\rm C}$ for $x=0.4$ and a s…
▽ More
We report neutron depolarization measurements of the suppression of long-range ferromagnetism and the emergence of magnetic irreversibilities and spin freezing in CePd$_{1-x}$Rh$_x$ around $x^*\approx0.6$. Tracking the temperature versus field history of the neutron depolarization, we find clear signatures of long-range Ising ferromagnetism below a Curie temperature $T_{\rm C}$ for $x=0.4$ and a spin freezing of tiny ferromagnetic clusters below a freezing temperature $T_{\rm F1}$ for $x>x^*$. Under zero-field-cooling/field-heating and for $x>x^*$ a reentrant temperature dependence of the neutron depolarization between $T_{\rm F2}<T_{\rm F1}$ and $T_{\rm F1}$ is microscopically consistent with a thermally activated growth of the cluster size. The evolution of the depolarization as well as the reentrant temperature dependence as a function of Rh content are consistent with the formation of a Kondo-cluster glass below $T_{\rm F1}$ adjacent to a ferromagnetic quantum phase transition at $x^*$.
△ Less
Submitted 22 September, 2022;
originally announced September 2022.
-
Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge Environments
Authors:
Rui Song,
Dai Liu,
Dave Zhenyu Chen,
Andreas Festag,
Carsten Trinitis,
Martin Schulz,
Alois Knoll
Abstract:
In federated learning, all networked clients contribute to the model training cooperatively. However, with model sizes increasing, even sharing the trained partial models often leads to severe communication bottlenecks in underlying networks, especially when communicated iteratively. In this paper, we introduce a federated learning framework FedD3 requiring only one-shot communication by integrati…
▽ More
In federated learning, all networked clients contribute to the model training cooperatively. However, with model sizes increasing, even sharing the trained partial models often leads to severe communication bottlenecks in underlying networks, especially when communicated iteratively. In this paper, we introduce a federated learning framework FedD3 requiring only one-shot communication by integrating dataset distillation instances. Instead of sharing model updates in other federated learning approaches, FedD3 allows the connected clients to distill the local datasets independently, and then aggregates those decentralized distilled datasets (e.g. a few unrecognizable images) from networks for model training. Our experimental results show that FedD3 significantly outperforms other federated learning frameworks in terms of needed communication volumes, while it provides the additional benefit to be able to balance the trade-off between accuracy and communication cost, depending on usage scenario or target dataset. For instance, for training an AlexNet model on CIFAR-10 with 10 clients under non-independent and identically distributed (Non-IID) setting, FedD3 can either increase the accuracy by over 71% with a similar communication volume, or save 98% of communication volume, while reaching the same accuracy, compared to other one-shot federated learning approaches.
△ Less
Submitted 19 May, 2023; v1 submitted 24 August, 2022;
originally announced August 2022.
-
A comparison of partial information decompositions using data from real and simulated layer 5b pyramidal cells
Authors:
Jim W. Kay,
Jan M. Schulz,
W. A. Phillips
Abstract:
Partial information decomposition allows the joint mutual information between an output and a set of inputs to be divided into components that are synergistic or shared or unique to each input. We consider five different decompositions and compare their results on data from layer 5b pyramidal cells in two different studies. The first study was of the amplification of somatic action potential outpu…
▽ More
Partial information decomposition allows the joint mutual information between an output and a set of inputs to be divided into components that are synergistic or shared or unique to each input. We consider five different decompositions and compare their results on data from layer 5b pyramidal cells in two different studies. The first study was of the amplification of somatic action potential output by apical dendritic input and its regulation by dendritic inhibition. We find that two of the decompositions produce much larger estimates of synergy and shared information than the others, as well as large levels of unique misinformation. When within-neuron differences in the components are examined, the five methods produce more similar results for all but the shared information component, for which two methods produce a different statistical conclusion from the others. There are some differences in the expression of unique information asymmetry among the methods. It is significantly larger, on average, under dendritic inhibition. Three of the methods support a previous conclusion that apical amplification is reduced by dendritic inhibition. The second study used a detailed compartmental model to produce action potentials for many combinations of the numbers of basal and apical synaptic inputs. Two analyses of decompositions are conducted on subsets of the data. In the first, the decompositions reveal a bifurcation in unique information asymmetry. For three of the methods this suggests that apical drive switches to basal drive as the strength of the basal input increases, while the other two show changing mixtures of information and misinformation. Decompositions produced using the second set of subsets show that all five decompositions provide support for properties of cooperative context-sensitivity - to varying extents.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Infinitely many pairs of free boundary minimal surfaces with the same topology and symmetry group
Authors:
Alessandro Carlotto,
Mario B. Schulz,
David Wiygul
Abstract:
The topology and symmetry group of a free boundary minimal surface in the three-dimensional Euclidean unit ball do not determine the surface uniquely. We provide pairs of non-isometric free boundary minimal surfaces having any sufficiently large genus $g$, three boundary components and antiprismatic symmetry group of order $4(g+1)$.
The topology and symmetry group of a free boundary minimal surface in the three-dimensional Euclidean unit ball do not determine the surface uniquely. We provide pairs of non-isometric free boundary minimal surfaces having any sufficiently large genus $g$, three boundary components and antiprismatic symmetry group of order $4(g+1)$.
△ Less
Submitted 8 October, 2023; v1 submitted 10 May, 2022;
originally announced May 2022.
-
Phase Separation in Systems of Interacting Active Brownian Particles
Authors:
M. Bruna,
M. Burger,
A. Esposito,
S. M. Schulz
Abstract:
The aim of this paper is to discuss the mathematical modeling of Brownian active particle systems, a recently popular paradigmatic system for self-propelled particles. We present four microscopic models with different types of repulsive interactions between particles and their associated macroscopic models, which are formally obtained using different coarse-graining methods. The macroscopic limits…
▽ More
The aim of this paper is to discuss the mathematical modeling of Brownian active particle systems, a recently popular paradigmatic system for self-propelled particles. We present four microscopic models with different types of repulsive interactions between particles and their associated macroscopic models, which are formally obtained using different coarse-graining methods. The macroscopic limits are integro-differential equations for the density in phase space (positions and orientations) of the particles and may include nonlinearities in both the diffusive and advective components. In contrast to passive particles, systems of active particles can undergo phase separation without any attractive interactions, a mechanism known as motility-induced phase separation (MIPS). We explore the onset of such a transition for each model in the parameter space of occupied volume fraction and Péclet number via a linear stability analysis and numerical simulations at both the microscopic and macroscopic levels. We establish that one of the models, namely the mean-field model which assumes long-range repulsive interactions, cannot explain the emergence of MIPS. In contrast, MIPS is observed for the remaining three models that assume short-range interactions that localize the interaction terms in space.
△ Less
Submitted 27 May, 2022; v1 submitted 13 October, 2021;
originally announced October 2021.
-
Label scarcity in biomedicine: Data-rich latent factor discovery enhances phenotype prediction
Authors:
Marc-Andre Schulz,
Bertrand Thirion,
Alexandre Gramfort,
Gaël Varoquaux,
Danilo Bzdok
Abstract:
High-quality data accumulation is now becoming ubiquitous in the health domain. There is increasing opportunity to exploit rich data from normal subjects to improve supervised estimators in specific diseases with notorious data scarcity. We demonstrate that low-dimensional embedding spaces can be derived from the UK Biobank population dataset and used to enhance data-scarce prediction of health in…
▽ More
High-quality data accumulation is now becoming ubiquitous in the health domain. There is increasing opportunity to exploit rich data from normal subjects to improve supervised estimators in specific diseases with notorious data scarcity. We demonstrate that low-dimensional embedding spaces can be derived from the UK Biobank population dataset and used to enhance data-scarce prediction of health indicators, lifestyle and demographic characteristics. Phenotype predictions facilitated by Variational Autoencoder manifolds typically scaled better with increasing unlabeled data than dimensionality reduction by PCA or Isomap. Performances gains from semisupervison approaches will probably become an important ingredient for various medical data science applications.
△ Less
Submitted 12 October, 2021;
originally announced October 2021.
-
Noncompact self-shrinkers for mean curvature flow with arbitrary genus
Authors:
Reto Buzano,
Huy The Nguyen,
Mario B. Schulz
Abstract:
In his lecture notes on mean curvature flow, Ilmanen conjectured the existence of noncompact self-shrinkers with arbitrary genus. Here, we employ min-max techniques to give a rigorous existence proof for these surfaces. Conjecturally, the self-shrinkers that we obtain have precisely one (asymptotically conical) end. We confirm this for large genus via a precise analysis of the limiting object of s…
▽ More
In his lecture notes on mean curvature flow, Ilmanen conjectured the existence of noncompact self-shrinkers with arbitrary genus. Here, we employ min-max techniques to give a rigorous existence proof for these surfaces. Conjecturally, the self-shrinkers that we obtain have precisely one (asymptotically conical) end. We confirm this for large genus via a precise analysis of the limiting object of sequences of such self-shrinkers for which the genus tends to infinity. Finally, we provide numerical evidence for a further family of noncompact self-shrinkers with odd genus and two asymptotically conical ends.
△ Less
Submitted 5 September, 2024; v1 submitted 12 October, 2021;
originally announced October 2021.
-
Minimal hypertori in the four-dimensional sphere
Authors:
Alessandro Carlotto,
Mario B. Schulz
Abstract:
We prove that the four-dimensional round sphere contains a minimally embedded hypertorus, as well as infinitely many, pairwise non-isometric, immersed ones. Our analysis also yields infinitely many, pairwise non-isometric, minimally embedded hyperspheres and thus provides a self-contained solution to Chern's spherical Bernstein conjecture in dimensions four and six.
We prove that the four-dimensional round sphere contains a minimally embedded hypertorus, as well as infinitely many, pairwise non-isometric, immersed ones. Our analysis also yields infinitely many, pairwise non-isometric, minimally embedded hyperspheres and thus provides a self-contained solution to Chern's spherical Bernstein conjecture in dimensions four and six.
△ Less
Submitted 25 September, 2023; v1 submitted 24 September, 2021;
originally announced September 2021.
-
Operational Data Analytics in Practice: Experiences from Design to Deployment in Production HPC Environments
Authors:
Alessio Netti,
Michael Ott,
Carla Guillen,
Daniele Tafani,
Martin Schulz
Abstract:
As HPC systems grow in complexity, efficient and manageable operation is increasingly critical. Many centers are thus starting to explore the use of Operational Data Analytics (ODA) techniques, which extract knowledge from massive amounts of monitoring data and use it for control and visualization purposes. As ODA is a multi-faceted problem, much effort has gone into researching its separate aspec…
▽ More
As HPC systems grow in complexity, efficient and manageable operation is increasingly critical. Many centers are thus starting to explore the use of Operational Data Analytics (ODA) techniques, which extract knowledge from massive amounts of monitoring data and use it for control and visualization purposes. As ODA is a multi-faceted problem, much effort has gone into researching its separate aspects: however, accounts of production ODA experiences are still hard to come across.
In this work we aim to bridge the gap between ODA research and production use by presenting our experiences with ODA in production, involving in particular the control of cooling infrastructures and visualization of job data on two HPC systems. We cover the entire development process, from design to deployment, highlighting our insights in an effort to drive the community forward. We rely on open-source tools, which make for a generic ODA framework suitable for most scenarios.
△ Less
Submitted 28 June, 2021;
originally announced June 2021.
-
Analytical Solution of a Gas Release Problem Considering Permeation with Time-Dependent Boundary Conditions
Authors:
Marvin R. Schulz,
Kaori Nagatou,
Axel von der Weth,
Frederik Arbeiter,
Volker Pasler
Abstract:
In this paper the determination of material properties such as Sieverts' constant (solubility) and diffusivity (transport rate) via so-called gas release experiments is discussed. In order to simulate the time-dependent hydrogen fluxes and concentration profiles efficiently, we make use of an analytical method, namely we provide an analytical solution for the corresponding diffusion equations on a…
▽ More
In this paper the determination of material properties such as Sieverts' constant (solubility) and diffusivity (transport rate) via so-called gas release experiments is discussed. In order to simulate the time-dependent hydrogen fluxes and concentration profiles efficiently, we make use of an analytical method, namely we provide an analytical solution for the corresponding diffusion equations on a cylindrical specimen and a cylindrical container for three boundary conditions. These conditions occur in three phases -- loading phase, evacuation phase and gas release phase. In the loading phase the specimen is charged with hydrogen assuring a constant partial pressure of hydrogen. Then the gas will be quickly removed by a vacuum pump in the second phase, and finally in the third time interval, the hydrogen is released from the specimen to the gaseous phase, where the pressure increase will be measured by an equipment which is attached to the cylindrical container. The investigated diffusion equation in each phase is a simple homogeneous equation, but due to the complex time-dependent boundary conditions which include the Sieverts' constant and the pressure, we transform the homogeneous equations to the non-homogeneous ones with a zero Dirichlet boundary condition. Compared with the time consuming numerical methods our analytical approach has an advantage that the flux of desorbed hydrogen can be explicitly given and therefore can be evaluated efficiently. Our analytical solution also assures that the time-dependent boundary conditions are exactly satisfied and furthermore that the interaction between specimen and container is correctly taken into account.
△ Less
Submitted 14 April, 2021;
originally announced April 2021.
-
Quantum Algorithms for Solving Ordinary Differential Equations via Classical Integration Methods
Authors:
Benjamin Zanger,
Christian B. Mendl,
Martin Schulz,
Martin Schreiber
Abstract:
Identifying computational tasks suitable for (future) quantum computers is an active field of research. Here we explore utilizing quantum computers for the purpose of solving differential equations. We consider two approaches: (i) basis encoding and fixed-point arithmetic on a digital quantum computer, and (ii) representing and solving high-order Runge-Kutta methods as optimization problems on qua…
▽ More
Identifying computational tasks suitable for (future) quantum computers is an active field of research. Here we explore utilizing quantum computers for the purpose of solving differential equations. We consider two approaches: (i) basis encoding and fixed-point arithmetic on a digital quantum computer, and (ii) representing and solving high-order Runge-Kutta methods as optimization problems on quantum annealers. As realizations applied to two-dimensional linear ordinary differential equations, we devise and simulate corresponding digital quantum circuits, and implement and run a 6$^{\mathrm{th}}$ order Gauss-Legendre collocation method on a D-Wave 2000Q system, showing good agreement with the reference solution. We find that the quantum annealing approach exhibits the largest potential for high-order implicit integration methods. As promising future scenario, the digital arithmetic method could be employed as an "oracle" within quantum search algorithms for inverse problems.
△ Less
Submitted 12 July, 2021; v1 submitted 17 December, 2020;
originally announced December 2020.
-
Electron-impact ionization and ionic fragmentation of O$_{2}$ from threshold to 120 eV energy range
Authors:
R. A. Lomsadze,
M. R. Gochitashvili,
R. Ya. Kezerashvili,
M. Schulz
Abstract:
We study the electron-impact induced ionization of O$_{2}$ from threshold to 120 eV using the electron spectroscopy method. Our approach is simple in concept and embodies the ion source with a collision chamber and a mass spectrometer with a quadruple filter as a selector for the product ions. The combination of these two devices makes it possible to unequivocally collect all energetic fragment io…
▽ More
We study the electron-impact induced ionization of O$_{2}$ from threshold to 120 eV using the electron spectroscopy method. Our approach is simple in concept and embodies the ion source with a collision chamber and a mass spectrometer with a quadruple filter as a selector for the product ions. The combination of these two devices makes it possible to unequivocally collect all energetic fragment ions formed in ionization and dissociative processes and to detect them with known efficiency. The ion source allows to vary and tune the electron-impact ionization energy and the target-gas pressure. We demonstrate that for obtaining reliable results of cross sections for inelastic processes and determining mechanisms for the formation of O$^{+}$($^{4}S,^2{D},^2{P}$) ions, it is crucial to control the electron-impact energy for production of ion and the pressure in the ion source. A comparison of our results with other experimental and theoretical data shows good agreement and proves the validity of our approach.
△ Less
Submitted 23 November, 2020;
originally announced November 2020.
-
Resolving the outer ring of HD 38206 using ALMA and constraining limits on planets in the system
Authors:
Mark Booth,
Michael Schulz,
Alexander V. Krivov,
Sebastián Marino,
Tim D. Pearce,
Ralf Launhardt
Abstract:
HD 38206 is an A0V star in the Columba association, hosting a debris disc first discovered by IRAS. Further observations by Spitzer and Herschel showed that the disc has two components, likely analogous to the asteroid and Kuiper belts of the Solar System. The young age of this star makes it a prime target for direct imaging planet searches. Possible planets in the system can be constrained using…
▽ More
HD 38206 is an A0V star in the Columba association, hosting a debris disc first discovered by IRAS. Further observations by Spitzer and Herschel showed that the disc has two components, likely analogous to the asteroid and Kuiper belts of the Solar System. The young age of this star makes it a prime target for direct imaging planet searches. Possible planets in the system can be constrained using the debris disc. Here we present the first ALMA observations of the system's Kuiper belt and fit them using a forward modelling MCMC approach. We detect an extended disc of dust peaking at around 180 au with a width of 140 au. The disc is close to edge on and shows tentative signs of an asymmetry best fit by an eccentricity of $0.25^{+0.10}_{-0.09}$. We use the fitted parameters to determine limits on the masses of planets interior to the cold belt. We determine that a minimum of four planets are required, each with a minimum mass of 0.64 M$_J$, in order to clear the gap between the asteroid and Kuiper belts of the system. If we make the assumption that the outermost planet is responsible for the stirring of the disc, the location of its inner edge and the eccentricity of the disc, then we can more tightly predict its eccentricity, mass and semimajor axis to be $e_{\rm{p}}=0.34^{+0.20}_{-0.13}$, $m_{\rm{p}}=0.7^{+0.5}_{-0.3}\,\rm{M}_{\rm{J}}$ and $a_{\rm{p}}=76^{+12}_{-13}\,\rm{au}$.
△ Less
Submitted 27 October, 2020;
originally announced October 2020.
-
Resiliency in Numerical Algorithm Design for Extreme Scale Simulations
Authors:
Emmanuel Agullo,
Mirco Altenbernd,
Hartwig Anzt,
Leonardo Bautista-Gomez,
Tommaso Benacchio,
Luca Bonaventura,
Hans-Joachim Bungartz,
Sanjay Chatterjee,
Florina M. Ciorba,
Nathan DeBardeleben,
Daniel Drzisga,
Sebastian Eibl,
Christian Engelmann,
Wilfried N. Gansterer,
Luc Giraud,
Dominik Goeddeke,
Marco Heisig,
Fabienne Jezequel,
Nils Kohl,
Xiaoye Sherry Li,
Romain Lion,
Miriam Mehl,
Paul Mycek,
Michael Obersteiner,
Enrique S. Quintana-Orti
, et al. (11 additional authors not shown)
Abstract:
This work is based on the seminar titled ``Resiliency in Numerical Algorithm Design for Extreme Scale Simulations'' held March 1-6, 2020 at Schloss Dagstuhl, that was attended by all the authors.
Naive versions of conventional resilience techniques will not scale to the exascale regime: with a main memory footprint of tens of Petabytes, synchronously writing checkpoint data all the way to backgr…
▽ More
This work is based on the seminar titled ``Resiliency in Numerical Algorithm Design for Extreme Scale Simulations'' held March 1-6, 2020 at Schloss Dagstuhl, that was attended by all the authors.
Naive versions of conventional resilience techniques will not scale to the exascale regime: with a main memory footprint of tens of Petabytes, synchronously writing checkpoint data all the way to background storage at frequent intervals will create intolerable overheads in runtime and energy consumption. Forecasts show that the mean time between failures could be lower than the time to recover from such a checkpoint, so that large calculations at scale might not make any progress if robust alternatives are not investigated.
More advanced resilience techniques must be devised. The key may lie in exploiting both advanced system features as well as specific application knowledge. Research will face two essential questions: (1) what are the reliability requirements for a particular computation and (2) how do we best design the algorithms and software to meet these requirements? One avenue would be to refine and improve on system- or application-level checkpointing and rollback strategies in the case an error is detected. Developers might use fault notification interfaces and flexible runtime systems to respond to node failures in an application-dependent fashion. Novel numerical algorithms or more stochastic computational approaches may be required to meet accuracy requirements in the face of undetectable soft errors.
The goal of this Dagstuhl Seminar was to bring together a diverse group of scientists with expertise in exascale computing to discuss novel ways to make applications resilient against detected and undetected faults. In particular, participants explored the role that algorithms and applications play in the holistic approach needed to tackle this challenge.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.