-
On a question of Kaplansky concerning his density theorem
Authors:
George A. Elliott,
Charles J. K. Griffin
Abstract:
A new proof following a suggestion of Kaplansky to use a result of Dixmier, and in this way avoid unbounded nets of operators, is given of the Kaplansky density theorem.
A new proof following a suggestion of Kaplansky to use a result of Dixmier, and in this way avoid unbounded nets of operators, is given of the Kaplansky density theorem.
△ Less
Submitted 12 September, 2024;
originally announced October 2024.
-
Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols
Authors:
Charlie Griffin,
Louis Thomson,
Buck Shlegeris,
Alessandro Abate
Abstract:
To evaluate the safety and usefulness of deployment protocols for untrusted AIs, AI Control uses a red-teaming exercise played between a protocol designer and an adversary. This paper introduces AI-Control Games, a formal decision-making model of the red-teaming exercise as a multi-objective, partially observable, stochastic game. We also introduce methods for finding optimal protocols in AI-Contr…
▽ More
To evaluate the safety and usefulness of deployment protocols for untrusted AIs, AI Control uses a red-teaming exercise played between a protocol designer and an adversary. This paper introduces AI-Control Games, a formal decision-making model of the red-teaming exercise as a multi-objective, partially observable, stochastic game. We also introduce methods for finding optimal protocols in AI-Control Games, by reducing them to a set of zero-sum partially observable stochastic games. We apply our formalism to model, evaluate and synthesise protocols for deploying untrusted language models as programming assistants, focusing on Trusted Monitoring protocols, which use weaker language models and limited human assistance. Finally, we demonstrate the utility of our formalism by showcasing improvements over empirical studies in existing settings, evaluating protocols in new settings, and analysing how modelling assumptions affect the safety and usefulness of protocols.
△ Less
Submitted 12 September, 2024;
originally announced September 2024.
-
Integrability of Generalised Skew-Symmetric Replicator Equations via Graph Embeddings
Authors:
Matthew Visomirski,
Christopher Griffin
Abstract:
It is known that there is a one-to-one mapping between oriented directed graphs and zero-sum replicator dynamics (Lotka-Volterra equations) and that furthermore these dynamics are Hamiltonian in an appropriately defined nonlinear Poisson bracket. In this paper, we investigate the problem of determining whether these dynamics are Liouville-Arnold integrable, building on prior work graph in graph de…
▽ More
It is known that there is a one-to-one mapping between oriented directed graphs and zero-sum replicator dynamics (Lotka-Volterra equations) and that furthermore these dynamics are Hamiltonian in an appropriately defined nonlinear Poisson bracket. In this paper, we investigate the problem of determining whether these dynamics are Liouville-Arnold integrable, building on prior work graph in graph decloning by Evripidou et al. [J. Phys. A., 55:325201, 2022] and graph embedding by Paik and Griffin [Phys. Rev. E. 107(5): L052202, 2024]. Using the embedding procedure from Paik and Griffin, we show (with certain caveats) that when a graph producing integrable dynamics is embedded in another graph producing integrable dynamics, the resulting graph structure also produces integrable dynamics. We also construct a new family of graph structures that produces integrable dynamics that does not arise either from embeddings or decloning. We use these results to classify the dynamics generated by almost all oriented directed graphs on six vertices, with three hold-out graphs that generate integrable dynamics and are not part of a natural taxonomy arising from known families and graph operations. These hold-out graphs suggest more structure is available to be found. Moreover, the work suggests that oriented directed graphs leading to integrable dynamics may be classifiable in an analogous way to the classification of finite simple groups, creating the possibility that there is a deep connection between integrable dynamics and combinatorial structures in graphs.
△ Less
Submitted 19 August, 2024;
originally announced August 2024.
-
First simultaneous measurement of the gamma-ray and neutron emission probabilities in inverse kinematics at a heavy-ion storage ring
Authors:
M. Sguazzin,
B. Jurado,
J. Pibernat,
J. A. Swartz,
M. Grieser,
J. Glorius,
Yu. A. Litvinov,
C. Berthelot,
B. Włoch,
J. Adamczewski-Musch,
P. Alfaurt,
P. Ascher,
L. Audouin,
B. Blank,
K. Blaum,
B. Brückner,
S. Dellmann,
I. Dillmann,
C. Domingo-Pardo,
M. Dupuis,
P. Erbacher,
M. Flayol,
O. Forstner,
D. Freire-Fernández,
M. Gerbaux
, et al. (28 additional authors not shown)
Abstract:
The probabilities for gamma-ray and particle emission as a function of the excitation energy of a decaying nucleus are valuable observables for constraining the ingredients of the models that describe the de-excitation of nuclei near the particle emission threshold. These models are essential in nuclear astrophysics and applications. In this work, we have for the first time simultaneously measured…
▽ More
The probabilities for gamma-ray and particle emission as a function of the excitation energy of a decaying nucleus are valuable observables for constraining the ingredients of the models that describe the de-excitation of nuclei near the particle emission threshold. These models are essential in nuclear astrophysics and applications. In this work, we have for the first time simultaneously measured the gamma-ray and neutron emission probabilities of 208Pb. The measurement was performed in inverse kinematics at the Experimental Storage Ring (ESR) of the GSI/FAIR facility, where a 208Pb beam interacted through the 208Pb(p,p') reaction with a hydrogen gas jet target. Instead of detecting the gamma-rays and neutrons emitted by 208Pb, we detected the heavy beam-like residues produced after gamma and neutron emission. These heavy residues were fully separated by a dipole magnet of the ESR and were detected with outstanding efficiencies. The comparison of the measured probabilities with model calculations has allowed us to test different descriptions of the gamma-ray strength function and the nuclear level density available in the literature.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Dynamics of An Information Theoretic Analog of Two Masses on a Spring
Authors:
Geoff Goehle,
Christopher Griffin
Abstract:
In this short communication we investigate an information theoretic analogue of the classic two masses on spring system, arising from a physical interpretation of Friston's free energy principle in the theory of learning in a system of agents. Using methods from classical mechanics on manifolds, we define a kinetic energy term using the Fisher metric on distributions and a potential energy functio…
▽ More
In this short communication we investigate an information theoretic analogue of the classic two masses on spring system, arising from a physical interpretation of Friston's free energy principle in the theory of learning in a system of agents. Using methods from classical mechanics on manifolds, we define a kinetic energy term using the Fisher metric on distributions and a potential energy function defined in terms of stress on the agents' beliefs. The resulting Lagrangian (Hamiltonian) produces a variation of the classic DeGroot dynamics. In the two agent case, the potential function is defined using the Jeffrey's divergence and the resulting dynamics are characterized by a non-linear spring. These dynamics produce trajectories that resemble flows on tori but are shown numerically to produce chaos near the boundary of the space. We then investigate persuasion as an information theoretic control problem where analysis indicates that manipulating peer pressure with a fixed target is a more stable approach to altering an agent's belief than providing a slowly changing belief state that approaches the target.
△ Less
Submitted 2 September, 2024; v1 submitted 3 July, 2024;
originally announced July 2024.
-
The Ethics of Advanced AI Assistants
Authors:
Iason Gabriel,
Arianna Manzini,
Geoff Keeling,
Lisa Anne Hendricks,
Verena Rieser,
Hasan Iqbal,
Nenad Tomašev,
Ira Ktena,
Zachary Kenton,
Mikel Rodriguez,
Seliem El-Sayed,
Sasha Brown,
Canfer Akbulut,
Andrew Trask,
Edward Hughes,
A. Stevie Bergman,
Renee Shelby,
Nahema Marchal,
Conor Griffin,
Juan Mateos-Garcia,
Laura Weidinger,
Winnie Street,
Benjamin Lange,
Alex Ingerman,
Alison Lentz
, et al. (32 additional authors not shown)
Abstract:
This paper focuses on the opportunities and the ethical and societal risks posed by advanced AI assistants. We define advanced AI assistants as artificial agents with natural language interfaces, whose function is to plan and execute sequences of actions on behalf of a user, across one or more domains, in line with the user's expectations. The paper starts by considering the technology itself, pro…
▽ More
This paper focuses on the opportunities and the ethical and societal risks posed by advanced AI assistants. We define advanced AI assistants as artificial agents with natural language interfaces, whose function is to plan and execute sequences of actions on behalf of a user, across one or more domains, in line with the user's expectations. The paper starts by considering the technology itself, providing an overview of AI assistants, their technical foundations and potential range of applications. It then explores questions around AI value alignment, well-being, safety and malicious uses. Extending the circle of inquiry further, we next consider the relationship between advanced AI assistants and individual users in more detail, exploring topics such as manipulation and persuasion, anthropomorphism, appropriate relationships, trust and privacy. With this analysis in place, we consider the deployment of advanced assistants at a societal scale, focusing on cooperation, equity and access, misinformation, economic impact, the environment and how best to evaluate advanced AI assistants. Finally, we conclude by providing a range of recommendations for researchers, developers, policymakers and public stakeholders.
△ Less
Submitted 28 April, 2024; v1 submitted 24 April, 2024;
originally announced April 2024.
-
Simulations of Classical Three-Body Thermalization in One Dimension
Authors:
M. Eltohfa,
Xinghan Wang,
Colton M. Griffin,
F. Robicheaux
Abstract:
One-dimensional systems, such as nanowires or electrons moving along strong magnetic field lines, have peculiar thermalization physics. The binary collision of point-like particles, typically the dominant process for reaching thermal equilibrium in higher dimensional systems, cannot thermalize a 1D system. We study how dilute classical 1D gases thermalize through three-body collisions. We consider…
▽ More
One-dimensional systems, such as nanowires or electrons moving along strong magnetic field lines, have peculiar thermalization physics. The binary collision of point-like particles, typically the dominant process for reaching thermal equilibrium in higher dimensional systems, cannot thermalize a 1D system. We study how dilute classical 1D gases thermalize through three-body collisions. We consider a system of identical classical point particles with pairwise repulsive inverse power-law potential $V_{ij} \propto 1/|x_i-x_j|^n$ or the pairwise Lennard-Jones potential. Using Monte Carlo methods, we compute a collision kernel and use it in the Boltzmann equation to evolve a perturbed thermal state with temperature $T$ toward equilibrium. We explain the shape of the kernel and its dependence on the system parameters. Additionally, we implement molecular dynamics simulations of a many-body gas and show agreement with the Boltzmann evolution in the low density limit. For the inverse power-law potential, the rate of thermalization is proportional to $ρ^2 T^{\frac{1}{2}-\frac{1}{n}}$ where $ρ$ is the number density. The corresponding proportionality constant decreases with increasing $n$.
△ Less
Submitted 9 July, 2024; v1 submitted 29 February, 2024;
originally announced March 2024.
-
First Exploration of Monopole-Driven Shell Evolution above the N = 126 shell closure: new Millisecond Isomers in 213Tl and 215Tl
Authors:
T. T. Yeung,
A. I. Morales,
J. Wu,
M. Liu,
C. Yuan,
S. Nishimura,
V. H. Phong,
N. Fukuda,
J. L. Tain,
T. Davinson,
K. P. Rykaczewski,
R. Yokoyama,
T. Isobe,
M. Niikura,
Zs. Podolyak,
G. Alcala,
A. Algora,
J. Agramunt,
C. Appleton,
H. Baba,
R. Caballero-Folch,
P. Calvino,
M. P. Carpenter,
I. Dillmann,
A. Estrade
, et al. (30 additional authors not shown)
Abstract:
Isomer spectroscopy of heavy neutron-rich nuclei beyond the N=126 closed shell has been performed for the first time at the Radioactive Isotope Beam Factory of the RIKEN Nishina Center. New millisecond isomers have been identified at low excitation energies, 985.3(19) keV in 213Tl and 874(5) keV in 215Tl. The measured half-lives of 1.34(5) ms in 213Tl and 3.0(3) ms in 215Tl suggest spins and parit…
▽ More
Isomer spectroscopy of heavy neutron-rich nuclei beyond the N=126 closed shell has been performed for the first time at the Radioactive Isotope Beam Factory of the RIKEN Nishina Center. New millisecond isomers have been identified at low excitation energies, 985.3(19) keV in 213Tl and 874(5) keV in 215Tl. The measured half-lives of 1.34(5) ms in 213Tl and 3.0(3) ms in 215Tl suggest spins and parities 11/2- with the single proton-hole configuration h11/2 as leading component. They are populated via E1 transitions by the decay of higher-lying isomeric states with proposed spin and parity 17/2+, interpreted as arising from a single s1/2 proton hole coupled to the 8+ seniority isomer in the (A+1)Pb cores. The lowering of the 11/2- states is ascribed to an increase of the h11/2 proton effective single-particle energy as the second g9/2 orbital is filled by neutrons, owing to a significant reduction of the proton-neutron monopole interaction between the h11/2 and g9/2 orbitals. The new ms-isomers provide the first experimental observation of shell evolution in the almost unexplored N>126 nuclear region below doubly-magic 208Pb.
△ Less
Submitted 25 April, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
Reverse Projection: Real-Time Local Space Texture Mapping
Authors:
Adrian Xuan Wei Lim,
Lynnette Hui Xian Ng,
Conor Griffin,
Nicholas Kyger,
Faraz Baghernezhad
Abstract:
We present Reverse Projection, a novel projective texture mapping technique for painting a decal directly to the texture of a 3D object. Designed to be used in games, this technique works in real-time. By using projection techniques that are computed in local space textures and outward-looking, users using low-end android devices to high-end gaming desktops are able to enjoy the personalization of…
▽ More
We present Reverse Projection, a novel projective texture mapping technique for painting a decal directly to the texture of a 3D object. Designed to be used in games, this technique works in real-time. By using projection techniques that are computed in local space textures and outward-looking, users using low-end android devices to high-end gaming desktops are able to enjoy the personalization of their assets. We believe our proposed pipeline is a step in improving the speed and versatility of model painting.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Persistence and disappearance of negative eigenvalues in dimension two
Authors:
T. J. Christiansen,
K. Datchev,
C. Griffin
Abstract:
We compute asymptotics of eigenvalues approaching the bottom of the continuous spectrum, and associated resonances, for Schrödinger operators in dimension two. We distinguish persistent eigenvalues, which have associated resonances, from disappearing ones, which do not. We illustrate the significance of this distinction by computing corresponding scattering phase asymptotics and numerical Breit--W…
▽ More
We compute asymptotics of eigenvalues approaching the bottom of the continuous spectrum, and associated resonances, for Schrödinger operators in dimension two. We distinguish persistent eigenvalues, which have associated resonances, from disappearing ones, which do not. We illustrate the significance of this distinction by computing corresponding scattering phase asymptotics and numerical Breit--Wigner peaks. We prove all of our results for circular wells, and extend some of them to more general problems using recent resolvent techniques.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Cross-shell excited configurations in the structure of 34Si
Authors:
R. S. Lubna,
A. B. Garnsworthy,
Vandana Tripathi,
G. C. Ball,
C. R. Natzke,
M. Rocchini,
C. Andreoiu,
S. S. Bhattacharjee,
I. Dillmann,
F. H. Garcia,
S. A. Gillespie,
G. Hackman,
C. J. Griffin,
G. Leckenby,
T. Miyagi,
B. Olaizola,
C. Porzio,
M. M. Rajabali,
Y. Saito,
P. Spagnoletti,
S. L. Tabor,
R. Umashankar,
V. Vedia,
A. Volya,
J. Williams
, et al. (1 additional authors not shown)
Abstract:
The cross-shell excited states of $^{34}$Si have been investigated via $β$-decays of the $4^-$ ground state and the $1^+$ isomeric state of $^{34}$Al. Since the valence protons and valence neutrons occupy different major shells in the ground state as well as the intruder $1^+$ isomeric state of $^{34}$Al, intruder levels of $^{34}$Si are populated via allowed $β$ decays. Spin assignments to such i…
▽ More
The cross-shell excited states of $^{34}$Si have been investigated via $β$-decays of the $4^-$ ground state and the $1^+$ isomeric state of $^{34}$Al. Since the valence protons and valence neutrons occupy different major shells in the ground state as well as the intruder $1^+$ isomeric state of $^{34}$Al, intruder levels of $^{34}$Si are populated via allowed $β$ decays. Spin assignments to such intruder levels of $^{34}$Si were established through $γ$-$γ$ angular correlation analysis for the negative parity states with dominant configurations $(νd_{3/2})^{-1} \otimes (νf_{7/2})^{1}$ as well as the positive parity states with dominant configurations $(νsd)^{-2} \otimes (νf_{7/2}p_{3/2})^2$. The configurations of such intruder states play crucial roles in our understanding of the $N=20$ shell gap evolution. A configuration interaction model derived from the FSU Hamiltonian was utilized in order to interpret the intruder states in $^{34}$Si. Shell model interaction derived from a more fundamental theory with the Valence Space In Medium Similarity Renormalization Group (VS-IMSRG) method was also employed to interpret the structure of $^{34}$Si.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
Spatial Dynamics of Higher Order Rock-Paper-Scissors and Generalisations
Authors:
Christopher Griffin,
Li Feng,
Rongling Wu
Abstract:
We introduce and study the spatial replicator equation with higher order interactions and both infinite (spatially homogeneous) populations and finite (spatially inhomogeneous) populations. We show that in the special case of three strategies (rock-paper-scissors) higher order interaction terms allow travelling waves to emerge in non-declining finite populations. We show that these travelling wave…
▽ More
We introduce and study the spatial replicator equation with higher order interactions and both infinite (spatially homogeneous) populations and finite (spatially inhomogeneous) populations. We show that in the special case of three strategies (rock-paper-scissors) higher order interaction terms allow travelling waves to emerge in non-declining finite populations. We show that these travelling waves arise from diffusion stabilisation of an unstable interior equilibrium point that is present in the aspatial dynamics. Based on these observations and prior results, we offer two conjectures whose proofs would fully generalise our results to all odd cyclic games, both with and without higher order interactions, assuming a spatial replicator dynamic. Intriguingly, these generalisations for $N \geq 5$ strategies seem to require declining populations, as we show in our discussion.
△ Less
Submitted 27 December, 2023;
originally announced December 2023.
-
First measurement of the neutron-emission probability with a surrogate reaction in inverse kinematics at a heavy-ion storage ring
Authors:
M. Sguazzin,
B. Jurado,
J. Pibernat,
J. A. Swartz,
M. Grieser,
J. Glorius,
Yu. A. Litvinov,
J. Adamczewski-Musch,
P. Alfaurt,
P. Ascher,
L. Audouin,
C. Berthelot,
B. Blank,
K. Blaum,
B. Brückner,
S. Dellmann,
I. Dillmann,
C. Domingo-Pardo,
M. Dupuis,
P. Erbacher,
M. Flayol,
O. Forstner,
D. Freire-Fernández,
M. Gerbaux,
J. Giovinazzo
, et al. (28 additional authors not shown)
Abstract:
Neutron-induced reaction cross sections of short-lived nuclei are imperative to understand the origin of heavy elements in stellar nucleosynthesis and for societal applications, but their measurement is extremely complicated due to the radioactivity of the targets involved. One way of overcoming this issue is to combine surrogate reactions with the unique possibilities offered by heavy-ion storage…
▽ More
Neutron-induced reaction cross sections of short-lived nuclei are imperative to understand the origin of heavy elements in stellar nucleosynthesis and for societal applications, but their measurement is extremely complicated due to the radioactivity of the targets involved. One way of overcoming this issue is to combine surrogate reactions with the unique possibilities offered by heavy-ion storage rings. In this work, we describe the first surrogate-reaction experiment in inverse kinematics, which we successfully conducted at the Experimental Storage Ring (ESR) of the GSI/FAIR facility, using the 208Pb(p,p') reaction as a surrogate for neutron capture on 207Pb. Thanks to the outstanding detection efficiencies possible at the ESR, we were able to measure for the first time the neutron-emission probability as a function of the excitation energy of 208Pb. We have used this probability to select different descriptions of the gamma-ray strength function and nuclear level density, and provide reliable results for the neutron-induced radiative capture cross section of 207Pb at energies for which no experimental data exist.
△ Less
Submitted 22 July, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Measurement of the Isolated Nuclear Two-Photon Decay in $^{72}\mathrm{Ge}$
Authors:
D. Freire-Fernández,
W. Korten,
R. J. Chen,
S. Litvinov,
Yu. A. Litvinov,
M. S. Sanjari,
H. Weick,
F. C. Akinci,
H. M. Albers,
M. Armstrong,
A. Banerjee,
K. Blaum,
C. Brandau,
B. A. Brown,
C. G. Bruno,
J. J. Carroll,
X. Chen,
Ch. J. Chiara,
M. L. Cortes,
S. F. Dellmann,
I. Dillmann,
D. Dmytriiev,
O. Forstner,
H. Geissel,
J. Glorius
, et al. (35 additional authors not shown)
Abstract:
The nuclear two-photon or double-gamma ($2γ$) decay is a second-order electromagnetic process whereby a nucleus in an excited state emits two gamma rays simultaneously. To be able to directly measure the $2γ$ decay rate in the low-energy regime below the electron-positron pair-creation threshold, we combined the isochronous mode of a storage ring with Schottky resonant cavities. The newly develope…
▽ More
The nuclear two-photon or double-gamma ($2γ$) decay is a second-order electromagnetic process whereby a nucleus in an excited state emits two gamma rays simultaneously. To be able to directly measure the $2γ$ decay rate in the low-energy regime below the electron-positron pair-creation threshold, we combined the isochronous mode of a storage ring with Schottky resonant cavities. The newly developed technique can be applied to isomers with excitation energies down to $\sim100$\,keV and half-lives as short as $\sim10$\,ms. The half-life for the $2γ$ decay of the first-excited $0^+$ state in bare $^{72}\mathrm{Ge}$ ions was determined to be $23.9\left(6\right)$\,ms, which strongly deviates from expectations.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Use of statistically leinert sets to calculate return probabilities of random walks in F_s1 x F_s2
Authors:
Colton Griffin,
Sanchita Chakraborty
Abstract:
Hastings first presented bounds on the second largest eigenvalue for matrices in a Hermitian complete positive map in 2007. In this work we extend his work to tighten these bounds. To do this, we introduce the idea of Statistically Leinert Sets to modify the generating functions presented in Woess in 1986 and recompute the radii of convergence in his paper in 1986. We primarily use techniques from…
▽ More
Hastings first presented bounds on the second largest eigenvalue for matrices in a Hermitian complete positive map in 2007. In this work we extend his work to tighten these bounds. To do this, we introduce the idea of Statistically Leinert Sets to modify the generating functions presented in Woess in 1986 and recompute the radii of convergence in his paper in 1986. We primarily use techniques from combinatorics and calculate norms using the ideas presented by Akemann and Ostrang in their paper in 1976.
△ Less
Submitted 25 November, 2023;
originally announced November 2023.
-
A characterization of piecewise $\mathcal{F}$-syndetic sets
Authors:
Conner Griffin
Abstract:
Some filter relative notions of size, $\left( \mathcal{F},\mathcal{G}\right) $-syndeticity and piecewise $\mathcal{F} $-syndeticity, were defined and applied with clarity and focus by Shuungula, Zelenyuk and Zelenyuk in their paper ``The closure of the smallest ideal of an ultrafilter semigroup.'' These notions are generalizations of the well studied notions of syndeticity and piecewise syndeticit…
▽ More
Some filter relative notions of size, $\left( \mathcal{F},\mathcal{G}\right) $-syndeticity and piecewise $\mathcal{F} $-syndeticity, were defined and applied with clarity and focus by Shuungula, Zelenyuk and Zelenyuk in their paper ``The closure of the smallest ideal of an ultrafilter semigroup.'' These notions are generalizations of the well studied notions of syndeticity and piecewise syndeticity. Since then, there has been an effort to develop the theory around the algebraic structure of the Stone-Čech compactification so that it encompasses these new generalizations. In this direction, we prove a characterization of piecewise $\mathcal{F}$-syndetic sets. This fully answers a conjecture of Christopherson and Johnson. arXiv:2105.09723
△ Less
Submitted 17 August, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Reinforcement Learning Fine-tuning of Language Models is Biased Towards More Extractable Features
Authors:
Diogo Cruz,
Edoardo Pona,
Alex Holness-Tofts,
Elias Schmied,
Víctor Abia Alonso,
Charlie Griffin,
Bogdan-Ionut Cirstea
Abstract:
Many capable large language models (LLMs) are developed via self-supervised pre-training followed by a reinforcement-learning fine-tuning phase, often based on human or AI feedback. During this stage, models may be guided by their inductive biases to rely on simpler features which may be easier to extract, at a cost to robustness and generalisation. We investigate whether principles governing indu…
▽ More
Many capable large language models (LLMs) are developed via self-supervised pre-training followed by a reinforcement-learning fine-tuning phase, often based on human or AI feedback. During this stage, models may be guided by their inductive biases to rely on simpler features which may be easier to extract, at a cost to robustness and generalisation. We investigate whether principles governing inductive biases in the supervised fine-tuning of LLMs also apply when the fine-tuning process uses reinforcement learning. Following Lovering et al (2021), we test two hypotheses: that features more $\textit{extractable}$ after pre-training are more likely to be utilised by the final policy, and that the evidence for/against a feature predicts whether it will be utilised. Through controlled experiments on synthetic and natural language tasks, we find statistically significant correlations which constitute strong evidence for these hypotheses.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Alexandrov's estimate revisited
Authors:
Charles Griffin,
Kennedy Obinna Idu,
Robert L. Jerrard
Abstract:
Alexandrov's estimate states that if $Ω$ is a bounded open convex domain in ${\mathbb R}^n$ and $u:\bar Ω\to {\mathbb R}$ is a convex solution of the Monge-Ampere equation $\det D^2 u = f$ that vanishes on $\partial Ω$, then \[ |u(x) - u(y)| \le ω(|x-y|)(\int_Ωf)^{1/n} \qquad \mbox{for }ω(δ) = C_n\,\mbox{diam}(Ω)^{\frac{n-1}n} δ^{1/n}. \] We establish a variety of improvements of this, depending o…
▽ More
Alexandrov's estimate states that if $Ω$ is a bounded open convex domain in ${\mathbb R}^n$ and $u:\bar Ω\to {\mathbb R}$ is a convex solution of the Monge-Ampere equation $\det D^2 u = f$ that vanishes on $\partial Ω$, then \[ |u(x) - u(y)| \le ω(|x-y|)(\int_Ωf)^{1/n} \qquad \mbox{for }ω(δ) = C_n\,\mbox{diam}(Ω)^{\frac{n-1}n} δ^{1/n}. \] We establish a variety of improvements of this, depending on the geometry of $\partial Ω$. For example, we show that if the curvature is bounded away from $0$, then the estimate remains valid if $ω(δ)$ is replaced by $C_Ωδ^{\frac 12 + \frac 1{2n}}$. We determine the sharp constant $C_Ω$ when $n=2$, and when $n\ge 3$ and $\partial Ω$ is $C^2$, we determine the sharp asymptotics of the optimal modulus of continuity $ω_Ω(δ)$ as $δ\to 0$. For arbitrary convex domains, we characterize the scaling of the optimal modulus $ω_Ω$. Under very mild nondegeneracy conditions, our results yield the improved Holder estimate, $ω_Ω(δ) \le C δ^α$ for some $α>1/n$.
△ Less
Submitted 9 March, 2024; v1 submitted 31 October, 2023;
originally announced October 2023.
-
Sociotechnical Safety Evaluation of Generative AI Systems
Authors:
Laura Weidinger,
Maribeth Rauh,
Nahema Marchal,
Arianna Manzini,
Lisa Anne Hendricks,
Juan Mateos-Garcia,
Stevie Bergman,
Jackie Kay,
Conor Griffin,
Ben Bariach,
Iason Gabriel,
Verena Rieser,
William Isaac
Abstract:
Generative AI systems produce a range of risks. To ensure the safety of generative AI systems, these risks must be evaluated. In this paper, we make two main contributions toward establishing such evaluations. First, we propose a three-layered framework that takes a structured, sociotechnical approach to evaluating these risks. This framework encompasses capability evaluations, which are the main…
▽ More
Generative AI systems produce a range of risks. To ensure the safety of generative AI systems, these risks must be evaluated. In this paper, we make two main contributions toward establishing such evaluations. First, we propose a three-layered framework that takes a structured, sociotechnical approach to evaluating these risks. This framework encompasses capability evaluations, which are the main current approach to safety evaluation. It then reaches further by building on system safety principles, particularly the insight that context determines whether a given capability may cause harm. To account for relevant context, our framework adds human interaction and systemic impacts as additional layers of evaluation. Second, we survey the current state of safety evaluation of generative AI systems and create a repository of existing evaluations. Three salient evaluation gaps emerge from this analysis. We propose ways forward to closing these gaps, outlining practical steps as well as roles and responsibilities for different actors. Sociotechnical safety evaluation is a tractable approach to the robust and comprehensive safety evaluation of generative AI systems.
△ Less
Submitted 31 October, 2023; v1 submitted 18 October, 2023;
originally announced October 2023.
-
On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning
Authors:
Rohan Subramani,
Marcus Williams,
Max Heitmann,
Halfdan Holm,
Charlie Griffin,
Joar Skalse
Abstract:
Most algorithms in reinforcement learning (RL) require that the objective is formalised with a Markovian reward function. However, it is well-known that certain tasks cannot be expressed by means of an objective in the Markov rewards formalism, motivating the study of alternative objective-specification formalisms in RL such as Linear Temporal Logic and Multi-Objective Reinforcement Learning. To d…
▽ More
Most algorithms in reinforcement learning (RL) require that the objective is formalised with a Markovian reward function. However, it is well-known that certain tasks cannot be expressed by means of an objective in the Markov rewards formalism, motivating the study of alternative objective-specification formalisms in RL such as Linear Temporal Logic and Multi-Objective Reinforcement Learning. To date, there has not yet been any thorough analysis of how these formalisms relate to each other in terms of their expressivity. We fill this gap in the existing literature by providing a comprehensive comparison of 17 salient objective-specification formalisms. We place these formalisms in a preorder based on their expressive power, and present this preorder as a Hasse diagram. We find a variety of limitations for the different formalisms, and argue that no formalism is both dominantly expressive and straightforward to optimise with current techniques. For example, we prove that each of Regularised RL, (Outer) Nonlinear Markov Rewards, Reward Machines, Linear Temporal Logic, and Limit Average Rewards can express a task that the others cannot. The significance of our results is twofold. First, we identify important expressivity limitations to consider when specifying objectives for policy optimization. Second, our results highlight the need for future research which adapts reward learning to work with a greater variety of formalisms, since many existing reward learning methods assume that the desired objective takes a Markovian form. Our work contributes towards a more cohesive understanding of the costs and benefits of different RL objective-specification formalisms.
△ Less
Submitted 17 February, 2024; v1 submitted 18 October, 2023;
originally announced October 2023.
-
Goodhart's Law in Reinforcement Learning
Authors:
Jacek Karwowski,
Oliver Hayman,
Xingjian Bai,
Klaus Kiendlhofer,
Charlie Griffin,
Joar Skalse
Abstract:
Implementing a reward function that perfectly captures a complex task in the real world is impractical. As a result, it is often appropriate to think of the reward function as a proxy for the true objective rather than as its definition. We study this phenomenon through the lens of Goodhart's law, which predicts that increasing optimisation of an imperfect proxy beyond some critical point decrease…
▽ More
Implementing a reward function that perfectly captures a complex task in the real world is impractical. As a result, it is often appropriate to think of the reward function as a proxy for the true objective rather than as its definition. We study this phenomenon through the lens of Goodhart's law, which predicts that increasing optimisation of an imperfect proxy beyond some critical point decreases performance on the true objective. First, we propose a way to quantify the magnitude of this effect and show empirically that optimising an imperfect proxy reward often leads to the behaviour predicted by Goodhart's law for a wide range of environments and reward functions. We then provide a geometric explanation for why Goodhart's law occurs in Markov decision processes. We use these theoretical insights to propose an optimal early stopping method that provably avoids the aforementioned pitfall and derive theoretical regret bounds for this method. Moreover, we derive a training method that maximises worst-case reward, for the setting where there is uncertainty about the true reward function. Finally, we evaluate our early stopping method experimentally. Our results support a foundation for a theoretically-principled study of reinforcement learning under reward misspecification.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Free Entropy Minimizing Persuasion in a Predictor-Corrector Dynamic
Authors:
Geoff Goehle,
Christopher Griffin
Abstract:
Persuasion is the process of changing an agent's belief distribution from a given (or estimated) prior to a desired posterior. A common assumption in the acceptance of information or misinformation as fact is that the (mis)information must be consistent with or familiar to the individual who accepts it. We model the process as a control problem in which the state is given by a (time-varying) belie…
▽ More
Persuasion is the process of changing an agent's belief distribution from a given (or estimated) prior to a desired posterior. A common assumption in the acceptance of information or misinformation as fact is that the (mis)information must be consistent with or familiar to the individual who accepts it. We model the process as a control problem in which the state is given by a (time-varying) belief distribution following a predictor-corrector dynamic. Persuasion is modeled as the corrector control signal with the performance index defined using the Fisher-Rao information metric, reflecting a fundamental cost associated to altering the agent's belief distribution. To compensate for the fact that information production arises naturally from the predictor dynamic (i.e., expected beliefs change) we modify the Fisher-Rao metric to account just for information generated by the control signal. The resulting optimal control problem produces non-geodesic paths through distribution space that are compared to the geodesic paths found using the standard free entropy minimizing Fisher metric in several example belief models: a Kalman Filter, a Boltzmann distribution and a joint Kalman/Boltzmann belief system.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
A Multidimensional Fourier Approximation of Optimal Control Surfaces
Authors:
Gabriel Nicolosi,
Terry Friesz,
Christopher Griffin
Abstract:
This work considers the problem of approximating initial condition and time-dependent optimal control and trajectory surfaces using multivariable Fourier series. A modified Augmented Lagrangian algorithm for translating the optimal control problem into an unconstrained optimization one is proposed and two problems are solved: a quadratic control problem in the context of Newtonian mechanics, and a…
▽ More
This work considers the problem of approximating initial condition and time-dependent optimal control and trajectory surfaces using multivariable Fourier series. A modified Augmented Lagrangian algorithm for translating the optimal control problem into an unconstrained optimization one is proposed and two problems are solved: a quadratic control problem in the context of Newtonian mechanics, and a control problem arising from an odd-circulant game ruled by the replicator dynamics. Various computational results are presented. Use of automatic differentiation is explored to circumvent the elaborated gradient computation in the first-order optimization procedure. Furthermore, mean square error bounds are derived for the case of one and two-dimensional Fourier series approximations, inducing a general bound for problems of $n$ dimensions.
△ Less
Submitted 12 December, 2023; v1 submitted 28 June, 2023;
originally announced June 2023.
-
Consensus in Complex Networks with Noisy Agents and Peer Pressure
Authors:
Christopher Griffin,
Anna Squicciarini,
Feiran Jia
Abstract:
In this paper we study a discrete time consensus model on a connected graph with monotonically increasing peer-pressure and noise perturbed outputs masking a hidden state. We assume that each agent maintains a constant hidden state and a presents a dynamic output that is perturbed by random noise drawn from a mean-zero distribution. We show consensus is ensured in the limit as time goes to infinit…
▽ More
In this paper we study a discrete time consensus model on a connected graph with monotonically increasing peer-pressure and noise perturbed outputs masking a hidden state. We assume that each agent maintains a constant hidden state and a presents a dynamic output that is perturbed by random noise drawn from a mean-zero distribution. We show consensus is ensured in the limit as time goes to infinity under certain assumptions on the increasing peer-pressure term and also show that the hidden state cannot be exactly recovered even when model dynamics and outputs are known. The exact nature of the distribution is computed for a simple two vertex graph and results found are shown to generalize (empirically) to more complex graph structures.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Storage, Accumulation and Deceleration of Secondary Beams for Nuclear Astrophysics
Authors:
J. Glorius,
Yu. A. Litvinov,
M. Aliotta,
F. Amjad,
B. Brückner,
C. G. Bruno,
R. Chen,
T. Davinson,
S. F. Dellmann,
T. Dickel,
I. Dillmann,
P. Erbacher,
O. Forstner,
H. Geissel,
C. J. Griffin,
R. Grisenti,
A. Gumberidze,
E. Haettner,
R. Hess,
P. -M. Hillenbrand,
C. Hornung,
R. Joseph,
B. Jurado,
E. Kazanseva,
R. Knöbel
, et al. (39 additional authors not shown)
Abstract:
Low-energy investigations on rare ion beams are often limited by the available intensity and purity of the ion species in focus. Here, we present the first application of a technique that combines in-flight production at relativistic energies with subsequent secondary beam storage, accumulation and finally deceleration to the energy of interest. Using the FRS and ESR facilities at GSI, this scheme…
▽ More
Low-energy investigations on rare ion beams are often limited by the available intensity and purity of the ion species in focus. Here, we present the first application of a technique that combines in-flight production at relativistic energies with subsequent secondary beam storage, accumulation and finally deceleration to the energy of interest. Using the FRS and ESR facilities at GSI, this scheme was pioneered to provide a secondary beam of $^{118}$Te$^{52+}$ for the measurement of nuclear proton-capture at energies of 6 and 7 MeV/u. The technique provided stored beam intensities of about $10^6$ ions at high purity and brilliance, representing a major step towards low-energy nuclear physics studies using rare ion beams.
△ Less
Submitted 30 May, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Approximation of Optimal Control Surfaces for the Bass Model with Stochastic Dynamics
Authors:
Gabriel Nicolosi,
Christopher Griffin
Abstract:
The Bass diffusion equation is a well-known and established modeling approach for describing new product adoption in a competitive market. This model also describes diffusion phenomena in various contexts: infectious disease spread modeling and estimation, rumor spread on social networks, prediction of renewable energy technology markets, among others. Most of these models, however, consider a det…
▽ More
The Bass diffusion equation is a well-known and established modeling approach for describing new product adoption in a competitive market. This model also describes diffusion phenomena in various contexts: infectious disease spread modeling and estimation, rumor spread on social networks, prediction of renewable energy technology markets, among others. Most of these models, however, consider a deterministic trajectory of the associated state variable (e.g., market-share). In reality, the diffusion process is subject to noise, and a stochastic component must be added to the state dynamics. The stochastic Bass model has also been studied in many areas, such as energy markets and marketing. Exploring the stochastic version of the Bass diffusion model, we propose in this work an approximation of (stochastic) optimal control surfaces for a continuous-time problem arising from a $2\times2$ skew symmetric evolutionary game, providing the stochastic counter-part of the Fourier-based optimal control approximation already existent in the literature.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
A prototype hybrid prediction market for estimating replicability of published work
Authors:
Tatiana Chakravorti,
Robert Fraleigh,
Timothy Fritton,
Michael McLaughlin,
Vaibhav Singh,
Christopher Griffin,
Anthony Kwasnica,
David Pennock,
C. Lee Giles,
Sarah Rajtmajer
Abstract:
We present a prototype hybrid prediction market and demonstrate the avenue it represents for meaningful human-AI collaboration. We build on prior work proposing artificial prediction markets as a novel machine-learning algorithm. In an artificial prediction market, trained AI agents buy and sell outcomes of future events. Classification decisions can be framed as outcomes of future events, and acc…
▽ More
We present a prototype hybrid prediction market and demonstrate the avenue it represents for meaningful human-AI collaboration. We build on prior work proposing artificial prediction markets as a novel machine-learning algorithm. In an artificial prediction market, trained AI agents buy and sell outcomes of future events. Classification decisions can be framed as outcomes of future events, and accordingly, the price of an asset corresponding to a given classification outcome can be taken as a proxy for the confidence of the system in that decision. By embedding human participants in these markets alongside bot traders, we can bring together insights from both. In this paper, we detail pilot studies with prototype hybrid markets for the prediction of replication study outcomes. We highlight challenges and opportunities, share insights from semi-structured interviews with hybrid market participants, and outline a vision for ongoing and future work.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Stability of Dining Clubs in the Kolkata Paise Problem with and without Cheating
Authors:
Akshat Harlalka,
Andrew Belmonte,
Christopher Griffin
Abstract:
We introduce the idea of a dining club to the Kolkata Paise Restaurant Problem. In this problem, $N$ agents choose (randomly) among $N$ restaurants, but if multiple agents choose the same restaurant, only one will eat. Agents in the dining club will coordinate their restaurant choice to avoid choice collision and increase their probability of eating. We model the problem of deciding whether to joi…
▽ More
We introduce the idea of a dining club to the Kolkata Paise Restaurant Problem. In this problem, $N$ agents choose (randomly) among $N$ restaurants, but if multiple agents choose the same restaurant, only one will eat. Agents in the dining club will coordinate their restaurant choice to avoid choice collision and increase their probability of eating. We model the problem of deciding whether to join the dining club as an evolutionary game and show that the strategy of joining the dining club is evolutionarily stable. We then introduce an optimized member tax to those individuals in the dining club, which is used to provide a safety net for those group members who don't eat because of collision with a non-dining club member. When non-dining club members are allowed to cheat and share communal food within the dining club, we show that a new unstable fixed point emerges in the dynamics. A bifurcation analysis is performed in this case. To conclude our theoretical study, we then introduce evolutionary dynamics for the cheater population and study these dynamics. Numerical experiments illustrate the behaviour of the system with more than one dining club and show several potential areas for future research.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
First Evidence of Axial Shape Asymmetry and Configuration Coexistence in $^{74}$Zn: Suggestion for a Northern Extension of the $N=40$ Island of Inversion
Authors:
M. Rocchini,
P. E. Garrett,
M. Zielinska,
S. M. Lenzi,
D. D. Dao,
F. Nowacki,
V. Bildstein,
A. D. MacLean,
B. Olaizola,
Z. T. Ahmed,
C. Andreoiu,
A. Babu,
G. C. Ball,
S. S. Bhattacharjee,
H. Bidaman,
C. Cheng,
R. Coleman,
I. Dillmann,
A. B. Garnsworthy,
S. Gillespie,
C. J. Griffin,
G. F. Grinyer,
G. Hackman,
M. Hanley,
A. Illana
, et al. (19 additional authors not shown)
Abstract:
The excited states of $N=44$ $^{74}$Zn were investigated via $γ$-ray spectroscopy following $^{74}$Cu $β$ decay. By exploiting $γ$-$γ$ angular correlation analysis, the $2_2^+$, $3_1^+$, $0_2^+$ and $2_3^+$ states in $^{74}$Zn were firmly established. The $γ$-ray branching and $E2/M1$ mixing ratios for transitions de-exciting the $2_2^+$, $3_1^+$ and $2_3^+$ states were measured, allowing for the…
▽ More
The excited states of $N=44$ $^{74}$Zn were investigated via $γ$-ray spectroscopy following $^{74}$Cu $β$ decay. By exploiting $γ$-$γ$ angular correlation analysis, the $2_2^+$, $3_1^+$, $0_2^+$ and $2_3^+$ states in $^{74}$Zn were firmly established. The $γ$-ray branching and $E2/M1$ mixing ratios for transitions de-exciting the $2_2^+$, $3_1^+$ and $2_3^+$ states were measured, allowing for the extraction of relative $B(E2)$ values. In particular, the $2_3^+ \to 0_2^+$ and $2_3^+ \to 4_1^+$ transitions were observed for the first time. The results show excellent agreement with new microscopic large-scale shell-model calculations, and are discussed in terms of underlying shapes, as well as the role of neutron excitations across the $N=40$ gap. Enhanced axial shape asymmetry (triaxiality) is suggested to characterize $^{74}$Zn in its ground state. Furthermore, an excited $K=0$ band with a significantly larger softness in its shape is identified. A shore of the $N=40$ ``island of inversion'' appears to manifest above $Z=26$, previously thought as its northern limit in the chart of the nuclides.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
Topological Learning in Multi-Class Data Sets
Authors:
Christopher Griffin,
Trevor Karn,
Benjamin Apple
Abstract:
We specialize techniques from topological data analysis to the problem of characterizing the topological complexity (as defined in the body of the paper) of a multi-class data set. As a by-product, a topological classifier is defined that uses an open sub-covering of the data set. This sub-covering can be used to construct a simplicial complex whose topological features (e.g., Betti numbers) provi…
▽ More
We specialize techniques from topological data analysis to the problem of characterizing the topological complexity (as defined in the body of the paper) of a multi-class data set. As a by-product, a topological classifier is defined that uses an open sub-covering of the data set. This sub-covering can be used to construct a simplicial complex whose topological features (e.g., Betti numbers) provide information about the classification problem. We use these topological constructs to study the impact of topological complexity on learning in feedforward deep neural networks (DNNs). We hypothesize that topological complexity is negatively correlated with the ability of a fully connected feedforward deep neural network to learn to classify data correctly. We evaluate our topological classification algorithm on multiple constructed and open source data sets. We also validate our hypothesis regarding the relationship between topological complexity and learning in DNN's on multiple data sets.
△ Less
Submitted 8 February, 2024; v1 submitted 23 January, 2023;
originally announced January 2023.
-
On a Finite Population Variation of the Fisher-KPP Equation
Authors:
Christopher Griffin
Abstract:
In this paper, we formulate a finite population variation of the Fisher-KPP equation using the fact that the reaction term can be generated from the replicator dynamic using a two-player two-strategy skew-symmetric game. We use prior results from Ablowitz and Zeppetella to show that the resulting system of partial differential equations admits a travelling wave solution, and that there are closed…
▽ More
In this paper, we formulate a finite population variation of the Fisher-KPP equation using the fact that the reaction term can be generated from the replicator dynamic using a two-player two-strategy skew-symmetric game. We use prior results from Ablowitz and Zeppetella to show that the resulting system of partial differential equations admits a travelling wave solution, and that there are closed form solutions for this travelling wave. Interestingly, the closed form solution is constructed from a sign-reversal of the known closed form solution of the classic Fisher equation. We also construct a closed form solution approximation for the corresponding equilibrium problem on a finite interval with Dirichlet and Neumann boundary conditions. Two conjectures on these corresponding equilibrium problems are presented and analysed numerically.
△ Less
Submitted 30 January, 2023; v1 submitted 17 January, 2023;
originally announced January 2023.
-
Higher Order Dynamics in the Replicator Equation Produce a Limit Cycle in Rock-Paper-Scissors
Authors:
Christopher Griffin,
Rongling Wu
Abstract:
Recent work has shown that pairwise interactions may not be sufficient to fully model ecological dynamics in the wild. In this letter, we consider a replicator dynamic that takes both pairwise and triadic interactions into consideration using a rank-three tensor. We study {these} new nonlinear dynamics using a generalized rock-paper-scissors game whose dynamics are well understood in the {standard…
▽ More
Recent work has shown that pairwise interactions may not be sufficient to fully model ecological dynamics in the wild. In this letter, we consider a replicator dynamic that takes both pairwise and triadic interactions into consideration using a rank-three tensor. We study {these} new nonlinear dynamics using a generalized rock-paper-scissors game whose dynamics are well understood in the {standard} replicator sense. We show that the addition of higher-order dynamics leads to the creation of a subcritical Hopf bifurcation and consequently an unstable limit cycle. It is known that this kind of behaviour cannot occur in the pairwise replicator in any three strategy games, showing the effect higher-order interactions can have on the resulting dynamics of the system. We numerically characterize parameter regimes in which limit cycles exist and discuss possible ways to generalize this approach to studying higher-order interactions.
△ Less
Submitted 26 April, 2023; v1 submitted 6 January, 2023;
originally announced January 2023.
-
Lexicographic Multi-Objective Reinforcement Learning
Authors:
Joar Skalse,
Lewis Hammond,
Charlie Griffin,
Alessandro Abate
Abstract:
In this work we introduce reinforcement learning techniques for solving lexicographic multi-objective problems. These are problems that involve multiple reward signals, and where the goal is to learn a policy that maximises the first reward signal, and subject to this constraint also maximises the second reward signal, and so on. We present a family of both action-value and policy gradient algorit…
▽ More
In this work we introduce reinforcement learning techniques for solving lexicographic multi-objective problems. These are problems that involve multiple reward signals, and where the goal is to learn a policy that maximises the first reward signal, and subject to this constraint also maximises the second reward signal, and so on. We present a family of both action-value and policy gradient algorithms that can be used to solve such problems, and prove that they converge to policies that are lexicographically optimal. We evaluate the scalability and performance of these algorithms empirically, demonstrating their practical applicability. As a more specific application, we show how our algorithms can be used to impose safety constraints on the behaviour of an agent, and compare their performance in this context with that of other constrained reinforcement learning algorithms.
△ Less
Submitted 28 December, 2022;
originally announced December 2022.
-
Artificial prediction markets present a novel opportunity for human-AI collaboration
Authors:
Tatiana Chakravorti,
Vaibhav Singh,
Sarah Rajtmajer,
Michael McLaughlin,
Robert Fraleigh,
Christopher Griffin,
Anthony Kwasnica,
David Pennock,
C. Lee Giles
Abstract:
Despite high-profile successes in the field of Artificial Intelligence, machine-driven technologies still suffer important limitations, particularly for complex tasks where creativity, planning, common sense, intuition, or learning from limited data is required. These limitations motivate effective methods for human-machine collaboration. Our work makes two primary contributions. We thoroughly exp…
▽ More
Despite high-profile successes in the field of Artificial Intelligence, machine-driven technologies still suffer important limitations, particularly for complex tasks where creativity, planning, common sense, intuition, or learning from limited data is required. These limitations motivate effective methods for human-machine collaboration. Our work makes two primary contributions. We thoroughly experiment with an artificial prediction market model to understand the effects of market parameters on model performance for benchmark classification tasks. We then demonstrate, through simulation, the impact of exogenous agents in the market, where these exogenous agents represent primitive human behaviors. This work lays the foundation for a novel set of hybrid human-AI machine learning algorithms.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Completely Integrable Replicator Dynamics Associated to Competitive Networks
Authors:
Josh Paik,
Christopher Griffin
Abstract:
The replicator equations are a family of ordinary differential equations that arise in evolutionary game theory, and are closely related to Lotka-Volterra. We produce an infinite family of replicator equations which are Liouville-Arnold integrable. We show this by explicitly providing conserved quantities and a Poisson structure. As a corollary, we classify all tournament replicators up to dimensi…
▽ More
The replicator equations are a family of ordinary differential equations that arise in evolutionary game theory, and are closely related to Lotka-Volterra. We produce an infinite family of replicator equations which are Liouville-Arnold integrable. We show this by explicitly providing conserved quantities and a Poisson structure. As a corollary, we classify all tournament replicators up to dimension 6 and most of dimension 7. As an application, we show that Fig. 1 of ``A competitive network theory of species diversity" by Allesina and Levine (Proc. Natl. Acad. Sci., 2011), produces quasiperiodic dynamics.
△ Less
Submitted 6 May, 2023; v1 submitted 11 November, 2022;
originally announced November 2022.
-
Approximation of Optimal Control Surfaces for $2\times 2$ Skew-Symmetric Evolutionary Game Dynamics
Authors:
Gabriel Nicolosi,
Terry Friesz,
Christopher Griffin
Abstract:
In this paper we study the problem of approximating the general solution to an optimal control problem whose dynamics arise from a $2\times 2$ skew-symmetric evolutionary game with arbitrary initial condition. Our approach uses a Fourier approximation method and generalizes prior work in the use of orthogonal function approximation for optimal control. At the same time we cast the fitting problem…
▽ More
In this paper we study the problem of approximating the general solution to an optimal control problem whose dynamics arise from a $2\times 2$ skew-symmetric evolutionary game with arbitrary initial condition. Our approach uses a Fourier approximation method and generalizes prior work in the use of orthogonal function approximation for optimal control. At the same time we cast the fitting problem in the context of a non-standard feedforward neural network and derive the back-propagation operator in this context. An example of the efficacy of this approach is provided and generalizations are discussed.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Community Formation in Wealth-Mediated Thermodynamic Strategy Evolution
Authors:
Connor Olson,
Andrew Belmonte,
Christopher Griffin
Abstract:
We study a dynamical system defined by a repeated game on a 1D lattice, in which the players keep track of their gross payoffs over time in a bank. Strategy updates are governed by a Boltzmann distribution which depends on the neighborhood bank values associated with each strategy, relative to a temperature scale which defines the random fluctuations. Players with higher bank values are thus less…
▽ More
We study a dynamical system defined by a repeated game on a 1D lattice, in which the players keep track of their gross payoffs over time in a bank. Strategy updates are governed by a Boltzmann distribution which depends on the neighborhood bank values associated with each strategy, relative to a temperature scale which defines the random fluctuations. Players with higher bank values are thus less likely to change strategy than players with lower bank value. For a parameterized rock-paper-scissors game, we derive a condition under which communities of a given strategy form with either fixed or drifting boundaries. We show the effect of temperature increase on the underlying system, and identify surprising properties of this model through numerical simulations.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Dynamics of a Binary Option Market with Exogenous Information and Price Sensitivity
Authors:
Hannah Gampe,
Christopher Griffin
Abstract:
In this paper, we derive and analyze a continuous of a binary option market with exogenous information. The resulting non-linear system has a discontinuous right hand side, which can be analyzed using zero-dimensional Filippov surfaces. Under general assumptions on purchasing rules, we show that when exogenous information is constant in the binary asset market, the price always converges. We then…
▽ More
In this paper, we derive and analyze a continuous of a binary option market with exogenous information. The resulting non-linear system has a discontinuous right hand side, which can be analyzed using zero-dimensional Filippov surfaces. Under general assumptions on purchasing rules, we show that when exogenous information is constant in the binary asset market, the price always converges. We then investigate market prices in the case of changing information, showing empirically that price sensitivity has a strong effect on price lag vs. information. We conclude with open questions on general $n$-ary option markets. As a by-product of the analysis, we show that these markets are equivalent to a simple recurrent neural network, helping to explain some of the predictive power associated with prediction markets, which are usually designed as $n$-ary option markets.
△ Less
Submitted 18 May, 2022;
originally announced June 2022.
-
"If it didn't happen, why would I change my decision?": How Judges Respond to Counterfactual Explanations for the Public Safety Assessment
Authors:
Yaniv Yacoby,
Ben Green,
Christopher L. Griffin Jr.,
Finale Doshi Velez
Abstract:
Many researchers and policymakers have expressed excitement about algorithmic explanations enabling more fair and responsible decision-making. However, recent experimental studies have found that explanations do not always improve human use of algorithmic advice. In this study, we shed light on how people interpret and respond to counterfactual explanations (CFEs) -- explanations that show how a m…
▽ More
Many researchers and policymakers have expressed excitement about algorithmic explanations enabling more fair and responsible decision-making. However, recent experimental studies have found that explanations do not always improve human use of algorithmic advice. In this study, we shed light on how people interpret and respond to counterfactual explanations (CFEs) -- explanations that show how a model's output would change with marginal changes to its input(s) -- in the context of pretrial risk assessment instruments (PRAIs). We ran think-aloud trials with eight sitting U.S. state court judges, providing them with recommendations from a PRAI that includes CFEs. We found that the CFEs did not alter the judges' decisions. At first, judges misinterpreted the counterfactuals as real -- rather than hypothetical -- changes to defendants. Once judges understood what the counterfactuals meant, they ignored them, stating their role is only to make decisions regarding the actual defendant in question. The judges also expressed a mix of reasons for ignoring or following the advice of the PRAI without CFEs. These results add to the literature detailing the unexpected ways in which people respond to algorithms and explanations. They also highlight new challenges associated with improving human-algorithm collaborations through explanations.
△ Less
Submitted 28 August, 2022; v1 submitted 11 May, 2022;
originally announced May 2022.
-
Tile Based Modeling of DNA Self-Assembly for Two Graph Families with Appended Paths
Authors:
D. Chloe Griffin,
Jessica Sorrells
Abstract:
Branched molecules of deoxyribonucleic acid (DNA) can self-assemble into nanostructures through complementary cohesive strand base pairing. The production of DNA nanostructures is valuable in targeted drug delivery and biomolecular computing. With theoretical efficiency of laboratory processes in mind, we use a flexible tile model for DNA assembly. We aim to minimize the number of different types…
▽ More
Branched molecules of deoxyribonucleic acid (DNA) can self-assemble into nanostructures through complementary cohesive strand base pairing. The production of DNA nanostructures is valuable in targeted drug delivery and biomolecular computing. With theoretical efficiency of laboratory processes in mind, we use a flexible tile model for DNA assembly. We aim to minimize the number of different types of branched junction molecules necessary to assemble certain target structures. We represent target structures as discrete graphs and branched DNA molecules as vertices with half-edges. We present the minimum numbers of required branched molecule and cohesive-end types under three levels of restrictive conditions for the tadpole and lollipop graph families. These families represent cycle and complete graphs with a path appended via a single cut-vertex. We include three general lemmas regarding such vertex-induced path subgraphs. Through proofs and examples, we demonstrate the challenges that can arise in determining optimal construction strategies.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
An Index for Inclusions of Operator Systems
Authors:
Roy Araiza,
Colton Griffin,
Thomas Sinclair
Abstract:
Inspired by a well-known characterization of the index of an inclusion of II$_1$ factors due to Pimsner and Popa, we define an index-type invariant for inclusions of operator systems. We compute examples of this invariant, show that it is multiplicative under minimal tensor products, and explain how it generalizes the quantum Lovász theta invariant for a matricial system defined by Duan, Severini,…
▽ More
Inspired by a well-known characterization of the index of an inclusion of II$_1$ factors due to Pimsner and Popa, we define an index-type invariant for inclusions of operator systems. We compute examples of this invariant, show that it is multiplicative under minimal tensor products, and explain how it generalizes the quantum Lovász theta invariant for a matricial system defined by Duan, Severini, and Winter.
△ Less
Submitted 27 April, 2022; v1 submitted 10 March, 2022;
originally announced March 2022.
-
Approximating projections by quantum operations
Authors:
Roy Araiza,
Colton Griffin,
Aneesh Khilnani,
Thomas Sinclair
Abstract:
Using techniques from semidefinite programming, we study the problem of finding a closest quantum channel to the projection onto a matricial subsystem. We derive two invariants of matricial subsystems which are related to the quantum Lovász theta function of Duan, Severini, and Winter.
Using techniques from semidefinite programming, we study the problem of finding a closest quantum channel to the projection onto a matricial subsystem. We derive two invariants of matricial subsystems which are related to the quantum Lovász theta function of Duan, Severini, and Winter.
△ Less
Submitted 24 February, 2023; v1 submitted 4 March, 2022;
originally announced March 2022.
-
A Synthetic Prediction Market for Estimating Confidence in Published Work
Authors:
Sarah Rajtmajer,
Christopher Griffin,
Jian Wu,
Robert Fraleigh,
Laxmaan Balaji,
Anna Squicciarini,
Anthony Kwasnica,
David Pennock,
Michael McLaughlin,
Timothy Fritton,
Nishanth Nakshatri,
Arjun Menon,
Sai Ajay Modukuri,
Rajal Nivargi,
Xin Wei,
C. Lee Giles
Abstract:
Explainably estimating confidence in published scholarly work offers opportunity for faster and more robust scientific progress. We develop a synthetic prediction market to assess the credibility of published claims in the social and behavioral sciences literature. We demonstrate our system and detail our findings using a collection of known replication projects. We suggest that this work lays the…
▽ More
Explainably estimating confidence in published scholarly work offers opportunity for faster and more robust scientific progress. We develop a synthetic prediction market to assess the credibility of published claims in the social and behavioral sciences literature. We demonstrate our system and detail our findings using a collection of known replication projects. We suggest that this work lays the foundation for a research agenda that creatively uses AI for peer review.
△ Less
Submitted 23 December, 2021;
originally announced January 2022.
-
Ethical and social risks of harm from Language Models
Authors:
Laura Weidinger,
John Mellor,
Maribeth Rauh,
Conor Griffin,
Jonathan Uesato,
Po-Sen Huang,
Myra Cheng,
Mia Glaese,
Borja Balle,
Atoosa Kasirzadeh,
Zac Kenton,
Sasha Brown,
Will Hawkins,
Tom Stepleton,
Courtney Biles,
Abeba Birhane,
Julia Haas,
Laura Rimell,
Lisa Anne Hendricks,
William Isaac,
Sean Legassick,
Geoffrey Irving,
Iason Gabriel
Abstract:
This paper aims to help structure the risk landscape associated with large-scale Language Models (LMs). In order to foster advances in responsible innovation, an in-depth understanding of the potential risks posed by these models is needed. A wide range of established and anticipated risks are analysed in detail, drawing on multidisciplinary expertise and literature from computer science, linguist…
▽ More
This paper aims to help structure the risk landscape associated with large-scale Language Models (LMs). In order to foster advances in responsible innovation, an in-depth understanding of the potential risks posed by these models is needed. A wide range of established and anticipated risks are analysed in detail, drawing on multidisciplinary expertise and literature from computer science, linguistics, and social sciences.
We outline six specific risk areas: I. Discrimination, Exclusion and Toxicity, II. Information Hazards, III. Misinformation Harms, V. Malicious Uses, V. Human-Computer Interaction Harms, VI. Automation, Access, and Environmental Harms. The first area concerns the perpetuation of stereotypes, unfair discrimination, exclusionary norms, toxic language, and lower performance by social group for LMs. The second focuses on risks from private data leaks or LMs correctly inferring sensitive information. The third addresses risks arising from poor, false or misleading information including in sensitive domains, and knock-on risks such as the erosion of trust in shared information. The fourth considers risks from actors who try to use LMs to cause harm. The fifth focuses on risks specific to LLMs used to underpin conversational agents that interact with human users, including unsafe use, manipulation or deception. The sixth discusses the risk of environmental harm, job automation, and other challenges that may have a disparate effect on different social groups or communities.
In total, we review 21 risks in-depth. We discuss the points of origin of different risks and point to potential mitigation approaches. Lastly, we discuss organisational responsibilities in implementing mitigations, and the role of collaboration and participation. We highlight directions for further research, particularly on expanding the toolkit for assessing and evaluating the outlined risks in LMs.
△ Less
Submitted 8 December, 2021;
originally announced December 2021.
-
Constructing Approximately Diagonal Quantum Gates
Authors:
Colton Griffin,
Shawn X. Cui
Abstract:
We study a method of producing approximately diagonal 1-qubit gates. For each positive integer, the method provides a sequence of gates that are defined iteratively from a fixed diagonal gate and an arbitrary gate. These sequences are conjectured to converge to diagonal gates doubly exponentially fast and are verified for small integers. We systemically study this conjecture and prove several impo…
▽ More
We study a method of producing approximately diagonal 1-qubit gates. For each positive integer, the method provides a sequence of gates that are defined iteratively from a fixed diagonal gate and an arbitrary gate. These sequences are conjectured to converge to diagonal gates doubly exponentially fast and are verified for small integers. We systemically study this conjecture and prove several important partial results. Some techniques are developed to pave the way for a final resolution of the conjecture. The sequences provided here have applications in quantum search algorithms, quantum circuit compilation, generation of leakage-free entangled gates in topological quantum computing, etc.
△ Less
Submitted 17 November, 2022; v1 submitted 10 September, 2021;
originally announced September 2021.
-
The Replicator Dynamics of Zero-Sum Games Arise from a Novel Poisson Algebra
Authors:
Christopher Griffin
Abstract:
We show that the replicator dynamics for zero-sum games arises as a result of a non-canonical bracket that is a hybrid between a Poisson Bracket and a Nambu Bracket. The resulting non-canonical bracket is parameterized both the by the skew-symmetric payoff matrix and a mediating function. The mediating function is only sometimes a conserved quantity, but plays a critical role in the determination…
▽ More
We show that the replicator dynamics for zero-sum games arises as a result of a non-canonical bracket that is a hybrid between a Poisson Bracket and a Nambu Bracket. The resulting non-canonical bracket is parameterized both the by the skew-symmetric payoff matrix and a mediating function. The mediating function is only sometimes a conserved quantity, but plays a critical role in the determination of the dynamics. As a by-product, we show that for the replicator dynamics this function arises in the definition of a natural metric on which phase flow-volume is preserved. Additionally, we show that the non-canonical bracket satisfies all the same identities as the Poisson bracket except for the Jacobi identity (JI), which is satisfied for special cases of the mediating function. In particular, the mediating function that gives rise to the replicator dynamics yields a bracket that satisfies JI. This neatly explains why the mediating function allows us to derive a metric on which phase flow is conserved and suggests a natural geometry for zero-sum games that extends the Symplectic geometry of the Poisson bracket and potentially an alternate approach to quantizing evolutionary games.
△ Less
Submitted 2 September, 2021;
originally announced September 2021.
-
Isomeric states in neutron-rich nuclei around $N = 40$
Authors:
K. Wimmer,
F. Recchia,
S. M. Lenzi,
S. Riccetto,
T. Davinson,
A. Estrade,
C. J. Griffin,
S. Nishimura,
V. Phong,
P. -A. Söderström,
O. Aktas,
M. Al-Aqeel,
T. Ando,
H. Baba,
S. Bae,
S. Choi,
P. Doornenbal,
J. Ha,
L. Harkness-Brennan,
T. Isobe,
P. R. John,
D. Kahl,
G. Kiss,
I. Kojouharov,
N. Kurz
, et al. (15 additional authors not shown)
Abstract:
Neutron-rich nuclei in the vicinity of the $N=40$ island of inversion are characterized by shell evolution and exhibit deformed ground states. In several nuclei isomeric states have been observed and attributed to excitations to the intruder neutron $1g_{9/2}$ orbital. In the present study we searched for isomeric states in nuclei around $N=40$, $Z=22$ produced by projectile fragmentation at RIBF.…
▽ More
Neutron-rich nuclei in the vicinity of the $N=40$ island of inversion are characterized by shell evolution and exhibit deformed ground states. In several nuclei isomeric states have been observed and attributed to excitations to the intruder neutron $1g_{9/2}$ orbital. In the present study we searched for isomeric states in nuclei around $N=40$, $Z=22$ produced by projectile fragmentation at RIBF. Delayed $γ$ rays were detected by the EURICA germanium detector array. High statistics data allowed for an updated decay scheme of $^{60}$V. The lifetime of an isomeric state in $^{64}$V was measured for the first time in the present experiment. A previously unobserved isomeric state was discovered in $^{58}$Sc. The measured lifetime suggests a parity changing transition, originating from an odd number of neutrons in the $1g_{9/2}$ orbital. The nature of the isomeric state in $^{58}$Sc is thus different from isomers in the less exotic V and Sc nuclei.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
Privacy-preserving Object Detection
Authors:
Peiyang He,
Charlie Griffin,
Krzysztof Kacprzyk,
Artjom Joosen,
Michael Collyer,
Aleksandar Shtedritski,
Yuki M. Asano
Abstract:
Privacy considerations and bias in datasets are quickly becoming high-priority issues that the computer vision community needs to face. So far, little attention has been given to practical solutions that do not involve collection of new datasets. In this work, we show that for object detection on COCO, both anonymizing the dataset by blurring faces, as well as swapping faces in a balanced manner a…
▽ More
Privacy considerations and bias in datasets are quickly becoming high-priority issues that the computer vision community needs to face. So far, little attention has been given to practical solutions that do not involve collection of new datasets. In this work, we show that for object detection on COCO, both anonymizing the dataset by blurring faces, as well as swapping faces in a balanced manner along the gender and skin tone dimension, can retain object detection performances while preserving privacy and partially balancing bias.
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
Design and Analysis of a Synthetic Prediction Market using Dynamic Convex Sets
Authors:
Nishanth Nakshatri,
Arjun Menon,
C. Lee Giles,
Sarah Rajtmajer,
Christopher Griffin
Abstract:
We present a synthetic prediction market whose agent purchase logic is defined using a sigmoid transformation of a convex semi-algebraic set defined in feature space. Asset prices are determined by a logarithmic scoring market rule. Time varying asset prices affect the structure of the semi-algebraic sets leading to time-varying agent purchase rules. We show that under certain assumptions on the u…
▽ More
We present a synthetic prediction market whose agent purchase logic is defined using a sigmoid transformation of a convex semi-algebraic set defined in feature space. Asset prices are determined by a logarithmic scoring market rule. Time varying asset prices affect the structure of the semi-algebraic sets leading to time-varying agent purchase rules. We show that under certain assumptions on the underlying geometry, the resulting synthetic prediction market can be used to arbitrarily closely approximate a binary function defined on a set of input data. We also provide sufficient conditions for market convergence and show that under certain instances markets can exhibit limit cycles in asset spot price. We provide an evolutionary algorithm for training agent parameters to allow a market to model the distribution of a given data set and illustrate the market approximation using two open source data sets. Results are compared to standard machine learning methods.
△ Less
Submitted 5 January, 2021;
originally announced January 2021.
-
A Comment on and Correction to: Opinion dynamics in the presence of increasing agreement pressure
Authors:
Christopher Griffin
Abstract:
We identify a counter-example to the consensus result given in [J. Semonsen et al. Opinion dynamics in the presence of increasing agreement pressure. \textit{IEEE Trans. Cyber.}, 49(4): 1270-1278, 2018]. We resolve the counter-example by replacing Lemma 5 in the given reference with a novel variation of the Banach Fixed Point theorem which explains both the numerical results in the reference and t…
▽ More
We identify a counter-example to the consensus result given in [J. Semonsen et al. Opinion dynamics in the presence of increasing agreement pressure. \textit{IEEE Trans. Cyber.}, 49(4): 1270-1278, 2018]. We resolve the counter-example by replacing Lemma 5 in the given reference with a novel variation of the Banach Fixed Point theorem which explains both the numerical results in the reference and the counter-example(s) in this note, and provides a sufficient condition for consensus in systems with increasing peer-pressure. This work is relevant for other papers that have used the proof technique from Semonsen et al. and establishes the veracity of their claims assuming the new sufficient condition.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.