-
On-chip pulse shaping of entangled photons
Authors:
Kaiyi Wu,
Lucas M. Cohen,
Karthik V. Myilswamy,
Navin B. Lingaraju,
Hsuan-Hao Lu,
Joseph M. Lukens,
Andrew M. Weiner
Abstract:
We demonstrate spectral shaping of entangled photons with a six-channel microring-resonator-based silicon photonic pulse shaper. Through precise calibration of thermal phase shifters in a microresonator-based pulse shaper, we demonstrate line-by-line phase control on a 3~GHz grid for two frequency-bin-entangled qudits, corresponding to Hilbert spaces of up to $6\times 6$ ($3\times 3$) dimensions f…
▽ More
We demonstrate spectral shaping of entangled photons with a six-channel microring-resonator-based silicon photonic pulse shaper. Through precise calibration of thermal phase shifters in a microresonator-based pulse shaper, we demonstrate line-by-line phase control on a 3~GHz grid for two frequency-bin-entangled qudits, corresponding to Hilbert spaces of up to $6\times 6$ ($3\times 3$) dimensions for shared (independent) signal-idler filters. The pulse shaper's fine spectral resolution enables control of nanosecond-scale temporal features, which are observed by direct coincidence detection of biphoton correlation functions that show excellent agreement with theory. This work marks, to our knowledge, the first demonstration of biphoton pulse shaping using an integrated spectral shaper and holds significant promise for applications in quantum information processing.
△ Less
Submitted 20 September, 2024;
originally announced September 2024.
-
Pairing interaction from Demons in Sr$_2$RuO$_4$
Authors:
Young Woo Choi,
Jisoon Ihm,
Marvin L. Cohen
Abstract:
We investigate the properties of the recently observed "demon" mode, a 3D acoustic plasmon, in Sr$_2$RuO$_4$ with an emphasis on evaluating its role for the pairing interactions in this superconductor. The demon mode is a low-energy electronic excitation, and it has been suggested that it could contribute to a reduced Coulomb repulsion and even a possible attractive interaction between electrons.…
▽ More
We investigate the properties of the recently observed "demon" mode, a 3D acoustic plasmon, in Sr$_2$RuO$_4$ with an emphasis on evaluating its role for the pairing interactions in this superconductor. The demon mode is a low-energy electronic excitation, and it has been suggested that it could contribute to a reduced Coulomb repulsion and even a possible attractive interaction between electrons. In this study, we explicitly calculate the dynamically screened Coulomb interaction for Sr$_2$RuO$_4$ by using a renormalized tight-binding band structure and the random phase approximation for the dielectric function. Although the focus here is on Sr$_2$RuO$_4$, this material is considered mainly as a prototype system, having an observed demon mode, and our results should be considered as a guide for application to other systems. Our calculations show that there are regions in ($\mathbf{q}$, $ω$) space where the Coulomb interaction becomes attractive. We find that, although the demon mode is not capable of producing a total attractive electron pairing interaction in Sr$_2$RuO$_4$, it does contribute to a significant reduction in the Coulomb repulsion at the relevant pairing energy scale.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Human-Robot Mutual Learning through Affective-Linguistic Interaction and Differential Outcomes Training [Pre-Print]
Authors:
Emilia Heikkinen,
Elsa Silvennoinen,
Imran Khan,
Zakaria Lemhaouri,
Laura Cohen,
Lola Cañamero,
Robert Lowe
Abstract:
Owing to the recent success of Large Language Models, Modern A.I has been much focused on linguistic interactions with humans but less focused on non-linguistic forms of communication between man and machine. In the present paper, we test how affective-linguistic communication, in combination with differential outcomes training, affects mutual learning in a human-robot context. Taking inspiration…
▽ More
Owing to the recent success of Large Language Models, Modern A.I has been much focused on linguistic interactions with humans but less focused on non-linguistic forms of communication between man and machine. In the present paper, we test how affective-linguistic communication, in combination with differential outcomes training, affects mutual learning in a human-robot context. Taking inspiration from child-caregiver dynamics, our human-robot interaction setup consists of a (simulated) robot attempting to learn how best to communicate internal, homeostatically-controlled needs; while a human "caregiver" attempts to learn the correct object to satisfy the robot's present communicated need. We studied the effects of i) human training type, and ii) robot reinforcement learning type, to assess mutual learning terminal accuracy and rate of learning (as measured by the average reward achieved by the robot). Our results find mutual learning between a human and a robot is significantly improved with Differential Outcomes Training (DOT) compared to Non-DOT (control) conditions. We find further improvements when the robot uses an exploration-exploitation policy selection, compared to purely exploitation policy selection. These findings have implications for utilizing socially assistive robots (SAR) in therapeutic contexts, e.g. for cognitive interventions, and educational applications.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
GAPNet: Granularity Attention Network with Anatomy-Prior-Constraint for Carotid Artery Segmentation
Authors:
Lin Zhang,
Chenggang Lu,
Xin-yang Shi,
Caifeng Shan,
Jiong Zhang,
Da Chen,
Laurent D. Cohen
Abstract:
Atherosclerosis is a chronic, progressive disease that primarily affects the arterial walls. It is one of the major causes of cardiovascular disease. Magnetic Resonance (MR) black-blood vessel wall imaging (BB-VWI) offers crucial insights into vascular disease diagnosis by clearly visualizing vascular structures. However, the complex anatomy of the neck poses challenges in distinguishing the carot…
▽ More
Atherosclerosis is a chronic, progressive disease that primarily affects the arterial walls. It is one of the major causes of cardiovascular disease. Magnetic Resonance (MR) black-blood vessel wall imaging (BB-VWI) offers crucial insights into vascular disease diagnosis by clearly visualizing vascular structures. However, the complex anatomy of the neck poses challenges in distinguishing the carotid artery (CA) from surrounding structures, especially with changes like atherosclerosis. In order to address these issues, we propose GAPNet, which is a consisting of a novel geometric prior deduced from.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Realisation of de Gennes$'$ Absolute Superconducting Switch with a Heavy Metal Interface
Authors:
Hisakazu Matsuki,
Alberto Hijano,
Grzegorz P. Mazur,
Stefan Ilic,
Binbin Wang,
Yuliya Alekhina,
Kohei Ohnishi,
Sachio Komori,
Yang Li,
Nadia Stelmashenko,
Niladri Banerjee,
Lesley F. Cohen,
David W. McComb,
F. Sebastian Bergeret,
Guang Yang,
Jason W. A. Robinson
Abstract:
In 1966, Pierre-Gilles de Gennes proposed a non-volatile mechanism for switching superconductivity on and off in a magnetic device. This involved a superconductor (S) sandwiched between ferromagnetic (F) insulators in which the net magnetic exchange field could be controlled through the magnetisation-orientation of the F layers. Because superconducting switches are attractive for a range of applic…
▽ More
In 1966, Pierre-Gilles de Gennes proposed a non-volatile mechanism for switching superconductivity on and off in a magnetic device. This involved a superconductor (S) sandwiched between ferromagnetic (F) insulators in which the net magnetic exchange field could be controlled through the magnetisation-orientation of the F layers. Because superconducting switches are attractive for a range of applications, extensive studies have been carried out on $F/S/F$ structures. Although these have demonstrated a sensitivity of the superconducting critical temperature ($T_{c}$) to parallel (P) and antiparallel (AP) magnetisation-orientations of the F layers, corresponding shifts in $T_c$ (i.e., $ΔT_c = T_{c,AP} - T_{c,P}$) are lower than predicted with $ΔT_c$ only a small fraction of $T_{c,AP}$, precluding the development of applications. Here, we report $EuS/Au/Nb/EuS$ structures where EuS is an insulating ferromagnet, Nb is a superconductor and Au is a heavy metal. For P magnetisations, the superconducting state in this structure is quenched down to the lowest measured temperature of 20 mK meaning that $ΔT_c/T_{c,AP}$ is practically 1. The key to this so-called absolute switching effect is a sizable spin-mixing conductance at the $EuS/Au$ interface which ensures a robust magnetic proximity effect, unlocking the potential of $F/S/F$ switches for low power electronics.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement
Authors:
Pushkar Shukla,
Dhruv Srikanth,
Lee Cohen,
Matthew Turk
Abstract:
We propose a novel approach to mitigate biases in computer vision models by utilizing counterfactual generation and fine-tuning. While counterfactuals have been used to analyze and address biases in DNN models, the counterfactuals themselves are often generated from biased generative models, which can introduce additional biases or spurious correlations. To address this issue, we propose using adv…
▽ More
We propose a novel approach to mitigate biases in computer vision models by utilizing counterfactual generation and fine-tuning. While counterfactuals have been used to analyze and address biases in DNN models, the counterfactuals themselves are often generated from biased generative models, which can introduce additional biases or spurious correlations. To address this issue, we propose using adversarial images, that is images that deceive a deep neural network but not humans, as counterfactuals for fair model training. Our approach leverages a curriculum learning framework combined with a fine-grained adversarial loss to fine-tune the model using adversarial examples. By incorporating adversarial images into the training data, we aim to prevent biases from propagating through the pipeline. We validate our approach through both qualitative and quantitative assessments, demonstrating improved bias mitigation and accuracy compared to existing methods. Qualitatively, our results indicate that post-training, the decisions made by the model are less dependent on the sensitive attribute and our model better disentangles the relationship between sensitive attributes and classification variables.
△ Less
Submitted 27 June, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
Anyonic statistics and slow quasiparticle dynamics in a graphene fractional quantum Hall interferometer
Authors:
Noah L. Samuelson,
Liam A. Cohen,
Will Wang,
Simon Blanch,
Takashi Taniguchi,
Kenji Watanabe,
Michael P. Zaletel,
Andrea F. Young
Abstract:
Anyons are two dimensional particles with fractional exchange statistics that emerge as elementary excitations of fractional quantum Hall phases. Experimentally, anyonic statistics manifest directly in the edge-state Fabry-Pérot interferometer geometry, where the presence of $N_{qp}$ localized anyons in the interferometer bulk contributes a phase $N_{qp} θ_a$ to the observed interference pattern,…
▽ More
Anyons are two dimensional particles with fractional exchange statistics that emerge as elementary excitations of fractional quantum Hall phases. Experimentally, anyonic statistics manifest directly in the edge-state Fabry-Pérot interferometer geometry, where the presence of $N_{qp}$ localized anyons in the interferometer bulk contributes a phase $N_{qp} θ_a$ to the observed interference pattern, where $θ_a$ is twice the statistical exchange phase. Here, we report a measurement of $θ_a$ in a monolayer graphene Fabry-Pérot interferometer at $ν$ = 1/3. We find a preponderance of phase slips with magnitudes $Δθ\approx 2 π/ 3$, confirming the result of past experiments in GaAs quantum wells and consistent with expectations for the tunneling of Abelian anyons into the interferometer bulk. In contrast to prior work, however, single anyon tunneling events manifest as instantaneous and irreversible phase slips, indicative of quasiparticle equilibration times exceeding 20 minutes in some cases. We use the discrepancy between the quasiparticle equilibration rate and our measurement speed to vary the interferometer area and $N_{qp}$ independently, allowing us to precisely determine the interferometer phase and monitor the entry and exit of individual anyons to the interferometer loop in the time domain. Besides providing a replication of previous interferometric measurements sensitive to $θ_a$ in GaAs, our results bring anyon dynamics into the experimental regime and suggest that the average `topological charge' of a mesoscopic quantum Hall device can be held constant over hour long timescales.
△ Less
Submitted 28 May, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
Silicon Photonic Microresonator-Based High-Resolution Line-by-Line Pulse Shaping
Authors:
Lucas M. Cohen,
Kaiyi Wu,
Karthik V. Myilswamy,
Saleha Fatema,
Navin B. Lingaraju,
Andrew M. Weiner
Abstract:
Optical pulse shaping stands as a formidable technique in ultrafast optics, radio-frequency photonics, and quantum communications. While existing systems rely on bulk optics or integrated platforms with planar waveguide sections for spatial dispersion, they face limitations in achieving finer (few- or sub-GHz) spectrum control. These methods either demand considerable space or suffer from pronounc…
▽ More
Optical pulse shaping stands as a formidable technique in ultrafast optics, radio-frequency photonics, and quantum communications. While existing systems rely on bulk optics or integrated platforms with planar waveguide sections for spatial dispersion, they face limitations in achieving finer (few- or sub-GHz) spectrum control. These methods either demand considerable space or suffer from pronounced phase errors and optical losses when assembled to achieve fine resolution. Addressing these challenges, we present a foundry-fabricated six-channel silicon photonic shaper using microresonator filter banks with inline phase control and high spectral resolution. Leveraging existing comb-based spectroscopic techniques, we devise a novel system to mitigate thermal crosstalk and enable the versatile use of our on-chip shaper. Our results demonstrate the shaper's ability to phase-compensate six comb lines at tunable channel spacings of 3, 4, and 5 GHz. Specifically, at a 3 GHz channel spacing, we showcase the generation of arbitrary waveforms in the time domain. This scalable design and control scheme holds promise in meeting future demands for high-precision spectral shaping capabilities.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Learnability Gaps of Strategic Classification
Authors:
Lee Cohen,
Yishay Mansour,
Shay Moran,
Han Shao
Abstract:
In contrast with standard classification tasks, strategic classification involves agents strategically modifying their features in an effort to receive favorable predictions. For instance, given a classifier determining loan approval based on credit scores, applicants may open or close their credit cards to fool the classifier. The learning goal is to find a classifier robust against strategic man…
▽ More
In contrast with standard classification tasks, strategic classification involves agents strategically modifying their features in an effort to receive favorable predictions. For instance, given a classifier determining loan approval based on credit scores, applicants may open or close their credit cards to fool the classifier. The learning goal is to find a classifier robust against strategic manipulations. Various settings, based on what and when information is known, have been explored in strategic classification. In this work, we focus on addressing a fundamental question: the learnability gaps between strategic classification and standard learning.
We essentially show that any learnable class is also strategically learnable: we first consider a fully informative setting, where the manipulation structure (which is modeled by a manipulation graph $G^\star$) is known and during training time the learner has access to both the pre-manipulation data and post-manipulation data. We provide nearly tight sample complexity and regret bounds, offering significant improvements over prior results. Then, we relax the fully informative setting by introducing two natural types of uncertainty. First, following Ahmadi et al. (2023), we consider the setting in which the learner only has access to the post-manipulation data. We improve the results of Ahmadi et al. (2023) and close the gap between mistake upper bound and lower bound raised by them. Our second relaxation of the fully informative setting introduces uncertainty to the manipulation structure. That is, we assume that the manipulation graph is unknown but belongs to a known class of graphs. We provide nearly tight bounds on the learning complexity in various unknown manipulation graph settings. Notably, our algorithm in this setting is of independent interest and can be applied to other problems such as multi-label learning.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
On the relationship between speech and hearing
Authors:
Srinivasan Umesh,
Leon Cohen,
Douglas Nelson
Abstract:
We present a framework for experimentally linking speech production and hearing. Using this approach, we describe experimental results, that lead to the concept that sounds made by different individuals and perceived to be the same can be transformed into each other by a "speech scale". The speech scale is empirically determined using only speech data. We show the similarity of the speech scale to…
▽ More
We present a framework for experimentally linking speech production and hearing. Using this approach, we describe experimental results, that lead to the concept that sounds made by different individuals and perceived to be the same can be transformed into each other by a "speech scale". The speech scale is empirically determined using only speech data. We show the similarity of the speech scale to the MEL scale of Stevens and Volkmann, which was derived only from hearing experiments. We thus experimentally link speech production and hearing.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
CMOS photonic integrated source of ultrabroadband polarization-entangled photons
Authors:
Alexander Miloshevsky,
Lucas M. Cohen,
Karthik V. Myilswamy,
Muneer Alshowkan,
Saleha Fatema,
Hsuan-Hao Lu,
Andrew M. Weiner,
Joseph M. Lukens
Abstract:
We showcase a fully on-chip CMOS-fabricated silicon photonic integrated circuit employing a bidirectionally pumped microring and polarization splitter-rotators tailored for the generation of ultrabroadband ($>$9 THz), high-fidelity (90-98%) polarization-entangled photons. Spanning the optical C+L-band and producing over 116 frequency-bin pairs on a 38.4 GHz-spaced grid, this source is ideal for fl…
▽ More
We showcase a fully on-chip CMOS-fabricated silicon photonic integrated circuit employing a bidirectionally pumped microring and polarization splitter-rotators tailored for the generation of ultrabroadband ($>$9 THz), high-fidelity (90-98%) polarization-entangled photons. Spanning the optical C+L-band and producing over 116 frequency-bin pairs on a 38.4 GHz-spaced grid, this source is ideal for flex-grid wavelength-multiplexed entanglement distribution in multiuser networks.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Bayesian Strategic Classification
Authors:
Lee Cohen,
Saeed Sharifi-Malvajerdi,
Kevin Stangl,
Ali Vakilian,
Juba Ziani
Abstract:
In strategic classification, agents modify their features, at a cost, to ideally obtain a positive classification from the learner's classifier. The typical response of the learner is to carefully modify their classifier to be robust to such strategic behavior. When reasoning about agent manipulations, most papers that study strategic classification rely on the following strong assumption: agents…
▽ More
In strategic classification, agents modify their features, at a cost, to ideally obtain a positive classification from the learner's classifier. The typical response of the learner is to carefully modify their classifier to be robust to such strategic behavior. When reasoning about agent manipulations, most papers that study strategic classification rely on the following strong assumption: agents fully know the exact parameters of the deployed classifier by the learner. This often is an unrealistic assumption when using complex or proprietary machine learning techniques in real-world prediction tasks.
We initiate the study of partial information release by the learner in strategic classification. We move away from the traditional assumption that agents have full knowledge of the classifier. Instead, we consider agents that have a common distributional prior on which classifier the learner is using. The learner in our model can reveal truthful, yet not necessarily complete, information about the deployed classifier to the agents. The learner's goal is to release just enough information about the classifier to maximize accuracy. We show how such partial information release can, counter-intuitively, benefit the learner's accuracy, despite increasing agents' abilities to manipulate.
We show that while it is intractable to compute the best response of an agent in the general case, there exist oracle-efficient algorithms that can solve the best response of the agents when the learner's hypothesis class is the class of linear classifiers, or when the agents' cost function satisfies a natural notion of submodularity as we define. We then turn our attention to the learner's optimization problem and provide both positive and negative results on the algorithmic problem of how much information the learner should release about the classifier to maximize their expected accuracy.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Improving Token-Based World Models with Parallel Observation Prediction
Authors:
Lior Cohen,
Kaixin Wang,
Bingyi Kang,
Shie Mannor
Abstract:
Motivated by the success of Transformers when applied to sequences of discrete symbols, token-based world models (TBWMs) were recently proposed as sample-efficient methods. In TBWMs, the world model consumes agent experience as a language-like sequence of tokens, where each observation constitutes a sub-sequence. However, during imagination, the sequential token-by-token generation of next observa…
▽ More
Motivated by the success of Transformers when applied to sequences of discrete symbols, token-based world models (TBWMs) were recently proposed as sample-efficient methods. In TBWMs, the world model consumes agent experience as a language-like sequence of tokens, where each observation constitutes a sub-sequence. However, during imagination, the sequential token-by-token generation of next observations results in a severe bottleneck, leading to long training times, poor GPU utilization, and limited representations. To resolve this bottleneck, we devise a novel Parallel Observation Prediction (POP) mechanism. POP augments a Retentive Network (RetNet) with a novel forward mode tailored to our reinforcement learning setting. We incorporate POP in a novel TBWM agent named REM (Retentive Environment Model), showcasing a 15.4x faster imagination compared to prior TBWMs. REM attains superhuman performance on 12 out of 26 games of the Atari 100K benchmark, while training in less than 12 hours. Our code is available at \url{https://github.com/leor-c/REM}.
△ Less
Submitted 29 May, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Spontaneous localization at a potential saddle point from edge state reconstruction in a quantum Hall point contact
Authors:
Liam A. Cohen,
Noah L. Samuelson,
Taige Wang,
Kai Klocke,
Cian C. Reeves,
Takashi Taniguchi,
Kenji Watanabe,
Sagar Vijay,
Michael P. Zaletel,
Andrea F. Young
Abstract:
Quantum point contacts (QPCs) are an essential component in mesoscopic devices. Here, we study the transmission of quantum Hall edge modes through a gate-defined QPC in monolayer graphene. We observe resonant tunneling peaks and a nonlinear conductance pattern characteristic of Coulomb-blockaded localized states. The in-plane electric polarizability reveals the states are localized at a classicall…
▽ More
Quantum point contacts (QPCs) are an essential component in mesoscopic devices. Here, we study the transmission of quantum Hall edge modes through a gate-defined QPC in monolayer graphene. We observe resonant tunneling peaks and a nonlinear conductance pattern characteristic of Coulomb-blockaded localized states. The in-plane electric polarizability reveals the states are localized at a classically-unstable electrostatic saddle point. We explain this unexpected finding within a self-consistent Thomas-Fermi model, finding that localization of a zero-dimensional state at the saddle point is favored whenever the applied confinement potential is sufficiently soft compared to the Coulomb energy. Our results provide a direct demonstration of Coulomb-driven reconstruction at the boundary of a quantum Hall system.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Pressure-induced superconductivity in a novel germanium allotrope
Authors:
Liangzi Deng,
Jianbo Zhang,
Yuki Sakai,
Zhongjia Tang,
Moein Adnani,
Rabin Dahal,
Alexander P. Litvinchuk,
James R. Chelikowsky,
Marvin L. Cohen,
Russell J. Hemley,
Arnold Guloy,
Yang Ding,
Ching-Wu Chu
Abstract:
High-pressure studies on elements play an essential role in superconductivity research, with implications for both fundamental science and applications. Here we report the experimental discovery of surprisingly low pressure driving a novel germanium allotrope into a superconducting state in comparison to that for alpha-Ge. Raman measurements revealed structural phase transitions and possible elect…
▽ More
High-pressure studies on elements play an essential role in superconductivity research, with implications for both fundamental science and applications. Here we report the experimental discovery of surprisingly low pressure driving a novel germanium allotrope into a superconducting state in comparison to that for alpha-Ge. Raman measurements revealed structural phase transitions and possible electronic topological transitions under pressure up to 58 GPa. Based on pressure-dependent resistivity measurements, superconductivity was induced above 2 GPa and the maximum Tc of 6.8 K was observed under 4.6 GPa. Interestingly, a superconductivity enhancement was discovered during decompression, indicating the possibility of maintaining pressure-induced superconductivity at ambient pressure with better superconducting performance. Density functional theory analysis further suggested that the electronic structure of Ge (oP32) is sensitive to its detailed geometry and revealed that disorder in the beta-tin structure leads to a higher Tc in comparison to the perfect beta-tin Ge.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
The Impact of Local Strain Fields in Non-Collinear Antiferromagnetic Films
Authors:
Freya Johnson,
Frederic Rendell-Bhatti,
Bryan D. Esser,
Aisling Hussey,
David W. McComb,
Jan Zemen,
David Boldrin,
Lesley Cohen
Abstract:
Antiferromagnets hosting structural or magnetic order that breaks time reversal symmetry are of increasing interest for 'beyond von Neumann computing' applications because the topology of their band structure allows for intrinsic physical properties, exploitable in integrated memory and logic function. One such group are the non-collinear antiferromagnets. Essential for domain manipulation is the…
▽ More
Antiferromagnets hosting structural or magnetic order that breaks time reversal symmetry are of increasing interest for 'beyond von Neumann computing' applications because the topology of their band structure allows for intrinsic physical properties, exploitable in integrated memory and logic function. One such group are the non-collinear antiferromagnets. Essential for domain manipulation is the existence of small net moments found routinely when the material is synthesised in thin film form and attributed to symmetry-breaking caused by spin canting, either from the Dzyaloshinskii-Moriya interaction or from strain. Although the spin arrangement of these materials makes them highly sensitive to strain, there is little understanding about the influence of local strain fields caused by lattice defects on global properties, such as magnetisation and anomalous Hall effect. This premise is investigated by examining non-collinear films that are either highly lattice mismatched or closely matched to their substrate. In either case, edge dislocation networks are generated and for the former case these extend throughout the entire film thickness, creating large local strain fields. These strain fields allow for finite intrinsic magnetisation in seemly structurally relaxed films and influence the antiferromagnetic domain state and the intrinsic anomalous Hall effect.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Fitting tree model with CNN and geodesics to track vesselsand application to Ultrasound Localization Microscopy data
Authors:
Théo Bertrand,
Laurent D. Cohen
Abstract:
Segmentation of tubular structures in vascular imaging is a well studied task, although it is rare that we try to infuse knowledge of the tree-like structure of the regions to be detected. Our work focuses on detecting the important landmarks in the vascular network (via CNN performing both localization and classification of the points of interest) and representing vessels as the edges in some min…
▽ More
Segmentation of tubular structures in vascular imaging is a well studied task, although it is rare that we try to infuse knowledge of the tree-like structure of the regions to be detected. Our work focuses on detecting the important landmarks in the vascular network (via CNN performing both localization and classification of the points of interest) and representing vessels as the edges in some minimal distance tree graph. We leverage geodesic methods relevant to the detection of vessels and their geometry, making use of the space of positions and orientations so that 2D vessels can be accurately represented as trees. We build our model to carry tracking on Ultrasound Localization Microscopy (ULM) data, proposing to build a good cost function for tracking on this type of data. We also test our framework on synthetic and eye fundus data. Results show that scarcity of well annotated ULM data is an obstacle to localization of vascular landmarks but the Orientation Score built from ULM data yields good geodesics for tracking blood vessels.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Incentivized Collaboration in Active Learning
Authors:
Lee Cohen,
Han Shao
Abstract:
In collaborative active learning, where multiple agents try to learn labels from a common hypothesis, we introduce an innovative framework for incentivized collaboration. Here, rational agents aim to obtain labels for their data sets while keeping label complexity at a minimum. We focus on designing (strict) individually rational (IR) collaboration protocols, ensuring that agents cannot reduce the…
▽ More
In collaborative active learning, where multiple agents try to learn labels from a common hypothesis, we introduce an innovative framework for incentivized collaboration. Here, rational agents aim to obtain labels for their data sets while keeping label complexity at a minimum. We focus on designing (strict) individually rational (IR) collaboration protocols, ensuring that agents cannot reduce their expected label complexity by acting individually. We first show that given any optimal active learning algorithm, the collaboration protocol that runs the algorithm as is over the entire data is already IR. However, computing the optimal algorithm is NP-hard. We therefore provide collaboration protocols that achieve (strict) IR and are comparable with the best known tractable approximation algorithm in terms of label complexity.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
A Human-Robot Mutual Learning System with Affect-Grounded Language Acquisition and Differential Outcomes Training
Authors:
Alva Markelius,
Sofia Sjöberg,
Zakaria Lemhauori,
Laura Cohen,
Martin Bergström,
Robert Lowe,
Lola Cañamero
Abstract:
This paper presents a novel human-robot interaction setup for robot and human learning of symbolic language for identifying robot homeostatic needs. The robot and human learn to use and respond to the same language symbols that convey homeostatic needs and the stimuli that satisfy the homeostatic needs, respectively. We adopted a differential outcomes training (DOT) protocol whereby the robot prov…
▽ More
This paper presents a novel human-robot interaction setup for robot and human learning of symbolic language for identifying robot homeostatic needs. The robot and human learn to use and respond to the same language symbols that convey homeostatic needs and the stimuli that satisfy the homeostatic needs, respectively. We adopted a differential outcomes training (DOT) protocol whereby the robot provides feedback specific (differential) to its internal needs (e.g. `hunger') when satisfied by the correct stimulus (e.g. cookie). We found evidence that DOT can enhance the human's learning efficiency, which in turn enables more efficient robot language acquisition. The robot used in the study has a vocabulary similar to that of a human infant in the linguistic ``babbling'' phase. The robot software architecture is built upon a model for affect-grounded language acquisition where the robot associates vocabulary with internal needs (hunger, thirst, curiosity) through interactions with the human. The paper presents the results of an initial pilot study conducted with the interactive setup, which reveal that the robot's language acquisition achieves higher convergence rate in the DOT condition compared to the non-DOT control condition. Additionally, participants reported positive affective experiences, feeling of being in control, and an empathetic connection with the robot. This mutual learning (teacher-student learning) approach offers a potential contribution of facilitating cognitive interventions with DOT (e.g. for people with dementia) through increased therapy adherence as a result of engaging humans more in training tasks by taking an active teaching-learning role. The homeostatic motivational grounding of the robot's language acquisition has potential to contribute to more ecologically valid and social (collaborative/nurturing) interactions with robots.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Fine-Resolution Silicon Photonic Wavelength-Selective Switch Using Hybrid Multimode Racetrack Resonators
Authors:
Lucas M. Cohen,
Saleha Fatema,
Vivek V. Wankhade,
Navin B. Lingaraju,
Bohan Zhang,
Deniz Onural,
Milos Popovic,
Andrew M. Weiner
Abstract:
In this work, we describe a procedure for synthesizing racetrack resonators with large quality factors and apply it to realize a multi-channel wavelength-selective switch (WSS) on a silicon photonic chip. We first determine the contribution of each component primitive to propagation loss in a racetrack resonator and use this data to develop a model for the frequency response of arbitrary order, co…
▽ More
In this work, we describe a procedure for synthesizing racetrack resonators with large quality factors and apply it to realize a multi-channel wavelength-selective switch (WSS) on a silicon photonic chip. We first determine the contribution of each component primitive to propagation loss in a racetrack resonator and use this data to develop a model for the frequency response of arbitrary order, coupled-racetrack channel dropping filters. We design second-order racetrack filters based on this model and cascade multiple such filters to form a 1x7 WSS. We find good agreement between our model and device performance with second-order racetrack that have ~1 dB of drop-port loss, ~2 GHz FWHM linewidth, and low optical crosstalk due to the quick filter roll-off of ~ 5.3 dB/GHz. Using a control algorithm, we show three-channel operation of our WSS with a channel spacing of only 10 GHz. Owing to the high quality factor and quick roll-off of our filter design, adjacent channel crosstalk is measured to be <-25 dB for channels spaced on a 10 GHz grid. As a further demonstration, we use five of seven WSS channels to perform a demultiplexing operation on both an 8 GHz and a 10 GHz grid. These results suggest that a low-loss WSS with fine channel resolution can be realized in a scalable manner using the silicon photonics platform.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
The Past, Present, and Future of the Brain Imaging Data Structure (BIDS)
Authors:
Russell A. Poldrack,
Christopher J. Markiewicz,
Stefan Appelhoff,
Yoni K. Ashar,
Tibor Auer,
Sylvain Baillet,
Shashank Bansal,
Leandro Beltrachini,
Christian G. Benar,
Giacomo Bertazzoli,
Suyash Bhogawar,
Ross W. Blair,
Marta Bortoletto,
Mathieu Boudreau,
Teon L. Brooks,
Vince D. Calhoun,
Filippo Maria Castelli,
Patricia Clement,
Alexander L Cohen,
Julien Cohen-Adad,
Sasha D'Ambrosio,
Gilles de Hollander,
María de la iglesia-Vayá,
Alejandro de la Vega,
Arnaud Delorme
, et al. (89 additional authors not shown)
Abstract:
The Brain Imaging Data Structure (BIDS) is a community-driven standard for the organization of data and metadata from a growing range of neuroscience modalities. This paper is meant as a history of how the standard has developed and grown over time. We outline the principles behind the project, the mechanisms by which it has been extended, and some of the challenges being addressed as it evolves.…
▽ More
The Brain Imaging Data Structure (BIDS) is a community-driven standard for the organization of data and metadata from a growing range of neuroscience modalities. This paper is meant as a history of how the standard has developed and grown over time. We outline the principles behind the project, the mechanisms by which it has been extended, and some of the challenges being addressed as it evolves. We also discuss the lessons learned through the project, with the aim of enabling researchers in other domains to learn from the success of BIDS.
△ Less
Submitted 8 January, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Grouping Boundary Proposals for Fast Interactive Image Segmentation
Authors:
Li Liu,
Da Chen,
Minglei Shu,
Laurent D. Cohen
Abstract:
Geodesic models are known as an efficient tool for solving various image segmentation problems. Most of existing approaches only exploit local pointwise image features to track geodesic paths for delineating the objective boundaries. However, such a segmentation strategy cannot take into account the connectivity of the image edge features, increasing the risk of shortcut problem, especially in the…
▽ More
Geodesic models are known as an efficient tool for solving various image segmentation problems. Most of existing approaches only exploit local pointwise image features to track geodesic paths for delineating the objective boundaries. However, such a segmentation strategy cannot take into account the connectivity of the image edge features, increasing the risk of shortcut problem, especially in the case of complicated scenario. In this work, we introduce a new image segmentation model based on the minimal geodesic framework in conjunction with an adaptive cut-based circular optimal path computation scheme and a graph-based boundary proposals grouping scheme. Specifically, the adaptive cut can disconnect the image domain such that the target contours are imposed to pass through this cut only once. The boundary proposals are comprised of precomputed image edge segments, providing the connectivity information for our segmentation model. These boundary proposals are then incorporated into the proposed image segmentation model, such that the target segmentation contours are made up of a set of selected boundary proposals and the corresponding geodesic paths linking them. Experimental results show that the proposed model indeed outperforms state-of-the-art minimal paths-based image segmentation approaches.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Computing Geodesic Paths Encoding a Curvature Prior
Authors:
Da Chen,
Jean-Marie Mirebeau,
Minglei Shu,
Laurent D. Cohen
Abstract:
In this paper, we introduce an efficient method for computing curves minimizing a variant of the Euler-Mumford elastica energy, with fixed endpoints and tangents at these endpoints, where the bending energy is enhanced with a user defined and data-driven scalar-valued term referred to as the curvature prior. In order to guarantee that the globally optimal curve is extracted, the proposed method in…
▽ More
In this paper, we introduce an efficient method for computing curves minimizing a variant of the Euler-Mumford elastica energy, with fixed endpoints and tangents at these endpoints, where the bending energy is enhanced with a user defined and data-driven scalar-valued term referred to as the curvature prior. In order to guarantee that the globally optimal curve is extracted, the proposed method involves the numerical computation of the viscosity solution to a specific static Hamilton-Jacobi-Bellman (HJB) partial differential equation (PDE). For that purpose, we derive the explicit Hamiltonian associated to this variant model equipped with a curvature prior, discretize the resulting HJB PDE using an adaptive finite difference scheme, and solve it in a single pass using a generalized Fast-Marching method. In addition, we also present a practical method for estimating the curvature prior values from image data, designed for the task of accurately tracking curvilinear structure centerlines. Numerical experiments on synthetic and real image data illustrate the advantages of the considered variant of the elastica model with a prior curvature enhancement in complex scenarios where challenging geometric structures appear.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Energy gap of the even-denominator fractional quantum Hall state in bilayer graphene
Authors:
Alexandre Assouline,
Taige Wang,
Haoxin Zhou,
Liam A. Cohen,
Fangyuan Yang,
Ruining Zhang,
Takashi Taniguchi,
Kenji Watanabe,
Roger S. K. Mong,
Michael P. Zaletel,
Andrea F. Young
Abstract:
Bernal bilayer graphene hosts even denominator fractional quantum Hall states thought to be described by a Pfaffian wave function with nonabelian quasiparticle excitations. Here we report the quantitative determination of fractional quantum Hall energy gaps in bilayer graphene using both thermally activated transport and by direct measurement of the chemical potential. We find a transport activati…
▽ More
Bernal bilayer graphene hosts even denominator fractional quantum Hall states thought to be described by a Pfaffian wave function with nonabelian quasiparticle excitations. Here we report the quantitative determination of fractional quantum Hall energy gaps in bilayer graphene using both thermally activated transport and by direct measurement of the chemical potential. We find a transport activation gap of 5.1K at B = 12T for a half-filled N=1 Landau level, consistent with density matrix renormalization group calculations for the Pfaffian state. However, the measured thermodynamic gap of 11.6K is smaller than theoretical expectations for the clean limit by approximately a factor of two. We analyze the chemical potential data near fractional filling within a simplified model of a Wigner crystal of fractional quasiparticles with long-wavelength disorder, explaining this discrepancy. Our results quantitatively establish bilayer graphene as a robust platform for probing the non-Abelian anyons expected to arise as the elementary excitations of the even-denominator state.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
Simulation of quantum computation with magic states via Jordan-Wigner transformations
Authors:
Michael Zurel,
Lawrence Z. Cohen,
Robert Raussendorf
Abstract:
Negativity in certain quasiprobability representations is a necessary condition for a quantum computational advantage. Here we define a new quasiprobability representation exhibiting this property with respect to quantum computations in the magic state model. It is based on generalized Jordan-Wigner transformations and it has a close connection to the probability representation of universal quantu…
▽ More
Negativity in certain quasiprobability representations is a necessary condition for a quantum computational advantage. Here we define a new quasiprobability representation exhibiting this property with respect to quantum computations in the magic state model. It is based on generalized Jordan-Wigner transformations and it has a close connection to the probability representation of universal quantum computation based on the $Λ$ polytopes. For each number of qubits it defines a polytope contained in the $Λ$ polytope with some shared vertices. It leads to an efficient classical simulation algorithm for magic state quantum circuits for which the input state is positively represented, and it outperforms previous representations in terms of the states that can be positively represented.
△ Less
Submitted 29 July, 2023;
originally announced July 2023.
-
$\text{TT}^{\Box}_{\mathcal C}$: a Family of Extensional Type Theories with Effectful Realizers of Continuity
Authors:
Liron Cohen,
Vincent Rahli
Abstract:
$\text{TT}^{\Box}_{\mathcal C}$ is a generic family of effectful, extensional type theories with a forcing interpretation parameterized by modalities. This paper identifies a subclass of $\text{TT}^{\Box}_{\mathcal C}$ theories that internally realizes continuity principles through stateful computations, such as reference cells. The principle of continuity is a seminal property that holds for a nu…
▽ More
$\text{TT}^{\Box}_{\mathcal C}$ is a generic family of effectful, extensional type theories with a forcing interpretation parameterized by modalities. This paper identifies a subclass of $\text{TT}^{\Box}_{\mathcal C}$ theories that internally realizes continuity principles through stateful computations, such as reference cells. The principle of continuity is a seminal property that holds for a number of intuitionistic theories such as System T. Roughly speaking, it states that functions on real numbers only need approximations of these numbers to compute. Generally, continuity principles have been justified using semantical arguments, but it is known that the modulus of continuity of functions can be computed using effectful computations such as exceptions or reference cells. In this paper, the modulus of continuity of the functionals on the Baire space is directly computed using the stateful computations enabled internally in the theory.
△ Less
Submitted 25 June, 2024; v1 submitted 26 July, 2023;
originally announced July 2023.
-
Anisotropy and Isotope Effect in Superconducting Solid Hydrogen
Authors:
Mehmet Dogan,
James R. Chelikowsky,
Marvin L. Cohen
Abstract:
Elucidating the phase diagram of solid hydrogen is a key objective in condensed matter physics. Several decades ago, it was proposed that at low temperatures and high pressures, solid hydrogen would be a metal with a high superconducting transition temperature. This transition to a metallic state can happen through the closing of the energy gap in the molecular solid or through a transition to an…
▽ More
Elucidating the phase diagram of solid hydrogen is a key objective in condensed matter physics. Several decades ago, it was proposed that at low temperatures and high pressures, solid hydrogen would be a metal with a high superconducting transition temperature. This transition to a metallic state can happen through the closing of the energy gap in the molecular solid or through a transition to an atomic solid. Recent experiments have managed to reach pressures in the range of 400-500 GPa, providing valuable insights. There is strong evidence suggesting that metallization via either of these mechanisms occurs within this pressure range. Computational and experimental studies have identified multiple promising crystal phases, but the limited accuracy of calculations and the limited capabilities of experiments prevent us from determining unequivocally the observed phase or phases. Therefore, it is crucial to investigate the superconducting properties of all the candidate phases. Recently, we reported the superconducting properties of the C2/c-24, Cmca-12, Cmca-4 and I41/amd-2 phases, including anharmonic effects. Here, we report the effects of anisotropy on superconducting properties using Eliashberg theory. Then, we investigate the superconducting properties of deuterium and estimate the size of the isotope effect for each phase. We find that the isotope effect on superconductivity is diminished by anharmonicity in the C2/c-24 and Cmca-12 phases and enlarged in the Cmca-4 and I41/amd-2 phases. Our anharmonic calculations of the C2/c-24 phase of deuterium agree closely with the most recent experiment by Loubeyre et al. [Phys. Rev. Lett. 29, 035501 (2022)], indicating that the C2/c-24 phase remains the leading candidate in this pressure range, and has a strong anharmonic character. These characteristics can serve to distinguish among crystal phases in experiment.
△ Less
Submitted 6 July, 2023;
originally announced July 2023.
-
Fast Marching Energy CNN
Authors:
Nicolas Makaroff,
Théo Bertrand,
Laurent D. Cohen
Abstract:
Leveraging geodesic distances and the geometrical information they convey is key for many data-oriented applications in imaging. Geodesic distance computation has been used for long for image segmentation using Image based metrics. We introduce a new method by generating isotropic Riemannian metrics adapted to a problem using CNN and give as illustrations an example of application. We then apply t…
▽ More
Leveraging geodesic distances and the geometrical information they convey is key for many data-oriented applications in imaging. Geodesic distance computation has been used for long for image segmentation using Image based metrics. We introduce a new method by generating isotropic Riemannian metrics adapted to a problem using CNN and give as illustrations an example of application. We then apply this idea to the segmentation of brain tumours as unit balls for the geodesic distance computed with the metric potential output by a CNN, thus imposing geometrical and topological constraints on the output mask. We show that geodesic distance modules work well in machine learning frameworks and can be used to achieve state-of-the-art performances while ensuring geometrical and/or topological properties.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Chan-Vese Attention U-Net: An attention mechanism for robust segmentation
Authors:
Nicolas Makaroff,
Laurent D. Cohen
Abstract:
When studying the results of a segmentation algorithm using convolutional neural networks, one wonders about the reliability and consistency of the results. This leads to questioning the possibility of using such an algorithm in applications where there is little room for doubt. We propose in this paper a new attention gate based on the use of Chan-Vese energy minimization to control more precisel…
▽ More
When studying the results of a segmentation algorithm using convolutional neural networks, one wonders about the reliability and consistency of the results. This leads to questioning the possibility of using such an algorithm in applications where there is little room for doubt. We propose in this paper a new attention gate based on the use of Chan-Vese energy minimization to control more precisely the segmentation masks given by a standard CNN architecture such as the U-Net model. This mechanism allows to obtain a constraint on the segmentation based on the resolution of a PDE. The study of the results allows us to observe the spatial information retained by the neural network on the region of interest and obtains competitive results on the binary segmentation. We illustrate the efficiency of this approach for medical image segmentation on a database of MRI brain images.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
A Comparison of Neuroelectrophysiology Databases
Authors:
Priyanka Subash,
Alex Gray,
Misque Boswell,
Samantha L. Cohen,
Rachael Garner,
Sana Salehi,
Calvary Fisher,
Samuel Hobel,
Satrajit Ghosh,
Yaroslav Halchenko,
Benjamin Dichter,
Russell A. Poldrack,
Chris Markiewicz,
Dora Hermes,
Arnaud Delorme,
Scott Makeig,
Brendan Behan,
Alana Sparks,
Stephen R Arnott,
Zhengjia Wang,
John Magnotti,
Michael S. Beauchamp,
Nader Pouratian,
Arthur W. Toga,
Dominique Duncan
Abstract:
As data sharing has become more prevalent, three pillars - archives, standards, and analysis tools - have emerged as critical components in facilitating effective data sharing and collaboration. This paper compares four freely available intracranial neuroelectrophysiology data repositories: Data Archive for the BRAIN Initiative (DABI), Distributed Archives for Neurophysiology Data Integration (DAN…
▽ More
As data sharing has become more prevalent, three pillars - archives, standards, and analysis tools - have emerged as critical components in facilitating effective data sharing and collaboration. This paper compares four freely available intracranial neuroelectrophysiology data repositories: Data Archive for the BRAIN Initiative (DABI), Distributed Archives for Neurophysiology Data Integration (DANDI), OpenNeuro, and Brain-CODE. The aim of this review is to describe archives that provide researchers with tools to store, share, and reanalyze both human and non-human neurophysiology data based on criteria that are of interest to the neuroscientific community. The Brain Imaging Data Structure (BIDS) and Neurodata Without Borders (NWB) are utilized by these archives to make data more accessible to researchers by implementing a common standard. As the necessity for integrating large-scale analysis into data repository platforms continues to grow within the neuroscientific community, this article will highlight the various analytical and customizable tools developed within the chosen archives that may advance the field of neuroinformatics.
△ Less
Submitted 30 August, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Quantum fluctuations spatial mode profiler
Authors:
Charris Gabaldon,
Pratik Barge,
Savannah L. Cuozzo,
Irina Novikova,
Hwang Lee,
Lior Cohen,
Eugeniy E. Mikhailov
Abstract:
The spatial mode is an essential component of an electromagnetic field description, yet it is challenging to characterize it for optical fields with low average photon number, such as in a squeezed vacuum. We present a method for reconstruction of the spatial modes of such fields based on the homodyne measurements of their quadrature noise variance performed with a set of structured masks. We show…
▽ More
The spatial mode is an essential component of an electromagnetic field description, yet it is challenging to characterize it for optical fields with low average photon number, such as in a squeezed vacuum. We present a method for reconstruction of the spatial modes of such fields based on the homodyne measurements of their quadrature noise variance performed with a set of structured masks. We show theoretically that under certain conditions we can recover individual spatial mode distributions by using the weighted sum of the basis masks, where weights are determined using measured variance values and phases. We apply this approach to analyze the spatial structure of a squeezed vacuum field with various amount of excess thermal noise generated in Rb vapor.
△ Less
Submitted 9 June, 2023;
originally announced June 2023.
-
New Cascaded Architecture for Classical and Quantum Multiparameter Sensing
Authors:
Gregory Krueper,
Lior Cohen,
Juliet T. Gopinath
Abstract:
We present an innovative concept for quantum-enhanced multiparameter optical phase sensing that can be implemented in free space, optical fiber or on-chip. Our measurable phases are in series, or cascaded, enabling measurements as a function of position with only a single input and output. We have modeled up to 20 phases, and fitting shows near-linear scaling of the power requirements for addition…
▽ More
We present an innovative concept for quantum-enhanced multiparameter optical phase sensing that can be implemented in free space, optical fiber or on-chip. Our measurable phases are in series, or cascaded, enabling measurements as a function of position with only a single input and output. We have modeled up to 20 phases, and fitting shows near-linear scaling of the power requirements for additional phases. This novel approach represents a new paradigm in multiparameter quantum metrology, and can be applied to remote sensing, communications, and geophysics.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
The James Webb Space Telescope Mission
Authors:
Jonathan P. Gardner,
John C. Mather,
Randy Abbott,
James S. Abell,
Mark Abernathy,
Faith E. Abney,
John G. Abraham,
Roberto Abraham,
Yasin M. Abul-Huda,
Scott Acton,
Cynthia K. Adams,
Evan Adams,
David S. Adler,
Maarten Adriaensen,
Jonathan Albert Aguilar,
Mansoor Ahmed,
Nasif S. Ahmed,
Tanjira Ahmed,
Rüdeger Albat,
Loïc Albert,
Stacey Alberts,
David Aldridge,
Mary Marsha Allen,
Shaune S. Allen,
Martin Altenburg
, et al. (983 additional authors not shown)
Abstract:
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono…
▽ More
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astronomers will celebrate their accomplishments for the life of the mission, potentially as long as 20 years, and beyond. This report and the scientific discoveries that follow are extended thank-you notes to the 20,000 team members. The telescope is working perfectly, with much better image quality than expected. In this and accompanying papers, we give a brief history, describe the observatory, outline its objectives and current observing program, and discuss the inventions and people who made it possible. We cite detailed reports on the design and the measured performance on orbit.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
A model is worth tens of thousands of examples
Authors:
Thomas Dagès,
Laurent D. Cohen,
Alfred M. Bruckstein
Abstract:
Traditional signal processing methods relying on mathematical data generation models have been cast aside in favour of deep neural networks, which require vast amounts of data. Since the theoretical sample complexity is nearly impossible to evaluate, these amounts of examples are usually estimated with crude rules of thumb. However, these rules only suggest when the networks should work, but do no…
▽ More
Traditional signal processing methods relying on mathematical data generation models have been cast aside in favour of deep neural networks, which require vast amounts of data. Since the theoretical sample complexity is nearly impossible to evaluate, these amounts of examples are usually estimated with crude rules of thumb. However, these rules only suggest when the networks should work, but do not relate to the traditional methods. In particular, an interesting question is: how much data is required for neural networks to be on par or outperform, if possible, the traditional model-based methods? In this work, we empirically investigate this question in two simple examples, where the data is generated according to precisely defined mathematical models, and where well-understood optimal or state-of-the-art mathematical data-agnostic solutions are known. A first problem is deconvolving one-dimensional Gaussian signals and a second one is estimating a circle's radius and location in random grayscale images of disks. By training various networks, either naive custom designed or well-established ones, with various amounts of training data, we find that networks require tens of thousands of examples in comparison to the traditional methods, whether the networks are trained from scratch or even with transfer-learning or finetuning.
△ Less
Submitted 19 March, 2023;
originally announced March 2023.
-
ER network heterogeneity guides diffusive transport and kinetics
Authors:
Zubenelgenubi C. Scott,
Katherine Koning,
Molly Vanderwerp,
Lorna Cohen,
Laura M. Westrate,
Elena F. Koslover
Abstract:
The endoplasmic reticulum (ER) is a dynamic network of interconnected sheets and tubules that orchestrates the distribution of lipids, ions, and proteins throughout the cell. The impact of its complex, dynamic morphology on its function as an intracellular transport hub remains poorly understood. To elucidate the functional consequences of ER network structure and dynamics, we quantify how the het…
▽ More
The endoplasmic reticulum (ER) is a dynamic network of interconnected sheets and tubules that orchestrates the distribution of lipids, ions, and proteins throughout the cell. The impact of its complex, dynamic morphology on its function as an intracellular transport hub remains poorly understood. To elucidate the functional consequences of ER network structure and dynamics, we quantify how the heterogeneity of the peripheral ER in COS7 cells affects diffusive protein transport. In vivo imaging of photoactivated ER membrane proteins demonstrates their non-uniform spreading to adjacent regions, in a manner consistent with simulations of diffusing particles on extracted network structures. Using a minimal network model to represent tubule rearrangements, we demonstrate that ER network dynamics are sufficiently slow to have little effect on diffusive protein transport. Furthermore, stochastic simulations reveal a novel consequence of ER network heterogeneity: the existence of 'hot spots' where sparse diffusive reactants are more likely to find one another. Intriguingly, ER exit sites are disproportionately found in these highly accessible regions. Combining in vivo experiments with analytic calculations, quantitative image analysis, and computational modeling, we demonstrate how structure guides diffusive protein transport and reactions in the ER.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Eliciting User Preferences for Personalized Multi-Objective Decision Making through Comparative Feedback
Authors:
Han Shao,
Lee Cohen,
Avrim Blum,
Yishay Mansour,
Aadirupa Saha,
Matthew R. Walter
Abstract:
In classic reinforcement learning (RL) and decision making problems, policies are evaluated with respect to a scalar reward function, and all optimal policies are the same with regards to their expected return. However, many real-world problems involve balancing multiple, sometimes conflicting, objectives whose relative priority will vary according to the preferences of each user. Consequently, a…
▽ More
In classic reinforcement learning (RL) and decision making problems, policies are evaluated with respect to a scalar reward function, and all optimal policies are the same with regards to their expected return. However, many real-world problems involve balancing multiple, sometimes conflicting, objectives whose relative priority will vary according to the preferences of each user. Consequently, a policy that is optimal for one user might be sub-optimal for another. In this work, we propose a multi-objective decision making framework that accommodates different user preferences over objectives, where preferences are learned via policy comparisons. Our model consists of a Markov decision process with a vector-valued reward function, with each user having an unknown preference vector that expresses the relative importance of each objective. The goal is to efficiently compute a near-optimal policy for a given user. We consider two user feedback models. We first address the case where a user is provided with two policies and returns their preferred policy as feedback. We then move to a different user feedback model, where a user is instead provided with two small weighted sets of representative trajectories and selects the preferred one. In both cases, we suggest an algorithm that finds a nearly optimal policy for the user using a small number of comparison queries.
△ Less
Submitted 31 October, 2023; v1 submitted 7 February, 2023;
originally announced February 2023.
-
Sequential Strategic Screening
Authors:
Lee Cohen,
Saeed Sharifi-Malvajerdi,
Kevin Stangl,
Ali Vakilian,
Juba Ziani
Abstract:
We initiate the study of strategic behavior in screening processes with multiple classifiers. We focus on two contrasting settings: a conjunctive setting in which an individual must satisfy all classifiers simultaneously, and a sequential setting in which an individual to succeed must satisfy classifiers one at a time. In other words, we introduce the combination of strategic classification with s…
▽ More
We initiate the study of strategic behavior in screening processes with multiple classifiers. We focus on two contrasting settings: a conjunctive setting in which an individual must satisfy all classifiers simultaneously, and a sequential setting in which an individual to succeed must satisfy classifiers one at a time. In other words, we introduce the combination of strategic classification with screening processes.
We show that sequential screening pipelines exhibit new and surprising behavior where individuals can exploit the sequential ordering of the tests to zig-zag between classifiers without having to simultaneously satisfy all of them. We demonstrate an individual can obtain a positive outcome using a limited manipulation budget even when far from the intersection of the positive regions of every classifier. Finally, we consider a learner whose goal is to design a sequential screening process that is robust to such manipulations, and provide a construction for the learner that optimizes a natural objective.
△ Less
Submitted 10 February, 2023; v1 submitted 30 January, 2023;
originally announced January 2023.
-
Uncertainty Estimation based on Geometric Separation
Authors:
Gabriella Chouraqui,
Liron Cohen,
Gil Einziger,
Liel Leman
Abstract:
In machine learning, accurately predicting the probability that a specific input is correct is crucial for risk management. This process, known as uncertainty (or confidence) estimation, is particularly important in mission-critical applications such as autonomous driving. In this work, we put forward a novel geometric-based approach for improving uncertainty estimations in machine learning models…
▽ More
In machine learning, accurately predicting the probability that a specific input is correct is crucial for risk management. This process, known as uncertainty (or confidence) estimation, is particularly important in mission-critical applications such as autonomous driving. In this work, we put forward a novel geometric-based approach for improving uncertainty estimations in machine learning models. Our approach involves using the geometric distance of the current input from existing training inputs as a signal for estimating uncertainty, and then calibrating this signal using standard post-hoc techniques. We demonstrate that our method leads to more accurate uncertainty estimations than recently proposed approaches through extensive evaluation on a variety of datasets and models. Additionally, we optimize our approach so that it can be implemented on large datasets in near real-time applications, making it suitable for time-sensitive scenarios.
△ Less
Submitted 11 January, 2023;
originally announced January 2023.
-
The James Webb Space Telescope Mission: Optical Telescope Element Design, Development, and Performance
Authors:
Michael W. McElwain,
Lee D. Feinberg,
Marshall D. Perrin,
Mark Clampin,
C. Matt Mountain,
Matthew D. Lallo,
Charles-Philippe Lajoie,
Randy A. Kimble,
Charles W. Bowers,
Christopher C. Stark,
D. Scott Acton,
Ken Aiello,
Charles Atkinson,
Beth Barinek,
Allison Barto,
Scott Basinger,
Tracy Beck,
Matthew D. Bergkoetter,
Marcel Bluth,
Rene A. Boucarut,
Gregory R. Brady,
Keira J. Brooks,
Bob Brown,
John Byard,
Larkin Carey
, et al. (104 additional authors not shown)
Abstract:
The James Webb Space Telescope (JWST) is a large, infrared space telescope that has recently started its science program which will enable breakthroughs in astrophysics and planetary science. Notably, JWST will provide the very first observations of the earliest luminous objects in the Universe and start a new era of exoplanet atmospheric characterization. This transformative science is enabled by…
▽ More
The James Webb Space Telescope (JWST) is a large, infrared space telescope that has recently started its science program which will enable breakthroughs in astrophysics and planetary science. Notably, JWST will provide the very first observations of the earliest luminous objects in the Universe and start a new era of exoplanet atmospheric characterization. This transformative science is enabled by a 6.6 m telescope that is passively cooled with a 5-layer sunshield. The primary mirror is comprised of 18 controllable, low areal density hexagonal segments, that were aligned and phased relative to each other in orbit using innovative image-based wavefront sensing and control algorithms. This revolutionary telescope took more than two decades to develop with a widely distributed team across engineering disciplines. We present an overview of the telescope requirements, architecture, development, superb on-orbit performance, and lessons learned. JWST successfully demonstrates a segmented aperture space telescope and establishes a path to building even larger space telescopes.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
Anomalous Nernst effect in Mn$_3$NiN thin films
Authors:
Sebastian Beckert,
João Godinho,
Freya Johnson,
Jozef Kimák,
Eva Schmoranzerová,
Jan Zemen,
Zbyněk Šobáň,
Kamil Olejník,
Jakub Železný,
Joerg Wunderlich,
Petr Němec,
Dominik Kriegner,
Andy Thomas,
Sebastian T. B. Goennenwein,
Lesley F Cohen,
Helena Reichlová
Abstract:
The observation of a sizable anomalous Hall effect in magnetic materials with vanishing magnetization has renewed interest in understanding and engineering this phenomenon. Antiferromagnetic antiperovskites are one of emerging material classes that exhibit a variety of interesting properties owing to a complex electronic band structure and magnetic ordering. Reports on the anomalous Nernst effect…
▽ More
The observation of a sizable anomalous Hall effect in magnetic materials with vanishing magnetization has renewed interest in understanding and engineering this phenomenon. Antiferromagnetic antiperovskites are one of emerging material classes that exhibit a variety of interesting properties owing to a complex electronic band structure and magnetic ordering. Reports on the anomalous Nernst effect and its magnitude in this class of materials are, however, very limited. This scarcity may be partly due to the experimental difficulty of reliably quantifying the anomalous Nernst coefficient. Here, we report experiments on the anomalous Nernst effect in antiferromagnetic antiperovskite Mn$_3$NiN thin films. Measurement of both the anomalous Hall and Nernst effects using the same sample and measurement geometry makes it possible to directly compare these two effects and quantify the anomalous Nernst coefficient and conductivity in Mn$_3$NiN. We carefully evaluate the spatial distribution of the thermal gradient in the sample and use finite element modeling to corroborate our experimental results.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Universal chiral Luttinger liquid behavior in a graphene fractional quantum Hall point contact
Authors:
Liam A. Cohen,
Noah L. Samuelson,
Taige Wang,
Takashi Taniguchi,
Kenji Watanabe,
Michael P. Zaletel,
Andrea F. Young
Abstract:
One dimensional conductors are described by Luttinger liquid theory, which predicts a power-law suppression of the density of states near the Fermi level. The scaling exponent is non-universal in the general case, but is predicted to be quantized for the chiral edge states of the fractional quantum Hall effect. Here, we report conductance measurements across a point contact linking integer and fra…
▽ More
One dimensional conductors are described by Luttinger liquid theory, which predicts a power-law suppression of the density of states near the Fermi level. The scaling exponent is non-universal in the general case, but is predicted to be quantized for the chiral edge states of the fractional quantum Hall effect. Here, we report conductance measurements across a point contact linking integer and fractional quantum Hall edge states. At weak coupling, we observe the predicted universal quadratic scaling with temperature and voltage. At strong coupling, the conductance saturates to e^2/2h, arising from perfect Andreev reflection of fractionalized quasi-particles at the point contact. We use the strong coupling physics to realize a nearly dissipationless DC voltage step-up transformer, whose gain of 3/2 arises directly from topological fractionalization of electrical charge.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.
-
Deformable Voxel Grids for Shape Comparisons
Authors:
Raphaël Groscot,
Laurent D. Cohen
Abstract:
We present Deformable Voxel Grids (DVGs) for 3D shapes comparison and processing. It consists of a voxel grid which is deformed to approximate the silhouette of a shape, via energy-minimization. By interpreting the DVG as a local coordinates system, it provides a better embedding space than a regular voxel grid, since it is adapted to the geometry of the shape. It also allows to deform the shape b…
▽ More
We present Deformable Voxel Grids (DVGs) for 3D shapes comparison and processing. It consists of a voxel grid which is deformed to approximate the silhouette of a shape, via energy-minimization. By interpreting the DVG as a local coordinates system, it provides a better embedding space than a regular voxel grid, since it is adapted to the geometry of the shape. It also allows to deform the shape by moving the control points of the DVG, in a similar manner to the Free Form Deformation, but with easier interpretability of the control points positions. After proposing a computation scheme of the energies compatible with meshes and pointclouds, we demonstrate the use of DVGs in a variety of applications: correspondences via cubification, style transfer, shape retrieval and PCA deformations. The first two require no learning and can be readily run on any shapes in a matter of minutes on modest hardware. As for the last two, they require to first optimize DVGs on a collection of shapes, which amounts to a pre-processing step. Then, determining PCA coordinates is straightforward and brings a few parameters to deform a shape.
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
Deep learning for enhanced free-space optical communications
Authors:
Manon P. Bart,
Nicholas J. Savino,
Paras Regmi,
Lior Cohen,
Haleh Safavi,
Harry C. Shaw,
Sanjaya Lohani,
Thomas A. Searles,
Brian T. Kirby,
Hwang Lee,
Ryan T. Glasser
Abstract:
Atmospheric effects, such as turbulence and background thermal noise, inhibit the propagation of coherent light used in ON-OFF keying free-space optical communication. Here we present and experimentally validate a convolutional neural network to reduce the bit error rate of free-space optical communication in post-processing that is significantly simpler and cheaper than existing solutions based o…
▽ More
Atmospheric effects, such as turbulence and background thermal noise, inhibit the propagation of coherent light used in ON-OFF keying free-space optical communication. Here we present and experimentally validate a convolutional neural network to reduce the bit error rate of free-space optical communication in post-processing that is significantly simpler and cheaper than existing solutions based on advanced optics. Our approach consists of two neural networks, the first determining the presence of coherent bit sequences in thermal noise and turbulence and the second demodulating the coherent bit sequences. All data used for training and testing our network is obtained experimentally by generating ON-OFF keying bit streams of coherent light, combining these with thermal light, and passing the resultant light through a turbulent water tank which we have verified mimics turbulence in the air to a high degree of accuracy. Our convolutional neural network improves detection accuracy over threshold classification schemes and has the capability to be integrated with current demodulation and error correction schemes.
△ Less
Submitted 15 August, 2022;
originally announced August 2022.
-
Wave-Front Reconstruction via Single-Pixel Homodyne Imaging
Authors:
Savannah L. Cuozzo,
Charris Gabaldon,
Pratik J. Barge,
Ziqi Niu,
Hwang Lee,
Lior Cohen,
Irina Novikova,
Eugeniy E. Mikhailov
Abstract:
We combine single-pixel imaging and homodyne detection to perform full object recovery (phase and amplitude). Our method does not require any prior information about the object or the illuminating fields. As a demonstration, we reconstruct the optical properties of several semi-transparent objects and find that the reconstructed complex transmission has a phase precision of 0.02 radians and a rela…
▽ More
We combine single-pixel imaging and homodyne detection to perform full object recovery (phase and amplitude). Our method does not require any prior information about the object or the illuminating fields. As a demonstration, we reconstruct the optical properties of several semi-transparent objects and find that the reconstructed complex transmission has a phase precision of 0.02 radians and a relative amplitude precision of 0.01.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
Efficient One Sided Kolmogorov Approximation
Authors:
Liat Cohen,
Tal Grinshpoun,
Gera Weiss
Abstract:
We present an efficient algorithm that, given a discrete random variable $X$ and a number $m$, computes a random variable whose support is of size at most $m$ and whose Kolmogorov distance from $X$ is minimal, also for the one-sided Kolmogorov approximation. We present some variants of the algorithm, analyse their correctness and computational complexity, and present a detailed empirical evaluatio…
▽ More
We present an efficient algorithm that, given a discrete random variable $X$ and a number $m$, computes a random variable whose support is of size at most $m$ and whose Kolmogorov distance from $X$ is minimal, also for the one-sided Kolmogorov approximation. We present some variants of the algorithm, analyse their correctness and computational complexity, and present a detailed empirical evaluation that shows how they performs in practice. The main application that we examine, which is our motivation for this work, is estimation of the probability missing deadlines in series-parallel schedules. Since exact computation of these probabilities is NP-hard, we propose to use the algorithms described in this paper to obtain an approximation.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Experimental Simulation of Loop Quantum Gravity on a Photonic Chip
Authors:
Reinier van der Meer,
Zichang Huang,
Malaquias Correa Anguita,
Dongxue Qu,
Peter Hooijschuur,
Hongguang Liu,
Muxin Han,
Jelmer J. Renema,
Lior Cohen
Abstract:
The unification of general relativity and quantum theory is one of the fascinating problems of modern physics. One leading solution is Loop Quantum Gravity (LQG). Simulating LQG may be important for providing predictions which can then be tested experimentally. However, such complex quantum simulations cannot run efficiently on classical computers, and quantum computers or simulators are needed. H…
▽ More
The unification of general relativity and quantum theory is one of the fascinating problems of modern physics. One leading solution is Loop Quantum Gravity (LQG). Simulating LQG may be important for providing predictions which can then be tested experimentally. However, such complex quantum simulations cannot run efficiently on classical computers, and quantum computers or simulators are needed. Here, we experimentally demonstrate quantum simulations of spinfoam amplitudes of LQG on an integrated photonics quantum processor. We simulate a basic transition of LQG and show that the derived spinfoam vertex amplitude falls within 4% error with respect to the theoretical prediction, despite experimental imperfections. We also discuss how to generalize the simulation for more complex transitions, in realistic experimental conditions, which will eventually lead to a quantum advantage demonstration as well as expand the toolbox to investigate LQG.
△ Less
Submitted 1 July, 2022;
originally announced July 2022.
-
A Geometric Method for Improved Uncertainty Estimation in Real-time
Authors:
Gabriella Chouraqui,
Liron Cohen,
Gil Einziger,
Liel Leman
Abstract:
Machine learning classifiers are probabilistic in nature, and thus inevitably involve uncertainty. Predicting the probability of a specific input to be correct is called uncertainty (or confidence) estimation and is crucial for risk management. Post-hoc model calibrations can improve models' uncertainty estimations without the need for retraining, and without changing the model. Our work puts forw…
▽ More
Machine learning classifiers are probabilistic in nature, and thus inevitably involve uncertainty. Predicting the probability of a specific input to be correct is called uncertainty (or confidence) estimation and is crucial for risk management. Post-hoc model calibrations can improve models' uncertainty estimations without the need for retraining, and without changing the model. Our work puts forward a geometric-based approach for uncertainty estimation. Roughly speaking, we use the geometric distance of the current input from the existing training inputs as a signal for estimating uncertainty and then calibrate that signal (instead of the model's estimation) using standard post-hoc calibration techniques. We show that our method yields better uncertainty estimations than recently proposed approaches by extensively evaluating multiple datasets and models. In addition, we also demonstrate the possibility of performing our approach in near real-time applications. Our code is available at our Github https://github.com/NoSleepDeveloper/Geometric-Calibrator.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Identifying the octupole Antiferromagnetic domain orientation in Mn$_{3}$NiN by scanning Anomalous Nernst Effect microscopy
Authors:
F. Johnson,
J. Kimák,
J. Zemen,
Z. Šobáň,
E. Schmoranzerová,
J. Godinho,
P. Němec,
S. Beckert,
H. Reichlová,
D. Boldrin,
J. Wunderlich,
L. F. Cohen
Abstract:
The intrinsic anomalous Nernst effect in a magnetic material is governed by the Berry curvature at the Fermi energy and can be realized in non-collinear antiferromagnets with vanishing magnetization. Thin films of (001)-oriented Mn$_{3}$NiN have their chiral antiferromagnetic structure located in the (111) plane facilitating the anomalous Nernst effect unusually in two orthogonal in-plane directio…
▽ More
The intrinsic anomalous Nernst effect in a magnetic material is governed by the Berry curvature at the Fermi energy and can be realized in non-collinear antiferromagnets with vanishing magnetization. Thin films of (001)-oriented Mn$_{3}$NiN have their chiral antiferromagnetic structure located in the (111) plane facilitating the anomalous Nernst effect unusually in two orthogonal in-plane directions. The sign of each component of the anomalous Nernst effect is determined by the local antiferromagnetic domain state. In this work, a temperature gradient is induced in a 50 nm thick Mn$_{3}$NiN two micron-size Hall cross by a focused scanning laser beam, and the spatial distribution of the anomalous Nernst voltage is used to image and identify the octupole macrodomain arrangement. Although the focused laser beam width may span many individual domains, cooling from room temperature through the antiferromagnetic transition temperature in an in-plane magnetic field prepares the domain state producing a checkerboard pattern resulting from the convolution of contributions from each domain. These images together with atomistic and micromagnetic simulations suggest an average macrodomain of the order of $1 μm^{2}$.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
Nanoscale electrostatic control in ultra clean van der Waals heterostructures by local anodic oxidation of graphite gates
Authors:
Liam A. Cohen,
Noah L. Samuelson,
Taige Wang,
Kai Klocke,
Cian C. Reeves,
Takashi Taniguchi,
Kenji Watanabe,
Sagar Vijay,
Michael P. Zaletel,
Andrea F. Young
Abstract:
In an all-van der Waals heterostructure, the active layer, gate dielectrics and gate electrodes are assembled from two-dimensional crystals that have a low density of atomic defects. This design allows two-dimensional electron systems with very low disorder to be created, particularly in heterostructures where the active layer also has intrinsically low disorder, such as crystalline graphene layer…
▽ More
In an all-van der Waals heterostructure, the active layer, gate dielectrics and gate electrodes are assembled from two-dimensional crystals that have a low density of atomic defects. This design allows two-dimensional electron systems with very low disorder to be created, particularly in heterostructures where the active layer also has intrinsically low disorder, such as crystalline graphene layers or metal dichalcogenide heterobilayers. A key missing ingredient has been nanoscale electrostatic control, with existing methods for fabricated local gates typically introducing unwanted contamination. Here we describe a resist-free local anodic oxidation process for patterning sub 100nm features in graphite gates, and their subsequent integration into an all-van der Waals heterostructure. We define a quantum point contact in the fractional quantum Hall regime as a benchmark device and observe signatures of chiral Luttinger liquid behaviour, indicating an absence of extrinsic scattering centres in the vicinity of the point contact. In the integer quantum Hall regime, we demonstrate in situ control of the edge confinement potential, a key requirement for the precision control of chiral edge states. This technique may enable the fabrication of devices capable of single anyon control and coherent edge-state interferometry in the fractional quantum Hall regime.
△ Less
Submitted 25 February, 2024; v1 submitted 21 April, 2022;
originally announced April 2022.
-
Modeling Attrition in Recommender Systems with Departing Bandits
Authors:
Omer Ben-Porat,
Lee Cohen,
Liu Leqi,
Zachary C. Lipton,
Yishay Mansour
Abstract:
Traditionally, when recommender systems are formalized as multi-armed bandits, the policy of the recommender system influences the rewards accrued, but not the length of interaction. However, in real-world systems, dissatisfied users may depart (and never come back). In this work, we propose a novel multi-armed bandit setup that captures such policy-dependent horizons. Our setup consists of a fini…
▽ More
Traditionally, when recommender systems are formalized as multi-armed bandits, the policy of the recommender system influences the rewards accrued, but not the length of interaction. However, in real-world systems, dissatisfied users may depart (and never come back). In this work, we propose a novel multi-armed bandit setup that captures such policy-dependent horizons. Our setup consists of a finite set of user types, and multiple arms with Bernoulli payoffs. Each (user type, arm) tuple corresponds to an (unknown) reward probability. Each user's type is initially unknown and can only be inferred through their response to recommendations. Moreover, if a user is dissatisfied with their recommendation, they might depart the system. We first address the case where all users share the same type, demonstrating that a recent UCB-based algorithm is optimal. We then move forward to the more challenging case, where users are divided among two types. While naive approaches cannot handle this setting, we provide an efficient learning algorithm that achieves $\tilde{O}(\sqrt{T})$ regret, where $T$ is the number of users.
△ Less
Submitted 15 February, 2024; v1 submitted 24 March, 2022;
originally announced March 2022.