-
Micro-scale Electrostatic Structures formed on the Rough Surfaces of the Moon
Authors:
Y. Miyake,
J. Nakazono,
Y. Miyoshi,
Y. Harada,
M. N. Nishino,
S. Kurita,
S. Kasahara,
H. Usui,
A. Nagamatsu,
S. Nakamura
Abstract:
It is widely accepted that the surface potential of the lunar dayside is "on average" several to 10 V positive due to photoelectron emission in addition to the solar wind plasma precipitation. Recent studies, however, have shown that an insulating and rough regolith layer tends to make positive and negative charges separated and irregularly distributed on sub-Debye-length scales. The local charge…
▽ More
It is widely accepted that the surface potential of the lunar dayside is "on average" several to 10 V positive due to photoelectron emission in addition to the solar wind plasma precipitation. Recent studies, however, have shown that an insulating and rough regolith layer tends to make positive and negative charges separated and irregularly distributed on sub-Debye-length scales. The local charge separation then gives rise to an intense and structured electrostatic field. Such micro-scale electrostatic structures lie in the innermost part of the photoelectron sheath and may contribute to the mobilization of the charged dust particles. Since the electrostatic structures can take different states depending on the topography of the lunar surface, it is necessary to update the research approach. We have launched a research group to develop an integrated assessment framework that includes theoretical and numerical modeling, on-orbit observations, ground-based testing, and the development of charging measurement instruments, with the ultimate goal of comprehensively understanding the surface charging processes on the Moon.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
A generalized Legendre duality relation and Gaussian saturation
Authors:
Shohei Nakamura,
Hiroshi Tsuji
Abstract:
Motivated by the barycenter problem in optimal transportation theory, Kolesnikov--Werner recently extended the notion of the Legendre duality relation for two functions to the case for multiple functions. We further generalize the duality relation and then establish the centered Gaussian saturation property for a Blaschke--Santaló type inequality associated with it. Our approach to the understandi…
▽ More
Motivated by the barycenter problem in optimal transportation theory, Kolesnikov--Werner recently extended the notion of the Legendre duality relation for two functions to the case for multiple functions. We further generalize the duality relation and then establish the centered Gaussian saturation property for a Blaschke--Santaló type inequality associated with it. Our approach to the understanding such a generalized Legendre duality relation is based on our earlier observation that directly links Legendre duality with the inverse Brascamp--Lieb inequality. More precisely, for a large family of degenerate Brascamp--Lieb data, we prove that the centered Gaussian saturation property for the inverse Brascamp--Lieb inequality holds true when inputs are restricted to even and log-concave functions.
As an application to convex geometry, we establish the most important case of a conjecture of Kolesnikov and Werner about the Blaschke--Santaló inequality for multiple even functions as well as multiple symmetric convex bodies. Furthermore, in the direction of information theory and optimal transportation theory, this provides an affirmative answer to another conjecture of Kolesnikov--Werner about a Talagrand type inequality for multiple even probability measures that involves the Wasserstein barycenter.
△ Less
Submitted 8 October, 2024; v1 submitted 20 September, 2024;
originally announced September 2024.
-
Continuity method for the Mabuchi soliton on the extremal Fano manifolds
Authors:
Tomoyuki Hisamoto,
Satoshi Nakamura
Abstract:
We run the continuity method for Mabuchi's generalization of Kähler-Einstein metrics, assuming the existence of an extremal Kähler metric. It gives an analytic proof (without minimal model program) of the recent existence result obtained by Apostolov, Lahdili and Nitta. Our key observation is the boundedness of the energy functionals along the continuity method. The same argument can be applied to…
▽ More
We run the continuity method for Mabuchi's generalization of Kähler-Einstein metrics, assuming the existence of an extremal Kähler metric. It gives an analytic proof (without minimal model program) of the recent existence result obtained by Apostolov, Lahdili and Nitta. Our key observation is the boundedness of the energy functionals along the continuity method. The same argument can be applied to general $g$-solitons and $g$-extremal metrics.
△ Less
Submitted 1 September, 2024;
originally announced September 2024.
-
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
Authors:
Xi Chen,
Songyang Zhang,
Qibing Bai,
Kai Chen,
Satoshi Nakamura
Abstract:
We introduces LLaST, a framework for building high-performance Large Language model based Speech-to-text Translation systems. We address the limitations of end-to-end speech translation(E2E ST) models by exploring model architecture design and optimization techniques tailored for LLMs. Our approach includes LLM-based speech translation architecture design, ASR-augmented training, multilingual data…
▽ More
We introduces LLaST, a framework for building high-performance Large Language model based Speech-to-text Translation systems. We address the limitations of end-to-end speech translation(E2E ST) models by exploring model architecture design and optimization techniques tailored for LLMs. Our approach includes LLM-based speech translation architecture design, ASR-augmented training, multilingual data augmentation, and dual-LoRA optimization. Our approach demonstrates superior performance on the CoVoST-2 benchmark and showcases exceptional scaling capabilities powered by LLMs. We believe this effective method will serve as a strong baseline for speech translation and provide insights for future improvements of the LLM-based speech translation framework. We release the data, code and models in https://github.com/openaudiolab/LLaST.
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
Probing instantaneous quantum circuit refrigeration in the quantum regime
Authors:
Shuji Nakamura,
Teruaki Yoshioka,
Sergei Lemziakov,
Dmitrii Lvov,
Hiroto Mukai,
Akiyoshi Tomonaga,
Shintaro Takada,
Yuma Okazaki,
Nobu-Hisa Kaneko,
Jukka Pekola,
Jaw-Shen Tsai
Abstract:
Recent advancements in circuit quantum electrodynamics have enabled precise manipulation and detection of the single energy quantum in quantum systems. A quantum circuit refrigerator (QCR) is capable of electrically cooling the excited population of quantum systems, such as superconducting resonators and qubits, through photon-assisted tunneling of quasi-particles within a superconductor-insulator…
▽ More
Recent advancements in circuit quantum electrodynamics have enabled precise manipulation and detection of the single energy quantum in quantum systems. A quantum circuit refrigerator (QCR) is capable of electrically cooling the excited population of quantum systems, such as superconducting resonators and qubits, through photon-assisted tunneling of quasi-particles within a superconductor-insulator-normal metal junction. In this study, we demonstrated instantaneous QCR in the quantum regime. We performed the time-resolved measurement of the QCR-induced cooling of photon number inside the superconducting resonator by harnessing a qubit as a photon detector. From the enhanced photon loss rate of the resonator estimated from the amount of the AC Stark shift, the QCR was shown to have a cooling power of approximately 300 aW. Furthermore, even below the single energy quantum, the QCR can reduce the number of photons inside the resonator with 100 ns pulse from thermal equilibrium. Numerical calculations based on the Lindblad master equation successfully reproduced these experimental results.
△ Less
Submitted 13 August, 2024; v1 submitted 19 July, 2024;
originally announced July 2024.
-
An Automatic Quality Metric for Evaluating Simultaneous Interpretation
Authors:
Mana Makinae,
Katsuhito Sudoh,
Mararu Yamada,
Satoshi Nakamura
Abstract:
Simultaneous interpretation (SI), the translation of one language to another in real time, starts translation before the original speech has finished. Its evaluation needs to consider both latency and quality. This trade-off is challenging especially for distant word order language pairs such as English and Japanese. To handle this word order gap, interpreters maintain the word order of the source…
▽ More
Simultaneous interpretation (SI), the translation of one language to another in real time, starts translation before the original speech has finished. Its evaluation needs to consider both latency and quality. This trade-off is challenging especially for distant word order language pairs such as English and Japanese. To handle this word order gap, interpreters maintain the word order of the source language as much as possible to keep up with original language to minimize its latency while maintaining its quality, whereas in translation reordering happens to keep fluency in the target language. This means outputs synchronized with the source language are desirable based on the real SI situation, and it's a key for further progress in computational SI and simultaneous machine translation (SiMT). In this work, we propose an automatic evaluation metric for SI and SiMT focusing on word order synchronization. Our evaluation metric is based on rank correlation coefficients, leveraging cross-lingual pre-trained language models. Our experimental results on NAIST-SIC-Aligned and JNPC showed our metrics' effectiveness to measure word order synchronization between source and target language.
△ Less
Submitted 13 September, 2024; v1 submitted 9 July, 2024;
originally announced July 2024.
-
Effect of ground-state deformation on the Isoscalar Giant Monopole Resonance and the first observation of overtones of the Isoscalar Giant Quadrupole Resonance in rare-earth Nd isotopes
Authors:
M. Abdullah,
S. Bagchi,
M. N. Harakeh,
H. Akimune,
D. Das,
T. Doi,
L. M. Donaldson,
Y. Fujikawa,
M. Fujiwara,
T. Furuno,
U. Garg,
Y. K. Gupta,
K. B. Howard,
Y. Hijikata,
K. Inaba,
S. Ishida,
M. Itoh,
N. Kalantar-Nayestanaki,
D. Kar,
T. Kawabata,
S. Kawashima,
K. Khokhar,
K. Kitamura,
N. Kobayashi,
Y. Matsuda
, et al. (11 additional authors not shown)
Abstract:
The strength distributions of the Isoscalar Giant Monopole Resonance (ISGMR) and Isoscalar Giant Quadrupole Resonance (ISGQR) in 142,146-150Nd have been determined via inelastic alpha-particle scattering with the Grand Raiden (GR) Spectrometer at the Research Center for Nuclear Physics (RCNP), Japan. In the deformed nuclei 146-150Nd, the ISGMR strength distributions exhibit a splitting into two co…
▽ More
The strength distributions of the Isoscalar Giant Monopole Resonance (ISGMR) and Isoscalar Giant Quadrupole Resonance (ISGQR) in 142,146-150Nd have been determined via inelastic alpha-particle scattering with the Grand Raiden (GR) Spectrometer at the Research Center for Nuclear Physics (RCNP), Japan. In the deformed nuclei 146-150Nd, the ISGMR strength distributions exhibit a splitting into two components, while the nearly spherical nucleus 142Nd displays a single peak in the ISGMR strength distribution. A noteworthy achievement in this study is the first-time detection of overtones in the Isoscalar Giant Quadrupole Resonance (ISGQR) strength distributions within Nd isotopes at an excitation energy around 25 MeV obtained through Multipole Decomposition Analysis (MDA).
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
NAIST Simultaneous Speech Translation System for IWSLT 2024
Authors:
Yuka Ko,
Ryo Fukuda,
Yuta Nishikawa,
Yasumasa Kano,
Tomoya Yanagita,
Kosuke Doi,
Mana Makinae,
Haotian Tan,
Makoto Sakai,
Sakriani Sakti,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
This paper describes NAIST's submission to the simultaneous track of the IWSLT 2024 Evaluation Campaign: English-to-{German, Japanese, Chinese} speech-to-text translation and English-to-Japanese speech-to-speech translation. We develop a multilingual end-to-end speech-to-text translation model combining two pre-trained language models, HuBERT and mBART. We trained this model with two decoding poli…
▽ More
This paper describes NAIST's submission to the simultaneous track of the IWSLT 2024 Evaluation Campaign: English-to-{German, Japanese, Chinese} speech-to-text translation and English-to-Japanese speech-to-speech translation. We develop a multilingual end-to-end speech-to-text translation model combining two pre-trained language models, HuBERT and mBART. We trained this model with two decoding policies, Local Agreement (LA) and AlignAtt. The submitted models employ the LA policy because it outperformed the AlignAtt policy in previous models. Our speech-to-speech translation method is a cascade of the above speech-to-text model and an incremental text-to-speech (TTS) module that incorporates a phoneme estimation model, a parallel acoustic model, and a parallel WaveGAN vocoder. We improved our incremental TTS by applying the Transformer architecture with the AlignAtt policy for the estimation model. The results show that our upgraded TTS module contributed to improving the system performance.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Status of Xtend telescope onboard X-Ray Imaging and Spectroscopy Mission (XRISM)
Authors:
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takashi Okajima,
Hirofumi Noda,
Hiroyuki Uchida,
Hiromasa Suzuki,
Shogo Benjamin Kobayashi,
Tomokage Yoneyama,
Kouichi Hagino,
Kumiko Nobukawa,
Takaaki Tanaka,
Hiroshi Murakami,
Hideki Uchiyama,
Masayoshi Nobukawa,
Hironori Matsumoto,
Takeshi Tsuru,
Makoto Yamauchi,
Isamu Hatsukade,
Hirokazu Odaka,
Takayoshi Kohmura,
Kazutaka Yamaoka,
Manabu Ishida,
Yoshitomo Maeda,
Takayuki Hayashi
, et al. (38 additional authors not shown)
Abstract:
Xtend is one of the two telescopes onboard the X-ray imaging and spectroscopy mission (XRISM), which was launched on September 7th, 2023. Xtend comprises the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. A large field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is…
▽ More
Xtend is one of the two telescopes onboard the X-ray imaging and spectroscopy mission (XRISM), which was launched on September 7th, 2023. Xtend comprises the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. A large field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is realized by the combination of the SXI and XMA with a focal length of 5.6 m. The SXI employs four P-channel, back-illuminated type CCDs with a thick depletion layer of 200 $μ$m. The four CCD chips are arranged in a 2$\times$2 grid and cooled down to $-110$ $^{\circ}$C with a single-stage Stirling cooler. Before the launch of XRISM, we conducted a month-long spacecraft thermal vacuum test. The performance verification of the SXI was successfully carried out in a course of multiple thermal cycles of the spacecraft. About a month after the launch of XRISM, the SXI was carefully activated and the soundness of its functionality was checked by a step-by-step process. Commissioning observations followed the initial operation. We here present pre- and post-launch results verifying the Xtend performance. All the in-orbit performances are consistent with those measured on ground and satisfy the mission requirement. Extensive calibration studies are ongoing.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Circular polarization measurement for individual gamma rays in capture reactions with intense pulsed neutrons
Authors:
S. Endo,
R. Abe,
H. Fujioka,
T. Ino,
O. Iwamoto,
N. Iwamoto,
S. Kawamura,
A. Kimura,
M. Kitaguchi,
R. Kobayashi,
S. Nakamura,
T. Oku T. Okudaira,
M. Okuizumi,
M. Omer,
G. Rovira,
T. Shima,
H. M. Shimizu,
T. Shizuma,
Y. Taira,
S. Takada,
S. Takahashi,
H. Yoshikawa,
T. Yoshioka,
H. Zen
Abstract:
Measurements of circular polarization of $γ$-ray emitted from neutron capture reactions provide valuable information for nuclear physics studies. The spin and parity of excited states can be determined by measuring the circular polarization from polarized neutron capture reactions. Furthermore, the $γ$-ray circular polarization in a neutron capture resonance is crucial for studying the enhancement…
▽ More
Measurements of circular polarization of $γ$-ray emitted from neutron capture reactions provide valuable information for nuclear physics studies. The spin and parity of excited states can be determined by measuring the circular polarization from polarized neutron capture reactions. Furthermore, the $γ$-ray circular polarization in a neutron capture resonance is crucial for studying the enhancement effect of parity nonconservation in compound nuclei. The $γ$-ray circular polarization can be measured using a polarimeter based on magnetic Compton scattering. A polarimeter was constructed, and its performance indicators were evaluated using a circularly polarized $γ$-ray beam. Furthermore, as a demonstration, the $γ$-ray circular polarization was measured in $^{32}$S($\vec{\textrm{n}}$,$γ$)$^{33}$S reactions with polarized neutrons.
△ Less
Submitted 7 May, 2024;
originally announced June 2024.
-
A phase-space approach to weighted Fourier extension inequalities
Authors:
Jonathan Bennett,
Susana Gutierrez,
Shohei Nakamura,
Itamar Oliveira
Abstract:
The purpose of this paper is to expose and investigate natural phase-space formulations of two longstanding problems in the restriction theory of the Fourier transform. These problems, often referred to as the Stein and Mizohata--Takeuchi conjectures, assert that Fourier extension operators associated with rather general (codimension 1) submanifolds of euclidean space, may be effectively controlle…
▽ More
The purpose of this paper is to expose and investigate natural phase-space formulations of two longstanding problems in the restriction theory of the Fourier transform. These problems, often referred to as the Stein and Mizohata--Takeuchi conjectures, assert that Fourier extension operators associated with rather general (codimension 1) submanifolds of euclidean space, may be effectively controlled by the classical X-ray transform via weighted $L^2$ inequalities. Our phase-space formulations, which have their origins in recent work of Dendrinos, Mustata and Vitturi, expose close connections with a conjecture of Flandrin from time-frequency analysis, and rest on the identification of an explicit "geometric" Wigner transform associated with an arbitrary (smooth strictly convex) submanifold $S$ of $\mathbb{R}^n$. Our main results are certain natural "Sobolev variants" of the Stein and Mizohata--Takeuchi conjectures, and involve estimating the Sobolev norms of such Wigner transforms by geometric forms of classical bilinear fractional integrals. Our broad geometric framework allows us to explore the role of the curvature of the submanifold in these problems, and in particular we obtain bounds that are independent of any lower bound on the curvature; a feature that is uncommon in the wider restriction theory of the Fourier transform. Finally, we provide a further illustration of the effectiveness of our analysis by establishing a form of Flandrin's conjecture in the plane with an $\varepsilon$-loss. While our perspective comes primarily from euclidean harmonic analysis, the procedure used for constructing phase-space representations of extension operators is well-known in optics.
△ Less
Submitted 11 July, 2024; v1 submitted 21 June, 2024;
originally announced June 2024.
-
LLMs Are Zero-Shot Context-Aware Simultaneous Translators
Authors:
Roman Koshkin,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
The advent of transformers has fueled progress in machine translation. More recently large language models (LLMs) have come to the spotlight thanks to their generality and strong performance in a wide range of language tasks, including translation. Here we show that open-source LLMs perform on par with or better than some state-of-the-art baselines in simultaneous machine translation (SiMT) tasks,…
▽ More
The advent of transformers has fueled progress in machine translation. More recently large language models (LLMs) have come to the spotlight thanks to their generality and strong performance in a wide range of language tasks, including translation. Here we show that open-source LLMs perform on par with or better than some state-of-the-art baselines in simultaneous machine translation (SiMT) tasks, zero-shot. We also demonstrate that injection of minimal background information, which is easy with an LLM, brings further performance gains, especially on challenging technical subject-matter. This highlights LLMs' potential for building next generation of massively multilingual, context-aware and terminologically accurate SiMT systems that require no resource-intensive training or fine-tuning.
△ Less
Submitted 25 June, 2024; v1 submitted 19 June, 2024;
originally announced June 2024.
-
Word Order in English-Japanese Simultaneous Interpretation: Analyses and Evaluation using Chunk-wise Monotonic Translation
Authors:
Kosuke Doi,
Yuka Ko,
Mana Makinae,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
This paper analyzes the features of monotonic translations, which follow the word order of the source language, in simultaneous interpreting (SI). Word order differences are one of the biggest challenges in SI, especially for language pairs with significant structural differences like English and Japanese. We analyzed the characteristics of chunk-wise monotonic translation (CMT) sentences using th…
▽ More
This paper analyzes the features of monotonic translations, which follow the word order of the source language, in simultaneous interpreting (SI). Word order differences are one of the biggest challenges in SI, especially for language pairs with significant structural differences like English and Japanese. We analyzed the characteristics of chunk-wise monotonic translation (CMT) sentences using the NAIST English-to-Japanese Chunk-wise Monotonic Translation Evaluation Dataset and identified some grammatical structures that make monotonic translation difficult in English-Japanese SI. We further investigated the features of CMT sentences by evaluating the output from the existing speech translation (ST) and simultaneous speech translation (simulST) models on the NAIST English-to-Japanese Chunk-wise Monotonic Translation Evaluation Dataset as well as on existing test sets. The results indicate the possibility that the existing SI-based test set underestimates the model performance. The results also suggest that using CMT sentences as references gives higher scores to simulST models than ST models, and that using an offline-based test set to evaluate the simulST models underestimates the model performance.
△ Less
Submitted 15 July, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
Automated Essay Scoring Using Grammatical Variety and Errors with Multi-Task Learning and Item Response Theory
Authors:
Kosuke Doi,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
This study examines the effect of grammatical features in automatic essay scoring (AES). We use two kinds of grammatical features as input to an AES model: (1) grammatical items that writers used correctly in essays, and (2) the number of grammatical errors. Experimental results show that grammatical features improve the performance of AES models that predict the holistic scores of essays. Multi-t…
▽ More
This study examines the effect of grammatical features in automatic essay scoring (AES). We use two kinds of grammatical features as input to an AES model: (1) grammatical items that writers used correctly in essays, and (2) the number of grammatical errors. Experimental results show that grammatical features improve the performance of AES models that predict the holistic scores of essays. Multi-task learning with the holistic and grammar scores, alongside using grammatical features, resulted in a larger improvement in model performance. We also show that a model using grammar abilities estimated using Item Response Theory (IRT) as the labels for the auxiliary task achieved comparable performance to when we used grammar scores assigned by human raters. In addition, we weight the grammatical features using IRT to consider the difficulty of grammatical items and writers' grammar abilities. We found that weighting grammatical features with the difficulty led to further improvement in performance.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Planar near-field measurements of specular and diffuse reflection of millimeter-wave absorbers
Authors:
Fumiya Miura,
Hayato Takakura,
Yutaro Sekimoto,
Junji Inatani,
Frederick Matsuda,
Shugo Oguri,
Shogo Nakamura
Abstract:
Mitigating the far sidelobes of a wide field-of-view telescope is one of the critical issues for polarization observation of the cosmic microwave background. Since even small reflections of stray light at the millimeter-wave absorbers inside the telescope may create nonnegligible far sidelobes, we have developed a method to measure the reflectance of millimeter-wave absorbers, including diffuse re…
▽ More
Mitigating the far sidelobes of a wide field-of-view telescope is one of the critical issues for polarization observation of the cosmic microwave background. Since even small reflections of stray light at the millimeter-wave absorbers inside the telescope may create nonnegligible far sidelobes, we have developed a method to measure the reflectance of millimeter-wave absorbers, including diffuse reflections. By applying the planar near-field measurement method to the absorbers, we have enabled two-dimensional diffuse-reflection measurements, in addition to characterizing specular reflection. We have measured the reflectance of five samples (TK RAM Large and Small Tiles and Eccosorb AN-72, HR-10, and LS-22) at two angles of incidence in the frequency range from 70 GHz to 110 GHz. Compared with conventional horn-to-horn measurements, we obtained a consistent specular reflectance with a higher precision, less affected by standing waves. We have demonstrated that the angular response and diffuse-to-specular reflectance ratio differ among various materials. The measurements also imply that some absorbers may affect the polarization direction when reflecting the incident waves.
△ Less
Submitted 23 August, 2024; v1 submitted 7 June, 2024;
originally announced June 2024.
-
A beam profile monitor for GeV photon with high spatial resolution and fast readout capability
Authors:
Ryoko Kino,
Takeru Akiyama,
Hiroyuki Fujioka,
Tomomasa Fujiwara,
Tatsuhiro Ishige,
Kosuke Itabashi,
Shunsuke Kajikawa,
Masashi Kaneta,
Masaya Mizuno,
Sho Nagao,
Satoshi N. Nakamura,
Kotaro Nishi,
Ken Nishida,
Kazuki Okuyama,
Fumiya Oura,
Koga Tachibana,
Yuichi Toyama,
Daigo Watanabe
Abstract:
A beam profile monitor (BPM) has been developed to measure photon beams at the BM4 beamline of the Mikamine site, Research Center for Accelerator and Radioisotope Science (RARIS-Mikamine; previously known as ELPH) at Tohoku University. The BPM comprises plastic scintillation fibers and SiPMs, enabling high-precision, real-time measurements of photon beams in the 1 GeV region. Data acquisition util…
▽ More
A beam profile monitor (BPM) has been developed to measure photon beams at the BM4 beamline of the Mikamine site, Research Center for Accelerator and Radioisotope Science (RARIS-Mikamine; previously known as ELPH) at Tohoku University. The BPM comprises plastic scintillation fibers and SiPMs, enabling high-precision, real-time measurements of photon beams in the 1 GeV region. Data acquisition utilized streaming TDC, a firmware commonly employed in the J-PARC Hadron-hall, enabling real-time detection of high-intensity photon beams with count rates reaching several tens of MHz. With sufficient statistical data, the BPM achieved a 1 s beam-profiling accuracy of 10 μm. The proposed BPM serves as a valuable resource for future physics experiments at the BM4 photon beamline and will contribute significantly to ongoing accelerator research endeavors.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Modified extremal Kähler metrics and multiplier Hermitian-Einstein metrics
Authors:
Yasuhiro Nakagawa,
Satoshi Nakamura
Abstract:
Motivated by the notion of multiplier Hermitian-Einstein metric of type $σ$ introduced by Mabuchi, we introduce the notion of $σ$-extremal Kähler metrics on compact Kähler manifolds, which generalizes Calabi's extremal Kähler metrics. We characterize the existence of this metric in terms of the coercivity of a certain functional on the space of Kähler metrics to show that, on a Fano manifold, the…
▽ More
Motivated by the notion of multiplier Hermitian-Einstein metric of type $σ$ introduced by Mabuchi, we introduce the notion of $σ$-extremal Kähler metrics on compact Kähler manifolds, which generalizes Calabi's extremal Kähler metrics. We characterize the existence of this metric in terms of the coercivity of a certain functional on the space of Kähler metrics to show that, on a Fano manifold, the existence of a multiplier Hermitian-Einstein metric of type $σ$ implies the existence of a $σ$-extremal Kähler metric.
△ Less
Submitted 12 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Projective Geometries and Simple Pointed Matroids as $\mathbb{F}_1$-modules
Authors:
Jonathan Beardsley,
So Nakamura
Abstract:
We describe a fully faithful embedding of projective geometries, given in terms of closure operators, into $\mathbb{F}_1$-modules, in the sense of Connes and Consani. This factors through a faithful functor out of simple pointed matroids. This follows from our construction of a fully faithful embedding of weakly unital, commutative hypermagmas into $\fun$-modules. This embedding is of independent…
▽ More
We describe a fully faithful embedding of projective geometries, given in terms of closure operators, into $\mathbb{F}_1$-modules, in the sense of Connes and Consani. This factors through a faithful functor out of simple pointed matroids. This follows from our construction of a fully faithful embedding of weakly unital, commutative hypermagmas into $\fun$-modules. This embedding is of independent interest as it generalizes the classical Eilenberg-MacLane embedding for commutative monoids and recovers Segal's nerve construction for commutative partial monoids. For this reason, we spend some time elaborating its structure.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Duality and Heat flow
Authors:
Dario Cordero-Erausquin,
Nathael Gozlan,
Shohei Nakamura,
Hiroshi Tsuji
Abstract:
We reveal the relation between the Legendre transform of convex functions and heat flow evolution, and how it applies to the functional Blaschke-Santalo inequality. We also describe local maximizers in this inequality.
We reveal the relation between the Legendre transform of convex functions and heat flow evolution, and how it applies to the functional Blaschke-Santalo inequality. We also describe local maximizers in this inequality.
△ Less
Submitted 13 June, 2024; v1 submitted 22 March, 2024;
originally announced March 2024.
-
Electroproduction of the Lambda/Sigma^0 hyperons at Q^2~0.5 (GeV/c)^2 at forward angles
Authors:
K. Okuyama,
K. Itabashi,
S. Nagao,
S. N. Nakamura,
K. N. Suzuki,
T. Gogami,
B. Pandey,
L. Tang,
P. Bydžovský,
D. Skoupil,
T. Mart,
D. Abrams,
T. Akiyama,
D. Androic,
K. Aniol,
C. Ayerbe Gayoso,
J. Bane,
S. Barcus,
J. Barrow,
V. Bellini,
H. Bhatt,
D. Bhetuwal,
D. Biswas,
A. Camsonne,
J. Castellanos
, et al. (61 additional authors not shown)
Abstract:
In 2018, the E12-17-003 experiment was conducted at the Thomas Jefferson National Accelerator Facility (JLab) to explore the possible existence of an nnLambda state in the reconstructed missing mass distribution from a tritium gas target [K. N. Suzuki et al., Prog. Theor. Exp. Phys. 2022, 013D01 (2022), B. Pandey et al., Phys. Rev. C 105, L051001 (2022)]. As part of this investigation, data was al…
▽ More
In 2018, the E12-17-003 experiment was conducted at the Thomas Jefferson National Accelerator Facility (JLab) to explore the possible existence of an nnLambda state in the reconstructed missing mass distribution from a tritium gas target [K. N. Suzuki et al., Prog. Theor. Exp. Phys. 2022, 013D01 (2022), B. Pandey et al., Phys. Rev. C 105, L051001 (2022)]. As part of this investigation, data was also collected using a gaseous hydrogen target, not only for a precise absolute mass scale calibration but also for the study of Lambda/Sigma^0 electroproduction. This dataset was acquired at Q^2~0.5 (GeV/c)^2, W=2.14 GeV, and theta_{gamma K}^{c.m.}~8 deg. It covers forward angles where photoproduction data is scarce and a low-Q^2 region that is of interest for hypernuclear experiments. On the other hand, this kinematic region is at a slightly higher Q^2 than previous hypernuclear experiments, thus providing crucial information for understanding the Q^2 dependence of the differential cross sections for Lambda/Sigma^0 hyperon electroproduction. This paper reports on the Q^2 dependence of the differential cross section for the e + p -> e' + K^+ + Lambda/Sigma^0 reaction in the 0.2-0.8 (GeV/c)^2, and provides comparisons with the currently available theoretical models.
△ Less
Submitted 4 August, 2024; v1 submitted 2 March, 2024;
originally announced March 2024.
-
Equivalence of the staggered fermion Hamiltonan and the discrete Hodge-Dirac operator on square lattices
Authors:
Shu Nakamura
Abstract:
We show that the free massless staggered fermion (or the KS-fermion) Hamiltonian is equivalent to a discrete Hodge-Dirac operator on the $d$-dimensional square lattice $h\mathbb{Z}^d$. In fact, they are identical operator valued matrices under suitable choices of their representations on $\ell^2(2h\mathbb{Z}^d)\otimes\mathbb{C}^{2^d}$. We employ the formulations of the staggered fermion by Nakamur…
▽ More
We show that the free massless staggered fermion (or the KS-fermion) Hamiltonian is equivalent to a discrete Hodge-Dirac operator on the $d$-dimensional square lattice $h\mathbb{Z}^d$. In fact, they are identical operator valued matrices under suitable choices of their representations on $\ell^2(2h\mathbb{Z}^d)\otimes\mathbb{C}^{2^d}$. We employ the formulations of the staggered fermion by Nakamura (2024), and the discrete cohomology structure on the square lattices by Miranda-Parra (2023).
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
TransLLaMa: LLM-based Simultaneous Translation System
Authors:
Roman Koshkin,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
Decoder-only large language models (LLMs) have recently demonstrated impressive capabilities in text generation and reasoning. Nonetheless, they have limited applications in simultaneous machine translation (SiMT), currently dominated by encoder-decoder transformers. This study demonstrates that, after fine-tuning on a small dataset comprising causally aligned source and target sentence pairs, a p…
▽ More
Decoder-only large language models (LLMs) have recently demonstrated impressive capabilities in text generation and reasoning. Nonetheless, they have limited applications in simultaneous machine translation (SiMT), currently dominated by encoder-decoder transformers. This study demonstrates that, after fine-tuning on a small dataset comprising causally aligned source and target sentence pairs, a pre-trained open-source LLM can control input segmentation directly by generating a special "wait" token. This obviates the need for a separate policy and enables the LLM to perform English-German and English-Russian SiMT tasks with BLEU scores that are comparable to those of specific state-of-the-art baselines. We also evaluated closed-source models such as GPT-4, which displayed encouraging results in performing the SiMT task without prior training (zero-shot), indicating a promising avenue for enhancing future SiMT systems.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Response Generation for Cognitive Behavioral Therapy with Large Language Models: Comparative Study with Socratic Questioning
Authors:
Kenta Izumi,
Hiroki Tanaka,
Kazuhiro Shidara,
Hiroyoshi Adachi,
Daisuke Kanayama,
Takashi Kudo,
Satoshi Nakamura
Abstract:
Dialogue systems controlled by predefined or rule-based scenarios derived from counseling techniques, such as cognitive behavioral therapy (CBT), play an important role in mental health apps. Despite the need for responsible responses, it is conceivable that using the newly emerging LLMs to generate contextually relevant utterances will enhance these apps. In this study, we construct dialogue modu…
▽ More
Dialogue systems controlled by predefined or rule-based scenarios derived from counseling techniques, such as cognitive behavioral therapy (CBT), play an important role in mental health apps. Despite the need for responsible responses, it is conceivable that using the newly emerging LLMs to generate contextually relevant utterances will enhance these apps. In this study, we construct dialogue modules based on a CBT scenario focused on conventional Socratic questioning using two kinds of LLMs: a Transformer-based dialogue model further trained with a social media empathetic counseling dataset, provided by Osaka Prefecture (OsakaED), and GPT-4, a state-of-the art LLM created by OpenAI. By comparing systems that use LLM-generated responses with those that do not, we investigate the impact of generated responses on subjective evaluations such as mood change, cognitive change, and dialogue quality (e.g., empathy). As a result, no notable improvements are observed when using the OsakaED model. When using GPT-4, the amount of mood change, empathy, and other dialogue qualities improve significantly. Results suggest that GPT-4 possesses a high counseling ability. However, they also indicate that even when using a dialogue model trained with a human counseling dataset, it does not necessarily yield better outcomes compared to scenario-based dialogues. While presenting LLM-generated responses, including GPT-4, and having them interact directly with users in real-life mental health care services may raise ethical issues, it is still possible for human professionals to produce example responses or response templates using LLMs in advance in systems that use rules, scenarios, or example responses.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Picosecond Trajectory of Two-dimensional Vortex Motion in FeSe$_{0.5}$Te$_{0.5}$ Visualized by Terahertz Second Harmonic Generation
Authors:
Sachiko Nakamura,
Haruki Matsumoto,
Hiroki Ogawa,
Tomoki Kobayashi,
Fuyuki Nabeshima,
Atsutaka Maeda,
Ryo Shimano
Abstract:
We have investigated the vortex dynamics in a thin film of an iron-based superconductor FeSe$_{0.5}$Te$_{0.5}$ by observing second-harmonic generation (SHG) in the THz frequency range. We visualized the picosecond trajectory of two-dimensional vortex motion in a pinning potential tilted by Meissner shielding current. The SHG perpendicular to the driving field is observed, corresponding to the nonr…
▽ More
We have investigated the vortex dynamics in a thin film of an iron-based superconductor FeSe$_{0.5}$Te$_{0.5}$ by observing second-harmonic generation (SHG) in the THz frequency range. We visualized the picosecond trajectory of two-dimensional vortex motion in a pinning potential tilted by Meissner shielding current. The SHG perpendicular to the driving field is observed, corresponding to the nonreciprocal nonlinear Hall effect under the current-induced inversion symmetry breaking, whereas the linear Hall effect is negligible. The estimated vortex mass, as light as a bare electron, suggests that the vortex core moves independently from quasiparticles at such a high frequency and large velocity $\approx$300 km/s.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
The functional volume product under heat flow
Authors:
Shohei Nakamura,
Hiroshi Tsuji
Abstract:
We prove that the functional volume product for even functions is monotone increasing along the Fokker--Planck heat flow. This in particular yields a new proof of the functional Blaschke--Santaló inequality by K. Ball and also Artstein-Avidan--Klartag--Milman in the even case.
This result is the consequence of a new understanding of the regularizing property of the Ornstein--Uhlenbeck semigroup.…
▽ More
We prove that the functional volume product for even functions is monotone increasing along the Fokker--Planck heat flow. This in particular yields a new proof of the functional Blaschke--Santaló inequality by K. Ball and also Artstein-Avidan--Klartag--Milman in the even case.
This result is the consequence of a new understanding of the regularizing property of the Ornstein--Uhlenbeck semigroup. That is, we establish an improvement of Borell's reverse hypercontractivity inequality for even functions and identify the sharp range of the admissible exponents. As another consequence of successfully identifying the sharp range for the inequality, we derive the sharp $L^p$-$L^q$ inequality for the Laplace transform for even functions. The best constant of the inequality is attained by centered Gaussians, and thus this provides an analogous result to Beckner's sharp Hausdorff--Young inequality.
Our technical novelty in the proof is the use of the Brascamp--Lieb inequality for log-concave measures and Cramér--Rao's inequality in this context.
△ Less
Submitted 20 March, 2024; v1 submitted 31 December, 2023;
originally announced January 2024.
-
Spin current generation due to differential rotation
Authors:
Takumi Funato,
Shunichiro Kinoshita,
Norihiro Tanahashi,
Shin Nakamura,
Mamoru Matsuo
Abstract:
We study nonequilibrium spin dynamics in differentially rotating systems, deriving an effective Hamiltonian for conduction electrons in the comoving frame. In contrast to conventional spin current generation mechanisms that require vorticity, our theory describes spins and spin currents arising from differentially rotating systems regardless of vorticity. We demonstrate the generation of spin curr…
▽ More
We study nonequilibrium spin dynamics in differentially rotating systems, deriving an effective Hamiltonian for conduction electrons in the comoving frame. In contrast to conventional spin current generation mechanisms that require vorticity, our theory describes spins and spin currents arising from differentially rotating systems regardless of vorticity. We demonstrate the generation of spin currents in differentially rotating systems, such as liquid metals with Taylor-Couette flow. Our alternative mechanism will be important in the development of nanomechanical spin devices.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
Global coupled-channel analysis of $e^+e^-\to c\bar{c}$ processes in $\sqrt{s}=3.75-4.7$ GeV
Authors:
S. X. Nakamura,
X. -H. Li,
H. -P. Peng,
Z. -T. Sun,
X. -R. Zhou
Abstract:
Recent high-precision $e^+e^-\to c\bar{c}$ data from the BESIII and Belle are highly useful to understand the vector charmonium pole structure and puzzling lineshapes due to the exotic hadron candidates $Y$. We thus conduct a global coupled-channel analysis of most of the available data (9 two-body, 9 three-body, and 1 four-body final states) in $\sqrt{s}=3.75-4.7$ GeV. Not only cross sections but…
▽ More
Recent high-precision $e^+e^-\to c\bar{c}$ data from the BESIII and Belle are highly useful to understand the vector charmonium pole structure and puzzling lineshapes due to the exotic hadron candidates $Y$. We thus conduct a global coupled-channel analysis of most of the available data (9 two-body, 9 three-body, and 1 four-body final states) in $\sqrt{s}=3.75-4.7$ GeV. Not only cross sections but also invariant mass distributions of subsystems are fitted. Our model includes dozens of (quasi) two-body states that nonperturbatively couple with each other through bare charmonium excitations and particle-exchange mechanisms required by the three-body unitarity. The amplitudes obtained from the fits are analytically continued to vector charmonium and $Z_c$ poles. We do not find a $ψ(4160)$ pole that has been considered well-established. Instead, we find two poles of $\sim 4230$ MeV; $ψ(4230)$ with $Γ= 36$ MeV and a broader one with $Γ= 114$ MeV. Two $Z_c$ poles are found as virtual states $\sim 40$ MeV below the $D^*\bar{D}^{(*)}$ thresholds, being consistent with lattice QCD results. This work presents the first global analysis to determine the vector charmonium and $Z_c$ poles, thereby paving the way to extracting detailed properties of the prominent exotic hadron candidates from data.
△ Less
Submitted 8 February, 2024; v1 submitted 29 December, 2023;
originally announced December 2023.
-
Suppression of Electromagnetic Crosstalk by Differential Excitation for SAW Generation
Authors:
Shunsuke Ota,
Yuma Okazaki,
Shuji Nakamura,
Takehiko Oe,
Hermann Sellier,
Christopher Bäuerle,
Nobu-Hisa Kaneko,
Tetsuo Kodera,
Shintaro Takada
Abstract:
Surface acoustic waves (SAWs) hold a vast potential in various fields such as spintronics, quantum acoustics, and electron-quantum optics, but an electromagnetic wave emanating from SAW generation circuits has often been a major hurdle. Here, we investigate a differential excitation method of interdigital transducers (IDTs) to generate SAWs while reducing the electromagnetic wave. The results show…
▽ More
Surface acoustic waves (SAWs) hold a vast potential in various fields such as spintronics, quantum acoustics, and electron-quantum optics, but an electromagnetic wave emanating from SAW generation circuits has often been a major hurdle. Here, we investigate a differential excitation method of interdigital transducers (IDTs) to generate SAWs while reducing the electromagnetic wave. The results show that electromagnetic waves are suppressed by more than 90% in all directions. This suppression overcomes the operating limits and improves the scalability of SAW systems. Our results promise to facilitate the development of SAW-based applications in a wide range of research fields.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Strange hidden-charm pentaquark poles from $B^-\to J/ψΛ\bar{p}$
Authors:
Satoshi X. Nakamura,
Jia-Jun Wu
Abstract:
Recent LHCb data for $B^-\to J/ψΛ\bar{p}$ show a clear peak structure at the $Ξ_c\bar{D}$ threshold in the $J/ψΛ$ invariant mass ($M_{J/ψΛ}$) distribution. The LHCb's amplitude analysis identified the peak with the first hidden-charm pentaquark with strangeness $P_{ψs}^Λ(4338)$. We conduct a coupled-channel amplitude analysis of the LHCb data by simultaneously fitting the $M_{J/ψΛ}$,…
▽ More
Recent LHCb data for $B^-\to J/ψΛ\bar{p}$ show a clear peak structure at the $Ξ_c\bar{D}$ threshold in the $J/ψΛ$ invariant mass ($M_{J/ψΛ}$) distribution. The LHCb's amplitude analysis identified the peak with the first hidden-charm pentaquark with strangeness $P_{ψs}^Λ(4338)$. We conduct a coupled-channel amplitude analysis of the LHCb data by simultaneously fitting the $M_{J/ψΛ}$, $M_{J/ψ\bar{p}}$, $M_{Λ\bar{p}}$, and $\cosθ_{K^*}$ distributions. Rather than the Breit-Wigner fit employed in the LHCb analysis, we consider relevant threshold effects and a unitary $Ξ_c\bar{D}$-$Λ_c\bar{D}_s$ coupled-channel scattering amplitude from which $P_{ψs}^Λ$ poles are extracted for the first time. In our default fit, the $P_{ψs}^Λ(4338)$ pole is almost a $Ξ_c \bar{D}$ bound state at $( 4338.2\pm 1.4)-( 1.9\pm 0.5 )\,i$ MeV. Our default model also fits a large fluctuation at the $Λ_c\bar{D}_s$ threshold, giving a $Λ_c\bar{D}_s$ virtual state, $P_{ψs}^Λ(4255)$, at $4254.7\pm 0.4$ MeV. We also found that the $P_{ψs}^Λ(4338)$ peak cannot solely be a kinematical effect, and a nearby pole is needed.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
On-Demand Single-Electron Source via Single-Cycle Acoustic Pulses
Authors:
Shunsuke Ota,
Junliang Wang,
Hermann Edlbauer,
Yuma Okazaki,
Shuji Nakamura,
Takehiko Oe,
Arne Ludwig,
Andreas D. Wieck,
Hermann Sellier,
Christopher Bäuerle,
Nobu-Hisa Kaneko,
Tetsuo Kodera,
Shintaro Takada
Abstract:
Surface acoustic waves (SAWs) are a reliable solution to transport single electrons with precision in piezoelectric semiconductor devices. Recently, highly efficient single-electron transport with a strongly compressed single-cycle acoustic pulse has been demonstrated. This approach, however, requires surface gates constituting the quantum dots, their wiring, and multiple gate movements to load an…
▽ More
Surface acoustic waves (SAWs) are a reliable solution to transport single electrons with precision in piezoelectric semiconductor devices. Recently, highly efficient single-electron transport with a strongly compressed single-cycle acoustic pulse has been demonstrated. This approach, however, requires surface gates constituting the quantum dots, their wiring, and multiple gate movements to load and unload the electrons, which is very time-consuming. Here, on the contrary, we employ such a single-cycle acoustic pulse in a much simpler way - without any quantum dot at the entrance or exit of a transport channel - to perform single-electron transport between distant electron reservoirs. We observe the transport of a solitary electron in a single-cycle acoustic pulse via the appearance of the quantized acousto-electric current. The simplicity of our approach allows for on-demand electron emission with arbitrary delays on a ns time scale. We anticipate that enhanced synthesis of the SAWs will facilitate electron-quantum-optics experiments with multiple electron flying qubits.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Average Token Delay: A Duration-aware Latency Metric for Simultaneous Translation
Authors:
Yasumasa Kano,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
Simultaneous translation is a task in which the translation begins before the end of an input speech segment. Its evaluation should be conducted based on latency in addition to quality, and for users, the smallest possible amount of latency is preferable. Most existing metrics measure latency based on the start timings of partial translations and ignore their duration. This means such metrics do n…
▽ More
Simultaneous translation is a task in which the translation begins before the end of an input speech segment. Its evaluation should be conducted based on latency in addition to quality, and for users, the smallest possible amount of latency is preferable. Most existing metrics measure latency based on the start timings of partial translations and ignore their duration. This means such metrics do not penalize the latency caused by long translation output, which delays the comprehension of users and subsequent translations. In this work, we propose a novel latency evaluation metric for simultaneous translation called \emph{Average Token Delay} (ATD) that focuses on the duration of partial translations. We demonstrate its effectiveness through analyses simulating user-side latency based on Ear-Voice Span (EVS). In our experiment, ATD had the highest correlation with EVS among baseline latency metrics under most conditions.
△ Less
Submitted 27 November, 2023; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Exchange stiffness proportional to power of magnetization in permalloy co-doped with Mo and Cu
Authors:
Shiho Nakamura,
Nobuyuki Umetsu,
Michael Quinsat,
Masaki Kado
Abstract:
The exchange stiffness of magnetic materials is one of the essential parameters governing magnetic texture and its dynamics in magnetic devices. The effect of single-element doping on exchange stiffness has been investigated for several doping elements for permalloy (NiFe alloy), a soft magnetic material whose soft magnetic properties can be controlled by doping. However, the impact of more practi…
▽ More
The exchange stiffness of magnetic materials is one of the essential parameters governing magnetic texture and its dynamics in magnetic devices. The effect of single-element doping on exchange stiffness has been investigated for several doping elements for permalloy (NiFe alloy), a soft magnetic material whose soft magnetic properties can be controlled by doping. However, the impact of more practical multi-element doping on the exchange stiffness of permalloy is unknown. This study investigates the typical magnetic properties, including exchange stiffness, of permalloy systematically co-doped with Mo and Cu using broadband ferromagnetic resonance spectroscopy. We find that the exchange stiffness, which decreases with increasing doping levels, is proportional to a power of magnetization, which also decreases with increasing doping levels. The magnetization, $M_{\rm s}$, dependence of the exchange stiffness constant, $A$, of all the investigated samples, irrespective of the doping levels of each element, lies on a single curve expressed as $A\propto M_{\rm s}^n$ with exponent $n$ close to 2. This empirical power-law relationship provides a guideline for predicting unknown exchange stiffness in non-magnetic element-doped permalloy systems.
△ Less
Submitted 14 January, 2024; v1 submitted 9 November, 2023;
originally announced November 2023.
-
Three-body unitary coupled-channel approach to radiative $J/ψ$ decays and $η(1405/1475)$
Authors:
S. X. Nakamura,
Q. Huang,
J. -J. Wu,
H. P. Peng,
Y. Zhang,
Y. C. Zhu
Abstract:
Recent BESIII data on radiative $J/ψ$ decays from $\sim 10^{10}$ $J/ψ$ samples should significantly advance our understanding of the controversial nature of $η(1405/1475)$. This motivates us to develop a three-body unitary coupled-channel model for radiative $J/ψ$ decays to three-meson final states of any partial wave ($J^{PC}$). Basic building blocks of the model are bare resonance states such as…
▽ More
Recent BESIII data on radiative $J/ψ$ decays from $\sim 10^{10}$ $J/ψ$ samples should significantly advance our understanding of the controversial nature of $η(1405/1475)$. This motivates us to develop a three-body unitary coupled-channel model for radiative $J/ψ$ decays to three-meson final states of any partial wave ($J^{PC}$). Basic building blocks of the model are bare resonance states such as $η(1405/1475)$ and $f_1(1420)$, and $πK$, $K\bar{K}$, and $πη$ two-body interactions that generate resonances such as $K^*(892)$, $K^*_0(700)$, and $a_0(980)$. This model reasonably fits $K_SK_Sπ^0$ Dalitz plot pseudo data generated from the BESIII's $J^{PC}=0^{-+}$ amplitude for $J/ψ\toγK_SK_Sπ^0$. The experimental branching ratios of $η(1405/1475)\toηππ$ and $η(1405/1475)\toγρ$ relative to that of $η(1405/1475)\to K\bar{K}π$ are simultaneously fitted. Our $0^{-+}$ amplitude is analytically continued to find three poles, two of which correspond to $η(1405)$ on different Riemann sheets of the $K^*\bar{K}$ channel, and the third one for $η(1475)$. This is the first pole determination of $η(1405/1475)$ and, furthermore, the first-ever pole determination from analyzing experimental Dalitz plot distributions with a manifestly three-body unitary coupled-channel framework. Process-dependent $ηππ$, $γπ^+π^-$, and $πππ$ lineshapes of $J/ψ\toγ(0^{-+})\to γ(ηππ)$, $γ(γρ)$, and $γ(πππ)$ are predicted, and are in reasonable agreement with data. A triangle singularity is shown to play a crucial role to cause the large isospin violation of $J/ψ\toγ(πππ)$.
△ Less
Submitted 19 January, 2024; v1 submitted 9 November, 2023;
originally announced November 2023.
-
From Coupled Oscillators to Graph Neural Networks: Reducing Over-smoothing via a Kuramoto Model-based Approach
Authors:
Tuan Nguyen,
Hirotada Honda,
Takashi Sano,
Vinh Nguyen,
Shugo Nakamura,
Tan M. Nguyen
Abstract:
We propose the Kuramoto Graph Neural Network (KuramotoGNN), a novel class of continuous-depth graph neural networks (GNNs) that employs the Kuramoto model to mitigate the over-smoothing phenomenon, in which node features in GNNs become indistinguishable as the number of layers increases. The Kuramoto model captures the synchronization behavior of non-linear coupled oscillators. Under the view of c…
▽ More
We propose the Kuramoto Graph Neural Network (KuramotoGNN), a novel class of continuous-depth graph neural networks (GNNs) that employs the Kuramoto model to mitigate the over-smoothing phenomenon, in which node features in GNNs become indistinguishable as the number of layers increases. The Kuramoto model captures the synchronization behavior of non-linear coupled oscillators. Under the view of coupled oscillators, we first show the connection between Kuramoto model and basic GNN and then over-smoothing phenomenon in GNNs can be interpreted as phase synchronization in Kuramoto model. The KuramotoGNN replaces this phase synchronization with frequency synchronization to prevent the node features from converging into each other while allowing the system to reach a stable synchronized state. We experimentally verify the advantages of the KuramotoGNN over the baseline GNNs and existing methods in reducing over-smoothing on various graph deep learning benchmark tasks.
△ Less
Submitted 5 March, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
Authors:
Shintaro Nakamura,
Masashi Sugiyama
Abstract:
We study the real-valued combinatorial pure exploration of the multi-armed bandit in the fixed-budget setting. We first introduce the Combinatorial Successive Asign (CSA) algorithm, which is the first algorithm that can identify the best action even when the size of the action class is exponentially large with respect to the number of arms. We show that the upper bound of the probability of error…
▽ More
We study the real-valued combinatorial pure exploration of the multi-armed bandit in the fixed-budget setting. We first introduce the Combinatorial Successive Asign (CSA) algorithm, which is the first algorithm that can identify the best action even when the size of the action class is exponentially large with respect to the number of arms. We show that the upper bound of the probability of error of the CSA algorithm matches a lower bound up to a logarithmic factor in the exponent. Then, we introduce another algorithm named the Minimax Combinatorial Successive Accepts and Rejects (Minimax-CombSAR) algorithm for the case where the size of the action class is polynomial, and show that it is optimal, which matches a lower bound. Finally, we experimentally compare the algorithms with previous methods and show that our algorithm performs better.
△ Less
Submitted 15 November, 2023; v1 submitted 24 October, 2023;
originally announced October 2023.
-
Computational analyses of linguistic features with schizophrenic and autistic traits along with formal thought disorders
Authors:
Takeshi Saga,
Hiroki Tanaka,
Satoshi Nakamura
Abstract:
[See full abstract in the pdf] Formal Thought Disorder (FTD), which is a group of symptoms in cognition that affects language and thought, can be observed through language. FTD is seen across such developmental or psychiatric disorders as Autism Spectrum Disorder (ASD) or Schizophrenia, and its related Schizotypal Personality Disorder (SPD). This paper collected a Japanese audio-report dataset wit…
▽ More
[See full abstract in the pdf] Formal Thought Disorder (FTD), which is a group of symptoms in cognition that affects language and thought, can be observed through language. FTD is seen across such developmental or psychiatric disorders as Autism Spectrum Disorder (ASD) or Schizophrenia, and its related Schizotypal Personality Disorder (SPD). This paper collected a Japanese audio-report dataset with score labels related to ASD and SPD through a crowd-sourcing service from the general population. We measured language characteristics with the 2nd edition of the Social Responsiveness Scale (SRS2) and the Schizotypal Personality Questionnaire (SPQ), including an odd speech subscale from SPQ to quantify the FTD symptoms. We investigated the following four research questions through machine-learning-based score predictions: (RQ1) How are schizotypal and autistic measures correlated? (RQ2) What is the most suitable task to elicit FTD symptoms? (RQ3) Does the length of speech affect the elicitation of FTD symptoms? (RQ4) Which features are critical for capturing FTD symptoms? We confirmed that an FTD-related subscale, odd speech, was significantly correlated with both the total SPQ and SRS scores, although they themselves were not correlated significantly. Our regression analysis indicated that longer speech about a negative memory elicited more FTD symptoms. The ablation study confirmed the importance of function words and both the abstract and temporal features for FTD-related odd speech estimation. In contrast, content words were effective only in the SRS predictions, and content words were effective only in the SPQ predictions, a result that implies the differences between SPD-like and ASD-like symptoms. Data and programs used in this paper can be found here: https://sites.google.com/view/sagatake/resource.
△ Less
Submitted 14 October, 2023;
originally announced October 2023.
-
Threshold-cusp explanation for $X$ and $Z_{cs}$ in $B^+\to J/ψφK^+$
Authors:
S. X. Nakamura,
X. Luo
Abstract:
Several $X$ and $Z_{cs}$ exotic hadrons were claimed in the LHCb's amplitude analysis on $B^+\to J/ψφK^+$. The data shows that all the peaks and also dips in the spectra are located at thresholds of seemingly relevant meson-meson channels. While the LHCb analysis fitted the peaks with Breit-Wigner resonances, threshold kinematical cusps might be the cause of such structures. We thus analyze the LH…
▽ More
Several $X$ and $Z_{cs}$ exotic hadrons were claimed in the LHCb's amplitude analysis on $B^+\to J/ψφK^+$. The data shows that all the peaks and also dips in the spectra are located at thresholds of seemingly relevant meson-meson channels. While the LHCb analysis fitted the peaks with Breit-Wigner resonances, threshold kinematical cusps might be the cause of such structures. We thus analyze the LHCb data considering the threshold cusps. Our model is simultaneously fitted to $J/ψφ$, $J/ψK^+$, and $K^+φ$ invariant mass distributions. The threshold cusps fit well all the $X$, $Z_{cs}$, and dip structures. Our analysis indicates that spin-parity of the $X(4274)$ and $X(4500)$ structures are $0^-$ and $1^-$, respectively. This is different from the LHCb's spin-parity assignments ($1^+$ and $0^+$). The number of fitting parameters can be significantly reduced by considering the relevant threshold cusps. Our analysis shows that $D_s^{(*)}\bar{D}^{*}$ scattering lengths are consistent with zero. This disfavors an explanation of $Z_{cs}(4000)$ and $Z_{cs}(4220)$ as $D_s^{(*)}\bar{D}^{*}$ molecules, which is consistent with lattice QCD via the SU(3) relation.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
Authors:
Shintaro Nakamura,
Masashi Sugiyama
Abstract:
We study the real-valued combinatorial pure exploration of the multi-armed bandit (R-CPE-MAB) problem. In R-CPE-MAB, a player is given $d$ stochastic arms, and the reward of each arm $s\in\{1, \ldots, d\}$ follows an unknown distribution with mean $μ_s$. In each time step, a player pulls a single arm and observes its reward. The player's goal is to identify the optimal \emph{action}…
▽ More
We study the real-valued combinatorial pure exploration of the multi-armed bandit (R-CPE-MAB) problem. In R-CPE-MAB, a player is given $d$ stochastic arms, and the reward of each arm $s\in\{1, \ldots, d\}$ follows an unknown distribution with mean $μ_s$. In each time step, a player pulls a single arm and observes its reward. The player's goal is to identify the optimal \emph{action} $\boldsymbolπ^{*} = \argmax_{\boldsymbolπ \in \mathcal{A}} \boldsymbolμ^{\top}\boldsymbolπ$ from a finite-sized real-valued \emph{action set} $\mathcal{A}\subset \mathbb{R}^{d}$ with as few arm pulls as possible. Previous methods in the R-CPE-MAB assume that the size of the action set $\mathcal{A}$ is polynomial in $d$. We introduce an algorithm named the Generalized Thompson Sampling Explore (GenTS-Explore) algorithm, which is the first algorithm that can work even when the size of the action set is exponentially large in $d$. We also introduce a novel problem-dependent sample complexity lower bound of the R-CPE-MAB problem, and show that the GenTS-Explore algorithm achieves the optimal sample complexity up to a problem-dependent constant factor.
△ Less
Submitted 15 November, 2023; v1 submitted 20 August, 2023;
originally announced August 2023.
-
Measurements of neutron total and capture cross sections of $^{139}$La and evaluation of resonance parameters
Authors:
Shunsuke Endo,
Shiori Kawamura,
Takuya Okudaira,
Hiromoto Yoshikawa,
Gerard Rovira,
Atsushi Kimura,
Shoji Nakamura,
Osamu Iwamoto,
Nobuyuki Iwamoto
Abstract:
Neutron total and capture cross sections of Lanthanum(La)-139 were measured at the Accurate Ne-utron-Nucleus Reaction measurement Instrument (ANNRI) of the Materials and Life Science Experimental Facility (MLF) in the Japan Proton Accelerator Research Complex (J-PARC). The total cross section was largely different from that in evaluated libraries, such as JENDL-5, in the energy range from 80 to 90…
▽ More
Neutron total and capture cross sections of Lanthanum(La)-139 were measured at the Accurate Ne-utron-Nucleus Reaction measurement Instrument (ANNRI) of the Materials and Life Science Experimental Facility (MLF) in the Japan Proton Accelerator Research Complex (J-PARC). The total cross section was largely different from that in evaluated libraries, such as JENDL-5, in the energy range from 80 to 900~eV. Resonance parameters for four resonances including one negative resonance were obtained using a resonance analysis code, REFIT. The resonance analysis revealed discrepancies in several resonance parameters with the evaluated libraries. Furthermore, the information about the scattering radius was also extracted from the results of the total cross section. The obtained scattering radius was larger than that recorded in the evaluated libraries.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Continuum limit for Laplace and Elliptic operators on lattices
Authors:
Keita Mikami,
Shu Nakamura,
Yukihide Tadano
Abstract:
Continuum limits of Laplace operators on general lattices are considered, and it is shown that these operators converge to elliptic operators on the Euclidean space in the sense of the generalized norm resolvent convergence. We then study operators on the hexagonal lattice, which does not apply the above general theory, but we can show its Laplace operator converges to the continuous Laplace opera…
▽ More
Continuum limits of Laplace operators on general lattices are considered, and it is shown that these operators converge to elliptic operators on the Euclidean space in the sense of the generalized norm resolvent convergence. We then study operators on the hexagonal lattice, which does not apply the above general theory, but we can show its Laplace operator converges to the continuous Laplace operator in the continuum limit. We also study discrete operators on the square lattice corresponding to second order strictly elliptic operators with variable coefficients, and prove the generalized norm resolvent convergence in the continuum limit.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Remarks on discrete Dirac operators and their continuum limits
Authors:
Shu Nakamura
Abstract:
We discuss possible definitions of discrete Dirac operators, and discuss their continuum limits. It is well-known in the lattice field theory that the straightforward discretization of the Dirac operator introduces unwanted spectral subspaces, and it is known as the fermion doubling. In oder to overcome this difficulty, two methods were proposed. The first one is to introduce a new term, called th…
▽ More
We discuss possible definitions of discrete Dirac operators, and discuss their continuum limits. It is well-known in the lattice field theory that the straightforward discretization of the Dirac operator introduces unwanted spectral subspaces, and it is known as the fermion doubling. In oder to overcome this difficulty, two methods were proposed. The first one is to introduce a new term, called the Wilson term, and the second one is the KS-fermion model or the staggered fermion model. We discuss mathematical formulations of these, and study their continuum limits.
△ Less
Submitted 15 August, 2023; v1 submitted 25 June, 2023;
originally announced June 2023.
-
Discovery of antiferromagnetic chiral helical ordered state in trigonal GdNi$_3$Ga$_9$
Authors:
Shota Nakamura,
Takeshi Matsumura,
Kazuma Ohashi,
Hiroto Suzuki,
Mitsuru Tsukagoshi,
Kenshin Kurauchi,
Hironori Nakao,
Shigeo Ohara
Abstract:
We have performed magnetic susceptibility, magnetization, and specific heat measurements on a chiral magnet GdNi$_3$Ga$_9$, belonging to the trigonal space group $R32$ (\#155). A magnetic phase transition takes place at $T_{\rm N}$ = 19.5 K. By applying a magnetic field along the $a$ axis at 2 K, the magnetization curve exhibits two jumps at $\sim$ 3 kOe and = 45 kOe. To determine the magnetic str…
▽ More
We have performed magnetic susceptibility, magnetization, and specific heat measurements on a chiral magnet GdNi$_3$Ga$_9$, belonging to the trigonal space group $R32$ (\#155). A magnetic phase transition takes place at $T_{\rm N}$ = 19.5 K. By applying a magnetic field along the $a$ axis at 2 K, the magnetization curve exhibits two jumps at $\sim$ 3 kOe and = 45 kOe. To determine the magnetic structure, we performed a resonant X-ray diffraction experiment by utilizing a circularly polarized beam. It is shown that a long-period antiferromagnetic (AFM) helical order is realized at zero field. The Gd spins in the honeycomb layer are coupled in an antiferromagnetic manner in the $c$ plane and rotate with a propagation vector $q$ = (0, 0, 1.485). The period of the helix is 66.7 unit cells ($\sim 180$~nm). In magnetic fields above 3~kOe applied perpendicular to the helical $c$ axis, the AFM helical order changes to an AFM order with $q$ = (0, 0, 1.5).
△ Less
Submitted 27 September, 2023; v1 submitted 21 June, 2023;
originally announced June 2023.
-
Active Initialization Experiment of Superconducting Qubit Using Quantum-circuit Refrigerator
Authors:
Teruaki Yoshioka,
Hiroto Mukai,
Akiyoshi Tomonaga,
Shintaro Takada,
Yuma Okazaki,
Nobu-Hisa Kaneko,
Shuji Nakamura,
Jaw-Shen Tsai
Abstract:
The initialization of superconducting qubits is one of the essential techniques for the realization of quantum computation. In previous research, initialization above 99\% fidelity has been achieved at 280 ns. Here, we demonstrate the rapid initialization of a superconducting qubit with a quantum-circuit refrigerator (QCR). Photon-assisted tunneling of quasiparticles in the QCR can temporally incr…
▽ More
The initialization of superconducting qubits is one of the essential techniques for the realization of quantum computation. In previous research, initialization above 99\% fidelity has been achieved at 280 ns. Here, we demonstrate the rapid initialization of a superconducting qubit with a quantum-circuit refrigerator (QCR). Photon-assisted tunneling of quasiparticles in the QCR can temporally increase the relaxation time of photons inside the resonator and helps release energy from the qubit to the environment. Experiments using this protocol have shown that 99\% of initialization time is reduced to 180 ns. This initialization time depends strongly on the relaxation rate of the resonator, and faster initialization is possible by reducing the resistance of the QCR, which limits the ON/OFF ratio, and by strengthening the coupling between the QCR and the resonator.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Strong Interaction Physics at the Luminosity Frontier with 22 GeV Electrons at Jefferson Lab
Authors:
A. Accardi,
P. Achenbach,
D. Adhikari,
A. Afanasev,
C. S. Akondi,
N. Akopov,
M. Albaladejo,
H. Albataineh,
M. Albrecht,
B. Almeida-Zamora,
M. Amaryan,
D. Androić,
W. Armstrong,
D. S. Armstrong,
M. Arratia,
J. Arrington,
A. Asaturyan,
A. Austregesilo,
H. Avagyan,
T. Averett,
C. Ayerbe Gayoso,
A. Bacchetta,
A. B. Balantekin,
N. Baltzell,
L. Barion
, et al. (419 additional authors not shown)
Abstract:
This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron…
▽ More
This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron beams, CEBAF's potential for a higher energy upgrade presents a unique opportunity for an innovative nuclear physics program, which seamlessly integrates a rich historical background with a promising future. The proposed physics program encompass a diverse range of investigations centered around the nonperturbative dynamics inherent in hadron structure and the exploration of strongly interacting systems. It builds upon the exceptional capabilities of CEBAF in high-luminosity operations, the availability of existing or planned Hall equipment, and recent advancements in accelerator technology. The proposed program cover various scientific topics, including Hadron Spectroscopy, Partonic Structure and Spin, Hadronization and Transverse Momentum, Spatial Structure, Mechanical Properties, Form Factors and Emergent Hadron Mass, Hadron-Quark Transition, and Nuclear Dynamics at Extreme Conditions, as well as QCD Confinement and Fundamental Symmetries. Each topic highlights the key measurements achievable at a 22 GeV CEBAF accelerator. Furthermore, this document outlines the significant physics outcomes and unique aspects of these programs that distinguish them from other existing or planned facilities. In summary, this document provides an exciting rationale for the energy upgrade of CEBAF to 22 GeV, outlining the transformative scientific potential that lies within reach, and the remarkable opportunities it offers for advancing our understanding of hadron physics and related fundamental phenomena.
△ Less
Submitted 24 August, 2023; v1 submitted 13 June, 2023;
originally announced June 2023.
-
An Optimal Algorithm for the Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
Authors:
Shintaro Nakamura,
Masashi Sugiyama
Abstract:
We study the real-valued combinatorial pure exploration problem in the stochastic multi-armed bandit (R-CPE-MAB). We study the case where the size of the action set is polynomial with respect to the number of arms. In such a case, the R-CPE-MAB can be seen as a special case of the so-called transductive linear bandits. Existing methods in the R-CPE-MAB and transductive linear bandits have a gap of…
▽ More
We study the real-valued combinatorial pure exploration problem in the stochastic multi-armed bandit (R-CPE-MAB). We study the case where the size of the action set is polynomial with respect to the number of arms. In such a case, the R-CPE-MAB can be seen as a special case of the so-called transductive linear bandits. Existing methods in the R-CPE-MAB and transductive linear bandits have a gap of problem-dependent constant terms and logarithmic terms between the upper and lower bounds of the sample complexity, respectively. We close these gaps by proposing an algorithm named the combinatorial gap-based exploration (CombGapE) algorithm, whose sample complexity upper bound matches the lower bound. Finally, we numerically show that the CombGapE algorithm outperforms existing methods significantly.
△ Less
Submitted 14 December, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data
Authors:
Yuka Ko,
Ryo Fukuda,
Yuta Nishikawa,
Yasumasa Kano,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
Simultaneous speech translation (SimulST) translates partial speech inputs incrementally. Although the monotonic correspondence between input and output is preferable for smaller latency, it is not the case for distant language pairs such as English and Japanese. A prospective approach to this problem is to mimic simultaneous interpretation (SI) using SI data to train a SimulST model. However, the…
▽ More
Simultaneous speech translation (SimulST) translates partial speech inputs incrementally. Although the monotonic correspondence between input and output is preferable for smaller latency, it is not the case for distant language pairs such as English and Japanese. A prospective approach to this problem is to mimic simultaneous interpretation (SI) using SI data to train a SimulST model. However, the size of such SI data is limited, so the SI data should be used together with ordinary bilingual data whose translations are given in offline. In this paper, we propose an effective way to train a SimulST model using mixed data of SI and offline. The proposed method trains a single model using the mixed data with style tags that tell the model to generate SI- or offline-style outputs. Experiment results show improvements of BLEURT in different latency ranges, and our analyses revealed the proposed model generates SI-style outputs more than the baseline.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Inter-connection: Effective Connection between Pre-trained Encoder and Decoder for Speech Translation
Authors:
Yuta Nishikawa,
Satoshi Nakamura
Abstract:
In end-to-end speech translation, speech and text pre-trained models improve translation quality. Recently proposed models simply connect the pre-trained models of speech and text as encoder and decoder. Therefore, only the information from the final layer of encoders is input to the decoder. Since it is clear that the speech pre-trained model outputs different information from each layer, the sim…
▽ More
In end-to-end speech translation, speech and text pre-trained models improve translation quality. Recently proposed models simply connect the pre-trained models of speech and text as encoder and decoder. Therefore, only the information from the final layer of encoders is input to the decoder. Since it is clear that the speech pre-trained model outputs different information from each layer, the simple connection method cannot fully utilize the information that the speech pre-trained model has. In this study, we propose an inter-connection mechanism that aggregates the information from each layer of the speech pre-trained model by weighted sums and inputs into the decoder. This mechanism increased BLEU by approximately 2 points in en-de, en-ja, and en-zh by increasing parameters by 2K when the speech pre-trained model was frozen. Furthermore, we investigated the contribution of each layer for each language by visualizing layer weights and found that the contributions were different.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
Arukikata Travelogue Dataset
Authors:
Hiroki Ouchi,
Hiroyuki Shindo,
Shoko Wakamiya,
Yuki Matsuda,
Naoya Inoue,
Shohei Higashiyama,
Satoshi Nakamura,
Taro Watanabe
Abstract:
We have constructed Arukikata Travelogue Dataset and released it free of charge for academic research. This dataset is a Japanese text dataset with a total of over 31 million words, comprising 4,672 Japanese domestic travelogues and 9,607 overseas travelogues. Before providing our dataset, there was a scarcity of widely available travelogue data for research purposes, and each researcher had to pr…
▽ More
We have constructed Arukikata Travelogue Dataset and released it free of charge for academic research. This dataset is a Japanese text dataset with a total of over 31 million words, comprising 4,672 Japanese domestic travelogues and 9,607 overseas travelogues. Before providing our dataset, there was a scarcity of widely available travelogue data for research purposes, and each researcher had to prepare their own data. This hinders the replication of existing studies and fair comparative analysis of experimental results. Our dataset enables any researchers to conduct investigation on the same data and to ensure transparency and reproducibility in research. In this paper, we describe the academic significance, characteristics, and prospects of our dataset.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Improving Speech Translation Accuracy and Time Efficiency with Fine-tuned wav2vec 2.0-based Speech Segmentation
Authors:
Ryo Fukuda,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
Speech translation (ST) automatically converts utterances in a source language into text in another language. Splitting continuous speech into shorter segments, known as speech segmentation, plays an important role in ST. Recent segmentation methods trained to mimic the segmentation of ST corpora have surpassed traditional approaches. Tsiamas et al. proposed a segmentation frame classifier (SFC) b…
▽ More
Speech translation (ST) automatically converts utterances in a source language into text in another language. Splitting continuous speech into shorter segments, known as speech segmentation, plays an important role in ST. Recent segmentation methods trained to mimic the segmentation of ST corpora have surpassed traditional approaches. Tsiamas et al. proposed a segmentation frame classifier (SFC) based on a pre-trained speech encoder called wav2vec 2.0. Their method, named SHAS, retains 95-98% of the BLEU score for ST corpus segmentation. However, the segments generated by SHAS are very different from ST corpus segmentation and tend to be longer with multiple combined utterances. This is due to SHAS's reliance on length heuristics, i.e., it splits speech into segments of easily translatable length without fully considering the potential for ST improvement by splitting them into even shorter segments. Longer segments often degrade translation quality and ST's time efficiency. In this study, we extended SHAS to improve ST translation accuracy and efficiency by splitting speech into shorter segments that correspond to sentences. We introduced a simple segmentation algorithm using the moving average of SFC predictions without relying on length heuristics and explored wav2vec 2.0 fine-tuning for improved speech segmentation prediction. Our experimental results reveal that our speech segmentation method significantly improved the quality and the time efficiency of speech translation compared to SHAS.
△ Less
Submitted 18 December, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
NAIST-SIC-Aligned: an Aligned English-Japanese Simultaneous Interpretation Corpus
Authors:
Jinming Zhao,
Yuka Ko,
Kosuke Doi,
Ryo Fukuda,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
It remains a question that how simultaneous interpretation (SI) data affects simultaneous machine translation (SiMT). Research has been limited due to the lack of a large-scale training corpus. In this work, we aim to fill in the gap by introducing NAIST-SIC-Aligned, which is an automatically-aligned parallel English-Japanese SI dataset. Starting with a non-aligned corpus NAIST-SIC, we propose a t…
▽ More
It remains a question that how simultaneous interpretation (SI) data affects simultaneous machine translation (SiMT). Research has been limited due to the lack of a large-scale training corpus. In this work, we aim to fill in the gap by introducing NAIST-SIC-Aligned, which is an automatically-aligned parallel English-Japanese SI dataset. Starting with a non-aligned corpus NAIST-SIC, we propose a two-stage alignment approach to make the corpus parallel and thus suitable for model training. The first stage is coarse alignment where we perform a many-to-many mapping between source and target sentences, and the second stage is fine-grained alignment where we perform intra- and inter-sentence filtering to improve the quality of aligned pairs. To ensure the quality of the corpus, each step has been validated either quantitatively or qualitatively. This is the first open-sourced large-scale parallel SI dataset in the literature. We also manually curated a small test set for evaluation purposes. Our results show that models trained with SI data lead to significant improvement in translation quality and latency over baselines. We hope our work advances research on SI corpora construction and SiMT. Our data can be found at https://github.com/mingzi151/AHC-SI.
△ Less
Submitted 31 March, 2024; v1 submitted 23 April, 2023;
originally announced April 2023.