Skip to main content

Showing 1–26 of 26 results for author: Simon, I

  1. arXiv:2408.10529  [pdf

    cs.SE

    Automated Detection of Algorithm Debt in Deep Learning Frameworks: An Empirical Study

    Authors: Emmanuel Iko-Ojo Simon, Chirath Hettiarachchi, Alex Potanin, Hanna Suominen, Fatemeh Fard

    Abstract: Context: Previous studies demonstrate that Machine or Deep Learning (ML/DL) models can detect Technical Debt from source code comments called Self-Admitted Technical Debt (SATD). Despite the importance of ML/DL in software development, limited studies focus on automated detection for new SATD types: Algorithm Debt (AD). AD detection is important because it helps to identify TD early, facilitating… ▽ More

    Submitted 21 August, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

    Comments: Accepted as Continuity Acceptance (CA) for a Stage 1 registration of the Registered Report Track at 40th IEEE International Conference on Software Maintenance and Evolution (ICSME 2024), Flagstaff, USA, October 6-11, 2024

    ACM Class: D.2.7; K.6.3

  2. arXiv:2401.13825  [pdf, other

    astro-ph.SR astro-ph.GA

    RR Lyrae Stars Belonging to the Candidate Globular Cluster Patchick 99

    Authors: Evan Butler, Andrea Kunder, Zdenek Prudil, Kevin R. Covey, Macy Ball, Carlos Campos, Kaylen Gollnick, Julio Olivares Carvajal, Joanne Hughes, Kathryn Devine, Christian I. Johnson, A. Katherina Vivas, Michael R. Rich, Meridith Joyce, Iulia T. Simon, Tommaso Marchetti, Andreas J. Koch-Hansen, William I. Clarkson, Rebekah Kuss

    Abstract: Patchick 99 is a candidate globular cluster located in the direction of the Galactic bulge, with a proper motion almost identical to the field and extreme field star contamination. A recent analysis suggests it is a low-luminosity globular cluster with a population of RR Lyrae stars. We present new spectra of stars in and around Patchick 99, targeting specifically the 3 RR Lyrae stars associated w… ▽ More

    Submitted 25 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted to The Astrophysical Journal Letters. Replaced due to a typo in the title

  3. arXiv:2311.06453  [pdf, other

    cs.SE cs.CL

    DocGen: Generating Detailed Parameter Docstrings in Python

    Authors: Vatsal Venkatkrishna, Durga Shree Nagabushanam, Emmanuel Iko-Ojo Simon, Melina Vidoni

    Abstract: Documentation debt hinders the effective utilization of open-source software. Although code summarization tools have been helpful for developers, most would prefer a detailed account of each parameter in a function rather than a high-level summary. However, generating such a summary is too intricate for a single generative model to produce reliably due to the lack of high-quality training data. Th… ▽ More

    Submitted 17 November, 2023; v1 submitted 10 November, 2023; originally announced November 2023.

  4. arXiv:2301.12662  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    SingSong: Generating musical accompaniments from singing

    Authors: Chris Donahue, Antoine Caillon, Adam Roberts, Ethan Manilow, Philippe Esling, Andrea Agostinelli, Mauro Verzetti, Ian Simon, Olivier Pietquin, Neil Zeghidour, Jesse Engel

    Abstract: We present SingSong, a system that generates instrumental music to accompany input vocals, potentially offering musicians and non-musicians alike an intuitive new way to create music featuring their own voice. To accomplish this, we build on recent developments in musical source separation and audio generation. Specifically, we apply a state-of-the-art source separation algorithm to a large corpus… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

  5. arXiv:2209.14458  [pdf, other

    cs.SD cs.IR cs.LG eess.AS

    The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling

    Authors: Yusong Wu, Josh Gardner, Ethan Manilow, Ian Simon, Curtis Hawthorne, Jesse Engel

    Abstract: Data is the lifeblood of modern machine learning systems, including for those in Music Information Retrieval (MIR). However, MIR has long been mired by small datasets and unreliable labels. In this work, we propose to break this bottleneck using generative modeling. By pipelining a generative model of notes (Coconet trained on Bach Chorales) with a structured synthesis model of chamber ensembles (… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  6. arXiv:2206.13380  [pdf

    astro-ph.EP physics.geo-ph

    Properties of the Nili Fossae Olivine-clay-carbonate lithology: orbital and in situ at Séítah

    Authors: Adrian J. Brown, Linda Kah, Lucia Mandon, Roger Wiens, Patrick Pinet, Elise Clavé, Stéphane Le Mouélic, Arya Udry, Patrick J. Gasda, Clément Royer, Keyron Hickman-Lewis11, Agnes Cousin, Justin I. Simon, Jade Comellas14, Edward Cloutis, Thierry Fouchet, Alberto G. Fairén, Stephanie Connell, David Flannery, Briony Horgan, Lisa Mayhew, Allan Treiman, Jorge I. Núñez, Brittan Wogsland, Karim Benzerara , et al. (9 additional authors not shown)

    Abstract: We examine the observed properties of the Nili Fossae olivine-clay-carbonate lithology from orbital data and in situ by the Mars 2020 rover at the Séítah unit in Jezero crater, including: 1) composition (Liu, 2022) 2) grain size (Tice, 2022) 3) inferred viscosity (calculated based on geochemistry collected by SuperCam (Wiens, 2022)). Based on the low viscosity and distribution of the unit we postu… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: 34 pages, 15 figures

  7. arXiv:2206.05408  [pdf, other

    cs.SD cs.LG eess.AS

    Multi-instrument Music Synthesis with Spectrogram Diffusion

    Authors: Curtis Hawthorne, Ian Simon, Adam Roberts, Neil Zeghidour, Josh Gardner, Ethan Manilow, Jesse Engel

    Abstract: An ideal music synthesizer should be both interactive and expressive, generating high-fidelity audio in realtime for arbitrary combinations of instruments and notes. Recent neural synthesizers have exhibited a tradeoff between domain-specific models that offer detailed control of only specific instruments, or raw waveform models that can train on any music but with minimal control and slow generat… ▽ More

    Submitted 12 December, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

  8. arXiv:2202.07765  [pdf, other

    cs.LG cs.AI cs.CV cs.SD eess.AS

    General-purpose, long-context autoregressive modeling with Perceiver AR

    Authors: Curtis Hawthorne, Andrew Jaegle, Cătălina Cangea, Sebastian Borgeaud, Charlie Nash, Mateusz Malinowski, Sander Dieleman, Oriol Vinyals, Matthew Botvinick, Ian Simon, Hannah Sheahan, Neil Zeghidour, Jean-Baptiste Alayrac, João Carreira, Jesse Engel

    Abstract: Real-world data is high-dimensional: a book, image, or musical performance can easily contain hundreds of thousands of elements even after compression. However, the most commonly used autoregressive models, Transformers, are prohibitively expensive to scale to the number of inputs and layers needed to capture this long-range structure. We develop Perceiver AR, an autoregressive, modality-agnostic… ▽ More

    Submitted 14 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  9. arXiv:2111.03017  [pdf, other

    cs.SD cs.LG eess.AS

    MT3: Multi-Task Multitrack Music Transcription

    Authors: Josh Gardner, Ian Simon, Ethan Manilow, Curtis Hawthorne, Jesse Engel

    Abstract: Automatic Music Transcription (AMT), inferring musical notes from raw audio, is a challenging task at the core of music understanding. Unlike Automatic Speech Recognition (ASR), which typically focuses on the words of a single speaker, AMT often requires transcribing multiple instruments simultaneously, all while preserving fine-scale pitch and timing information. Further, many AMT datasets are "l… ▽ More

    Submitted 15 March, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: ICLR 2022 camera-ready version

  10. arXiv:2107.09142  [pdf, other

    cs.SD cs.LG eess.AS

    Sequence-to-Sequence Piano Transcription with Transformers

    Authors: Curtis Hawthorne, Ian Simon, Rigel Swavely, Ethan Manilow, Jesse Engel

    Abstract: Automatic Music Transcription has seen significant progress in recent years by training custom deep neural networks on large datasets. However, these models have required extensive domain-specific design of network architectures, input/output representations, and complex decoding schemes. In this work, we show that equivalent performance can be achieved using a generic encoder-decoder Transformer… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  11. arXiv:2103.16091  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Symbolic Music Generation with Diffusion Models

    Authors: Gautam Mittal, Jesse Engel, Curtis Hawthorne, Ian Simon

    Abstract: Score-based generative models and diffusion probabilistic models have been successful at generating high-quality samples in continuous domains such as images and audio. However, due to their Langevin-inspired sampling mechanisms, their application to discrete and sequential data has been limited. In this work, we present a technique for training diffusion models on sequential data by parameterizin… ▽ More

    Submitted 25 November, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: ISMIR 2021

  12. arXiv:2008.01100  [pdf, ps, other

    astro-ph.SR astro-ph.GA

    A Search for Light Hydrides in the Envelopes of Evolved Stars

    Authors: Mark A. Siebert, Ignacio Simon, Christopher N. Shingledecker, P. Brandon Carroll, Andrew M. Burkhardt, Shawn Thomas Booth, Anthony J. Remijan, Rebeca Aladro, Carlos A. Duran, Brett A. McGuire

    Abstract: We report a search for the diatomic hydrides SiH, PH, and FeH along the line of sight toward the chemically rich circumstellar envelopes of IRC+10216 and VY Canis Majoris. These molecules are thought to form in high temperature regions near the photospheres of these stars, and may then further react via gas-phase and dust-grain interactions leading to more complex species, but have yet to be const… ▽ More

    Submitted 18 August, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted for publication in ApJ. 14 pages, 4 figures, 3 tables

  13. arXiv:1912.05537  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Encoding Musical Style with Transformer Autoencoders

    Authors: Kristy Choi, Curtis Hawthorne, Ian Simon, Monica Dinculescu, Jesse Engel

    Abstract: We consider the problem of learning high-level controls over the global structure of generated sequences, particularly in the context of symbolic music generation with complex language models. In this work, we present the Transformer autoencoder, which aggregates encodings of the input data across time to obtain a global representation of style from a given performance. We show it is possible to c… ▽ More

    Submitted 30 June, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

  14. arXiv:1906.02667  [pdf, other

    cs.LG stat.ML

    Application of Machine Learning to accidents detection at directional drilling

    Authors: Ekaterina Gurina, Nikita Klyuchnikov, Alexey Zaytsev, Evgenya Romanenkova, Ksenia Antipova, Igor Simon, Victor Makarov, Dmitry Koroteev

    Abstract: We present a data-driven algorithm and mathematical model for anomaly alarming at directional drilling. The algorithm is based on machine learning. It compares the real-time drilling telemetry with one corresponding to past accidents and analyses the level of similarity. The model performs a time-series comparison using aggregated statistics and Gradient Boosting classification. It is trained on h… ▽ More

    Submitted 12 December, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

  15. Real-time data-driven detection of the rock type alteration during a directional drilling

    Authors: Evgenya Romanenkova, Alexey Zaytsev, Nikita Klyuchnikov, Arseniy Gruzdev, Ksenia Antipova, Leyla Ismailova, Evgeny Burnaev, Artyom Semenikhin, Vitaliy Koryabkin, Igor Simon, Dmitry Koroteev

    Abstract: During the directional drilling, a bit may sometimes go to a nonproductive rock layer due to the gap about 20m between the bit and high-fidelity rock type sensors. The only way to detect the lithotype changes in time is the usage of Measurements While Drilling (MWD) data. However, there are no general mathematical modeling approaches that both well reconstruct the rock type based on MWD data and c… ▽ More

    Submitted 12 December, 2019; v1 submitted 27 March, 2019; originally announced March 2019.

  16. arXiv:1810.12247  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset

    Authors: Curtis Hawthorne, Andriy Stasyuk, Adam Roberts, Ian Simon, Cheng-Zhi Anna Huang, Sander Dieleman, Erich Elsen, Jesse Engel, Douglas Eck

    Abstract: Generating musical audio directly with neural networks is notoriously difficult because it requires coherently modeling structure at many different timescales. Fortunately, most music is also highly structured and can be represented as discrete note events played on musical instruments. Herein, we show that by using notes as an intermediate representation, we can train a suite of models capable of… ▽ More

    Submitted 17 January, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: Examples available at https://goo.gl/magenta/maestro-examples

  17. arXiv:1810.05246  [pdf, other

    cs.LG cs.HC cs.SD eess.AS stat.ML

    Piano Genie

    Authors: Chris Donahue, Ian Simon, Sander Dieleman

    Abstract: We present Piano Genie, an intelligent controller which allows non-musicians to improvise on the piano. With Piano Genie, a user performs on a simple interface with eight buttons, and their performance is decoded into the space of plausible piano music in real time. To learn a suitable mapping procedure for this problem, we train recurrent neural network autoencoders with discrete bottlenecks: an… ▽ More

    Submitted 22 March, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: Published as a conference paper at ACM IUI 2019

  18. arXiv:1809.04281  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Music Transformer

    Authors: Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Ian Simon, Curtis Hawthorne, Andrew M. Dai, Matthew D. Hoffman, Monica Dinculescu, Douglas Eck

    Abstract: Music relies heavily on repetition to build structure and meaning. Self-reference occurs on multiple timescales, from motifs to phrases to reusing of entire sections of music, such as in pieces with ABA structure. The Transformer (Vaswani et al., 2017), a sequence model based on self-attention, has achieved compelling results in many generation tasks that require maintaining long-range coherence.… ▽ More

    Submitted 12 December, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: Improved skewing section and accompanying figures. Previous titles are "An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation" and "Music Transformer"

  19. arXiv:1808.03715  [pdf, ps, other

    cs.SD cs.LG eess.AS

    This Time with Feeling: Learning Expressive Musical Performance

    Authors: Sageev Oore, Ian Simon, Sander Dieleman, Douglas Eck, Karen Simonyan

    Abstract: Music generation has generally been focused on either creating scores or interpreting them. We discuss differences between these two problems and propose that, in fact, it may be valuable to work in the space of direct $\it performance$ generation: jointly predicting the notes $\it and$ $\it also$ their expressive timing and dynamics. We consider the significance and qualities of the data set need… ▽ More

    Submitted 10 August, 2018; originally announced August 2018.

    Comments: Includes links to urls for audio samples

  20. arXiv:1806.03218  [pdf, other

    cs.LG stat.ML

    Data-driven model for the identification of the rock type at a drilling bit

    Authors: Nikita Klyuchnikov, Alexey Zaytsev, Arseniy Gruzdev, Georgiy Ovchinnikov, Ksenia Antipova, Leyla Ismailova, Ekaterina Muravleva, Evgeny Burnaev, Artyom Semenikhin, Alexey Cherepanov, Vitaliy Koryabkin, Igor Simon, Alexey Tsurgan, Fedor Krasnov, Dmitry Koroteev

    Abstract: Directional oil well drilling requires high precision of the wellbore positioning inside the productive area. However, due to specifics of engineering design, sensors that explicitly determine the type of the drilled rock are located farther than 15m from the drilling bit. As a result, the target area runaways can be detected only after this distance, which in turn, leads to a loss in well product… ▽ More

    Submitted 25 March, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

  21. arXiv:1806.00195  [pdf, other

    stat.ML cs.LG cs.SD eess.AS

    Learning a Latent Space of Multitrack Measures

    Authors: Ian Simon, Adam Roberts, Colin Raffel, Jesse Engel, Curtis Hawthorne, Douglas Eck

    Abstract: Discovering and exploring the underlying structure of multi-instrumental music using learning-based approaches remains an open problem. We extend the recent MusicVAE model to represent multitrack polyphonic measures as vectors in a latent space. Our approach enables several useful operations such as generating plausible measures from scratch, interpolating between measures in a musically meaningfu… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  22. arXiv:1712.10326  [pdf, ps, other

    math.CA

    Strong convergence of two--dimensional Vilenkin-Fourier series

    Authors: N. Memiæ, I. Simon, G. Tephnadze

    Abstract: We prove that certain means of the quadratical partial sums of the two-dimensional Vilenkin-Fourier series are uniformly bounded operators from the Hardy space $H_{p}$ to the space $L_{p}$ for $0<p\leq 1.$ We also prove that the sequence in the denominator cannot be improved.

    Submitted 15 December, 2017; originally announced December 2017.

    MSC Class: 42C10

  23. arXiv:1710.11153  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Onsets and Frames: Dual-Objective Piano Transcription

    Authors: Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse Engel, Sageev Oore, Douglas Eck

    Abstract: We advance the state of the art in polyphonic piano music transcription by using a deep convolutional and recurrent neural network which is trained to jointly predict onsets and frames. Our model predicts pitch onset events and then uses those predictions to condition framewise pitch predictions. During inference, we restrict the predictions from the framewise detector by not allowing a new note t… ▽ More

    Submitted 5 June, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: Examples available at https://goo.gl/magenta/onsets-frames-examples

  24. arXiv:1606.09092  [pdf, ps, other

    math.CA math.FA

    Density of the span of powers of a function à la Müntz-Szasz

    Authors: Philippe Jaming, Ilona Simon

    Abstract: The aim of this paper is to establish density properties in $L^p$ spaces of the span of powers of functions $\{ψ^λ\,:λ\inΛ\}$, $Λ\subset\N$ in the spirit of the Müntz-Szász Theorem. As density is almost never achieved, we further investigate the density of powers and a modulation of powers $\{ψ^λ,ψ^λe^{iαt}\,:λ\inΛ\}$. Finally, we establish a Müntz-Szász Theorem for density of translates of powers… ▽ More

    Submitted 29 June, 2016; originally announced June 2016.

  25. Self-regulating genes. Exact steady state solution by using Poisson Representation

    Authors: Istvan P. Sugar, Istvan Simon

    Abstract: Systems biology studies the structure and behavior of complex gene regulatory networks. One of its aims is to develop a quantitative understanding of the modular components that constitute such networks. The self-regulating gene is a type of auto regulatory genetic modules which appears in over 40% of known transcription factors in E. coli. In this work, using the technique of Poisson Representati… ▽ More

    Submitted 18 December, 2013; v1 submitted 13 December, 2013; originally announced December 2013.

    Comments: 10 pages, 2 figures, 1 table, 1 supplemental material (9 pages); additional reference to the work of Grima et al

  26. Mechanisms of B cell Synapse Formation Predicted by Stochastic Simulation

    Authors: Philippos K. Tsourkas, Nicole Baumgarth, Scott I. Simon, Subhadip Raychaudhuri

    Abstract: The clustering of B cell receptor (BCR) molecules and the formation of the protein segregation structure known as the immunological synapse appears to precede antigen (Ag) uptake by B cells. The mature B cell synapse is characterized by a central cluster of BCR/Ag molecular complexes surrounded by a ring of LFA-1/ICAM-1 complexes. Recent experimental evidence shows receptor clustering in B cells… ▽ More

    Submitted 19 October, 2006; v1 submitted 17 September, 2006; originally announced September 2006.

    Comments: 35 pages, 11 figures; Supplemental Materials added