-
A Unified Theory of Jetted Tidal Disruption Events: From Promptly Escaping Relativistic to Delayed Transrelativistic Jets
Authors:
Odelia Teboul,
Brian D. Metzger
Abstract:
Only a tiny fraction ~ 1% of stellar tidal disruption events (TDE) generate powerful relativistic jets evidenced by luminous hard X-ray and radio emissions. We propose that a key property responsible for both this surprisingly low rate and a variety of other observations is the typically large misalignment ψ between the orbital plane of the star and the spin axis of the supermassive black hole (SM…
▽ More
Only a tiny fraction ~ 1% of stellar tidal disruption events (TDE) generate powerful relativistic jets evidenced by luminous hard X-ray and radio emissions. We propose that a key property responsible for both this surprisingly low rate and a variety of other observations is the typically large misalignment ψ between the orbital plane of the star and the spin axis of the supermassive black hole (SMBH). Such misaligned disk/jet systems undergo Lense-Thirring precession together about the SMBH spin axis. We find that TDE disks precess sufficiently rapidly that winds from the accretion disk will encase the system on large scales in a quasi-spherical outflow. We derive the critical jet efficiency η > ηcrit for both aligned and misaligned precessing jets to successfully escape from the disk-wind ejecta. As ηcrit is higher for precessing jets, less powerful jets only escape after alignment with the SMBH spin. Alignment can occur through magneto-spin or hydrodynamic mechanisms, which we estimate occur on typical timescales of weeks and years, respectively. The dominant mechanism depends on η and the orbital penetration factor \b{eta}. Hence depending only on intrinsic parameters of the event {ψ,η,\b{eta}}, we propose that each TDE jet can either escape prior to alignment, thus exhibiting erratic X-ray light curve and two-component radio afterglow (e.g., Swift J1644+57) or escape after alignment. Relatively rapid magneto-spin alignments produce relativistic jets exhibiting X-ray power-law decay and bright afterglows (e.g., AT2022cmc), while long hydrodynamic alignments give rise to late jet escape and delayed radio flares (e.g., AT2018hyz).
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Loss Cone Shielding
Authors:
Odelia Teboul,
Nicholas C. Stone,
Jeremiah P. Ostriker
Abstract:
A star wandering close enough to a massive black hole (MBH) can be ripped apart by the tidal forces of the black hole. The advent of wide-field surveys at many wavelengths has quickly increased the number of tidal disruption events (TDEs) observed, and has revealed that i) observed TDE rates are lower than theoretical predictions and ii) E+A galaxies are significantly overrepresented. This overrep…
▽ More
A star wandering close enough to a massive black hole (MBH) can be ripped apart by the tidal forces of the black hole. The advent of wide-field surveys at many wavelengths has quickly increased the number of tidal disruption events (TDEs) observed, and has revealed that i) observed TDE rates are lower than theoretical predictions and ii) E+A galaxies are significantly overrepresented. This overrepresentation further worsens the tension between observed and theoretically predicted TDEs for non-E+A galaxies. Classical loss cone theory focuses on the cumulative effect of many weak scatterings. However, a strong scattering can remove a star from the distribution before it can get tidally disrupted. Most stars undergoing TDEs come from within the radius of influence, the densest environments of the universe. In such environments, close encounters rare elsewhere become non-negligible. We revise the standard loss cone theory to take into account classical two-body interactions as well as strong scattering, collisions, tidal captures, and study under which conditions close encounters can shield the loss cone. We i) analytically derive the impact of strong scattering and other close encounters, ii) compute time-dependent loss cone dynamics including both weak and strong encounters, and iii) derive analytical solutions to the Fokker-Planck equation with strong scattering. We find that i) TDE rates can be reduced to up to an order of magnitude and ii) strong shielding preferentially reduces deeply plunging stars. We also show that stellar overdensities, one possible explanation for the E+A preference, can fail to increase TDE rates when taking into account strong scattering.
△ Less
Submitted 26 October, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
AudioLM: a Language Modeling Approach to Audio Generation
Authors:
Zalán Borsos,
Raphaël Marinier,
Damien Vincent,
Eugene Kharitonov,
Olivier Pietquin,
Matt Sharifi,
Dominik Roblek,
Olivier Teboul,
David Grangier,
Marco Tagliasacchi,
Neil Zeghidour
Abstract:
We introduce AudioLM, a framework for high-quality audio generation with long-term consistency. AudioLM maps the input audio to a sequence of discrete tokens and casts audio generation as a language modeling task in this representation space. We show how existing audio tokenizers provide different trade-offs between reconstruction quality and long-term structure, and we propose a hybrid tokenizati…
▽ More
We introduce AudioLM, a framework for high-quality audio generation with long-term consistency. AudioLM maps the input audio to a sequence of discrete tokens and casts audio generation as a language modeling task in this representation space. We show how existing audio tokenizers provide different trade-offs between reconstruction quality and long-term structure, and we propose a hybrid tokenization scheme to achieve both objectives. Namely, we leverage the discretized activations of a masked language model pre-trained on audio to capture long-term structure and the discrete codes produced by a neural audio codec to achieve high-quality synthesis. By training on large corpora of raw audio waveforms, AudioLM learns to generate natural and coherent continuations given short prompts. When trained on speech, and without any transcript or annotation, AudioLM generates syntactically and semantically plausible speech continuations while also maintaining speaker identity and prosody for unseen speakers. Furthermore, we demonstrate how our approach extends beyond speech by generating coherent piano music continuations, despite being trained without any symbolic representation of music.
△ Less
Submitted 25 July, 2023; v1 submitted 7 September, 2022;
originally announced September 2022.
-
Learning strides in convolutional neural networks
Authors:
Rachid Riad,
Olivier Teboul,
David Grangier,
Neil Zeghidour
Abstract:
Convolutional neural networks typically contain several downsampling operators, such as strided convolutions or pooling layers, that progressively reduce the resolution of intermediate representations. This provides some shift-invariance while reducing the computational complexity of the whole architecture. A critical hyperparameter of such layers is their stride: the integer factor of downsamplin…
▽ More
Convolutional neural networks typically contain several downsampling operators, such as strided convolutions or pooling layers, that progressively reduce the resolution of intermediate representations. This provides some shift-invariance while reducing the computational complexity of the whole architecture. A critical hyperparameter of such layers is their stride: the integer factor of downsampling. As strides are not differentiable, finding the best configuration either requires cross-validation or discrete optimization (e.g. architecture search), which rapidly become prohibitive as the search space grows exponentially with the number of downsampling layers. Hence, exploring this search space by gradient descent would allow finding better configurations at a lower computational cost. This work introduces DiffStride, the first downsampling layer with learnable strides. Our layer learns the size of a cropping mask in the Fourier domain, that effectively performs resizing in a differentiable way. Experiments on audio and image classification show the generality and effectiveness of our solution: we use DiffStride as a drop-in replacement to standard downsampling layers and outperform them. In particular, we show that introducing our layer into a ResNet-18 architecture allows keeping consistent high performance on CIFAR10, CIFAR100 and ImageNet even when training starts from poor random stride configurations. Moreover, formulating strides as learnable variables allows us to introduce a regularization term that controls the computational complexity of the architecture. We show how this regularization allows trading off accuracy for efficiency on ImageNet.
△ Less
Submitted 3 February, 2022;
originally announced February 2022.
-
Optimal Transport Tools (OTT): A JAX Toolbox for all things Wasserstein
Authors:
Marco Cuturi,
Laetitia Meng-Papaxanthos,
Yingtao Tian,
Charlotte Bunne,
Geoff Davis,
Olivier Teboul
Abstract:
Optimal transport tools (OTT-JAX) is a Python toolbox that can solve optimal transport problems between point clouds and histograms. The toolbox builds on various JAX features, such as automatic and custom reverse mode differentiation, vectorization, just-in-time compilation and accelerators support. The toolbox covers elementary computations, such as the resolution of the regularized OT problem,…
▽ More
Optimal transport tools (OTT-JAX) is a Python toolbox that can solve optimal transport problems between point clouds and histograms. The toolbox builds on various JAX features, such as automatic and custom reverse mode differentiation, vectorization, just-in-time compilation and accelerators support. The toolbox covers elementary computations, such as the resolution of the regularized OT problem, and more advanced extensions, such as barycenters, Gromov-Wasserstein, low-rank solvers, estimation of convex maps, differentiable generalizations of quantiles and ranks, and approximate OT between Gaussian mixtures. The toolbox code is available at \texttt{https://github.com/ott-jax/ott}
△ Less
Submitted 28 January, 2022;
originally announced January 2022.
-
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding
Authors:
Neil Zeghidour,
Olivier Teboul,
David Grangier
Abstract:
We introduce DIVE, an end-to-end speaker diarization algorithm. Our neural algorithm presents the diarization task as an iterative process: it repeatedly builds a representation for each speaker before predicting the voice activity of each speaker conditioned on the extracted representations. This strategy intrinsically resolves the speaker ordering ambiguity without requiring the classical permut…
▽ More
We introduce DIVE, an end-to-end speaker diarization algorithm. Our neural algorithm presents the diarization task as an iterative process: it repeatedly builds a representation for each speaker before predicting the voice activity of each speaker conditioned on the extracted representations. This strategy intrinsically resolves the speaker ordering ambiguity without requiring the classical permutation invariant training loss. In contrast with prior work, our model does not rely on pretrained speaker representations and optimizes all parameters of the system with a multi-speaker voice activity loss. Importantly, our loss explicitly excludes unreliable speaker turn boundaries from training, which is adapted to the standard collar-based Diarization Error Rate (DER) evaluation. Overall, these contributions yield a system redefining the state-of-the-art on the standard CALLHOME benchmark, with 6.7% DER compared to 7.8% for the best alternative.
△ Less
Submitted 28 May, 2021;
originally announced May 2021.
-
Self-Supervised Learning of Audio Representations from Permutations with Differentiable Ranking
Authors:
Andrew N Carr,
Quentin Berthet,
Mathieu Blondel,
Olivier Teboul,
Neil Zeghidour
Abstract:
Self-supervised pre-training using so-called "pretext" tasks has recently shown impressive performance across a wide range of modalities. In this work, we advance self-supervised learning from permutations, by pre-training a model to reorder shuffled parts of the spectrogram of an audio signal, to improve downstream classification performance. We make two main contributions. First, we overcome the…
▽ More
Self-supervised pre-training using so-called "pretext" tasks has recently shown impressive performance across a wide range of modalities. In this work, we advance self-supervised learning from permutations, by pre-training a model to reorder shuffled parts of the spectrogram of an audio signal, to improve downstream classification performance. We make two main contributions. First, we overcome the main challenges of integrating permutation inversions into an end-to-end training scheme, using recent advances in differentiable ranking. This was heretofore sidestepped by casting the reordering task as classification, fundamentally reducing the space of permutations that can be exploited. Our experiments validate that learning from all possible permutations improves the quality of the pre-trained representations over using a limited, fixed set. Second, we show that inverting permutations is a meaningful pretext task for learning audio representations in an unsupervised fashion. In particular, we improve instrument classification and pitch estimation of musical notes by reordering spectrogram patches in the time-frequency space.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
LEAF: A Learnable Frontend for Audio Classification
Authors:
Neil Zeghidour,
Olivier Teboul,
Félix de Chaumont Quitry,
Marco Tagliasacchi
Abstract:
Mel-filterbanks are fixed, engineered audio features which emulate human perception and have been used through the history of audio understanding up to today. However, their undeniable qualities are counterbalanced by the fundamental limitations of handmade representations. In this work we show that we can train a single learnable frontend that outperforms mel-filterbanks on a wide range of audio…
▽ More
Mel-filterbanks are fixed, engineered audio features which emulate human perception and have been used through the history of audio understanding up to today. However, their undeniable qualities are counterbalanced by the fundamental limitations of handmade representations. In this work we show that we can train a single learnable frontend that outperforms mel-filterbanks on a wide range of audio signals, including speech, music, audio events and animal sounds, providing a general-purpose learned frontend for audio classification. To do so, we introduce a new principled, lightweight, fully learnable architecture that can be used as a drop-in replacement of mel-filterbanks. Our system learns all operations of audio features extraction, from filtering to pooling, compression and normalization, and can be integrated into any neural network at a negligible parameter cost. We perform multi-task training on eight diverse audio classification tasks, and show consistent improvements of our model over mel-filterbanks and previous learnable alternatives. Moreover, our system outperforms the current state-of-the-art learnable frontend on Audioset, with orders of magnitude fewer parameters.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
Impact of the ISM magnetic field on GRB afterglow polarization
Authors:
O. Teboul,
N. Shaviv
Abstract:
Linear polarization has been measured in several GRB afterglows. After a few days, polarization arises from the forward shock emission which depends on the post-shock magnetic field. The latter can originate both from compression of existing fields, here the ISM magnetic field, and from shock generated instabilities. For short GRBs, previous modelling of the polarization arising from the forward s…
▽ More
Linear polarization has been measured in several GRB afterglows. After a few days, polarization arises from the forward shock emission which depends on the post-shock magnetic field. The latter can originate both from compression of existing fields, here the ISM magnetic field, and from shock generated instabilities. For short GRBs, previous modelling of the polarization arising from the forward shock considered a random field fully or partially confined to the shock plane. However, the ISM magnetic field likely consists of both random and ordered components. Here we study the impact of a more realistic magnetic field having both ordered and random components. We present our semi-analytical model and compute polarization curves arising for different magnetic field configurations. We find that the presence of an ordered component, even significantly weaker than the random one, has distinct signatures that could be detectable. In the presence of an ordered component not in the observer plane, we show that: i) for an observer inside the jet, the polarization angle $θ_p$ either remains constant during all the afterglow phase or exhibits variations smaller than the 90\textdegree swing expected from a random component solely, ii) for an observer outside the jet, the polarization angle evolves from $θ_p^{\max}$, before the jet break to its opposite after the jet break. We also find that the upper limit polarization for GRB170817 requires a random field not fully confined to the shock plane and is compatible with an ordered component as large as half the random one.
△ Less
Submitted 20 October, 2021; v1 submitted 24 August, 2020;
originally announced August 2020.
-
Noisy Adaptive Group Testing using Bayesian Sequential Experimental Design
Authors:
Marco Cuturi,
Olivier Teboul,
Quentin Berthet,
Arnaud Doucet,
Jean-Philippe Vert
Abstract:
When the infection prevalence of a disease is low, Dorfman showed 80 years ago that testing groups of people can prove more efficient than testing people individually. Our goal in this paper is to propose new group testing algorithms that can operate in a noisy setting (tests can be mistaken) to decide adaptively (looking at past results) which groups to test next, with the goal to converge to a g…
▽ More
When the infection prevalence of a disease is low, Dorfman showed 80 years ago that testing groups of people can prove more efficient than testing people individually. Our goal in this paper is to propose new group testing algorithms that can operate in a noisy setting (tests can be mistaken) to decide adaptively (looking at past results) which groups to test next, with the goal to converge to a good detection, as quickly, and with as few tests as possible. We cast this problem as a Bayesian sequential experimental design problem. Using the posterior distribution of infection status vectors for $n$ patients, given observed tests carried out so far, we seek to form groups that have a maximal utility. We consider utilities such as mutual information, but also quantities that have a more direct relevance to testing, such as the AUC of the ROC curve of the test. Practically, the posterior distributions on $\{0,1\}^n$ are approximated by sequential Monte Carlo (SMC) samplers and the utility maximized by a greedy optimizer. Our procedures show in simulations significant improvements over both adaptive and non-adaptive baselines, and are far more efficient than individual tests when disease prevalence is low. Additionally, we show empirically that loopy belief propagation (LBP), widely regarded as the SoTA decoder to decide whether an individual is infected or not given previous tests, can be unreliable and exhibit oscillatory behavior. Our SMC decoder is more reliable, and can improve the performance of other group testing algorithms.
△ Less
Submitted 22 July, 2020; v1 submitted 26 April, 2020;
originally announced April 2020.
-
Fast Differentiable Sorting and Ranking
Authors:
Mathieu Blondel,
Olivier Teboul,
Quentin Berthet,
Josip Djolonga
Abstract:
The sorting operation is one of the most commonly used building blocks in computer programming. In machine learning, it is often used for robust statistics. However, seen as a function, it is piecewise linear and as a result includes many kinks where it is non-differentiable. More problematic is the related ranking operator, often used for order statistics and ranking metrics. It is a piecewise co…
▽ More
The sorting operation is one of the most commonly used building blocks in computer programming. In machine learning, it is often used for robust statistics. However, seen as a function, it is piecewise linear and as a result includes many kinks where it is non-differentiable. More problematic is the related ranking operator, often used for order statistics and ranking metrics. It is a piecewise constant function, meaning that its derivatives are null or undefined. While numerous works have proposed differentiable proxies to sorting and ranking, they do not achieve the $O(n \log n)$ time complexity one would expect from sorting and ranking operations. In this paper, we propose the first differentiable sorting and ranking operators with $O(n \log n)$ time and $O(n)$ space complexity. Our proposal in addition enjoys exact computation and differentiation. We achieve this feat by constructing differentiable operators as projections onto the permutahedron, the convex hull of permutations, and using a reduction to isotonic optimization. Empirically, we confirm that our approach is an order of magnitude faster than existing approaches and showcase two novel applications: differentiable Spearman's rank correlation coefficient and least trimmed squares.
△ Less
Submitted 29 June, 2020; v1 submitted 20 February, 2020;
originally announced February 2020.
-
Learning with Differentiable Perturbed Optimizers
Authors:
Quentin Berthet,
Mathieu Blondel,
Olivier Teboul,
Marco Cuturi,
Jean-Philippe Vert,
Francis Bach
Abstract:
Machine learning pipelines often rely on optimization procedures to make discrete decisions (e.g., sorting, picking closest neighbors, or shortest paths). Although these discrete decisions are easily computed, they break the back-propagation of computational graphs. In order to expand the scope of learning problems that can be solved in an end-to-end fashion, we propose a systematic method to tran…
▽ More
Machine learning pipelines often rely on optimization procedures to make discrete decisions (e.g., sorting, picking closest neighbors, or shortest paths). Although these discrete decisions are easily computed, they break the back-propagation of computational graphs. In order to expand the scope of learning problems that can be solved in an end-to-end fashion, we propose a systematic method to transform optimizers into operations that are differentiable and never locally constant. Our approach relies on stochastically perturbed optimizers, and can be used readily together with existing solvers. Their derivatives can be evaluated efficiently, and smoothness tuned via the chosen noise amplitude. We also show how this framework can be connected to a family of losses developed in structured prediction, and give theoretical guarantees for their use in learning tasks. We demonstrate experimentally the performance of our approach on various tasks.
△ Less
Submitted 9 June, 2020; v1 submitted 20 February, 2020;
originally announced February 2020.
-
Supervised Quantile Normalization for Low-rank Matrix Approximation
Authors:
Marco Cuturi,
Olivier Teboul,
Jonathan Niles-Weed,
Jean-Philippe Vert
Abstract:
Low rank matrix factorization is a fundamental building block in machine learning, used for instance to summarize gene expression profile data or word-document counts. To be robust to outliers and differences in scale across features, a matrix factorization step is usually preceded by ad-hoc feature normalization steps, such as \texttt{tf-idf} scaling or data whitening. We propose in this work to…
▽ More
Low rank matrix factorization is a fundamental building block in machine learning, used for instance to summarize gene expression profile data or word-document counts. To be robust to outliers and differences in scale across features, a matrix factorization step is usually preceded by ad-hoc feature normalization steps, such as \texttt{tf-idf} scaling or data whitening. We propose in this work to learn these normalization operators jointly with the factorization itself. More precisely, given a $d\times n$ matrix $X$ of $d$ features measured on $n$ individuals, we propose to learn the parameters of quantile normalization operators that can operate row-wise on the values of $X$ and/or of its factorization $UV$ to improve the quality of the low-rank representation of $X$ itself. This optimization is facilitated by the introduction of a new differentiable quantile normalization operator built using optimal transport, providing new results on top of existing work by (Cuturi et al. 2019). We demonstrate the applicability of these techniques on synthetic and genomics datasets.
△ Less
Submitted 3 July, 2020; v1 submitted 8 February, 2020;
originally announced February 2020.
-
MULEX: Disentangling Exploitation from Exploration in Deep RL
Authors:
Lucas Beyer,
Damien Vincent,
Olivier Teboul,
Sylvain Gelly,
Matthieu Geist,
Olivier Pietquin
Abstract:
An agent learning through interactions should balance its action selection process between probing the environment to discover new rewards and using the information acquired in the past to adopt useful behaviour. This trade-off is usually obtained by perturbing either the agent's actions (e.g., e-greedy or Gibbs sampling) or the agent's parameters (e.g., NoisyNet), or by modifying the reward it re…
▽ More
An agent learning through interactions should balance its action selection process between probing the environment to discover new rewards and using the information acquired in the past to adopt useful behaviour. This trade-off is usually obtained by perturbing either the agent's actions (e.g., e-greedy or Gibbs sampling) or the agent's parameters (e.g., NoisyNet), or by modifying the reward it receives (e.g., exploration bonus, intrinsic motivation, or hand-shaped rewards). Here, we adopt a disruptive but simple and generic perspective, where we explicitly disentangle exploration and exploitation. Different losses are optimized in parallel, one of them coming from the true objective (maximizing cumulative rewards from the environment) and others being related to exploration. Every loss is used in turn to learn a policy that generates transitions, all shared in a single replay buffer. Off-policy methods are then applied to these transitions to optimize each loss. We showcase our approach on a hard-exploration environment, show its sample-efficiency and robustness, and discuss further implications.
△ Less
Submitted 1 July, 2019;
originally announced July 2019.
-
Differentiable Ranks and Sorting using Optimal Transport
Authors:
Marco Cuturi,
Olivier Teboul,
Jean-Philippe Vert
Abstract:
Sorting an array is a fundamental routine in machine learning, one that is used to compute rank-based statistics, cumulative distribution functions (CDFs), quantiles, or to select closest neighbors and labels. The sorting function is however piece-wise constant (the sorting permutation of a vector does not change if the entries of that vector are infinitesimally perturbed) and therefore has no gra…
▽ More
Sorting an array is a fundamental routine in machine learning, one that is used to compute rank-based statistics, cumulative distribution functions (CDFs), quantiles, or to select closest neighbors and labels. The sorting function is however piece-wise constant (the sorting permutation of a vector does not change if the entries of that vector are infinitesimally perturbed) and therefore has no gradient information to back-propagate. We propose a framework to sort elements that is algorithmically differentiable. We leverage the fact that sorting can be seen as a particular instance of the optimal transport (OT) problem on $\mathbb{R}$, from input values to a predefined array of sorted values (e.g. $1,2,\dots,n$ if the input array has $n$ elements). Building upon this link , we propose generalized CDFs and quantile operators by varying the size and weights of the target presorted array. Because this amounts to using the so-called Kantorovich formulation of OT, we call these quantities K-sorts, K-CDFs and K-quantiles. We recover differentiable algorithms by adding to the OT problem an entropic regularization, and approximate it using a few Sinkhorn iterations. We call these operators S-sorts, S-CDFs and S-quantiles, and use them in various learning settings: we benchmark them against the recently proposed neuralsort [Grover et al. 2019], propose applications to quantile regression and introduce differentiable formulations of the top-k accuracy that deliver state-of-the art performance.
△ Less
Submitted 2 November, 2019; v1 submitted 28 May, 2019;
originally announced May 2019.
-
Emission of cocoon afterglow for short Gamma Ray Burst : a counterpart of gravitational waves?
Authors:
O Teboul,
T Piran
Abstract:
The three gravitational wave events detected by LIGO are opening a new era for high- energy astrophysics. Nevertheless, location of such events remain unknown. A promising solution to the localization problem is to find an electromagnetic (EM) counterpart of GW- generating events, such as binary neutron star mergers (BNS). Indeed, their GW emission will be above sensitivity threshold in the near f…
▽ More
The three gravitational wave events detected by LIGO are opening a new era for high- energy astrophysics. Nevertheless, location of such events remain unknown. A promising solution to the localization problem is to find an electromagnetic (EM) counterpart of GW- generating events, such as binary neutron star mergers (BNS). Indeed, their GW emission will be above sensitivity threshold in the near future. BNS are also considered as short Gamma Ray Bursts (sGRB) progenitors. However, sGRB are highly beamed. In this study, we will therefore focus on another EM counterpart candidate: the cocoon afterglow. The propagation of the GRB jet inside the matter ejected by the BNS produces a cocoon. Then, similarly to the GRB afterglow, a cocoon afterglow is produced, but with a mildly relativistic velocity. Firstly, we propose a model that gives the full hydrodynamic evolution of the cocoon including the mildly relativistic regime. Then we calculate the cocoon afterglow emission in X-ray, optical and radio wavelengths. Finally, we compare the cocoon afterglow emission to the GRB afterglow emission and conclude that the cocoon afterglow is a promising EM counterpart.
△ Less
Submitted 16 October, 2017;
originally announced October 2017.