subscribe to arXiv mailings

doi 10.1145/3627673.3680081

PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters

Authors: Azin Ghazimatin, Ekaterina Garmash, Gustavo Penha, Kristen Sheets, Martin Achenbach, Oguz Semerci, Remi Galvez, Marcus Tannenberg, Sahitya Mantravadi, Divya Narayanan, Ofeliya Kalaydzhyan, Douglas Cole, Ben Carterette, Ann Clifton, Paul N. Bennett, Claudia Hauff, Mounia Lalmas

Abstract: Listeners of long-form talk-audio content, such as podcast episodes, often find it challenging to understand the overall structure and locate relevant sections. A practical solution is to divide episodes into chapters--semantically coherent segments labeled with titles and timestamps. Since most episodes on our platform at Spotify currently lack creator-provided chapters, automating the creation o… ▽ More Listeners of long-form talk-audio content, such as podcast episodes, often find it challenging to understand the overall structure and locate relevant sections. A practical solution is to divide episodes into chapters--semantically coherent segments labeled with titles and timestamps. Since most episodes on our platform at Spotify currently lack creator-provided chapters, automating the creation of chapters is essential. Scaling the chapterization of podcast episodes presents unique challenges. First, episodes tend to be less structured than written texts, featuring spontaneous discussions with nuanced transitions. Second, the transcripts are usually lengthy, averaging about 16,000 tokens, which necessitates efficient processing that can preserve context. To address these challenges, we introduce PODTILE, a fine-tuned encoder-decoder transformer to segment conversational data. The model simultaneously generates chapter transitions and titles for the input transcript. To preserve context, each input text is augmented with global context, including the episode's title, description, and previous chapter titles. In our intrinsic evaluation, PODTILE achieved an 11% improvement in ROUGE score over the strongest baseline. Additionally, we provide insights into the practical benefits of auto-generated chapters for listeners navigating episode content. Our findings indicate that auto-generated chapters serve as a useful tool for engaging with less popular podcasts. Finally, we present empirical evidence that using chapter titles can enhance effectiveness of sparse retrieval in search tasks. △ Less

Submitted 21 October, 2024; originally announced October 2024.

Comments: 9 pages, 4 figures, CIKM industry track 2024

MSC Class: 68P20 ACM Class: H.3.3

arXiv:2209.11871 [pdf, ps, other]

doi 10.1007/978-3-031-42448-9_5

Cem Mil Podcasts: A Spoken Portuguese Document Corpus For Multi-modal, Multi-lingual and Multi-Dialect Information Access Research

Authors: Ekaterina Garmash, Edgar Tanaka, Ann Clifton, Joana Correia, Sharmistha Jat, Winstead Zhu, Rosie Jones, Jussi Karlgren

Abstract: In this paper we describe the Portuguese-language podcast dataset we have released for academic research purposes. We give an overview of how the data was sampled, descriptive statistics over the collection, as well as information about the distribution over Brazilian and Portuguese dialects. We give results from experiments on multi-lingual summarization, showing that summarizing podcast transcri… ▽ More In this paper we describe the Portuguese-language podcast dataset we have released for academic research purposes. We give an overview of how the data was sampled, descriptive statistics over the collection, as well as information about the distribution over Brazilian and Portuguese dialects. We give results from experiments on multi-lingual summarization, showing that summarizing podcast transcripts can be performed well by a system supporting both English and Portuguese. We also show experiments on Portuguese podcast genre classification using text metadata. Combining this collection with previously released English-language collection opens up the potential for multi-modal, multi-lingual and multi-dialect podcast information access research. △ Less

Submitted 13 December, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

Comments: 12 pages, 1 figure

Journal ref: Volume 14163 of Lecture Notes in Computer Science, pages 48-59, Springer, 2023

arXiv:2109.06333 [pdf, other]

Connecting degree and polarity: An artificial language learning study

Authors: Lisa Bylinina, Alexey Tikhonov, Ekaterina Garmash

Abstract: We investigate a new linguistic generalization in pre-trained language models (taking BERT (Devlin et al., 2019) as a case study). We focus on degree modifiers (expressions like slightly, very, rather, extremely) and test the hypothesis that the degree expressed by a modifier (low, medium or high degree) is related to the modifier's sensitivity to sentence polarity (whether it shows preference for… ▽ More We investigate a new linguistic generalization in pre-trained language models (taking BERT (Devlin et al., 2019) as a case study). We focus on degree modifiers (expressions like slightly, very, rather, extremely) and test the hypothesis that the degree expressed by a modifier (low, medium or high degree) is related to the modifier's sensitivity to sentence polarity (whether it shows preference for affirmative or negative sentences or neither). To probe this connection, we apply the Artificial Language Learning experimental paradigm from psycholinguistics to a neural language model. Our experimental results suggest that BERT generalizes in line with existing linguistic observations that relate degree semantics to polarity sensitivity, including the main one: low degree semantics is associated with preference towards positive polarity. △ Less

Submitted 19 October, 2023; v1 submitted 13 September, 2021; originally announced September 2021.

arXiv:1007.0488 [pdf, ps, other]

doi 10.1209/0295-5075/92/17007

Mechanisms of magnetoelectricity in manganese doped incipient ferroelectrics

Authors: R. O. Kuzian, V. V. Laguta, A. -M. Dare, I. V. Kondakova, M. Marysko, L. Raymond, E. P. Garmash, V. N. Pavlikov, A. Tkach, P. M. Vilarinho, R. Hayn

Abstract: We report magnetization measurements and magnetic resonance data for SrTiO3 doped by manganese. We show that the recently reported coexistent spin and dipole glass (multiglass) behaviours are strongly affected by the distribution of Mn ions between the Sr and Ti sites. Motivated by this finding we calculate the magnetic interactions between Mn impurities of different kinds. Both LSDA+U and many-bo… ▽ More We report magnetization measurements and magnetic resonance data for SrTiO3 doped by manganese. We show that the recently reported coexistent spin and dipole glass (multiglass) behaviours are strongly affected by the distribution of Mn ions between the Sr and Ti sites. Motivated by this finding we calculate the magnetic interactions between Mn impurities of different kinds. Both LSDA+U and many-body perturbation theory evidence that magnetic and magnetoelectric interactions are mediated by Mn$_B^{4+}$ ions substituting for Ti. We propose two microscopic magnetoelectric coupling mechanisms, which can be involved in all magnetoelectric systems based on incipient ferroelectrics. In the first one, the electric field modifies the spin susceptibility via spin-strain coupling of Mn$_{B}^{4+}$. The second mechanism concerns Mn pairs coupled by the position-dependent exchange interaction. △ Less

Submitted 27 September, 2010; v1 submitted 3 July, 2010; originally announced July 2010.

Comments: 6 pages, 2 figures; Revised version, accepted for publication in EPL. The physics underlying the proposed mechanisms is more detailed

Journal ref: EPL,vol. 92, p. 17007 (2010)

Showing 1–4 of 4 results for author: Garmash, E