Skip to main content

Showing 1–6 of 6 results for author: Bylinina, L

  1. arXiv:2409.18868  [pdf, other

    cs.CL cs.AI cs.LG

    Individuation in Neural Models with and without Visual Grounding

    Authors: Alexey Tikhonov, Lisa Bylinina, Ivan P. Yamshchikov

    Abstract: We show differences between a language-and-vision model CLIP and two text-only models - FastText and SBERT - when it comes to the encoding of individuation information. We study latent representations that CLIP provides for substrates, granular aggregates, and various numbers of objects. We demonstrate that CLIP embeddings capture quantitative differences in individuation better than models traine… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    ACM Class: I.2.4; J.4; I.6.8; I.2.10

  2. arXiv:2407.02136  [pdf, other

    cs.CL

    Black Big Boxes: Do Language Models Hide a Theory of Adjective Order?

    Authors: Jaap Jumelet, Lisa Bylinina, Willem Zuidema, Jakub Szymanik

    Abstract: In English and other languages, multiple adjectives in a complex noun phrase show intricate ordering patterns that have been a target of much linguistic theory. These patterns offer an opportunity to assess the ability of language models (LMs) to learn subtle rules of language involving factors that cross the traditional divisions of syntax, semantics, and pragmatics. We review existing hypotheses… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2311.01955  [pdf, other

    cs.CL

    Too Much Information: Keeping Training Simple for BabyLMs

    Authors: Lukas Edman, Lisa Bylinina

    Abstract: This paper details the work of the University of Groningen for the BabyLM Challenge. We follow the idea that, like babies, language models should be introduced to simpler concepts first and build off of that knowledge to understand more complex concepts. We examine this strategy of simple-then-complex through a variety of lenses, namely context size, vocabulary, and overall linguistic complexity o… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  4. arXiv:2306.02348  [pdf, other

    cs.CL

    Leverage Points in Modality Shifts: Comparing Language-only and Multimodal Word Representations

    Authors: Aleksey Tikhonov, Lisa Bylinina, Denis Paperno

    Abstract: Multimodal embeddings aim to enrich the semantic information in neural representations of language compared to text-only models. While different embeddings exhibit different applicability and performance on downstream tasks, little is known about the systematic representation differences attributed to the visual modality. Our paper compares word embeddings from three vision-and-language models (CL… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted for StarSEM 2023

  5. arXiv:2109.06333  [pdf, other

    cs.CL

    Connecting degree and polarity: An artificial language learning study

    Authors: Lisa Bylinina, Alexey Tikhonov, Ekaterina Garmash

    Abstract: We investigate a new linguistic generalization in pre-trained language models (taking BERT (Devlin et al., 2019) as a case study). We focus on degree modifiers (expressions like slightly, very, rather, extremely) and test the hypothesis that the degree expressed by a modifier (low, medium or high degree) is related to the modifier's sensitivity to sentence polarity (whether it shows preference for… ▽ More

    Submitted 19 October, 2023; v1 submitted 13 September, 2021; originally announced September 2021.

  6. arXiv:2109.03926  [pdf, other

    cs.CL

    Transformers in the loop: Polarity in neural models of language

    Authors: Lisa Bylinina, Alexey Tikhonov

    Abstract: Representation of linguistic phenomena in computational language models is typically assessed against the predictions of existing linguistic theories of these phenomena. Using the notion of polarity as a case study, we show that this is not always the most adequate set-up. We probe polarity via so-called 'negative polarity items' (in particular, English 'any') in two pre-trained Transformer-based… ▽ More

    Submitted 17 March, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to ACL 2022 main conference