Skip to main content

Showing 1–2 of 2 results for author: Hajimirsadegh, H

  1. arXiv:2410.01201  [pdf, other

    cs.LG cs.AI

    Were RNNs All We Needed?

    Authors: Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio, Hossein Hajimirsadegh

    Abstract: The scalability limitations of Transformers regarding sequence length have renewed interest in recurrent sequence models that are parallelizable during training. As a result, many novel recurrent architectures, such as S4, Mamba, and Aaren, have been proposed that achieve comparable performance. In this work, we revisit traditional recurrent neural networks (RNNs) from over a decade ago: LSTMs (19… ▽ More

    Submitted 4 October, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

  2. arXiv:2310.02473  [pdf, other

    cs.LG

    Prompting-based Temporal Domain Generalization

    Authors: Sepidehsadat Hosseini, Mengyao Zhai, Hossein Hajimirsadegh, Frederick Tung

    Abstract: Machine learning traditionally assumes that the training and testing data are distributed independently and identically. However, in many real-world settings, the data distribution can shift over time, leading to poor generalization of trained models in future time periods. This paper presents a novel prompting-based approach to temporal domain generalization that is parameter-efficient, time-effi… ▽ More

    Submitted 15 February, 2024; v1 submitted 3 October, 2023; originally announced October 2023.