Skip to main content

Showing 1–25 of 25 results for author: Eisenschlos, J M

  1. arXiv:2410.04739  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    TableRAG: Million-Token Table Understanding with Language Models

    Authors: Si-An Chen, Lesly Miculicich, Julian Martin Eisenschlos, Zifeng Wang, Zilong Wang, Yanfei Chen, Yasuhisa Fujii, Hsuan-Tien Lin, Chen-Yu Lee, Tomas Pfister

    Abstract: Recent advancements in language models (LMs) have notably enhanced their ability to reason with tabular data, primarily through program-aided mechanisms that manipulate and analyze tables. However, these methods often require the entire table as input, leading to scalability challenges due to the positional bias or context length constraints. In response to these challenges, we introduce TableRAG,… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted to NeurIPS 2024

  2. arXiv:2406.00980  [pdf, other

    cs.CL cs.CV

    Selectively Answering Visual Questions

    Authors: Julian Martin Eisenschlos, Hernán Maina, Guido Ivetta, Luciana Benotti

    Abstract: Recently, large multi-modal models (LMMs) have emerged with the capacity to perform vision tasks such as captioning and visual question answering (VQA) with unprecedented accuracy. Applications such as helping the blind or visually impaired have a critical need for precise answers. It is specially important for models to be well calibrated and be able to quantify their uncertainty in order to sele… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: To be published in the findings of the 2024 Annual Meeting of the Association for Computational Linguistics

  3. arXiv:2405.19094  [pdf, other

    cs.CL cs.AI

    Faithful Chart Summarization with ChaTS-Pi

    Authors: Syrine Krichene, Francesco Piccinno, Fangyu Liu, Julian Martin Eisenschlos

    Abstract: Chart-to-summary generation can help explore data, communicate insights, and help the visually impaired people. Multi-modal generative models have been used to produce fluent summaries, but they can suffer from factual and perceptual errors. In this work we present CHATS-CRITIC, a reference-free chart summarization metric for scoring faithfulness. CHATS-CRITIC is composed of an image-to-text model… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: To be published in the proceedings of the 2024 Annual Meeting of the Association for Computational Linguistics

  4. arXiv:2405.07765  [pdf, other

    cs.CL

    TANQ: An open domain dataset of table answered questions

    Authors: Mubashara Akhtar, Chenxi Pang, Andreea Marzoca, Yasemin Altun, Julian Martin Eisenschlos

    Abstract: Language models, potentially augmented with tool usage such as retrieval are becoming the go-to means of answering questions. Understanding and answering questions in real-world settings often requires retrieving information from different sources, processing and aggregating data to extract insights, and presenting complex findings in form of structured artifacts such as novel tables, charts, or i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages

  5. arXiv:2401.04398  [pdf, other

    cs.CL

    Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

    Authors: Zilong Wang, Hao Zhang, Chun-Liang Li, Julian Martin Eisenschlos, Vincent Perot, Zifeng Wang, Lesly Miculicich, Yasuhisa Fujii, Jingbo Shang, Chen-Yu Lee, Tomas Pfister

    Abstract: Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verification. Compared with generic reasoning, table-based reasoning requires the extraction of underlying semantics from both free-form questions and semi-structured tabular data. Chain-of-Thought and its similar approaches inco… ▽ More

    Submitted 18 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR 2024

  6. arXiv:2305.14926  [pdf, other

    cs.CL cs.AI cs.LG

    Universal Self-Adaptive Prompting

    Authors: Xingchen Wan, Ruoxi Sun, Hootan Nakhost, Hanjun Dai, Julian Martin Eisenschlos, Sercan O. Arik, Tomas Pfister

    Abstract: A hallmark of modern large language models (LLMs) is their impressive general zero-shot and few-shot abilities, often elicited through in-context learning (ICL) via prompting. However, while highly coveted and being the most general, zero-shot performances in LLMs are still typically weaker due to the lack of guidance and the difficulty of applying existing automatic prompt design methods in gener… ▽ More

    Submitted 20 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (Main). 10 pages, 5 figures, 4 tables (26 pages, 9 figures and 13 tables including references and appendices)

  7. arXiv:2305.14613  [pdf, other

    cs.CL cs.AI

    Selectively Answering Ambiguous Questions

    Authors: Jeremy R. Cole, Michael J. Q. Zhang, Daniel Gillick, Julian Martin Eisenschlos, Bhuwan Dhingra, Jacob Eisenstein

    Abstract: Trustworthy language models should abstain from answering questions when they do not know the answer. However, the answer to a question can be unknown for a variety of reasons. Prior research has focused on the case in which the question is clear and the answer is unambiguous but possibly unknown, but the answer to a question can also be unclear due to uncertainty of the questioner's intent or con… ▽ More

    Submitted 14 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: To appear in EMNLP 2023. 9 pages, 5 figures, 2 pages of appendix

  8. arXiv:2303.00242  [pdf, other

    cs.CL

    DIFFQG: Generating Questions to Summarize Factual Changes

    Authors: Jeremy R. Cole, Palak Jain, Julian Martin Eisenschlos, Michael J. Q. Zhang, Eunsol Choi, Bhuwan Dhingra

    Abstract: Identifying the difference between two versions of the same article is useful to update knowledge bases and to understand how articles evolve. Paired texts occur naturally in diverse situations: reporters write similar news stories and maintainers of authoritative websites must keep their information up to date. We propose representing factual changes between paired documents as question-answer pa… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 14 pages. Accepted at EACL 2023 (main, long)

  9. arXiv:2212.10505  [pdf, other

    cs.CL cs.AI cs.CV

    DePlot: One-shot visual language reasoning by plot-to-table translation

    Authors: Fangyu Liu, Julian Martin Eisenschlos, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee, Mandar Joshi, Wenhu Chen, Nigel Collier, Yasemin Altun

    Abstract: Visual language such as charts and plots is ubiquitous in the human world. Comprehending plots and charts requires strong reasoning skills. Prior state-of-the-art (SOTA) models require at least tens of thousands of training examples and their reasoning capabilities are still much limited, especially on complex human-written queries. This paper presents the first one-shot solution to visual languag… ▽ More

    Submitted 23 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 (Findings)

  10. arXiv:2212.09662  [pdf, other

    cs.CL cs.AI cs.CV

    MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering

    Authors: Fangyu Liu, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee, Mandar Joshi, Yasemin Altun, Nigel Collier, Julian Martin Eisenschlos

    Abstract: Visual language data such as plots, charts, and infographics are ubiquitous in the human world. However, state-of-the-art vision-language models do not perform well on these data. We propose MatCha (Math reasoning and Chart derendering pretraining) to enhance visual language models' capabilities in jointly modeling charts/plots and language data. Specifically, we propose several pretraining tasks… ▽ More

    Submitted 23 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  11. arXiv:2211.12641  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Leveraging Data Recasting to Enhance Tabular Reasoning

    Authors: Aashna Jena, Vivek Gupta, Manish Shrivastava, Julian Martin Eisenschlos

    Abstract: Creating challenging tabular inference data is essential for learning complex reasoning. Prior work has mostly relied on two data generation strategies. The first is human annotation, which yields linguistically diverse data but is difficult to scale. The second category for creation is synthetic generation, which is scalable and cost effective but lacks inventiveness. In this research, we present… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 14 pages, 10 tables, 3 figues, EMNLP 2022 (Findings)

  12. arXiv:2210.09162  [pdf, other

    cs.CL cs.LG

    Table-To-Text generation and pre-training with TabT5

    Authors: Ewa Andrejczuk, Julian Martin Eisenschlos, Francesco Piccinno, Syrine Krichene, Yasemin Altun

    Abstract: Encoder-only transformer models have been successfully applied to different table understanding tasks, as in TAPAS (Herzig et al., 2020). A major limitation of these architectures is that they are constrained to classification-like tasks such as cell selection or entailment detection. We present TABT5, an encoder-decoder model that generates natural language text based on tables and textual inputs… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted to Findings of EMNLP 2022

  13. arXiv:2210.07993  [pdf, other

    cs.CL

    MiQA: A Benchmark for Inference on Metaphorical Questions

    Authors: Iulia-Maria Comsa, Julian Martin Eisenschlos, Srini Narayanan

    Abstract: We propose a benchmark to assess the capability of large language models to reason with conventional metaphors. Our benchmark combines the previously isolated topics of metaphor detection and commonsense reasoning into a single task that requires a model to make inferences by accurately selecting between the literal and metaphorical register. We examine the performance of state-of-the-art pre-trai… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: AACL-IJCNLP 2022 conference paper

  14. arXiv:2209.12786  [pdf, other

    cs.CL cs.AI

    Do ever larger octopi still amplify reporting biases? Evidence from judgments of typical colour

    Authors: Fangyu Liu, Julian Martin Eisenschlos, Jeremy R. Cole, Nigel Collier

    Abstract: Language models (LMs) trained on raw texts have no direct access to the physical world. Gordon and Van Durme (2013) point out that LMs can thus suffer from reporting bias: texts rarely report on common facts, instead focusing on the unusual aspects of a situation. If LMs are only trained on text corpora and naively memorise local co-occurrence statistics, they thus naturally would learn a biased v… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: AACL 2022

  15. arXiv:2209.12153  [pdf, other

    cs.CL cs.AI

    WinoDict: Probing language models for in-context word acquisition

    Authors: Julian Martin Eisenschlos, Jeremy R. Cole, Fangyu Liu, William W. Cohen

    Abstract: We introduce a new in-context learning paradigm to measure Large Language Models' (LLMs) ability to learn novel words during inference. In particular, we rewrite Winograd-style co-reference resolution problems by replacing the key concept word with a synthetic but plausible word that the model must understand to complete the task. Solving this task requires the model to make use of the dictionary… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  16. arXiv:2109.04312  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    MATE: Multi-view Attention for Table Transformer Efficiency

    Authors: Julian Martin Eisenschlos, Maharshi Gor, Thomas Müller, William W. Cohen

    Abstract: This work presents a sparse-attention Transformer architecture for modeling documents that contain large tables. Tables are ubiquitous on the web, and are rich in information. However, more than 20% of relational tables on the web have 20 or more rows (Cafarella et al., 2008), and these large tables present a challenge for current Transformer models, which are typically limited to 512 tokens. Here… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021

  17. Time-Aware Language Models as Temporal Knowledge Bases

    Authors: Bhuwan Dhingra, Jeremy R. Cole, Julian Martin Eisenschlos, Daniel Gillick, Jacob Eisenstein, William W. Cohen

    Abstract: Many facts come with an expiration date, from the name of the President to the basketball team Lebron James plays for. But language models (LMs) are trained on snapshots of data collected at a specific moment in time, and this can limit their utility, especially in the closed-book setting where the pretraining corpus must contain the facts the model should memorize. We introduce a diagnostic datas… ▽ More

    Submitted 23 April, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: Version accepted to TACL

    Journal ref: Transactions of the Association for Computational Linguistics 2022; 10 257-273

  18. arXiv:2106.00479  [pdf, other

    cs.CL

    DoT: An efficient Double Transformer for NLP tasks with tables

    Authors: Syrine Krichene, Thomas Müller, Julian Martin Eisenschlos

    Abstract: Transformer-based approaches have been successfully used to obtain state-of-the-art accuracy on natural language processing (NLP) tasks with semi-structured tables. These model architectures are typically deep, resulting in slow training and inference, especially for long inputs. To improve efficiency while maintaining a high accuracy, we propose a new architecture, DoT, a double transformer model… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 11 pages, 4 figures, to be published in Findings of ACL-IJCNLP 2021

    MSC Class: 68-06 ACM Class: I.2.7

  19. arXiv:2104.04725  [pdf, other

    cs.CL

    Fool Me Twice: Entailment from Wikipedia Gamification

    Authors: Julian Martin Eisenschlos, Bhuwan Dhingra, Jannis Bulian, Benjamin Börschinger, Jordan Boyd-Graber

    Abstract: We release FoolMeTwice (FM2 for short), a large dataset of challenging entailment pairs collected through a fun multi-player game. Gamification encourages adversarial examples, drastically lowering the number of examples that can be solved using "shortcuts" compared to other popular entailment datasets. Players are presented with two tasks. The first task asks the player to write a plausible claim… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

    Comments: Published in NAACL 2021

  20. arXiv:2104.01099  [pdf, other

    cs.CL

    TAPAS at SemEval-2021 Task 9: Reasoning over tables with intermediate pre-training

    Authors: Thomas Müller, Julian Martin Eisenschlos, Syrine Krichene

    Abstract: We present the TAPAS contribution to the Shared Task on Statement Verification and Evidence Finding with Tables (SemEval 2021 Task 9, Wang et al. (2021)). SEM TAB FACT Task A is a classification task of recognizing if a statement is entailed, neutral or refuted by the content of a given table. We adopt the binary TAPAS model of Eisenschlos et al. (2020) to this task. We learn two binary classifica… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  21. arXiv:2103.12011  [pdf, other

    cs.CL

    Open Domain Question Answering over Tables via Dense Retrieval

    Authors: Jonathan Herzig, Thomas Müller, Syrine Krichene, Julian Martin Eisenschlos

    Abstract: Recent advances in open-domain QA have led to strong models based on dense retrieval, but only focused on retrieving textual passages. In this work, we tackle open-domain QA over tables for the first time, and show that retrieval can be improved by a retriever designed to handle tabular context. We present an effective pre-training procedure for our retriever and improve retrieval quality with min… ▽ More

    Submitted 9 June, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: NAACL 2021 camera ready

  22. arXiv:2010.00571  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Understanding tables with intermediate pre-training

    Authors: Julian Martin Eisenschlos, Syrine Krichene, Thomas Müller

    Abstract: Table entailment, the binary classification task of finding if a sentence is supported or refuted by the content of a table, requires parsing language and table structure as well as numerical and discrete reasoning. While there is extensive work on textual entailment, table entailment is less well studied. We adapt TAPAS (Herzig et al., 2020), a table-based BERT model, to recognize entailment. Mot… ▽ More

    Submitted 5 October, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: Accepted to EMNLP Findings 2020

  23. arXiv:2006.16038  [pdf, other

    cs.LG stat.ML

    SoftSort: A Continuous Relaxation for the argsort Operator

    Authors: Sebastian Prillo, Julian Martin Eisenschlos

    Abstract: While sorting is an important procedure in computer science, the argsort operator - which takes as input a vector and returns its sorting permutation - has a discrete image and thus zero gradients almost everywhere. This prohibits end-to-end, gradient-based learning of models that rely on the argsort operator. A natural way to overcome this problem is to replace the argsort operator with a continu… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: 9 pages, 6 figures. Accepted at ICML 2020

  24. arXiv:2004.02349  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    TAPAS: Weakly Supervised Table Parsing via Pre-training

    Authors: Jonathan Herzig, Paweł Krzysztof Nowak, Thomas Müller, Francesco Piccinno, Julian Martin Eisenschlos

    Abstract: Answering natural language questions over tables is usually seen as a semantic parsing task. To alleviate the collection cost of full logical forms, one popular approach focuses on weak supervision consisting of denotations instead of logical forms. However, training semantic parsers from weak supervision poses difficulties, and in addition, the generated logical forms are only used as an intermed… ▽ More

    Submitted 21 April, 2020; v1 submitted 5 April, 2020; originally announced April 2020.

    Comments: Accepted to ACL 2020

  25. arXiv:1909.04761  [pdf, other

    cs.CL cs.LG

    MultiFiT: Efficient Multi-lingual Language Model Fine-tuning

    Authors: Julian Martin Eisenschlos, Sebastian Ruder, Piotr Czapla, Marcin Kardas, Sylvain Gugger, Jeremy Howard

    Abstract: Pretrained language models are promising particularly for low-resource languages as they only require unlabelled data. However, training existing models requires huge amounts of compute, while pretrained cross-lingual models often underperform on low-resource languages. We propose Multi-lingual language model Fine-Tuning (MultiFiT) to enable practitioners to train and fine-tune language models eff… ▽ More

    Submitted 3 June, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: Proceedings of EMNLP-IJCNLP 2019