Computer Science > Computation and Language

arXiv:2308.13467 (cs)

[Submitted on 25 Aug 2023]

Title:Leveraging Knowledge and Reinforcement Learning for Enhanced Reliability of Language Models

Authors:Nancy Tyagi, Surjodeep Sarkar, Manas Gaur

View PDF

Abstract:The Natural Language Processing(NLP) community has been using crowd sourcing techniques to create benchmark datasets such as General Language Understanding and Evaluation(GLUE) for training modern Language Models such as BERT. GLUE tasks measure the reliability scores using inter annotator metrics i.e. Cohens Kappa. However, the reliability aspect of LMs has often been overlooked. To counter this problem, we explore a knowledge-guided LM ensembling approach that leverages reinforcement learning to integrate knowledge from ConceptNet and Wikipedia as knowledge graph embeddings. This approach mimics human annotators resorting to external knowledge to compensate for information deficits in the datasets. Across nine GLUE datasets, our research shows that ensembling strengthens reliability and accuracy scores, outperforming state of the art.

Comments:	Accepted at CIKM'23
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
Cite as:	arXiv:2308.13467 [cs.CL]
	(or arXiv:2308.13467v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2308.13467

Submission history

From: Nancy Tyagi [view email]
[v1] Fri, 25 Aug 2023 16:11:08 UTC (1,865 KB)

Computer Science > Computation and Language

Title:Leveraging Knowledge and Reinforcement Learning for Enhanced Reliability of Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Leveraging Knowledge and Reinforcement Learning for Enhanced Reliability of Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators