Skip to main content

Showing 1–8 of 8 results for author: Parisien, C

  1. arXiv:2410.01174  [pdf, other

    cs.CL cs.AI

    Towards Inference-time Category-wise Safety Steering for Large Language Models

    Authors: Amrita Bhattacharjee, Shaona Ghosh, Traian Rebedea, Christopher Parisien

    Abstract: While large language models (LLMs) have seen unprecedented advancements in capabilities and applications across a variety of use-cases, safety alignment of these models is still an area of active research. The fragile nature of LLMs, even models that have undergone extensive alignment and safety training regimes, warrants additional safety steering steps via training-free, inference-time methods.… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  2. arXiv:2406.15214  [pdf, other

    cs.CL

    Unsupervised Extraction of Dialogue Policies from Conversations

    Authors: Makesh Narsimhan Sreedhar, Traian Rebedea, Christopher Parisien

    Abstract: Dialogue policies play a crucial role in developing task-oriented dialogue systems, yet their development and maintenance are challenging and typically require substantial effort from experts in dialogue modeling. While in many situations, large amounts of conversational data are available for the task at hand, people lack an effective solution able to extract dialogue policies from this data. In… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 6 August, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2404.05993  [pdf, other

    cs.LG cs.CL cs.CY

    AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM Experts

    Authors: Shaona Ghosh, Prasoon Varshney, Erick Galinkin, Christopher Parisien

    Abstract: As Large Language Models (LLMs) and generative AI become more widespread, the content safety risks associated with their use also increase. We find a notable deficiency in high-quality content safety datasets and benchmarks that comprehensively cover a wide range of critical safety areas. To address this, we define a broad content safety risk taxonomy, comprising 13 critical risk and 9 sparse risk… ▽ More

    Submitted 11 September, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  5. arXiv:2404.03820  [pdf, other

    cs.CL

    CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

    Authors: Makesh Narsimhan Sreedhar, Traian Rebedea, Shaona Ghosh, Jiaqi Zeng, Christopher Parisien

    Abstract: Recent advancements in instruction-tuning datasets have predominantly focused on specific tasks like mathematical or logical reasoning. There has been a notable gap in data designed for aligning language models to maintain topic relevance in conversations - a critical aspect for deploying chatbots to production. We introduce the CantTalkAboutThis dataset to help language models remain focused on t… ▽ More

    Submitted 21 June, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  6. arXiv:2310.10501  [pdf, other

    cs.CL cs.AI

    NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable Rails

    Authors: Traian Rebedea, Razvan Dinu, Makesh Sreedhar, Christopher Parisien, Jonathan Cohen

    Abstract: NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems. Guardrails (or rails for short) are a specific way of controlling the output of an LLM, such as not talking about topics considered harmful, following a predefined dialogue path, using a particular language style, and more. There are several mechanisms that allow LLM providers a… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 - Demo track

  7. arXiv:2211.05596  [pdf, other

    cs.CL cs.AI

    Prompt Learning for Domain Adaptation in Task-Oriented Dialogue

    Authors: Makesh Narsimhan Sreedhar, Christopher Parisien

    Abstract: Conversation designers continue to face significant obstacles when creating production quality task-oriented dialogue systems. The complexity and cost involved in schema development and data collection is often a major barrier for such designers, limiting their ability to create natural, user-friendly experiences. We frame the classification of user intent as the generation of a canonical form, a… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at SereTOD Workshop - EMNLP 2022

  8. arXiv:2203.03540  [pdf

    cs.CL cs.AI cs.LG

    GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records

    Authors: Xi Yang, Aokun Chen, Nima PourNejatian, Hoo Chang Shin, Kaleb E Smith, Christopher Parisien, Colin Compas, Cheryl Martin, Mona G Flores, Ying Zhang, Tanja Magoc, Christopher A Harle, Gloria Lipori, Duane A Mitchell, William R Hogan, Elizabeth A Shenkman, Jiang Bian, Yonghui Wu

    Abstract: There is an increasing interest in developing artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is compar… ▽ More

    Submitted 16 December, 2022; v1 submitted 2 February, 2022; originally announced March 2022.

    Comments: 24 pages, 2 figures, 3 tables