Skip to content
View taidnguyen's full-sized avatar

Highlights

  • Pro

Organizations

@allenai @BrachioLab

Block or report taidnguyen

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

3D LEGO models and mosaics from images using R and #tidyverse

R 419 57 Updated Nov 27, 2023

AI Logging for Interpretability and Explainability🔬

Python 89 7 Updated Jun 7, 2024

LLM training code for Databricks foundation models

Python 4,044 528 Updated Nov 13, 2024

A Survey on Data Selection for Language Models

178 10 Updated Oct 13, 2024

vietnews dataset for vietnamese summarization benchmark

22 5 Updated Sep 24, 2019

Probabilistic programming with HuggingFace language models

Python 88 15 Updated Oct 23, 2024

[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.

Python 92 10 Updated Mar 15, 2023

Apéro is a Hugo theme for personal websites. A Hugo theme you'll want to hang out with 🌌 . This is the source for the theme files to install.

SCSS 184 58 Updated Sep 16, 2024

Mamba SSM architecture

Python 13,188 1,124 Updated Nov 5, 2024

Fine-tune mistral-7B on 3090s, a100s, h100s

Python 701 62 Updated Oct 11, 2023

Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard

Python 25 4 Updated May 23, 2024

OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.

Python 538 29 Updated Oct 3, 2023

[ASE 2023] Software Entity Recognition with Noise-Robust Learning

Python 3 1 Updated Apr 14, 2024

Gram2Vec is a document embedding algorithm that embeds documents into a higher dimensional space based off grammatical style.

Python 7 1 Updated Oct 17, 2024

[ACL 2023] Explanation-based Finetuning Makes Models More Robust to Spurious Correlation

Python 6 Updated May 19, 2023

Syntax Regex Matcher is a package for applying regular expressions to parse trees to look for syntactic constructions in English sentences

Python 1 Updated Aug 23, 2023

Multilingual syllable annotation pipeline component for spacy

Python 37 2 Updated Mar 8, 2023

The original Backpack Language Model implementation, a fork of FlashAttention

Python 64 6 Updated May 29, 2023

In-context Example Selection with Influences

Python 13 1 Updated May 12, 2023

Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467

Python 265 23 Updated Aug 5, 2023

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 969 72 Updated Aug 21, 2024

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 1,865 135 Updated Aug 25, 2024

A PyTorch implementation of Multimodal Few-Shot Learning with Frozen Language Models with OPT.

Jupyter Notebook 43 1 Updated Jul 23, 2022

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,703 668 Updated Jan 14, 2024
Python 67 5 Updated Aug 24, 2022

Few-shot Learning of GPT-3

Python 342 50 Updated Sep 18, 2023

Accessible large language models via k-bit quantization for PyTorch.

Python 6,278 630 Updated Nov 11, 2024

Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021

Python 55 7 Updated Dec 11, 2021

Code for paper "CrossFit 🏋️: A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)

Python 105 6 Updated Apr 28, 2022
Next