Skip to content
View shijie-wu's full-sized avatar
Block or Report

Block or report shijie-wu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

All things prompt engineering

Python 5,284 291 Updated Jun 4, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,408 493 Updated Aug 1, 2024
Python 7,041 545 Updated Jul 25, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,133 48 Updated Jul 9, 2024

LLM training code for Databricks foundation models

Python 3,898 511 Updated Aug 1, 2024

Tools for managing datasets for governance and training.

HTML 76 49 Updated Jul 29, 2024

A tiny library for coding with large language models.

Python 1,197 74 Updated Jul 10, 2024
Python 1,430 150 Updated Jul 30, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 90,084 14,248 Updated Aug 1, 2024
9 5 Updated Oct 20, 2022

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,367 771 Updated Jul 10, 2024

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

397 22 Updated Jun 25, 2024

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 1,805 240 Updated Aug 1, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 5,463 318 Updated Jul 5, 2024

Solve puzzles. Improve your pytorch.

Jupyter Notebook 2,963 242 Updated Jul 15, 2024

Beautiful calculator app for macOS, Linux & Windows

JavaScript 5,433 201 Updated Jul 25, 2024

A word2vec negative sampling implementation with correct CBOW update.

C++ 261 18 Updated Nov 8, 2021

A bilingual NLI dataset annotated in Spanish and human translated into English

8 4 Updated Apr 14, 2020

This repository contains the FewGLUE dataset for few-shot natural language understanding.

160 25 Updated Sep 16, 2020

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,102 3,991 Updated Aug 1, 2024

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 1,991 203 Updated Jan 9, 2024

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 29,596 7,358 Updated Jul 31, 2024

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Jupyter Notebook 6,930 1,294 Updated Jan 18, 2023

On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines

Python 130 21 Updated Sep 6, 2023

BERT models for many languages created from Wikipedia texts

34 1 Updated May 25, 2020

Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.

Python 112 16 Updated Nov 10, 2020

Efficient Low-Memory Aligner

C 135 30 Updated Jun 20, 2024

Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

Python 345 47 Updated Nov 7, 2023

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,393 433 Updated Aug 1, 2024
Next