Coder Social home page Coder Social logo

devpod's Projects

agents icon agents

An Open-source Framework for Autonomous Language Agents

alpaca_farm icon alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

argilla icon argilla

✨Argilla: the open-source data curation platform for LLMs

datasetgpt icon datasetgpt

A command-line interface to generate textual and conversational datasets with LLMs.

discus icon discus

Generate and enrich datasets on-demand to fine-tune LLMs. Discord: https://discord.gg/t6ADqBKrdZ

dromedary icon dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

flipped-learning icon flipped-learning

[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners

h2ogpt icon h2ogpt

Private Q&A and summarization of documents+images or chat with local GPT, 100% private, Apache 2.0. Supports LLaMa2, llama.cpp, and more. Demo: https://gpt.h2o.ai/

label-studio icon label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

lilac icon lilac

Curate better data for LLMs

llm-eval-survey icon llm-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

llmboxing icon llmboxing

this is to generate anonymous responses. Useful for RL

longform icon longform

Instruction Tuning Dataset and Models for Long Text Generation with Corpus Extraction

p-tuning-v2 icon p-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

prompt-tuning icon prompt-tuning

Original Implementation of Prompt Tuning from Lester, et al, 2021

prompttools icon prompttools

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate).

ragas icon ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

self-instruct icon self-instruct

Aligning pretrained language models with instruction data generated by themselves.

stanford_alpaca icon stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

trlx icon trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

wizardlm icon wizardlm

Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.