Coder Social home page Coder Social logo

johncruyff14 / llama2-accessory Goto Github PK

View Code? Open in Web Editor NEW

This project forked from alpha-vllm/llama2-accessory

0.0 0.0 0.0 23.61 MB

An Open-source Toolkit for LLM Development

Home Page: https://llama2-accessory.readthedocs.io/

License: Other

Shell 8.67% Python 90.77% Batchfile 0.56%

llama2-accessory's Introduction

LLaMA2-Accessory: An Open-source Toolkit for LLM Development ๐Ÿš€


๐Ÿš€LLaMA2-Accessory is an open-source toolkit for pre-training, fine-tuning and deployment of Large Language Models (LLMs) and mutlimodal LLMs. This repo is mainly inherited from LLaMA-Adapter with more advanced features.๐Ÿง 

News

  • [2023.09.15] We now support Falcon 180B!๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ
  • [2023.09.14] WeMix-LLaMA2-70B shows excellent performance on the OpenCompass benchmark!๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ
  • [2023.09.02] We now support InternLM๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ
  • [2023.08.28] We release quantized LLM with OmniQuant, which is an efficient, accurate, and omnibearing (even extremely low bit) quantization algorithm. Multimodal version is coming soon๐Ÿ”ฅ๐Ÿ”ฅ
  • [2023.08.27] We now support CodeLLaMA and instruction fine-tuning on evol-code-alpaca๐Ÿ”ฅ๐Ÿ”ฅ
  • [2023.08.27] We release our documentation in a webbook format ๐Ÿ”—Check it out here
  • [2023.08.21] We release the Quantization codes and Evaluation result๐Ÿ”ฅ
  • [2023.08.05] We release the multimodel fine-tuning codes and checkpoints๐Ÿ”ฅ
  • [2023.07.23] Initial release ๐Ÿ“Œ

Features

Setup

โš™๏ธ For environment installation, please refer to Environment Setup.

Model Usage

๐Ÿค– Instructions for model pre-training, fine-tuning, inference, and other related topics are all available in the document.

Frequently Asked Questions (FAQ)

โ“ Encountering issues or have further questions? Find answers to common inquiries here. We're here to assist you!

Demos

Core Contributors

Chris Liu, Ziyi Lin, Guian Fang, Jiaming Han, Yijiang Liu, Renrui Zhang

Project Leader

Peng Gao, Wenqi Shao, Shanghang Zhang

Hiring Announcement

๐Ÿ”ฅ We are hiring interns, postdocs, and full-time researchers at the General Vision Group, Shanghai AI Lab, with a focus on multi-modality and vision foundation models. If you are interested, please contact [email protected].

Citation

If you find our code and paper useful, please kindly cite:

@article{zhang2023llamaadapter,
  title = {LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention},
  author={Zhang, Renrui and Han, Jiaming and Liu, Chris and Gao, Peng and Zhou, Aojun and Hu, Xiangfei and Yan, Shilin and Lu, Pan and Li, Hongsheng and Qiao, Yu},
  journal={arXiv preprint arXiv:2303.16199},
  year={2023}
}
@article{gao2023llamaadapterv2,
  title = {LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model},
  author={Gao, Peng and Han, Jiaming and Zhang, Renrui and Lin, Ziyi and Geng, Shijie and Zhou, Aojun and Zhang, Wei and Lu, Pan and He, Conghui and Yue, Xiangyu and Li, Hongsheng and Qiao, Yu},
  journal={arXiv preprint arXiv:2304.15010},
  year={2023}
}

Acknowledgement

Show More

License

Llama 2 is licensed under the LLAMA 2 Community License, Copyright (c) Meta Platforms, Inc. All Rights Reserved.

llama2-accessory's People

Contributors

enderfga avatar chrisliu6 avatar kriskrisliu avatar csuhan avatar lloongx avatar linziyi96 avatar zrrskywalker avatar tmm1 avatar eltociear avatar lupantech avatar theia-4869 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.