Coder Social home page Coder Social logo

balanced-dataparallel's Introduction

Hi there 👋, I'm Yang Jianxin

yangjianxin1's GitHub stats

I'm a NLPer interested in Large Language Model and graduated from SYSU with a master's degree.

In my free time, I like to write technical blogs on [Wechat Official Accounts: YeungNLP] and [Zhihu: 红雨瓢泼]

🔭 Experiences:

  • Shopee, responsible for building NLP algorithm ability about Customer Service. (from 2022-04 to now)
  • Tencent, responsible for building NLP algorithm ability about Product Understanding. (from 2021-06 to 2022-04)
  • Alibaba, Internship at Alibaba (from 2020-06 to 2020-09).

⚙ Here are some my public projects:

Project Description Code
Firefly One-stop training for LLMs. Some achievements:
1. firefly-llama2-13b ranked 3rd among all 13B models on Open LLM Leaderboard, only 0.5 points less than 1st.
2. firefly-llama-30b ranked 10th among all 30B models on Open LLM Leaderboard trained with single V100.
3. firefly-baichuan-13b achieves over 1.63 million downloads.
4. firefly-qwen1.5-en-7b-dpo improves 7.21 points compared with the official chat model.
5. firefly-gemma-7b improves 9.37 points compared with the official chat model.
GPT2-chitchat Chinese GPT2 for chitchat
Firefly-LLaMA2-Chinese Chinese Llama2 with efficient and effective training method.
LongQLoRA Efficient and Effective method for extending context length of Llama2 to 8192 with single V100. Technical Report
CPM Chinese composition model based on CPM
CLIP-Chinese Chinese CLIP model trained with 1.4 million image-text pairs
ClipCap-Chinese Chinese image caption model based on clip and mengzi
OFA-Chinese Chinese multi-modal unified pre-training model
LLMPruner Prune vocabulary of LLMs to save memory in training.

📁 Here are some my technical blogs:

balanced-dataparallel's People

Contributors

link-li avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.