Coder Social home page Coder Social logo

korean-openorca's Introduction

🐳Korean-OpenOrca🐳

Korean-OpenOrca

Korean-Open-platypus 데이터셋을 활용하여 llama-2-ko를 fine-tuning한 Korean-Platypus model

🐳KoR-Orca-Platypus-13B🥮: Hugging Face
🐳Korean-OpenOrca-13B: Hugging Face

🐳OpenOrca-KO: Hugging Face
🐳KOR-OpenOrca-Platypus: Hugging Face
본 연구는 (주)마커와 (주)미디어그룹사람과숲의 오픈소스 LLM 연구 컨소시엄에서 진행되었습니다.


Model BenchMark(KO-LLM; will update new version)

Model Average Ko-ARC Ko-HellaSwag Ko-MMLU Ko-TruthfulQA Ko-CommonGen V2 Dataset Base_model
🐳KoR-Orca-Platypus-13B 50.13 42.06 53.95 42.28 43.55 68.78 KOR-OpenOrca-Platypus ko-en-llama2-13b
🐳Korean-OpenOrca-13B 47.85 43.09 54.13 40.24 45.22 56.57 🐳OpenOrca-KO ko-en-llama2-13b
KoT-Platypus2-13B 49.55 43.69 53.05 42.29 43.34 65.38 KoCoT KO-platypus2-13B
KO-platypus2-13B 47.90 44.20 54.31 42.47 44.41 54.11 KOpen-platyus ko-en-llama2-13b

News

Quick start

### KO-Platypus
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

repo = "kyujinpy/Korean-OpenOrca-13B"
OpenOrca = AutoModelForCausalLM.from_pretrained(
        repo,
        return_dict=True,
        torch_dtype=torch.float16,
        device_map='auto'
)
OpenOrca_tokenizer = AutoTokenizer.from_pretrained(repo)

Datasets

from datasets import load_dataset

# dataset testing
dataset = load_dataset("kyujinpy/OpenOrca-KO") # But currently, private repo. Please wait!

🐳OpenOrca-KO: Hugging Face

It is public state!

References

🐳OpenOrca
Kopen-Platypus🥮
🐳OpenOrca-KO
Platypus
llama-2-ko
ko-en-llama2
🐳Korean-OpenOrca-13B

TODO

  • Make KOR-OpenOrca
  • Share huggingface repo
  • Combined platypus+OpenOrca datasets
  • Make KOR-OpenOrca-Platypus
  • Share evaluation results
  • Share datasets

korean-openorca's People

Contributors

kyujinhan avatar

Stargazers

 avatar devjun7 avatar  avatar acer avatar OrigamiDream avatar  avatar orange fritters avatar  avatar Jinhyeong avatar Jioh L. Jung avatar  avatar HyunjunJeon avatar Sumin Kim avatar MwP avatar Insu Jeon avatar Yeong-Joon Ju (주영준) avatar ZanD avatar  avatar

Watchers

cheolisable2 avatar  avatar

korean-openorca's Issues

inference 시 필요한 GPU

안녕하세요,

먼저 리더보드에서 좋은 순위 달성 하신 것 축하드립니다.

해당 모델을 사용해보려고 하는데 inference 시 GPU가 몇장정도 필요한지 궁금합니다.

감사합니다.

[질문] 파인튜닝 방법

먼저, leaderboard에서 좋은 성적 축하드립니다!
SFT 시 사용한 프롬프트 공유가능한가요?
추가적으로 학습시 Hyperparameter도 궁금합니다. (lora config 관련)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.