Coder Social home page Coder Social logo

Yi-34B 需要的资源是多少? about yi HOT 13 CLOSED

01-ai avatar 01-ai commented on August 26, 2024
Yi-34B 需要的资源是多少?

from yi.

Comments (13)

wangye01inf avatar wangye01inf commented on August 26, 2024 3

@xgysigned 4090/3090 的显存应该在 24 GB,34B 参数以 float16/bfloat16 加载需要 34 GB*2=68 GB 左右显存,需要上多卡

多卡可以考虑使用仓库中的 TP Demo:https://github.com/01-ai/Yi/blob/main/demo/text_generation_tp.py

也可以考虑使用 vllm/llamacpp 等社区开源的推理框架的一些特性来进一步降低显存的需求以及提升推理性能:

from yi.

crapthings avatar crapthings commented on August 26, 2024
image

let's find out

from yi.

crapthings avatar crapthings commented on August 26, 2024
image

3 块48g或者a100 80g的够不够不知道

from yi.

crapthings avatar crapthings commented on August 26, 2024
image image

from yi.

ericjank avatar ericjank commented on August 26, 2024

image image

真有钱

from yi.

ericjank avatar ericjank commented on August 26, 2024
image 3 块48g或者a100 80g的够不够不知道

哈尔滨的朋友?

from yi.

xihajun avatar xihajun commented on August 26, 2024

可以考虑支持autotrain-advanced,似乎支持int4推理

from yi.

xain avatar xain commented on August 26, 2024

希望量化后的版本支持24G显卡。

from yi.

waltcow avatar waltcow commented on August 26, 2024

希望量化后的版本支持24G显卡。

from yi.

m1105550 avatar m1105550 commented on August 26, 2024

卡一個量化後版本

from yi.

Samge0 avatar Samge0 commented on August 26, 2024

4块2080ti魔改22g的显卡(22g*4=88g)可以跑吗?目前我有两块

from yi.

AmeowCAT avatar AmeowCAT commented on August 26, 2024

4bit量化版本应该正好支持24G显存的显卡

from yi.

255doesnotexist avatar 255doesnotexist commented on August 26, 2024

q4_k_s is suitable for deploying on Tesla P40 (24G VRAM).

from yi.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.