Coder Social home page Coder Social logo

Comments (7)

ysyx2008 avatar ysyx2008 commented on May 17, 2024

经测试,单卡非量化模式运行也是一样的问题。

from codegeex2.

ysyx2008 avatar ysyx2008 commented on May 17, 2024

硬件环境:Tesla T4 16G * 4

from codegeex2.

Stanislas0 avatar Stanislas0 commented on May 17, 2024

不管改成多少,都只输出很短一节内容,如图所示。

使用4卡部署,启动参数为:python run_demo.py --model-path "/home/dl/data/codegeex2-6b-model" --n-gpus 4

图片

Tesla T4不支持BF16,是否启用了.half()?

from codegeex2.

ysyx2008 avatar ysyx2008 commented on May 17, 2024

不管改成多少,都只输出很短一节内容,如图所示。
使用4卡部署,启动参数为:python run_demo.py --model-path "/home/dl/data/codegeex2-6b-model" --n-gpus 4
图片

Tesla T4不支持BF16,是否启用了.half()?

查看源代码,未启用.half():
图片

刚刚使用int4权重文件,似乎可以正常输出。启动参数如下:
python gyzq_demo.py --model-path "THUDM/codegeex2-6b-int4" --n-gpus 4

难道是权重文件的问题?用git ssh方式下载的,不应该有损坏才对。加载过程也无报错。

from codegeex2.

ysyx2008 avatar ysyx2008 commented on May 17, 2024

确认使用int4量化后的权重文件可以正确输出:
图片

from codegeex2.

ivankxt avatar ivankxt commented on May 17, 2024

hi,我从https://huggingface.co/THUDM/codegeex2-6b-int4/tree/main下载的codegeex2-6b-int4;然后在V100 GPU机器上加载模型,输出结果完全不对,请问这是什么原因?Stanislas0

image
image

from codegeex2.

xd-Nanan avatar xd-Nanan commented on May 17, 2024

hi,我从https://huggingface.co/THUDM/codegeex2-6b-int4/tree/main下载的codegeex2-6b-int4;然后在V100 GPU机器上加载模型,输出结果完全不对,请问这是什么原因?Stanislas0

image image

我的测试也频繁出现此问题,而且无法控制输出,请问有解决嘛?

from codegeex2.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.