Coder Social home page Coder Social logo

Comments (9)

ravenscroftj avatar ravenscroftj commented on July 18, 2024 1

Great thanks for looking into the AVX thing - in hindsight that was the obvious problem - I should have known 🀦

So I think the short term solution would be to build the image on the system you're targetting and the C preprocessor should pick up what instructionsets are supported. The slightly longer term solution is for me to add CI builds that build with different sets of CPU instructions and make them available as part of the release

from turbopilot.

ravenscroftj avatar ravenscroftj commented on July 18, 2024 1

hi @oshanz - please try changing the image uri to ghcr.io/ravenscroftj/turbopilot:latest - the path from your log above is the old build.

from turbopilot.

ravenscroftj avatar ravenscroftj commented on July 18, 2024

Hi there thanks for your ticket. Can I ask what operating system and processor architecture you are running your k8s cluster on?

from turbopilot.

voarsh2 avatar voarsh2 commented on July 18, 2024

Hi there thanks for your ticket. Can I ask what operating system and processor architecture you are running your k8s cluster on?

I am running Ubuntu 22.04 LTS.
Kubernetes V1.23
The CPU's support AVX (not AVX2) - some example of my node CPU's:
(48 x Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz (2 Sockets), 32 x Intel(R) Xeon(R) CPU E5-2650 v2 @ 2.60GHz (2 Sockets))

  • I haven't tried the container on my Ryzen system that has AVX2 support..... will try that now......

Update. No complaints on the K8 node that has (what I assume) is AVX2 support. You might want to add AVX support (plenty of powerful enough CPU's don't have AVX2 support) - gotta love bulky RAM. :D

from turbopilot.

D32vd avatar D32vd commented on July 18, 2024

It happened to me, too
when i run
docker run --rm -it -v /home/yang/models:/models -e MODEL="/models/codegen-2B-multi-ggml-4bit-quant.bin" -p 80:18080 ghcr.io/ravenscroftj/turbopilot/turbopilot:latest
terminal output
main: seed = 1681348410
gptj_model_load: loading model from '/models/codegen-2B-multi-ggml-4bit-quant.bin' - please wait ...
gptj_model_load: n_vocab = 51200
gptj_model_load: n_ctx = 2048
gptj_model_load: n_embd = 2560
gptj_model_load: n_head = 32
gptj_model_load: n_layer = 32
gptj_model_load: n_rot = 64
gptj_model_load: f16 = 3
Illegal instruction (core dumped)

My Operating Environment :
docker 20.10.18 Community
Ubuntu 22.04.1 LTS
Intel(R) Celeron(R) CPU J1900 @ 1.99GHz
I pulled the docker image directly

from turbopilot.

oshanz avatar oshanz commented on July 18, 2024
docker run --rm -it -v ./models:/models -e THREADS=6 -e MODEL="/models/codegen-6B-multi-ggml-4bit-quant.bin" -p 18080:18080 ghcr.io/ravenscroftj/turbopilot/turbopilot:latest

WARNING: The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested

main: seed = 1681194918
gptj_model_load: loading model from '/models/codegen-6B-multi-ggml-4bit-quant.bin' - please wait ...
gptj_model_load: n_vocab = 51200
gptj_model_load: n_ctx   = 2048
gptj_model_load: n_embd  = 4096
gptj_model_load: n_head  = 16
gptj_model_load: n_layer = 33
gptj_model_load: n_rot   = 64
gptj_model_load: f16     = 2
gptj_model_load: ggml ctx size = 6325.92 MB
Illegal instruction (core dumped)

Apple M1 Pro
Mac 13.2
colima 0.5.4

from turbopilot.

4j4y avatar 4j4y commented on July 18, 2024

@ravenscroftj same issue with above image on mac m1.

from turbopilot.

ravenscroftj avatar ravenscroftj commented on July 18, 2024

hi @4j4y - just to clarify when you say image above you are referring to ghcr.io/ravenscroftj/turbopilot:latest (as opposed to the old image with /turbopilot/turbopilot in it)?

I don't have an apple silicon device so I'm struggling to work out what is working and what isn't. Are you running docker desktop on your m1 in order to run the image? Don't suppose you can share the output from your docker pull or docker run command (I'm after the image hash that you are using).

Finally - have you tried downloading the binary zip file from the releases page and running that? Does it give the same error?

from turbopilot.

ravenscroftj avatar ravenscroftj commented on July 18, 2024

support for older instruction sets is provided in release 0.0.5

from turbopilot.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.