Comments (8)
This error happens when binary is compiled for x86_64 instead of arm64.
This is common mistake if you are compiling under conda like environment.
Check binary quantize with:
file quantize
On Apple Silicon Macs it should print:
quantize: Mach-O 64-bit executable arm64
If it shows x86_64 then you have to recompile after exiting conda environment:
make clean
make
from llama.cpp.
What does cc --version
and c++ --version
say?
from llama.cpp.
@prusnak it said
Target: x86_64-apple-darwin21.6.0
Thread model: posix
InstalledDir: /Library/Developer/CommandLineTools/usr/bin
from llama.cpp.
Solved by re-install conda
env. Thanks guys
from llama.cpp.
Thank you, it works as you said.
from llama.cpp.
@brozkrut I got the error in Windows 7:
ERROR: /app/.devops/tools.sh: line 40 6 Illegal instruction ./main $arg2
when I executed :
--quantize "/models/7B/ggml-model-f16.bin" "/models/7B/ggml-model-q4_0.bin" 2
and I checked my quantize file:
quantize: ELF 64-bit LSB shared object, x86-64, version 1 <GNU/Linux> ...... for GNU/Linux 3.2.0, not stripped
But the architecture of docker image named ghcr.io/ggerganov/llama.cpp:full is amd64:
docker image ghcr.io/ggerganov/llama.cpp:full|grep Architecture
"Architecture": "amd64"
and my environment is:
Docker Toolbox 1.13.1
docker client: 1.13.1 os/arch: windows 7 /amd64
docker server:19.03.12 os/arch:ubuntu 22.04 /amd64
SO I'M CONFUSED:
1 Must I re-compile quantize?
2 How to re-compile if I use docker image?
3 Is there anywhere that I can download directly?
from llama.cpp.
This error happens when binary is compiled for x86_64 instead of arm64.
This is common mistake if you are compiling under conda like environment.
Check binary quantize with:
file quantize
On Apple Silicon Macs it should print:
quantize: Mach-O 64-bit executable arm64
If it shows x86_64 then you have to recompile after exiting conda environment:
make clean
make
can you explain me clearly
from llama.cpp.
Solved by re-install
conda
env. Thanks guys
can you please explain me
from llama.cpp.
Related Issues (20)
- Bug: ggml-aarch64.c does not compile on Windows ARM64 with MSVC HOT 11
- Unable to convert a fireworks ai model to GGUF with gguf-my-repo HOT 3
- Feature Request: Drop dependency on cublas library on build / TinyBLAS support HOT 5
- Bug: llama.cpp with Vulkan not running on Snapdragon X + Windows (Copilot+PCs) HOT 7
- Bug: mmproj from LLaVA 1.6 (spatial_unpad) seems to be broken
- main : failed to eval HOT 2
- instruct models don't work with latest llama cppBug: HOT 5
- [Bug/Enhancement] Can't disable continuous batching HOT 2
- tiktoken package missing from requirements HOT 1
- glm-4-9b-chat-1m mopdel issue: wrong shape
- llama-cli chat templates ignored? HOT 2
- Bug: pydantic_models_to_grammar_examples.py is broken HOT 2
- Feature Request: Improve Gemma v2 model performance on Vulkan backend
- Bug: RuntimeError: Internal: could not parse ModelProto from ../llama3/Meta-Llama-3-8B-Instruct/tokenizer.model
- Feature Request: T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge HOT 4
- Bug: a null-pointer defer in examples/gguf/gguf.cpp/gguf_ex_read_0 and gguf_ex_read_1
- Feature Request: Hope to support Qwen VL
- Bug - Can't build vulkan backend on RISC-V platform anymore HOT 5
- Bug: gemma2 perplexity pending forever
- Bug: MESA: error: ../src/intel/vulkan/anv_device.c:4237: VK_ERROR_OUT_OF_DEVICE_MEMORY HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama.cpp.