Comments (3)
TORCH_CUDA_ARCH_LIST="6.0 6.1 7.0 7.5 8.0 8.6+PTX" pip3 install bmtrain
from bmtrain.
Did you set the TORCH_CUDA_ARCH_LIST environment variable?
from bmtrain.
same issue,do you resolve that ?
CH_EXTENSION_NAME=_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -gencode=arch=compute_50,code=sm_50 -std=c++17
csrc/cuda/has_inf_nan.cu(11): error: identifier "__heq" is undefined
1 error detected in the compilation of "csrc/cuda/has_inf_nan.cu".
error: command '/usr/local/cuda-11.7/bin/nvcc' failed with exit code 1
[end of output]
from bmtrain.
Related Issues (20)
- 安装成功,但import失败,bmtrain版本0.2.2 HOT 2
- 模型加载 HOT 1
- BMTrain setup without torch
- Adam offloading thread bugs
- bmt.load(model) -> Unexpected OOM
- Make Checkpointing Optional
- 安装BMTranin失败:nccl.obj : error LNK2001: XXXX HOT 1
- How to distribute weights to different GPUs? HOT 2
- TypeError: expected string or bytes-like object HOT 1
- 我们以后能否和spark-gpu一起配合使用,开发 java 、scala . c++ 版本的bmtrain HOT 1
- Error when pip install bmtrain HOT 2
- gather_reuslt存在潜在问题 HOT 1
- gather result存在潜在问题
- gather result存在潜在问题 HOT 1
- 用BMTrainModelWrapper封装大模型的问题 HOT 4
- Support Tensor Parallel
- model中存在Linear(config.hidden_size, config.vocab_size, bias=False)时候,print_inspect(model, "*")会报错。 HOT 1
- [BUG] Signal killed caused by Adam Offload
- [Feature] does bmtrain support torch 2.0+ HOT 1
- [BUG] Tensor Parallel async_chunk=4 mismatch async_chunk=1 result when sequence length longer than 16K
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bmtrain.