Comments (7)
Hey @full-stack-ai ! There are 2 implementations in llm.c:
- PyTorch reference implementation which is inside
train_gpt2.py
-> we only use it as a sanity check that our C implementation is correct. - C/CUDA implementation -> that's what you probably care about, to run it after building using
make
just run./train_gpt2
in case you care about the CPU implementation.
Let me know if you need any additional help, your feedback is welcome!
Otherwise feel free to close the issue.
from llm.c.
It's saying that you don't have NVCC which is because you're on a Mac. This is okay - I see the cc line so is it building or are there other errors?
from llm.c.
from llm.c.
Are the executable files there in the directory? Also, which version of gcc are you using?
from llm.c.
Here is the repo files in the same directory:
from llm.c.
Everything is there. train_gpt2 is built. You are good to go. Feel free to close this issue. Thanks.
from llm.c.
So why the instruction is mentioned to run make train_gpt2
? If python train_gpt2.py
trains the model, then the documentation should change.
from llm.c.
Related Issues (20)
- Mismatch of dweight at layernorm_backward.cu
- Recalculating the activations in the backwards pass to conserve memory HOT 3
- Deleting Conda/Python as a dependency entirely to dramatically decrease "latency to step" HOT 4
- python dev/data/fineweb.py --version 10B HOT 2
- BitNet (b1.58) support HOT 2
- Cudnn error cudnn_att.cpp on train_gptcu HOT 4
- Model Export & Inference HOT 3
- Modal script - benchmarking, profiling and libraries HOT 6
- ERROR on the AMD GPU HOT 4
- apparent compatibility issues with earlier c++ versions after recent pushes HOT 3
- I can not understand the `cublasGemmStridedBatchedEx` call in the `attention_forward`
- LLM.c in google colab HOT 1
- OSError: Memory mapping file failed: Cannot allocate memory HOT 1
- is max_seq_len configurable or hardcoded parameter? HOT 2
- sel4 + llm.c > path to putting these llms in any mission critical system
- Windows issue with Cuda Toolkit 12.5 and latest MSVC compiler 17.10
- Specify torch version number in requirements.txt ? HOT 2
- Pretraining (with CPUs) HOT 4
- Getting "Floating point exception (core dumped)" Error HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llm.c.