- add the following env vars to your project
worker_cpu=8
worker_gpu=1
num_workers=1
worker_ram_memory=16
-
go to site administration and add a new profile with 8 cpus and 16GB RAM
-
add the following cml runtime. (If you would like to use your own image. Refer to Runtime section for further steps [tba])
luismap/cml:pbjcuda-V2.0
- install requirements
pip install -r requirements.txt
- create ray cluster. This script will create a ray cluster with 2 nodes.
python3 ray_start_cluster_python.py
- run the script mistral_vllm.py.