Comments (5)
Update: I fixed my issue by rolling back my Nvidia driver to 552.44 and restarting my machine.
from wandb.
Update: I fixed my issue by rolling back my Nvidia driver to 552.44 and restarting my machine.
I second this, downgraded to the Nvidia 552.44 studio driver, it and now is working again.
from wandb.
I've recently started experiencing the same exact issue that I suspect is related to a recent NVIDIA driver update. Yesterday, I updated my NVIDIA driver to version 555.85, and since then, I've been encountering errors in both Ray Tune and wand.
Initially, I encountered the error in Ray Tune, but after modifying the nvidia_gpu.py file in python3.11/site-packages/ray/_private/accelerators/ to use Latin-1 encoding instead of UTF-8, I was able to get my Ray Tune project working again. The modified code is as follows:
try:
pynvml.nvmlInit()
except pynvml.NVMLError:
return None # pynvml init failed
device_count = pynvml.nvmlDeviceGetCount()
cuda_device_type = None
if device_count > 0:
handle = pynvml.nvmlDeviceGetHandleByIndex(0)
device_name = pynvml.nvmlDeviceGetName(handle)
if isinstance(device_name, bytes):
device_name = device_name.decode("latin1") # Changed from "utf-8" to "latin1"
cuda_device_type = (
NvidiaGPUAcceleratorManager._gpu_name_to_accelerator_type(device_name)
)
pynvml.nvmlShutdown()
return cuda_device_type
However, I'm still experiencing issues with W&B, where I'm receiving errors and my metrics are not being monitored as intended.
from wandb.
same here , worked.
from wandb.
Has anyone tested out if it's safe to update the driver now?
from wandb.
Related Issues (20)
- [Solved][App]: Unable to see the runs in workspace even if the run is taking place successfully HOT 2
- [App]: runs showing from link but not in project page HOT 3
- [Q] Upgrade to gqlparser 2.5.14 in next release? HOT 1
- [App]: All tests fail to run because some dependencies are missing HOT 8
- [Feature]: Halving Random Grid Search for Hyperparameter Tuning HOT 1
- Issue showing runs in the groups from mobile browsers HOT 6
- [Q] "None" was logged when wandb.sweep using pytorch_lightning HOT 3
- [CLI]: wandb write data error HOT 2
- [Q] Download an artifact without the API/CLI HOT 7
- [CLI]: Offline wandb sync failed HOT 7
- [CLI]: init wandb sdk but the /tmp/code directory was not created HOT 5
- [Q] why some steps didn't be logged during training? HOT 2
- [Q] wandb stream ID error HOT 2
- [Q]wandb: ERROR Internal wandb error: file data was not synced wandb: While tearing down the service manager. The following error has occurred: Python int too large to convert to C long HOT 7
- [Q] Does per-sample logging bottleneck batching? HOT 2
- [CLI]: Logging an external artifact folder in Azure Storage Account (HNS) results in a directory stub being logged HOT 6
- [CLI]: wandb.errors.UsageError: Agent user not valid HOT 2
- [Q] Any docs on the settings argument to `wandb.init`? HOT 7
- [App]: api.wandb.ai returns 403 forbidden in some regions HOT 7
- [Feature]: Add sum aggregation to panel grouping ui
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from wandb.