Comments (11)
Conda can be really annoying. To be honest we don't really need anything specific from there. The only reason we used it was to avoid having to specify installation that vary from linux distribution to linux distribution but it might be doing more harm than good...
from ffcv.
Hi @mbsariyildiz ! Do you mind posting the version of CUDA and the NVIDIA Drivers that you have installed? (e.g., the output of nvidia-smi)?
from ffcv.
Thanks a lot for your quick response.
I use NVIDIA driver version: 470.57.02 and CUDA Version: 11.4
Just to note, I am able to install PyTorch alone with cudatoolkit=11.3.
from ffcv.
Any idea which package is causing the conflict? I would try without cupy and maybe without numba first and if it works we see from there.
from ffcv.
Here is what I tried:
- By removing the python=3.9 constraint, I managed to create an environment peacefully. But then I got opencv-related issues (to be more specific, about ffmpeg decoders). ffmpeg was installed from the pytorch channel, and forcing conda to replace it from the version available in the default conda channel, or the conda-forge channel again resulted in a bunch of conflicts.
- Tried to create a conda environment from a fresh Anaconda installation, but again ended up with the same conflicts.
- Tried to use different GCC versions (some of the issues were GCC-related), again, no success.
After 2 hours of struggle, I gave up.
Interestingly, one of my colleagues managed to install ffcv on the same machine successfully. I simply cloned his conda environment, and installed ffcv on top, which worked.
Couple of things I noticed though, the version of major libraries, like PyTorch, CuPy, Numba was different, and ffmpeg was from the conda-forge channel.
from ffcv.
Thanks for letting us know @mbsariyildiz ! What version of conda did they have?
from ffcv.
Sorry for not being clear @andrewilyas , he had the same conda version 4.10.3.
I find that a bit odd, as we were on the same machine, using the same CUDA & NVIDIA drivers and LD_LIBRARY_PATH PATH variables.
Just to add, upgrading my conda to 4.11.0, or installing the latest ffmpeg via pip also didn't help. I guess I couldn't link the correct ffmpeg lib.
Thanks for your attention @andrewilyas and @GuillaumeLeclerc . I'll use this conda env that I cloned from my colleague for now. Please let me know if I can provide more information for you to diagnose. Otherwise, please feel free to close this issue.
from ffcv.
HI, could you please also mention the steps your colleague took to resolve the ffmpeg conflict? I have been trying to install opencv with python 3.9 and conda version 4.10.3 (I am working with a remote server and so can not downgrade conda) and am running into an incomplete installation (the process terminates midway when ffmpeg is being installed).
from ffcv.
Based on the stack trace @mbsariyildiz the problem might be with conda's strict channel priority mode. Try something like this in the created env?
conda config --env --set channel_priority flexible
(Edited!)
from ffcv.
@andrewilyas updating the channel priority of conda as you said solved the issue! 🎆 Wow, great, thank you very much.
@PeriklisTheodoropoulos, he didn't have any issue with ffmpeg actually. He was able to create a working environment.
from ffcv.
Great! Closing this issue and added a note to the README regarding the fix.
from ffcv.
Related Issues (20)
- Indexing HOT 1
- Changing Indices during training leads to much slower training HOT 2
- Memory Leak in Ffcv Loader? HOT 4
- Import error - libopencv_impgproc missing HOT 2
- Installation issues HOT 6
- Small bug (improvement suggestion) in the quickstart doc HOT 1
- stuck in the loader when using only cpu HOT 4
- Grayscale Image Datasets HOT 2
- Top-1 accuracy on ImageNet drops between runs -- only difference is FFCV HOT 2
- Large .beton files slow down or even freeze learning during loading [possible bug] HOT 2
- [General question] FFCV scope
- ModuleNotFoundError: No module named 'ffcv.compiler
- doubt about mutli-gpu train when use imagenet 4 gpus HOT 1
- Default num_workers is incompatible with SLURM
- Installing FFCV on CPU-only node HOT 1
- Exact performance improvement
- Unable to save anything in the Fields HOT 1
- Compression error causes performance drop
- Reuse memory?
- Installing ffcv fails HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ffcv.