Comments (31)
I found that applying this patch fixes the error. The drivers may be thinking that the maximum amount of nvenc streams are constantly being used, hence nvenc being unavailable.
Tested on Arch 5.17.9
Driver version: 515.43.04-7
GPU chip: GA106
from open-gpu-kernel-modules.
Well, but however on my RTX 3050 for example obs-studio fails with nvenc, it says no supported hardware found.
from open-gpu-kernel-modules.
I could say I've fixed all this issues by replacing Nvidia with AMD (I was happy with AMD last 10 years).
from open-gpu-kernel-modules.
Thanks @rustatian, that log has been very helpful. Problem identified, dev team working on a solution.
from open-gpu-kernel-modules.
Closing this, as i am not interested anymore, sold Nvidia RTX 3050 in order to replace it with AMD RX6750XT.
Nvidia+Linux = Pain in the Ass!
from open-gpu-kernel-modules.
@vans163 & @atirut-w please visit or subscribe to #378
I will let my issue open until it's solved since this issue over here was closed too soon without a real solution.
from open-gpu-kernel-modules.
Same on 3060ti
OpenEncodeSessionEx failed: out of memory (10): (no details)
Problem doesn't exits on regular 515.
from open-gpu-kernel-modules.
Yeah, you may laugh to my comment, but since a long time Nvidia getting the worst on Linux compared to AMD.
Out of box hardware acceleration works with MESA GL (Not Nvidia ones). Already saw how bad is nvenc on Linux as it doesn't cover my expectations, and lack of hardware support for HEVC on Nvidia (Where AMD has won again as they have Hardware Support for HEVC).
Long story in short. AMD HEVC for me works better than Nvidia Nvenc (Maybe in prior to quality on same bit rate nvenc is better, but overall it's opposite).
My next card would definitely be AMD, though, considering how perfectly AMD seems to work with Linux.
AMD on Linux it is a way better last years (since they opened the source, and migrated to MESA GL Stack)
I am sorry, but RTX 3050 will be replaced with RX 6750 XT :)
I am so tired of struggling with Nvidia GL stack which limit my experience in Linux.
from open-gpu-kernel-modules.
AMD has no nvenc tho lol their video sdk is a mess and quality is worse than ffmpeg ultrafast. Its like you needed a shovel but bought a ticket to the Caribbean instead and said f*** digging.
from open-gpu-kernel-modules.
Hi, as mentioned in #378 we do know the root cause of this issue, but we can't give an ETA just yet, sorry. It's being actively looked it.
from open-gpu-kernel-modules.
@sandikata wouldn't it be better to let it open and solve this issue, you can at least unsubscribe if you are not interested in a real solution. Buying a new card is no option for me.
I'd argue it's best to keep this open as well in the interest of getting it fixed - but -
as indicated by the lack of engagement from NVIDIAas NVIDIAdon't know what is causing the issue andcan't provide a timeframe - I think their preference is clearly for us to use the hack to get around this issue.Does the application of the hack not resolve the issue for you?
EDIT: Yes, bit of a snarky tone. The known-bad driver should be withdrawn, or at least very very clearly noted that it doesn't actually work. Spending a little over a week to find this issue with zero engagement from
NV
is not what I expect from them.
Well, keep waiting, I think you're naive if you think they will fix something for free. This is Nvidia, if they're going to do something it will be for an amount of money. If driver is the same (it is really) they for sure know the root of this issue, but not interested on fixing it for free.
And even more, Nvidia on Linux do not have any acceleration as it doesn't support proper va-api. So i'ts not only one point.
PS: even unsubscribe doesn't help and still receiving email's for this closed and unsubscribed ticket.
from open-gpu-kernel-modules.
Well, keep waiting, I think you're naive if you think they will fix something for free. This is Nvidia, if they're going to do something it will be for an amount of money. If driver is the same (it is really) they for sure know the root of this issue, but not interested on fixing it for free.
And even more, Nvidia on Linux do not have any acceleration as it doesn't support proper va-api. So i'ts not only one point.
PS: even unsubscribe doesn't help and still receiving email's for this closed and unsubscribed ticket.
-
This isn't NVIDIA, this is NVIDIA open, as in open source. Yes NV is a shitty company in many ways but the Linux and FOSS community is able to fix issues here.
-
Again incorrect, nvidia-vaapi-driver is a thing and with a bit of reading its more than possible to get NVDEC support working in userspace. My laptop right now is using NVDEC to decode videos in Firefox on Wayland.
from open-gpu-kernel-modules.
@Dungeonseeker I think you've missed that it's already fixed... 😃
from open-gpu-kernel-modules.
Hello and thank you for your interest!
The opensource kernel driver has the same support for nvenc and tools that use it as the proprietary variant. The firmware/microcode that runs on the nvdec microcontrollers is not published, however.
I hope this answers your question, but please elaborate if this is not what you meant. Thanks.
from open-gpu-kernel-modules.
I do believe that is a bug and we will investigate. Thank you for the report!
from open-gpu-kernel-modules.
Hi. I have the same problem with the GTX 1650 (TU117).
[h264_nvenc @ 0x5594ec491d40] OpenEncodeSessionEx failed: out of memory (10): (no details)
[h264_nvenc @ 0x5594ec491d40] No capable devices found
warning: [NVENC encoder: 'recording_h264'] Failed to open NVENC codec: Generic error in an external library
I have not had such a problem with the version 510 driver in the past.
OBS Log:
OBS.log
dmesg:
dmesg_nvidia.log
from open-gpu-kernel-modules.
Hello and thank you for your interest!
The opensource kernel driver has the same support for nvenc and tools that use it as the proprietary variant. The firmware/microcode that runs on the nvdec microcontrollers is not published, however.
I hope this answers your question, but please elaborate if this is not what you meant. Thanks.
Does this mean nvenc is currently unavailable?
from open-gpu-kernel-modules.
Internal bug 3661377 for tracking
from open-gpu-kernel-modules.
It's a little off-topic, but…
Is someone suffering a choppy record with nvenc? Even on 120fps ?
It is always choppy (not smooth as expected, but choppy like 20fps).
from open-gpu-kernel-modules.
still not fixed in 515.48
from open-gpu-kernel-modules.
from open-gpu-kernel-modules.
@sandikata, or others still seeing this issue in the latest driver drops - could you please capture a log file (sudo nvidia-bug-report.sh) and include the output with NVreg_RmMsg=":"?
This issue should have been resolved in 515.48.07 driver version. Thanks.
from open-gpu-kernel-modules.
@PAR2020 Hello. The issue still exists in the latest nvidia-open
drivers from the archlinux
repo (Driver Version: 515.57
).
Obs-studio:
[hevc_nvenc @ 0x55ae6c8aa600] OpenEncodeSessionEx failed: out of memory (10): (no details)
[hevc_nvenc @ 0x55ae6c8aa600] No capable devices found
from open-gpu-kernel-modules.
I have tested the 515.65.01-4
on arch and seems the issue is gone, but maybe I am wrong.
from open-gpu-kernel-modules.
Hello. I face the same issue on a 3080 TI.
driver version:
> pacman -Q | grep nvidia
nvidia-open 515.65.01-7
obs log:
15:27:55.216: [NVENC encoder: 'simple_h264_recording'] Failed to open NVENC codec: Generic error in an external library
15:27:56.702: ---------------------------------
15:27:56.702: [NVENC encoder: 'simple_h264_recording'] settings:
15:27:56.702: rate_control: CQP
15:27:56.702: bitrate: 0
15:27:56.702: cqp: 21
15:27:56.702: keyint: 250
15:27:56.702: preset: hq
15:27:56.702: profile: high
15:27:56.702: width: 1276
15:27:56.702: height: 720
15:27:56.702: 2-pass: false
15:27:56.702: b-frames: 2
15:27:56.702: psycho-aq: 1
15:27:56.702: GPU: 0
output of sudo NVreg_RmMsg=":" nvidia-bug-report.sh
is attached
nvidia-bug-report.log.gz
If desired, I'll start the X server with startx -- -logverbose 6
for more details. Just let me know
from open-gpu-kernel-modules.
I found that applying this patch fixes the error. The drivers may be thinking that the maximum amount of nvenc streams are constantly being used, hence nvenc being unavailable.
Tested on
Arch 5.17.9
Driver version:515.43.04-7
&520.61.05
GPU chip:GA106
Note
520.61.05 can not yet be patched. Do not upgrade until there is a patch. 520 does not resolve the issue.
I've lost a little over a week to this. Thank you @GamingLiamStudios
This was the only way I could get this working.
- esxi 7
- Ubuntu 22.04.1 LTS
# Ensure secure boot is disabled - ie mokutil --disable-validation
sudo apt -y install build-essential curl dkms
echo blacklist nouveau | sudo tee /etc/modprobe.d/blacklist-nvidia-nouveau.conf
echo options nouveau modeset=0 | sudo tee -a /etc/modprobe.d/blacklist-nvidia-nouveau.conf
echo options nvidia NVreg_OpenRmEnableUnsupportedGpus=1 | sudo tee /etc/modprobe.d/nvidia-unsupported-gpu.conf
sudo update-initramfs -u
sudo /sbin/reboot
git clone https://github.com/keylase/nvidia-patch.git
curl -LO https://developer.download.nvidia.com/compute/cuda/11.7.1/local_installers/cuda_11.7.1_515.65.01_linux.run
chmod +x ./cuda_11.7.1_515.65.01_linux.run
sudo ./cuda_11.7.1_515.65.01_linux.run -m=kernel-open
sudo /sbin/reboot
# Confirm working
nvidia-smi
cd nvidia-patch
sudo bash ./patch.sh
sudo /sbin/reboot
# Check can encode now
curl -LO http://commondatastorage.googleapis.com/gtv-videos-bucket/sample/BigBuckBunny.mp4
ffmpeg -i BigBuckBunny.mp4 -c:v h264_nvenc output.mp4
from open-gpu-kernel-modules.
I could say I've fixed all this issues by replacing Nvidia with AMD (I was happy with AMD last 10 years).
I can't get an AMD card unless I use getting into uni or this new year as an excuse :(
My next card would definitely be AMD, though, considering how perfectly AMD seems to work with Linux.
from open-gpu-kernel-modules.
I can also confirm that NVENC is not working in combination with the Nvidia Container Runtime.
Seems like the Nvidia Open Source driver is not allowed to use NVENC in conjunction with the libraries from the proprietary driver.
from open-gpu-kernel-modules.
@sandikata wouldn't it be better to let it open and solve this issue, you can at least unsubscribe if you are not interested in a real solution.
Buying a new card is no option for me.
from open-gpu-kernel-modules.
I thought this got fixed or smth. This is pretty rude considering the reason imo.
from open-gpu-kernel-modules.
@sandikata wouldn't it be better to let it open and solve this issue, you can at least unsubscribe if you are not interested in a real solution. Buying a new card is no option for me.
I'd argue it's best to keep this open as well in the interest of getting it fixed - but - as indicated by the lack of engagement from NVIDIA as NVIDIA don't know what is causing the issue and can't provide a timeframe - I think their preference is clearly for us to use the hack to get around this issue.
Does the application of the hack not resolve the issue for you?
EDIT: Yes, bit of a snarky tone. The known-bad driver should be withdrawn, or at least very very clearly noted that it doesn't actually work. Spending a little over a week to find this issue with zero engagement from NV
is not what I expect from them.
from open-gpu-kernel-modules.
Related Issues (20)
- linux 6.8. - NOT a release candidate - Module is failing to build - 390xx HOT 1
- kernel memory not released
- unconditional use of hmm_pfn_to_page() which was introduced in Linux 5.8
- Dynamic boost can't be enabled HOT 5
- Atomic commit fails if IN_FENCE_FD is set HOT 2
- PCI PM fails with modesetting enabled HOT 1
- 550.76 release missing HOT 1
- Failed to get power and usage information in nvidia-smi HOT 7
- nvidia_ioctl frequent dynamic memory allocation HOT 1
- NVRM: krcWatchdogCallbackVblankRecovery_IMPL: NVRM-RC: RM has detected that 7 Seconds without a Vblank Counter Update on head:D0 HOT 3
- using Clang to build and go error HOT 3
- Can't build on fedora 40 HOT 6
- -gcc-sections required
- Failed to display anything when early KMS HOT 1
- nvidia-drm Direct firmware load for nvidia/550.76/gsp_ga10x.bin failed with error -2 HOT 2
- [555.42.02] D3cold on Turing Mobile not working with kernel 6.9.2. Works with closed driver. HOT 3
- soc_isr_lock is missing a NV_SPIN_LOCK_INIT HOT 2
- `follow_pfn()` is removed from kernel HOT 7
- Chromium GPU Process Cannot Start HOT 4
- Black screen on 555.x.x, Gnome 46, Fedora 40
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from open-gpu-kernel-modules.