Hello, Thank you for this great library! Is there any way we can

Try running the model with 8-bit quantization: <div class="highlight highlight-sou

Hi, The Whisper tranion loop already handles long files using

Thank you. So is it normal that the tranion time is considerably long for long f

Inference on long files about faster-whisper HOT 8 CLOSED

systran commented on July 22, 2024

Inference on long files

from faster-whisper.

Comments (8)

guillaumekln commented on July 22, 2024 1

Yes, the transcription time depends on the audio file duration. Long files will take longer.

from faster-whisper.

guillaumekln commented on July 22, 2024 1

Try running the model with 8-bit quantization:

model = WhisperModel(model_path, device="cuda", compute_type="int8")

from faster-whisper.

guillaumekln commented on July 22, 2024

Hi,

The Whisper transcription loop already handles long files using a sliding 30-second window while keeping the context. So you don't need to do anything to transcribe long files.

from faster-whisper.

databill86 commented on July 22, 2024

Thank you. So is it normal that the transcription time is considerably long for long files ?

from faster-whisper.

databill86 commented on July 22, 2024

Sorry I closed and reopened the issue. I just have one last thing about the longer files.
If we use the "gpu" as a device, is there any way we can avoid OOM for these longer files ?

from faster-whisper.

guillaumekln commented on July 22, 2024

What is your GPU and what model size are you running?

from faster-whisper.

databill86 commented on July 22, 2024

It's a NVIDIA GeForce GTX 1070 Ti 8Go, I was running the large-v2 model on a 18min file. But even with 4min file I have OOM.

from faster-whisper.

databill86 commented on July 22, 2024

Wow, just like that! it's a lot faster, and no OOM!!!
Thank you!
I will close the issue now for good :)

from faster-whisper.

Recommend Projects

Inference on long files about faster-whisper HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent