I have two s, where the large-v3 model hallucinates, for instance by making up t

Large-v3 model hallucinates, large-v2 doesn't <p dir="a

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

Large-v3 model hallucinates, large-v2 doesn't </blockquo

Then I guess I'll stay with large-v3. <p d

Then I guess I'll stay with large-v3. </blo

Large-v3 model hallucinates, large-v2 doesn't about faster-whisper HOT 8 OPEN

Arche151 commented on May 24, 2024

Large-v3 model hallucinates, large-v2 doesn't

from faster-whisper.

Comments (8)

Purfview commented on May 24, 2024 1

Large-v3 model hallucinates, large-v2 doesn't

It's known that large-v3 hallucinates much more than large-v2, read there:
Whisper-v3 Hallucinations on Real World Data

from faster-whisper.

trungkienbkhn commented on May 24, 2024

@Arche151 , could you try again with compute_type="default" (or remove this command when initializing whisper model) ?

from faster-whisper.

Arche151 commented on May 24, 2024

@Arche151 , could you try again with compute_type="default" (or remove this command when initializing whisper model) ?

Thanks for the quick reply and suggestion!

I'll try that and report back.

from faster-whisper.

Arche151 commented on May 24, 2024

Large-v3 model hallucinates, large-v2 doesn't

It's known that large-v2 hallucinates much more that large-v2, read there: Whisper-v3 Hallucinations on Real World Data

Damn, that sucks hard. In that case, there's ofc nothing that faster-whisper can change about that. Then I guess I'll stay with large-v2.

Thanks for linking the article!

from faster-whisper.

Purfview commented on May 24, 2024

Then I guess I'll stay with large-v3.

Did you meant "large-v2"?

On my Standalone Faster-Whisper I've added auto-offsets to whisper's pseudo-vad thresholds when "v3" is in use, you can try these parameters when using large-v3:

compression_ratio_threshold=2.2
log_prob_threshold=-0.7

from faster-whisper.

terryops commented on May 24, 2024

Then I guess I'll stay with large-v3.

Did you meant "large-v2"?

On my Standalone Faster-Whisper I've added auto-offsets to whisper's pseudo-vad thresholds when "v3" is in use, you can try these parameters when using large-v3:

compression_ratio_threshold=2.2 log_prob_threshold=-0.7

does it yield better result than large-v2 using your parameters with large-v3?

from faster-whisper.

Arche151 commented on May 24, 2024

Then I guess I'll stay with large-v3.

Did you meant "large-v2"?
On my Standalone Faster-Whisper I've added auto-offsets to whisper's pseudo-vad thresholds when "v3" is in use, you can try these parameters when using large-v3:
compression_ratio_threshold=2.2 log_prob_threshold=-0.7

does it yield better result than large-v2 using your parameters with large-v3?

I didn't try for long enough to be able to say. Just went back to large-v2 after reading the deepgram article.

from faster-whisper.

Purfview commented on May 24, 2024

does it yield better result than large-v2 using your parameters with large-v3?

You tell me, as I don't use large-v3. IMO large-v2 is better.

from faster-whisper.

Large-v3 model hallucinates, large-v2 doesn't about faster-whisper HOT 8 OPEN

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent