Replies: 6 comments 1 reply
-
same , the prompt is helpful when transcribe some technical terms ,but will cause specific hallucinations and repetitions . Is there any way to avoid this problem ? Thanks a lot ! |
Beta Was this translation helpful? Give feedback.
-
I am also experiencing them. How is deepgram able to handle it? |
Beta Was this translation helpful? Give feedback.
-
I face the same issue, anyone can help? Thanks. |
Beta Was this translation helpful? Give feedback.
-
Not sure about what others did but what I did was to postprocess the output with the following methods.
These methods would reduce some hallucination. |
Beta Was this translation helpful? Give feedback.
-
I split audio up to 2,000 chunks to transcribe around 2m or 3 min chunks so if there is a repeat it's time is limited. Gonna use large-v2-q8_0 instead of large-v3-turbo as it's less prone to hallucinate. Even though turbo is 5x realtime speed compared to 3x realtime speed with large-v2-q8_0. After stitch all 2,000 subs together into one sub then fix any overlapping timecodes with this code. Actually have it done automatically after each time I stitch them.
Then I process for proper nouns any names, countries, cities, etc. that weren't capitalized. Have a huge list it goes through. Then finally check for any repeating words.
|
Beta Was this translation helpful? Give feedback.
-
Hi,
I see a lot of repetitions and hallucinations in the output when i use prompt feature, could you any one figured how to overcome this behaviour?, please let me know.
Beta Was this translation helpful? Give feedback.
All reactions