Replies: 5 comments 2 replies
-
Hi. |
Beta Was this translation helpful? Give feedback.
-
I read somewhere that to fine tune on this task you can follow this guide fine tuning whisper and just change the dataset and set translation as task instead of transcribe. I need to do the same but I still didn't tried. My worry is the calculation of WER, how can the wer be calculated in the task of translation? There are a lot of possibilities that the text predicted will have the same meaning as the real text but with differents and then the WER will mislead. Did you manage it? |
Beta Was this translation helpful? Give feedback.
-
@emanueleielo For evaluation, WER isn't a good metric for translation. You want to use one of the translation metrics like BLEU Score, METEOR, COMET or similar. Hope that helps! |
Beta Was this translation helpful? Give feedback.
-
Hi @AmgadHasan |
Beta Was this translation helpful? Give feedback.
-
Hi, @AmgadHasan , can you share how you prepare custom data? For example, let's say language pair is en->hi.
would be
right? |
Beta Was this translation helpful? Give feedback.
-
I have an audio dataset of specific domain in the Hindi language, and I want to enhance the whisper translation capabilities of my model. Currently, it can take non-English audio input and translate it into English text.
I understand how to fine-tune the whisper model for transcription tasks like writing same language text as in audio but I'm not sure how to fine-tune it specifically for cross-lingual translation when audio is in another language and we want to improve translation to English performance of whisper model. Could you provide guidance on how to fine-tune the model for this purpose or share any repo?
Beta Was this translation helpful? Give feedback.
All reactions