Replies: 4 comments
-
|
Beta Was this translation helpful? Give feedback.
-
Thanks for your answers. I'd like to better understand the codebase before contributing.
|
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
Thanks! I like the project and I'll do my best to contribute. |
Beta Was this translation helpful? Give feedback.
-
Hi! I have a few questions about how fish speech works:
I understand that there are two steps to generate the audio: the semantic tokens and then the audio. Do you generate all the tokens and then only stream the audio or do you stream everything: text -> tokens -> audio ?
I took a look at the code and it looks like it could handle receiving a stream of text as an input but couldn't find an implementation of it. Is it currently possible?
If I need to generate a lot of text using the same reference audios, is there a way to process them only once to save time?
There currently doesn't seem to be a simple way to use fish speech by importing it as a python library. Would you be interested by a PR that implement it?
Beta Was this translation helpful? Give feedback.
All reactions