Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

task: Use VieTTS if output language is Vietnamese #123

Open
4 tasks
Tracked by #116
hahuyhoang411 opened this issue Nov 19, 2024 · 2 comments
Open
4 tasks
Tracked by #116

task: Use VieTTS if output language is Vietnamese #123

hahuyhoang411 opened this issue Nov 19, 2024 · 2 comments
Assignees
Labels
P2: nice to have Nice to have feature
Milestone

Comments

@hahuyhoang411
Copy link
Contributor

hahuyhoang411 commented Nov 19, 2024

Goal

Making the vieTTS for the learning then use this to expand on multilingual TTS

Tasklist

  • Chose architecture (MaskGCT, F5, etc)
  • Setup training pipeline
  • Experiment with model hyperparameters
  • Draft research report
@hahuyhoang411 hahuyhoang411 added the P2: nice to have Nice to have feature label Nov 19, 2024
@hahuyhoang411 hahuyhoang411 changed the title VietTTS (Issue: ) task: VieTTS Nov 19, 2024
@bachvudinh bachvudinh self-assigned this Nov 20, 2024
@github-project-automation github-project-automation bot moved this to Investigating in Jan & Cortex Nov 22, 2024
@tikikun tikikun moved this from Investigating to In Progress in Jan & Cortex Nov 25, 2024
@dan-homebrew
Copy link
Contributor

dan-homebrew commented Nov 27, 2024

Discussion

  • How will we detect outputs in Vietnamese, and switch to VietTTS?
  • Future architecture: Ichigo will integrated speech decoder (i.e. built-in TTS)

@dan-homebrew dan-homebrew changed the title task: VieTTS task: Use VieTTS if output language is Vietnamese Nov 27, 2024
@tikikun
Copy link
Collaborator

tikikun commented Nov 27, 2024

Most model that comes with Vietnamese language will have English in it as well. The most realistic solution to this is just having a toggle for multi-lingual capability and use the model that is having Vietnamese language vs just English (English quality will be down a bit but this is more realistic to achieve within this milestone).

I will deep dive more on owning over the TTS part with my f5-tts research.

@tikikun tikikun self-assigned this Nov 28, 2024
@hahuyhoang411 hahuyhoang411 moved this from In Progress to Scheduled in Jan & Cortex Dec 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P2: nice to have Nice to have feature
Projects
Status: Scheduled
Development

No branches or pull requests

4 participants