Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

downlaod_lexicon.py #41

Open
HHAOi opened this issue Dec 6, 2024 · 4 comments
Open

downlaod_lexicon.py #41

HHAOi opened this issue Dec 6, 2024 · 4 comments

Comments

@HHAOi
Copy link

HHAOi commented Dec 6, 2024

Hello, sorry to bother you. I have some questions. I've downloaded the signsuisse dataset online, but I don't know how to change download_lexicon.py. Hope you can give me a hint, I would appreciate it!

@AmitMY
Copy link
Collaborator

AmitMY commented Dec 6, 2024

What are you trying to do? download_lexicon allows you to download the signsuisse lexicon.
Once you have it, you can for example run:

text_to_gloss_to_pose \
  --text "Kleine Kinder essen Pizza in Zürich." \
  --glosser "simple" \
  --lexicon "<directory to lexicon>" \
  --spoken-language "de" \
  --signed-language "sgg" \
  --pose "quick_test.pose"

@HHAOi
Copy link
Author

HHAOi commented Dec 6, 2024

What I mean is that when I run download_lexicon.py, a network connection error is reported, which causes the dataset to fail to download (because I am in China), so I downloaded the signsuisse dataset on Google Datasets. Now I want to run download_lexicon.py to get index.csv.

@AmitMY
Copy link
Collaborator

AmitMY commented Dec 6, 2024

Ah! China is not a use case I thought about.

For now, I assume you downloaded from https://www.swissubase.ch/en/catalogue/studies/20452/latest/datasets/2327/2685/files

You'll have to unpack the mediapipe files, and then use metadata_train.csv to create a csv smilar to the dummy lexicon https://github.com/sign-language-processing/spoken-to-signed-translation/blob/main/assets/dummy_lexicon/index.csv

Sorry you are facing these issues, I will consider hosting the database on Cloudflare R2 instead of GCP in order to make it available from china. (or you could use a VPN)

@HHAOi
Copy link
Author

HHAOi commented Dec 6, 2024

Thanks,I'll try it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants