Question about sign-language-processing/spoken-to-signed-translation/issues/26 #2

AmitMY · 2024-03-15T15:51:55Z

          Hi, nice work.

If I may ask, which scripts of the pose-to-video for the diffusion models do you use?
Have they the holistic bones as input? Or how do know what the expected output of the diffusion model should be?

Originally posted by @florianbaer in sign-language-processing/spoken-to-signed-translation#26 (comment)

The text was updated successfully, but these errors were encountered:

AmitMY · 2024-03-15T15:52:30Z

The command used to generate the video outputs there is

pose_to_video --type=controlnet --model=sign/sd-controlnet-mediapipe --pose=assets/testing-reduced.pose --video=assets/outputs/controlnet-animatediff.mp4 --processors animatediff

It is based on holistic, yes

AmitMY changed the title ~~Hi, nice work.~~ Question about sign-language-processing/spoken-to-signed-translation/issues/26 Mar 15, 2024

AmitMY closed this as completed Mar 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about sign-language-processing/spoken-to-signed-translation/issues/26 #2

Question about sign-language-processing/spoken-to-signed-translation/issues/26 #2

AmitMY commented Mar 15, 2024

AmitMY commented Mar 15, 2024

Question about sign-language-processing/spoken-to-signed-translation/issues/26 #2

Question about sign-language-processing/spoken-to-signed-translation/issues/26 #2

Comments

AmitMY commented Mar 15, 2024

AmitMY commented Mar 15, 2024