[Text-to-video] Update modality #857

NielsRogge · 2024-08-19T10:37:03Z

I noticed the "text-to-video" task is currently under computer vision, should be multimodal.

pcuenca · 2024-08-19T12:01:05Z

text-to-image is currently CV as well. There have been various discussions about what "multimodal" should include, the latest update was done in this PR where some comments were shared. cc @merveenoyan as well for opinions / clarifications.

osanseviero · 2024-08-19T12:01:07Z

packages/tasks/src/pipelines.ts

@@ -585,7 +585,7 @@ export const PIPELINE_DATA = {
 	},
 	"text-to-video": {
 		name: "Text-to-Video",
-		modality: "cv",
+		modality: "multimodal",


This was done in #477 as part of existing multimodal discussions in Slack (search for this PR). I'm happy to reconsider this specific case

For me, multimodal is anything that is not single modality

If only images are involved => cv

if only text is involed => nlp

if image + text, or any other combination of modalities (like text and video) => multimodal

Image generation and captioning involve both text and images. I think the discussion had converged in cardinality of modalities of input or output. Please refer to the existing discussions

Update modality

609d1a1

NielsRogge requested review from osanseviero, SBrandeis, gary149, Wauplin, julien-c and pcuenca as code owners August 19, 2024 10:37

NielsRogge changed the title ~~[Text-to-vide] Update modality~~ [Text-to-video] Update modality Aug 19, 2024

osanseviero reviewed Aug 19, 2024

View reviewed changes

coyotte508 requested a review from ngxson as a code owner November 14, 2024 22:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Text-to-video] Update modality #857

[Text-to-video] Update modality #857

NielsRogge commented Aug 19, 2024

pcuenca commented Aug 19, 2024

osanseviero Aug 19, 2024

NielsRogge Aug 19, 2024

osanseviero Aug 19, 2024

[Text-to-video] Update modality #857

Are you sure you want to change the base?

[Text-to-video] Update modality #857

Conversation

NielsRogge commented Aug 19, 2024

pcuenca commented Aug 19, 2024

osanseviero Aug 19, 2024

Choose a reason for hiding this comment

NielsRogge Aug 19, 2024

Choose a reason for hiding this comment

osanseviero Aug 19, 2024

Choose a reason for hiding this comment