visually-grounded-speech

Star

Here are 6 public repositories matching this topic...

atosystem / SpeechCLIP

Star

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022

deep-learning pytorch clip speech-processing visually-grounded-speech

Updated Nov 25, 2022
Python

bhigy / zr-2021vg_baseline

Star

Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition

challenge deep-neural-networks pytorch representation-learning speech-processing weakly-supervised-learning multimodal-learning librispeech visually-grounded-speech spokencoco

Updated Jun 1, 2021
Python

ShampooWang / SpeechCLIP_plus

Star

SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data. Accepted to ICASSP 2024, Self-supervision in Audio, Speech, and Beyond (SASB) workshop.

deep-learning speech-processing pytorch-lightning visually-grounded-speech

Updated Apr 9, 2024
Python

spokenlanguage / platalea

Star

Library for training visually-grounded models of spoken language understanding.

deep-neural-networks pytorch speech-processing weakly-supervised-learning multimodal-learning spoken-language-understanding multi-tasking visually-grounded-speech flickr8k spokencoco

Updated Apr 13, 2022
Python

bhigy / textual-supervision

Star

Code for the paper "Textual supervision for visually grounded spoken language understanding".

speech-recognition multi-task-learning spoken-language-understanding visually-grounded-speech

Updated Oct 12, 2021
Python

aayushi12 / thesis_dss

Star

Code used in my Master's thesis

speech-emotion-recognition visually-grounded-speech

Updated May 28, 2020
Python

Improve this page

Add a description, image, and links to the visually-grounded-speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the visually-grounded-speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

visually-grounded-speech

Here are 6 public repositories matching this topic...

atosystem / SpeechCLIP

bhigy / zr-2021vg_baseline

ShampooWang / SpeechCLIP_plus

spokenlanguage / platalea

bhigy / textual-supervision

aayushi12 / thesis_dss

Improve this page

Add this topic to your repo