Speech-Driven Facial Animation with Spectral Gathering and Temporal Attention

Install dependencies

Necessary libraries:

# install python libs
$ python3 -m pip install -r requirements.txt
# install cmake and sndfile lib
$ sudo apt install libsndfile1 cmake

(not necessary) If you want to prepare dataset, montreal-forced-aligner must be installed. (Some errors may occur during installation, please pay attention.)

$ bash scripts/install_mkl.sh
$ bash scripts/install_kaldi.sh
$ bash scripts/install_mfa.sh

Evaluate

Download pretrained model from Google Drive, unzip it, and put in ./pretrained_models/dgrad.

Modify and run evaluate script bash evaluate.sh.

Prepare VOCASET

Download VOCASET from https://voca.is.tue.mpg.de/ Unzip directories:

| VOCASET
 -| unposedcleaneddata
 -| sentencestext
 -| templates
 -| audio

Run the preload python script.

python3 -m saberspeech.datasets.voca.preload\
    --source_root <ROOT_VOCASET> \
    --output_root <ROOT_PROCESSED>

Pre-trained models

dgrad
offsets
PCA of dgrad, offsets

Citation

@article{chai2022speech,
  title={Speech-driven facial animation with spectral gathering and temporal attention},
  author={Chai, Yujin and Weng, Yanlin and Wang, Lvdi and Zhou, Kun},
  journal={Frontiers of Computer Science},
  volume={16},
  number={3},
  pages={1--10},
  year={2022},
  publisher={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
deformation		deformation
saber		saber
speech_anime		speech_anime
tool-blender		tool-blender
.flake8		.flake8
.gitignore		.gitignore
README.md		README.md
dataset_preprocess.py		dataset_preprocess.py
dataset_visualize.py		dataset_visualize.py
evaluate.sh		evaluate.sh
requirements.txt		requirements.txt
test_render.py		test_render.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-Driven Facial Animation with Spectral Gathering and Temporal Attention

Install dependencies

Evaluate

Prepare VOCASET

Pre-trained models

Citation

About

Releases

Packages

Languages

chaiyujin/sdfa-2019

Folders and files

Latest commit

History

Repository files navigation

Speech-Driven Facial Animation with Spectral Gathering and Temporal Attention

Install dependencies

Evaluate

Prepare VOCASET

Pre-trained models

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages