VideoGenHub

VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.

We define 2 prominent generation tasks (Text-to-Video and Image-to-Video).
We built a unified inference pipeline to ensure fair comparison. We currently support around 10 models.

📰 News

2024 Jun 10: GenAI-Arena Paper is out. It is featured on Huggingface Daily Papers.

🛠️ Installation 🔝

To install from pypi:

pip install videogen-hub

To install from github:

git clone https://github.com/TIGER-AI-Lab/VideoGenHub.git
cd VideoGenHub
cd env_cfg
pip install -r requirements.txt
cd ..
pip install -e .

The requirement of opensora is in env_cfg/opensora.txt

For some models like show one, you need to login through huggingface-cli.

huggingface-cli login

👨‍🏫 Get Started 🔝

Benchmarking

To reproduce our experiment using benchmark.

For text-to-video generation:

./t2v_inference.sh --<model_name> --<device>

Infering one model

import videogen_hub

model = videogen_hub.load('VideoCrafter2')
video = model.infer_one_video(prompt="A child excitedly swings on a rusty swing set, laughter filling the air.")

# Here video is a torch tensor of shape torch.Size([16, 3, 320, 512])

See Google Colab here: https://colab.research.google.com/drive/145UMsBOe5JLqZ2m0LKqvvqsyRJA1IeaE?usp=sharing

🧠 Philosophy 🔝

By streamlining research and collaboration, VideoGenHub plays a pivotal role in propelling the field of Video Generation.

Purity of Evaluation: We ensure a fair and consistent evaluation for all models, eliminating biases.
Research Roadmap: By defining tasks and curating datasets, we provide clear direction for researchers.
Open Collaboration: Our platform fosters the exchange and cooperation of related technologies, bringing together minds and innovations.

Implemented Models

We included more than 10 Models in video generation.

Method	Venue	Type
LaVie	-	Text-To-Video Generation
VideoCrafter2	-	Text-To-Video Generation
ModelScope	-	Text-To-Video Generation
StreamingT2V	-	Text-To-Video Generation
Show 1	-	Text-To-Video Generation
OpenSora	-	Text-To-Video Generation
OpenSora-Plan	-	Text-To-Video Generation
T2V-Turbo	-	Text-To-Video Generation
DynamiCrafter2	-	Image-To-Video Generation
SEINE	ICLR'24	Image-To-Video Generation
Consisti2v	-	Image-To_Video Generation
I2VGenXL	-	Image-To_Video Generation

TODO

Add ComfyUI Support
Add Metrics Support
Add Visualization Support (Similar to ImagenHub)
Add Video Editing Task

🎫 License 🔝

This project is released under the License.

🖊️ Citation 🔝

This work is a part of GenAI-Arena work.

Please kindly cite our paper if you use our code, data, models or results:

@misc{jiang2024genai,
      title={GenAI Arena: An Open Evaluation Platform for Generative Models}, 
      author={Dongfu Jiang and Max Ku and Tianle Li and Yuansheng Ni and Shizhuo Sun and Rongqi Fan and Wenhu Chen},
      year={2024},
      eprint={2406.04485},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Name		Name	Last commit message	Last commit date
Latest commit History 252 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
docs		docs
env_cfg		env_cfg
src/videogen_hub		src/videogen_hub
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build_pypi.sh		build_pypi.sh
conftest.py		conftest.py
correct_mp4.py		correct_mp4.py
download_models.sh		download_models.sh
load_models.py		load_models.py
pyproject.toml		pyproject.toml
setup.py		setup.py
t2v_inference.sh		t2v_inference.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VideoGenHub

📰 News

📄 Table of Contents

🛠️ Installation 🔝

👨‍🏫 Get Started 🔝

Benchmarking

Infering one model

🧠 Philosophy 🔝

Implemented Models

TODO

🎫 License 🔝

🖊️ Citation 🔝

About

Releases 2

Packages

Contributors 8

Languages

License

TIGER-AI-Lab/VideoGenHub

Folders and files

Latest commit

History

Repository files navigation

VideoGenHub

📰 News

📄 Table of Contents

🛠️ Installation 🔝

👨‍🏫 Get Started 🔝

Benchmarking

Infering one model

🧠 Philosophy 🔝

Implemented Models

TODO

🎫 License 🔝

🖊️ Citation 🔝

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 8

Languages

Packages