-
TensorRT Public
Forked from NVIDIA/TensorRTNVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…
C++ Apache License 2.0 UpdatedSep 26, 2024 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Python Apache License 2.0 UpdatedSep 11, 2024 -
onnx-tensorrt Public
Forked from onnx/onnx-tensorrtONNX-TensorRT: TensorRT backend for ONNX
C++ Apache License 2.0 UpdatedApr 25, 2024 -
optimum Public
Forked from huggingface/optimum🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Python Apache License 2.0 UpdatedApr 18, 2024 -
MONAI Public
Forked from Project-MONAI/MONAIAI Toolkit for Healthcare Imaging
Python Apache License 2.0 UpdatedApr 13, 2024 -
NeMo Public
Forked from NVIDIA/NeMoNeMo: a toolkit for conversational AI
Python Apache License 2.0 UpdatedJun 25, 2023 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in bot…
Python Apache License 2.0 UpdatedJun 20, 2023 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedJan 4, 2023