Skip to content
View yuanlehome's full-sized avatar
💭
coding
💭
coding

Block or report yuanlehome

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 391 30 Updated Dec 16, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,601 160 Updated Dec 20, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 28,802 2,815 Updated Dec 22, 2024

Lightweight C++ command line option parser

C++ 4,275 591 Updated Dec 21, 2024

LLM101n: Let's build a Storyteller

30,601 1,674 Updated Aug 1, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,290 4,921 Updated Dec 22, 2024

我的电视 电视直播软件,安装即可使用

C 30,947 3,469 Updated Jun 20, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,705 216 Updated Dec 22, 2024

Adlik: Toolkit for Accelerating Deep Learning Inference

C++ 794 83 Updated Dec 27, 2023

Deliver web apps with confidence 🚀

TypeScript 96,507 25,640 Updated Dec 21, 2024

Network Analysis in Python

Python 15,160 3,276 Updated Dec 21, 2024

A C++ library for interacting with JSON.

C++ 8,230 2,652 Updated Dec 5, 2024

Argument Parser for Modern C++

C++ 2,804 256 Updated Nov 20, 2024

Inference code for CodeLlama models

Python 16,104 1,878 Updated Aug 12, 2024

library to read/write .npy and .npz files in C/C++

C++ 1,345 306 Updated Jan 18, 2023

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,941 1,032 Updated Dec 17, 2024

ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.

Jupyter Notebook 352 52 Updated Aug 20, 2024

本书为《C++17 the complete guide》的个人中文翻译,仅供学习和交流使用,侵删

TeX 1,620 257 Updated Sep 22, 2024

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 6,227 699 Updated Dec 20, 2024

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…

Python 12,239 2,961 Updated Dec 20, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 6,435 639 Updated Dec 18, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,593 27,349 Updated Dec 21, 2024

Foundation Architecture for (M)LLMs

Python 3,037 211 Updated Apr 11, 2024

how to optimize some algorithm in cuda.

Cuda 1,754 145 Updated Dec 20, 2024

Development repository for the Triton language and compiler

C++ 13,751 1,684 Updated Dec 22, 2024

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

C++ 1,678 170 Updated Dec 19, 2024

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,695 6,849 Updated Nov 17, 2024

【A common used C++ DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流

C++ 1,828 329 Updated Dec 17, 2024

SmartMatrix Library for Teensy 3, Teensy 4, and ESP32

C++ 641 165 Updated Jan 10, 2024

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 22,364 5,635 Updated Dec 21, 2024
Next
Showing results