yuanlehome

💭

coding

Yuanle Liu yuanlehome

💭

coding

19 followers · 9 following

Achievements

x2 x3

Achievements

x2 x3

Stars

zhihu / ZhiLight

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 391 30 Updated Dec 16, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1,601 160 Updated Dec 20, 2024

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 28,802 2,815 Updated Dec 22, 2024

jarro2783 / cxxopts

Lightweight C++ command line option parser

C++ 4,275 591 Updated Dec 21, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

30,601 1,674 Updated Aug 1, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,290 4,921 Updated Dec 22, 2024

lizongying / my-tv

我的电视电视直播软件，安装即可使用

C 30,947 3,469 Updated Jun 20, 2024

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,705 216 Updated Dec 22, 2024

Adlik / Adlik

Adlik: Toolkit for Accelerating Deep Learning Inference

C++ 794 83 Updated Dec 27, 2023

angular / angular

Deliver web apps with confidence 🚀

TypeScript 96,507 25,640 Updated Dec 21, 2024

networkx / networkx

Network Analysis in Python

Python 15,160 3,276 Updated Dec 21, 2024

open-source-parsers / jsoncpp

A C++ library for interacting with JSON.

C++ 8,230 2,652 Updated Dec 5, 2024

p-ranav / argparse

Argument Parser for Modern C++

C++ 2,804 256 Updated Nov 20, 2024

meta-llama / codellama

Inference code for CodeLlama models

Python 16,104 1,878 Updated Aug 12, 2024

rogersce / cnpy

library to read/write .npy and .npz files in C/C++

C++ 1,345 306 Updated Jan 18, 2023

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,941 1,032 Updated Dec 17, 2024

PaddlePaddle / ERNIE-SDK

ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.

Jupyter Notebook 352 52 Updated Aug 20, 2024

MeouSker77 / Cpp17

本书为《C++17 the complete guide》的个人中文翻译，仅供学习和交流使用，侵删

TeX 1,620 257 Updated Sep 22, 2024

Oneflow-Inc / oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 6,227 699 Updated Dec 20, 2024

PaddlePaddle / PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…

Python 12,239 2,961 Updated Dec 20, 2024

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 6,435 639 Updated Dec 18, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,593 27,349 Updated Dec 21, 2024

microsoft / torchscale

Foundation Architecture for (M)LLMs

Python 3,037 211 Updated Apr 11, 2024

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 1,754 145 Updated Dec 20, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 13,751 1,684 Updated Dec 22, 2024

NVIDIA / stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

C++ 1,678 170 Updated Dec 19, 2024

gpt-engineer-org / gpt-engineer

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,695 6,849 Updated Nov 17, 2024

ChunelFeng / CGraph

【A common used C++ DAG framework】一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流

C++ 1,828 329 Updated Dec 17, 2024

pixelmatix / SmartMatrix

SmartMatrix Library for Teensy 3, Teensy 4, and ESP32

C++ 641 165 Updated Jan 10, 2024

PaddlePaddle / Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

C++ 22,364 5,635 Updated Dec 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuanle Liu yuanlehome

Achievements

Achievements

Block or report yuanlehome

Stars

zhihu / ZhiLight

flashinfer-ai / flashinfer

lutzroeder / netron

jarro2783 / cxxopts

karpathy / LLM101n

vllm-project / vllm

lizongying / my-tv

ModelTC / lightllm

Adlik / Adlik

angular / angular

networkx / networkx

open-source-parsers / jsoncpp

p-ranav / argparse

meta-llama / codellama

rogersce / cnpy

NVIDIA / TensorRT-LLM

PaddlePaddle / ERNIE-SDK

MeouSker77 / Cpp17

Oneflow-Inc / oneflow

PaddlePaddle / PaddleNLP

bitsandbytes-foundation / bitsandbytes

huggingface / transformers

microsoft / torchscale

BBuf / how-to-optim-algorithm-in-cuda

triton-lang / triton

NVIDIA / stdexec

gpt-engineer-org / gpt-engineer

ChunelFeng / CGraph

pixelmatix / SmartMatrix

PaddlePaddle / Paddle