From e96b496a33778c9cecdc690e423f78fb5a0fca07 Mon Sep 17 00:00:00 2001 From: "github-actions[bot]" Date: Wed, 24 Jul 2024 01:14:15 +0000 Subject: [PATCH] Update README.md --- README.md | 26 +++++++++++++------------- 1 file changed, 13 insertions(+), 13 deletions(-) diff --git a/README.md b/README.md index 8c83f566..5f94e3f3 100644 --- a/README.md +++ b/README.md @@ -14,6 +14,7 @@ openllm repo add nightly https://github.com/bentoml/openllm-models@nightly ### Table of Contents +- [Llama-3.1](#llama3.1) - [Llama-3](#llama3) - [Phi-3](#phi3) - [Mistral](#mistral) @@ -21,7 +22,18 @@ openllm repo add nightly https://github.com/bentoml/openllm-models@nightly - [Gemma](#gemma) - [Llama-2](#llama2) - [Mixtral](#mixtral) -- [](#llama3.1) + +--- + + +### Llama-3.1 + +| Model | Version | Huggingface Link | +| --- | --- | --- | +| llama3.1 | 70b-instruct-awq-4bit-f55b | [HF Link](https://huggingface.co/hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4) | +| llama3.1 | 70b-instruct-fp16-2eb2 | [HF Link](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) | +| llama3.1 | 8b-instruct-awq-4bit-f737 | [HF Link](https://huggingface.co/hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4) | +| llama3.1 | 8b-instruct-fp16-6d7b | [HF Link](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) | --- @@ -105,15 +117,3 @@ openllm repo add nightly https://github.com/bentoml/openllm-models@nightly --- - -### - -| Model | Version | Huggingface Link | -| --- | --- | --- | -| llama3.1 | 70b-instruct-awq-4bit-f55b | [HF Link](https://huggingface.co/hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4) | -| llama3.1 | 70b-instruct-fp16-2eb2 | [HF Link](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) | -| llama3.1 | 8b-instruct-awq-4bit-f737 | [HF Link](https://huggingface.co/hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4) | -| llama3.1 | 8b-instruct-fp16-6d7b | [HF Link](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) | - ---- -