Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to deploy Llava in triton? #913

Open
forrestjgq opened this issue Jan 19, 2024 · 6 comments
Open

How to deploy Llava in triton? #913

forrestjgq opened this issue Jan 19, 2024 · 6 comments
Assignees
Labels
question Further information is requested triaged Issue has been triaged by maintainers

Comments

@forrestjgq
Copy link

Hello:

Glad to see that Llava is supported now. We're trying to deploy it in triton, how to do that?

@byshiue
Copy link
Collaborator

byshiue commented Jan 19, 2024

You could refer the document of Triton backend https://github.com/triton-inference-server/tensorrtllm_backend/blob/main/docs/llama.md and replace the engine and tokenizer by Llava if you already have a Llava engine.

@byshiue byshiue self-assigned this Jan 19, 2024
@byshiue byshiue added question Further information is requested triaged Issue has been triaged by maintainers labels Jan 19, 2024
@isaac-vidas
Copy link

Does tensorrt-llm-backend have support for multimodal? Is there an example for passing prompt and and an image through a request?

@byshiue
Copy link
Collaborator

byshiue commented Feb 19, 2024

There is no such example now.

@DefTruth
Copy link

same question. need some docs about how to deploy multimodal model (such as LLaVA) via triton server tensorrtllm_backend.

@Iven2132
Copy link

@DefTruth Did you figure it out? I'm looking for the same

@JeremySun1224
Copy link

There is no such example now.

multimodal

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested triaged Issue has been triaged by maintainers
Projects
None yet
Development

No branches or pull requests

6 participants