Skip to content

Update TRT-LLM example, achieve 28k tok/s (#991) #758

Update TRT-LLM example, achieve 28k tok/s (#991)

Update TRT-LLM example, achieve 28k tok/s (#991) #758

Workflow file for this run

name: Deploy
on:
push:
branches:
- main
workflow_dispatch:
jobs:
deploy:
name: Deploy example apps
if: github.ref == 'refs/heads/main'
runs-on: ubuntu-20.04
env:
MODAL_TOKEN_ID: ${{ secrets.MODAL_MODAL_LABS_TOKEN_ID }}
MODAL_TOKEN_SECRET: ${{ secrets.MODAL_MODAL_LABS_TOKEN_SECRET }}
MODAL_ENVIRONMENT: main
steps:
- uses: actions/checkout@v3
with:
fetch-depth: 1
- uses: ./.github/actions/setup
- name: Run deployment script
run: |
python3 internal/deploy.py --no-dry-run