Add Llama3.1 1B HTP to benchmark #7398

guangy10 · 2024-12-19T03:17:37Z

Llama3.2 QNN HTP: https://github.com/pytorch/executorch/actions/runs/12426136559/job/34693953714

pytorch-bot · 2024-12-19T03:17:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7398

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 2c5f9dc with merge base b2a680b ():

NEW FAILURES - The following jobs have failed:

android-perf / export-models (meta-llama/Llama-3.2-1B, llama3_qnn_htp, samsung_galaxy_s22, arn:aws:devicefarm:us... / linux-job (gh)
RuntimeError: Command docker exec -t 7bac8a61823176b7ed751718612cf720461ef7eb373eaceb834ee45ea89e0dfd /exec failed with exit code 1
android-perf / upload-benchmark-results (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

.github/workflows/android-perf.yml

facebook-github-bot · 2024-12-19T05:00:23Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

guangy10 · 2024-12-19T19:50:20Z

The HTP path is expected to take much longer due to calibration. Need to bump up the timeout threshold. I guess there is no harm to bump it up for ALL as most will finish within 30mins anyway.

.github/workflows/android-perf.yml

facebook-github-bot · 2024-12-19T19:54:17Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

guangy10 · 2024-12-19T23:47:12Z

.github/workflows/android-perf.yml

      docker-image: executorch-ubuntu-22.04-qnn-sdk
      submodules: 'true'
-      timeout: 60
+      timeout: 240


Set to 120 still timed out.

guangy10 · 2024-12-19T23:50:29Z

.github/workflows/android-perf.yml

@@ -132,10 +132,10 @@ jobs:
      matrix: ${{ fromJson(needs.set-parameters.outputs.benchmark_configs) }}
      fail-fast: false
    with:
-      runner: linux.2xlarge.memory
+      runner: linux.4xlarge.memory


Locally on devserver, quantization is taking "INFO:root:Time for quantizing: 1203.9422521591187", but on CI it's 4x slower, bump up to use the 4x runner.

facebook-github-bot · 2024-12-20T06:50:17Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 19, 2024

guangy10 added module: benchmark Features or issues related to benchmark infra, including the workflow, CI and benchmark apps topic: not user facing labels Dec 19, 2024

guangy10 temporarily deployed to upload-benchmark-results December 19, 2024 04:02 — with GitHub Actions Inactive

guangy10 had a problem deploying to upload-benchmark-results December 19, 2024 04:33 — with GitHub Actions Failure

guangy10 commented Dec 19, 2024

View reviewed changes

.github/workflows/android-perf.yml Outdated Show resolved Hide resolved

guangy10 requested review from huydhn and cccclai December 19, 2024 05:00

guangy10 marked this pull request as ready for review December 19, 2024 05:00

guangy10 had a problem deploying to upload-benchmark-results December 19, 2024 05:55 — with GitHub Actions Failure

guangy10 had a problem deploying to upload-benchmark-results December 19, 2024 08:31 — with GitHub Actions Failure

guangy10 had a problem deploying to upload-benchmark-results December 19, 2024 17:32 — with GitHub Actions Failure

guangy10 had a problem deploying to upload-benchmark-results December 19, 2024 18:24 — with GitHub Actions Failure

guangy10 force-pushed the benchmark_llama_htp branch 2 times, most recently from 32a44a8 to e15fd50 Compare December 19, 2024 18:56

guangy10 temporarily deployed to upload-benchmark-results December 19, 2024 19:38 — with GitHub Actions Inactive

guangy10 force-pushed the benchmark_llama_htp branch from e15fd50 to fc3ccb4 Compare December 19, 2024 19:48

guangy10 commented Dec 19, 2024

View reviewed changes

.github/workflows/android-perf.yml Outdated Show resolved Hide resolved

guangy10 temporarily deployed to upload-benchmark-results December 19, 2024 20:27 — with GitHub Actions Inactive

guangy10 force-pushed the benchmark_llama_htp branch 2 times, most recently from 3287b18 to 0565c7e Compare December 19, 2024 21:36

guangy10 marked this pull request as draft December 19, 2024 21:44

guangy10 temporarily deployed to upload-benchmark-results December 19, 2024 22:20 — with GitHub Actions Inactive

guangy10 force-pushed the benchmark_llama_htp branch from 0565c7e to 361ca85 Compare December 19, 2024 23:46

guangy10 commented Dec 19, 2024

View reviewed changes

guangy10 had a problem deploying to upload-benchmark-results December 19, 2024 23:52 — with GitHub Actions Failure

guangy10 force-pushed the benchmark_llama_htp branch from 361ca85 to 88a9267 Compare December 20, 2024 00:21

guangy10 temporarily deployed to upload-benchmark-results December 20, 2024 01:04 — with GitHub Actions Inactive

guangy10 had a problem deploying to upload-benchmark-results December 20, 2024 03:20 — with GitHub Actions Failure

Add Llama3.1 1B HTP to benchmark

2c5f9dc

guangy10 force-pushed the benchmark_llama_htp branch from 88a9267 to 2c5f9dc Compare December 20, 2024 05:31

guangy10 temporarily deployed to upload-benchmark-results December 20, 2024 06:12 — with GitHub Actions Inactive

guangy10 had a problem deploying to upload-benchmark-results December 20, 2024 08:30 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Llama3.1 1B HTP to benchmark #7398

Add Llama3.1 1B HTP to benchmark #7398

guangy10 commented Dec 19, 2024 •

edited

Loading

pytorch-bot bot commented Dec 19, 2024 •

edited

Loading

facebook-github-bot commented Dec 19, 2024

guangy10 commented Dec 19, 2024

facebook-github-bot commented Dec 19, 2024

guangy10 Dec 19, 2024

guangy10 Dec 19, 2024

facebook-github-bot commented Dec 20, 2024

Add Llama3.1 1B HTP to benchmark #7398

Are you sure you want to change the base?

Add Llama3.1 1B HTP to benchmark #7398

Conversation

guangy10 commented Dec 19, 2024 • edited Loading

pytorch-bot bot commented Dec 19, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7398

❌ 2 New Failures

facebook-github-bot commented Dec 19, 2024

guangy10 commented Dec 19, 2024

facebook-github-bot commented Dec 19, 2024

guangy10 Dec 19, 2024

Choose a reason for hiding this comment

guangy10 Dec 19, 2024

Choose a reason for hiding this comment

facebook-github-bot commented Dec 20, 2024

guangy10 commented Dec 19, 2024 •

edited

Loading

pytorch-bot bot commented Dec 19, 2024 •

edited

Loading