Refactor pytest collection to anchor on test case files, not .mlir files. #282

ScottTodd · 2024-07-08T23:37:35Z

This allows us to detect test cases that store their .mlir or .mlirbc file remotely, since the test_data_flags.txt or test_cases.json file will always exist locally. This also changes test case names in summaries from e.g.

PASSED onnx/node/generated/test_sub_uint8/model.mlir::cpu_llvm_sync_test
PASSED onnx/node/generated/test_sub_example/model.mlir::cpu_llvm_sync_test
PASSED onnx/node/generated/test_sub_bcast/model.mlir::cpu_llvm_sync_test

XFAIL pytorch/models/opt-125M/opt-125M.mlirbc::cpu_llvm_task_splats - Expected compilation to fail (included in 'expected_compile_failures')

to e.g.

PASSED onnx/node/generated/test_sub_example/test_data_flags.txt::model.mlir::cpu_llvm_sync
PASSED onnx/node/generated/test_sub_uint8/test_data_flags.txt::model.mlir::cpu_llvm_sync
PASSED onnx/node/generated/test_sub_bcast/test_data_flags.txt::model.mlir::cpu_llvm_sync

XFAIL pytorch/models/opt-125M/test_cases.json::opt-125M.mlirbc::cpu_llvm_task::splats - Expected compilation to fail (included in 'expected_compile_failures'

(anchored on the .txt or .json so added back the .mlir, also dropped the "test")

This is a bit hacky either way. I took a look at https://docs.pytest.org/en/stable/example/customdirectory.html as an alternative to https://docs.pytest.org/en/stable/example/nonpython.html and that could help.

ScottTodd · 2024-07-08T23:45:36Z

iree_tests/conftest.py

+        if self.path.name == TEST_DATA_FLAGFILE_NAME:
            test_cases.append(
-                MlirFile.TestCase(
-                    name="test",
-                    runtime_flagfile=test_data_flagfile_name,
+                MlirCompileRunTest.TestCase(
+                    name="",
+                    mlir_file=mlir_file,
+                    runtime_flagfile=TEST_DATA_FLAGFILE_NAME,
                    enabled=have_lfs_files,
                )
            )


Could remove the need for this branch by generating boilerplate test_cases.json files in the ONNX test suite:

{ "file_format": "test_cases_v0", "test_cases": [ { "name": "", "runtime_flagfile": "test_data_flags.txt", "remote_files": [] } ] }

Trying to keep this conftest file somewhat simple and a bit concerned this is sliding the wrong way 🤔

Yeah, would probably be best to have test_cases.json files for all the onnx tests too. That way all our tests can follow a unified format and the conftest won't get confusing. We should probably be anchoring on a test configuration file rather than the runtime flag file for onnx tests

Okay, I can make that change too. How does a follow-up PR sound? That will require touching lots of files.

Might also be able to replace test_cases.json with tests.py and then further simplify (or remove) the conftest.py file... 🤔 Mainly trying to keep the large test suites autogenerated, concise, and directly compatible with iree-compile -> iree-run-module.

) See https://huggingface.co/amd-shark/sdxl-quant-models, which is now a source of truth for developers interfacing with this model. For now the files are pinned to a commit hash as changes are expected over the coming days. Some details: * I needed to update the regex in `download_remote_files.py` to support subdirectories. The `/` search was greedy, capturing e.g. `fe57fe12eeb6eac83f469793984f6ad4c06a478c/unet/fp16/export` and `sdxl_unet_fp16_dataset.irpa` instead of `fe57fe12eeb6eac83f469793984f6ad4c06a478c` and `unet/fp16/export/sdxl_unet_fp16_dataset.irpa`. * This benefits from #282, since the `.mlir` file that the test collector looks for is a remote file not downloaded with a regular Git [LFS] checkout. That PR changes the collector to look for `test_cases.json` instead of `.mlir` files.

saienduri

Follow up is fine

) See https://huggingface.co/amd-shark/sdxl-quant-models, which is now a source of truth for developers interfacing with this model. For now the files are pinned to a commit hash as changes are expected over the coming days. Some details: * I needed to update the regex in `download_remote_files.py` to support subdirectories. The `/` search was greedy, capturing e.g. `fe57fe12eeb6eac83f469793984f6ad4c06a478c/unet/fp16/export` and `sdxl_unet_fp16_dataset.irpa` instead of `fe57fe12eeb6eac83f469793984f6ad4c06a478c` and `unet/fp16/export/sdxl_unet_fp16_dataset.irpa`. * This benefits from #282, since the `.mlir` file that the test collector looks for is a remote file not downloaded with a regular Git [LFS] checkout. That PR changes the collector to look for `test_cases.json` instead of `.mlir` files.

…les. (#282) This allows us to detect test cases that store their `.mlir` or `.mlirbc` file remotely, since the `test_data_flags.txt` or `test_cases.json` file will always exist locally. This also changes test case names in summaries from e.g. ``` PASSED onnx/node/generated/test_sub_uint8/model.mlir::cpu_llvm_sync_test PASSED onnx/node/generated/test_sub_example/model.mlir::cpu_llvm_sync_test PASSED onnx/node/generated/test_sub_bcast/model.mlir::cpu_llvm_sync_test XFAIL pytorch/models/opt-125M/opt-125M.mlirbc::cpu_llvm_task_splats - Expected compilation to fail (included in 'expected_compile_failures') ``` to e.g. ``` PASSED onnx/node/generated/test_sub_example/test_data_flags.txt::model.mlir::cpu_llvm_sync PASSED onnx/node/generated/test_sub_uint8/test_data_flags.txt::model.mlir::cpu_llvm_sync PASSED onnx/node/generated/test_sub_bcast/test_data_flags.txt::model.mlir::cpu_llvm_sync XFAIL pytorch/models/opt-125M/test_cases.json::opt-125M.mlirbc::cpu_llvm_task::splats - Expected compilation to fail (included in 'expected_compile_failures' ``` (anchored on the .txt or .json so added back the .mlir, also dropped the "test") This is a bit hacky either way. I took a look at https://docs.pytest.org/en/stable/example/customdirectory.html as an alternative to https://docs.pytest.org/en/stable/example/nonpython.html and that could help.

ScottTodd added 2 commits July 8, 2024 15:32

Refactor collection to look for test case files not .mlir files.

362935b

Rename test items.

e0c5e03

ScottTodd mentioned this pull request Jul 8, 2024

Switch punet tests to use .mlir and .irpa files from Hugging Face. #283

Merged

ScottTodd commented Jul 8, 2024

View reviewed changes

ScottTodd marked this pull request as ready for review July 9, 2024 16:44

ScottTodd requested a review from saienduri July 9, 2024 16:44

saienduri approved these changes Jul 11, 2024

View reviewed changes

ScottTodd merged commit 1da89a1 into nod-ai:main Jul 11, 2024
3 checks passed

ScottTodd deleted the refactor-collection branch July 11, 2024 15:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor pytest collection to anchor on test case files, not .mlir files. #282

Refactor pytest collection to anchor on test case files, not .mlir files. #282

ScottTodd commented Jul 8, 2024

ScottTodd Jul 8, 2024

saienduri Jul 11, 2024

ScottTodd Jul 11, 2024

ScottTodd Jul 11, 2024

saienduri left a comment

Refactor pytest collection to anchor on test case files, not .mlir files. #282

Refactor pytest collection to anchor on test case files, not .mlir files. #282

Conversation

ScottTodd commented Jul 8, 2024

ScottTodd Jul 8, 2024

Choose a reason for hiding this comment

saienduri Jul 11, 2024

Choose a reason for hiding this comment

ScottTodd Jul 11, 2024

Choose a reason for hiding this comment

ScottTodd Jul 11, 2024

Choose a reason for hiding this comment

saienduri left a comment

Choose a reason for hiding this comment