Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[bugfix] don't allow qlinearmatmul to match output Q/DQ block meant for next quantizable node #1700

Closed
wants to merge 2 commits into from

Conversation

bfineran
Copy link
Contributor

currently, if a Q/DQ -> MatMul -> Q/DQ -> MatMul series of nodes occurs, the first matmul may be matched as a target for a QLinearMatMul conversion even though the output Q/DQ block is meant for the second matmul

this adds a simple check to skip these cases. this issue was brought up in testing of quantizing distilbert by @rahul-tuli

test-plan:
to be tested by @rahul-tuli

@bfineran bfineran self-assigned this Aug 10, 2023
@rahul-tuli
Copy link
Member

[Noting] the commits from this diff have been verified and cherrypicked to #1701

@bfineran
Copy link
Contributor Author

#1701 rebases onto this - closing

@bfineran bfineran closed this Aug 14, 2023
@bfineran bfineran deleted the qlinear-matmul-transform-patch branch August 14, 2023 14:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants