Loop auto-vectorization #22

dubiousconst282 · 2023-03-14T03:57:59Z

Auto-vectorization is a very interesting transform because it can have a significant impact on the few cases where it works. In reality, only a few loops are effectively vectorizable due to unprevalent memory accesses and branching patterns present in normal code. The lack of efficient gather instructions makes this even more problematic with the involvement of non-sequential objects/structs rather than flat arrays.

TODOs

Extras:

Basic if-conversion
- Code gen support for SelectInst
- Handle non-diamond graphs through path duplication (for empty blocks only)
Consider introducing a GetElementPtr/LEA instruction, because recognizing indexing expressions is tricky. Having this could also help consolidation of load/store instructions for arrays and fields.
Consider first-class support for vector types in the IR. This may not be that valuable outside of bringing the ability to perform basic peepholes.
Consider supporting basic transcendental math functions: Sin, Cos, Log, Exp (port from DirectXMath lib?)
Consider supporting some basic cost-model

The text was updated successfully, but these errors were encountered:

Contributes to #22

dubiousconst282 added a commit that referenced this issue Mar 14, 2023

Preliminary loop vectorization pass

a652908

Contributes to #22

dubiousconst282 added a commit that referenced this issue Mar 15, 2023

AutoVec: initial support for reductions

057485e

Contributes to #22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loop auto-vectorization #22

Loop auto-vectorization #22

dubiousconst282 commented Mar 14, 2023 •

edited

Loading

Loop auto-vectorization #22

Loop auto-vectorization #22

Comments

dubiousconst282 commented Mar 14, 2023 • edited Loading

TODOs

dubiousconst282 commented Mar 14, 2023 •

edited

Loading