Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Dec 20, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
A header only library implementing common mathematical functions using SIMD intrinsics
Optimized Recursive Bilateral Filter
The Dlang SIMD library
miniRT is the final C project of the 42 Common Core: our very first ray-tracer. Our miniRT focused on optimising CPU-rendered graphics, to achieve a real-time renderer with movement controls and extra options
DR3 enables users to write vectorised code using generic lambdas and filters. Switch instruction set just by changing enclosing namespace
Template SIMD Library (+Generator)
C++ template for generating small sorting networks compatible with SIMD intrinsics
C++ interface for SIMD instruction sets
A High Performance C# wrapper that allows you to get the benefits of SIMD Intrinsics on List<T>.
Vectroized String Helper Functions
Winning submission for StartHack 2024: HPC optimized multi-GPU/CPU inference
Simple neural network microkernels in C accelerated with ARMv8.2-a Neon vector intrinsics.
high-speed math functions based on AVX-512 intrinsics
Add a description, image, and links to the simd-intrinsics topic page so that developers can more easily learn about it.
To associate your repository with the simd-intrinsics topic, visit your repo's landing page and select "manage topics."