Algorithmic improvements in the Chebyshev filter
Improvements
- Integrated axpy in the call to HEMM when executing the 3-terms recurrence relation in the Chebyshev filter.
- Moved the shift of the A matrix in the 3-terms recurrent relation for the GPU build within the accelerator.