CUDA: fix matrix multiplication algorithm choice #8102

JohannesGaessler · 2024-06-24T20:39:38Z

I made a mistake in #8075 . The checks for the matrix multiplication algorithms capable of handling non-contiguous data have to be done first or else dequantize_mul_mat_vec could be used incorrectly.

)" This reverts commit 2df373a.

CUDA: fix matrix multiplication algorithm choice

85f60a0

slaren approved these changes Jun 24, 2024

View reviewed changes

JohannesGaessler mentioned this pull request Jun 24, 2024

Bug: Crashes at the end of startup during first prompt processing #8096

Closed

JohannesGaessler merged commit 2df373a into ggerganov:master Jun 24, 2024
57 of 62 checks passed

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Jun 26, 2024

Revert "CUDA: fix matrix multiplication algorithm choice (ggerganov#8102

e575da5

)" This reverts commit 2df373a.

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jun 30, 2024

CUDA: fix matrix multiplication algorithm choice (ggerganov#8102)

1a6bf92

MagnusS0 pushed a commit to MagnusS0/llama.cpp-normistral-tokenizer that referenced this pull request Jul 1, 2024

CUDA: fix matrix multiplication algorithm choice (ggerganov#8102)

f806496

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: fix matrix multiplication algorithm choice #8102

CUDA: fix matrix multiplication algorithm choice #8102

JohannesGaessler commented Jun 24, 2024

CUDA: fix matrix multiplication algorithm choice #8102

CUDA: fix matrix multiplication algorithm choice #8102

Conversation

JohannesGaessler commented Jun 24, 2024