Skip to content

Commit

Permalink
metal : do not use mul_mm kernels when ne00 < 64 (ggerganov#3542)
Browse files Browse the repository at this point in the history
  • Loading branch information
ggerganov authored Oct 9, 2023
1 parent db3abcc commit dcc09d2
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion ggml-metal.m
Original file line number Diff line number Diff line change
Expand Up @@ -1040,7 +1040,7 @@ void ggml_metal_graph_compute(
!ggml_is_transposed(src0) &&
!ggml_is_transposed(src1) &&
src1t == GGML_TYPE_F32 &&
ne00 % 32 == 0 &&
ne00 % 32 == 0 && ne00 >= 64 &&
ne11 > ne11_mm_min) {
//printf("matrix: ne00 = %6d, ne01 = %6d, ne02 = %6d, ne11 = %6d, ne12 = %6d\n", ne00, ne01, ne02, ne11, ne12);
switch (src0->type) {
Expand Down

0 comments on commit dcc09d2

Please sign in to comment.