Commit b213fce
authored
metal : improve F32, F16 and BF16 mat-vec multiplication (ggml-org#16057)
* metal : improve F32, F16 and BF16 mat-vec multiplication
ggml-ci
* metal : make the NSG a function constant in mul_mv kernels
ggml-ci1 parent e00f3fd commit b213fce
File tree
6 files changed
+355
-288
lines changed- ggml/src/ggml-metal
6 files changed
+355
-288
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
37 | 41 | | |
38 | 42 | | |
39 | 43 | | |
| |||
467 | 471 | | |
468 | 472 | | |
469 | 473 | | |
| 474 | + | |
| 475 | + | |
470 | 476 | | |
471 | | - | |
472 | | - | |
473 | | - | |
474 | | - | |
475 | | - | |
476 | 477 | | |
| 478 | + | |
477 | 479 | | |
| 480 | + | |
478 | 481 | | |
479 | | - | |
480 | | - | |
481 | | - | |
482 | | - | |
483 | | - | |
484 | | - | |
485 | | - | |
486 | | - | |
487 | | - | |
488 | | - | |
489 | | - | |
490 | | - | |
491 | | - | |
492 | | - | |
493 | | - | |
494 | | - | |
495 | | - | |
496 | | - | |
497 | | - | |
498 | | - | |
| 482 | + | |
| 483 | + | |
| 484 | + | |
| 485 | + | |
| 486 | + | |
| 487 | + | |
499 | 488 | | |
500 | | - | |
| 489 | + | |
| 490 | + | |
| 491 | + | |
| 492 | + | |
501 | 493 | | |
502 | 494 | | |
503 | 495 | | |
| |||
623 | 615 | | |
624 | 616 | | |
625 | 617 | | |
626 | | - | |
| 618 | + | |
| 619 | + | |
| 620 | + | |
| 621 | + | |
| 622 | + | |
| 623 | + | |
| 624 | + | |
627 | 625 | | |
628 | 626 | | |
629 | 627 | | |
| |||
689 | 687 | | |
690 | 688 | | |
691 | 689 | | |
| 690 | + | |
| 691 | + | |
692 | 692 | | |
693 | 693 | | |
694 | 694 | | |
695 | | - | |
696 | | - | |
697 | | - | |
698 | | - | |
699 | | - | |
700 | 695 | | |
701 | | - | |
702 | | - | |
703 | | - | |
704 | | - | |
705 | | - | |
706 | 696 | | |
707 | 697 | | |
708 | | - | |
709 | | - | |
710 | | - | |
| 698 | + | |
| 699 | + | |
| 700 | + | |
| 701 | + | |
| 702 | + | |
| 703 | + | |
| 704 | + | |
| 705 | + | |
| 706 | + | |
| 707 | + | |
| 708 | + | |
| 709 | + | |
711 | 710 | | |
712 | 711 | | |
713 | 712 | | |
| |||
824 | 823 | | |
825 | 824 | | |
826 | 825 | | |
827 | | - | |
| 826 | + | |
828 | 827 | | |
829 | 828 | | |
830 | 829 | | |
831 | 830 | | |
832 | 831 | | |
833 | 832 | | |
834 | 833 | | |
835 | | - | |
| 834 | + | |
| 835 | + | |
| 836 | + | |
| 837 | + | |
| 838 | + | |
| 839 | + | |
| 840 | + | |
836 | 841 | | |
837 | 842 | | |
838 | 843 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
| 25 | + | |
25 | 26 | | |
26 | 27 | | |
27 | 28 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
54 | 58 | | |
55 | 59 | | |
56 | 60 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
11 | 14 | | |
12 | 15 | | |
13 | 16 | | |
| |||
72 | 75 | | |
73 | 76 | | |
74 | 77 | | |
| 78 | + | |
75 | 79 | | |
76 | 80 | | |
77 | 81 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1564 | 1564 | | |
1565 | 1565 | | |
1566 | 1566 | | |
1567 | | - | |
| 1567 | + | |
| 1568 | + | |
| 1569 | + | |
| 1570 | + | |
1568 | 1571 | | |
1569 | 1572 | | |
1570 | 1573 | | |
| |||
1772 | 1775 | | |
1773 | 1776 | | |
1774 | 1777 | | |
1775 | | - | |
| 1778 | + | |
| 1779 | + | |
| 1780 | + | |
| 1781 | + | |
1776 | 1782 | | |
1777 | 1783 | | |
1778 | 1784 | | |
| |||
0 commit comments