Commit 2fffc52
llama : fix Roberta embeddings (ggml-org#10856)
* fix: Use gpt2 tokenizer for roberta and add eos/bos tokens
Branch: RobertaTokenizer
Signed-off-by: Gabe Goodhart <[email protected]>
* fixes to position embeddings
Signed-off-by: Sukriti-Sharma4 <[email protected]>
* map roberta-bpe to gpt-2
Signed-off-by: Sukriti-Sharma4 <[email protected]>
* fix linting
Signed-off-by: Sukriti-Sharma4 <[email protected]>
---------
Signed-off-by: Gabe Goodhart <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Co-authored-by: Gabe Goodhart <[email protected]>1 parent 7585edb commit 2fffc52
2 files changed
+48
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2628 | 2628 | | |
2629 | 2629 | | |
2630 | 2630 | | |
2631 | | - | |
| 2631 | + | |
2632 | 2632 | | |
2633 | 2633 | | |
2634 | 2634 | | |
| |||
2701 | 2701 | | |
2702 | 2702 | | |
2703 | 2703 | | |
| 2704 | + | |
| 2705 | + | |
| 2706 | + | |
| 2707 | + | |
| 2708 | + | |
| 2709 | + | |
| 2710 | + | |
| 2711 | + | |
| 2712 | + | |
| 2713 | + | |
| 2714 | + | |
| 2715 | + | |
| 2716 | + | |
| 2717 | + | |
| 2718 | + | |
| 2719 | + | |
| 2720 | + | |
| 2721 | + | |
| 2722 | + | |
| 2723 | + | |
| 2724 | + | |
| 2725 | + | |
| 2726 | + | |
| 2727 | + | |
| 2728 | + | |
| 2729 | + | |
| 2730 | + | |
| 2731 | + | |
| 2732 | + | |
| 2733 | + | |
| 2734 | + | |
| 2735 | + | |
| 2736 | + | |
| 2737 | + | |
| 2738 | + | |
| 2739 | + | |
| 2740 | + | |
| 2741 | + | |
| 2742 | + | |
| 2743 | + | |
| 2744 | + | |
| 2745 | + | |
| 2746 | + | |
| 2747 | + | |
| 2748 | + | |
2704 | 2749 | | |
2705 | 2750 | | |
2706 | 2751 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6592 | 6592 | | |
6593 | 6593 | | |
6594 | 6594 | | |
6595 | | - | |
| 6595 | + | |
| 6596 | + | |
6596 | 6597 | | |
6597 | 6598 | | |
6598 | 6599 | | |
| |||
0 commit comments