Skip to content

Commit 7e098f1

Browse files
smallv0221LiuChiachiFrostML
authored
Fix tokenizer bug (#893)
* fix unified transformer dtype problem * fix win dtype bug * Fix plato-2 and plato-mini dtype bug * Fix plato-2 tokenization * Refine some doc * Add general k support for topk sampling * fix seed * minor fix * Fix unitransformer readme * topk kernel optimization * add unimo model and fix generate api * add 3 datasets for unimo-text * fix tokenizer bug Co-authored-by: Jiaqi Liu <[email protected]> Co-authored-by: liu zhengxi <[email protected]>
1 parent 740f5e2 commit 7e098f1

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

paddlenlp/transformers/unimo/tokenizer.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -75,7 +75,7 @@ class UNIMOTokenizer(PretrainedTokenizer):
7575
"unimo-text-1.0":
7676
"https://paddlenlp.bj.bcebos.com/models/transformers/unimo/unimo-text-1.0-vocab.txt",
7777
"unimo-text-1.0-large":
78-
"https://paddlenlp.bj.bcebos.com/models/transformers/unimo/unimo-text-1.0-vocab-large.txt",
78+
"https://paddlenlp.bj.bcebos.com/models/transformers/unimo/unimo-text-1.0-large-vocab.txt",
7979
}
8080
}
8181
pretrained_init_configuration = {

0 commit comments

Comments
 (0)