Skip to content

Commit 7a4d35b

Browse files
authored
Update README.md
1 parent f1e66e0 commit 7a4d35b

File tree

1 file changed

+6
-2
lines changed

1 file changed

+6
-2
lines changed

README.md

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,11 @@ Available models (Training ATM):
2828
- KoGPT2-medium(345M)
2929
- KoGPT2-large(774M)
3030
- KoGPT2-xlarge(1.5B)
31-
- KoGPT2-2.7B
31+
- KoGPT2-2.7B *TBA*
32+
33+
Models are available as TF checkpoint files [Training script](https://github.com/ksjae/KoGPT2-train) or [Huggingface transformers](https://github.com/huggingface/transformers.git) compatible ones
34+
35+
n_ctx available : 1024 2048 384
3236

3337
## Intended uses & limitations
3438
Intended for **Korean** text generation for ai-text-adventure(https://github.com/ksjae/ai-text-adventure) with PPLM.
@@ -48,7 +52,7 @@ or go to [KoGPT2-train](https://github.com/ksjae/KoGPT2-train) and use scripts/d
4852
v0.1 may have faulty tokenizers, producing bad outputs.
4953

5054
v0.2+ be GPT2 with n_ctx of 2048. True form of GPT-3 implementation(alternating layers) will not be available within the year.
51-
v0.2-story is producing hashtags (which were not finetuned for)
55+
5256
If other limitations or errors are found, please open an issue.
5357

5458
## Training data

0 commit comments

Comments
 (0)