Skip to content

Commit 9c8521f

Browse files
committed
electra config: update vocab size & max pos for protein seq
1 parent 7048cd0 commit 9c8521f

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

configs/model/electra.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,8 +3,8 @@ init_args:
33
optimizer_kwargs:
44
lr: 1e-3
55
config:
6-
vocab_size: 1400
7-
max_position_embeddings: 1800
6+
vocab_size: 31 # 21 amino acids (when n_gram=1) + 10 special tokens of LLM
7+
max_position_embeddings: 1000 # max default sequence length for protein
88
num_attention_heads: 8
99
num_hidden_layers: 6
1010
type_vocab_size: 1

0 commit comments

Comments
 (0)