Skip to content

Commit b1c5249

Browse files
committed
Update docs
1 parent 74a0c6a commit b1c5249

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

STT/CONTRIBUTING.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -45,6 +45,7 @@ Some parameters for the model itself:
4545
- `duplicate_sentence_count` to control if Common Voice dataset might need
4646
to be regenerated with more duplicated allowed using Corpora Creator
4747
**USE WITH CAUTION**
48+
- `enable_augments` to help the model to better genralise on noisy data by augmenting the data in various ways.
4849
- `cv_personal_first_url` to download only your own voice instead of all Common Voice dataset (first url)
4950
- `cv_personal_second_url` to download only your own voice instead of all Common Voice dataset (second url)
5051

@@ -84,6 +85,8 @@ files, with proper `checkpoint` descriptor as TensorFlow produces.
8485
To use an existing checkpoint, just ensure the `docker run` includes a mount such as:
8586
`type=bind,src=PATH/TO/CHECKPOINTS,dst=/transfer-checkpoint`. Upon running, the checkpoints will be automatically used as starting point.
8687

88+
Checkpoints don't typically use automatic mixed precision nor fully-connected layer normalization and mostly use a standard number of hidden layers (2048 unless specified otherwise). So don't change those parameters to fine-tune from them.
89+
8790
## Hardware
8891

8992
Training successfull on:

0 commit comments

Comments
 (0)