Skip to content

Commit 37e02e3

Browse files
authored
Added XTC to README
1 parent ed535bb commit 37e02e3

File tree

1 file changed

+16
-0
lines changed

1 file changed

+16
-0
lines changed

examples/main/README.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -241,6 +241,22 @@ The `--mirostat-ent` option sets the Mirostat target entropy (tau), which repres
241241

242242
Example usage: `--mirostat 2 --mirostat-lr 0.05 --mirostat-ent 3.0`
243243

244+
### XTC Sampling
245+
246+
- `--xtc-p N`: Sets the chance for token removal (checked once on sampler start) (default: 0.5).
247+
- `--xtc-t N`: Sets a minimum probability threshold for tokens to be removed (default: 0.1).
248+
- `--xtc-t-max N`: Sets a maximum probability threshold for tokens to be removed (highly expetrimental) (default: 1.0).
249+
250+
Exclude Top Choices (XTC) is a unique sampler that is designed to remove top tokens from consideration and avoid more obvious and repetitive answers. With a chance of `xtc-p` it searches for tokens with probabilities of `xtc-t` threshold and above, then removes all such tokens except the least probable one.
251+
252+
By removing top tokens XTC can improve variety of answers, break writing clichés and inhibit repition, since clichés and repeated phrases are usually more likely to appear. By keeping the last top token XTC ensures that the answer is still coherent. XTC is meant to be used for creative tasks, but feel free to experiment with different settings for different models.
253+
254+
The additional `xtc-t-max` parameter may help with finetuned models that already give relatively creative output, meaning that clichés and repetitive phrases may appear at lower probabilities. It allows to remove tokens from a middle range which will always be specific to a model, requiring careful experimenting. Leave `xtc-t-max` on default 1.0 for all base/instruct models.
255+
256+
Being experimental and unique, XTC is not included in the default sampling queue. You can start from a recommended combination of Min-P followed by XTC on its default settings: `--sampling-seq mx --min-p 0.02`.
257+
258+
Example usage: `--xtc-p 0.5 --xtc-t 0.1 --xtc-t-max 1.0`
259+
244260
### Logit Bias
245261

246262
- `-l TOKEN_ID(+/-)BIAS, --logit-bias TOKEN_ID(+/-)BIAS`: Modify the likelihood of a token appearing in the generated text completion.

0 commit comments

Comments
 (0)