generated from shenxiangzhuang/mppt
-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Labels
enhancementNew feature or requestNew feature or request
Description
LLM basic
- LLM: Perplexity #43
- BPE #33
- LLM: BN or LN #41
- LLM: BLEU from scratch #42
- LLM: Model Parameter counting #39
- Sampling from logits: TopK, TopP, Temparature #36
LLM architectures
- GPT2 training & inference
- llama model structure #100
- deepseek model structure #101
- qwen model structure #102
- LoRA, QLoRA and DoRA #38
Fast inference
Reinforce learning
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request