|
| 1 | +# High-Level APIs |
1 | 2 |
|
2 | | -### AutoModel |
| 3 | +## AutoModel |
3 | 4 |
|
4 | 5 | | **AutoModel Variant** | **API** | |
5 | | -|-----------|---------| |
| 6 | +|------------------------|---------| |
6 | 7 | | AutoModelForCausalLM | `liger_kernel.transformers.AutoLigerKernelForCausalLM` | |
7 | 8 |
|
8 | 9 | This API extends the implementation of the `AutoModelForCausalLM` within the `transformers` library from Hugging Face. |
9 | 10 |
|
| 11 | +::: liger_kernel.transformers.AutoLigerKernelForCausalLM |
| 12 | + options: |
| 13 | + extra: |
| 14 | + show_docstring: true |
| 15 | + show_signature: true |
| 16 | + show_source: true |
| 17 | + |
10 | 18 | !!! Example "Try it Out" |
11 | 19 | You can experiment as shown in this example [here](https://github.com/linkedin/Liger-Kernel?tab=readme-ov-file#1-use-autoligerkernelforcausallm). |
12 | 20 |
|
13 | | -### Patching |
| 21 | +--- |
14 | 22 |
|
15 | | -You can also use the Patching APIs to use the kernels for a specific model architecture. |
| 23 | +## Patching |
16 | 24 |
|
17 | | -!!! Example "Try it Out" |
18 | | - You can experiment as shown in this example [here](https://github.com/linkedin/Liger-Kernel?tab=readme-ov-file#2-apply-model-specific-patching-apis). |
| 25 | +You can also use the Patching APIs to use the kernels for a specific model architecture. |
19 | 26 |
|
20 | 27 | | **Model** | **API** | **Supported Operations** | |
21 | 28 | |-------------|--------------------------------------------------------------|-------------------------------------------------------------------------| |
22 | | -| LLaMA 2 & 3 | `liger_kernel.transformers.apply_liger_kernel_to_llama` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
23 | | -| LLaMA 3.2-Vision | `liger_kernel.transformers.apply_liger_kernel_to_mllama` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
24 | | -| Mistral | `liger_kernel.transformers.apply_liger_kernel_to_mistral` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
25 | | -| Mixtral | `liger_kernel.transformers.apply_liger_kernel_to_mixtral` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
26 | | -| Gemma1 | `liger_kernel.transformers.apply_liger_kernel_to_gemma` | RoPE, RMSNorm, GeGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
27 | | -| Gemma2 | `liger_kernel.transformers.apply_liger_kernel_to_gemma2` | RoPE, RMSNorm, GeGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
28 | | -| Qwen2, Qwen2.5, & QwQ | `liger_kernel.transformers.apply_liger_kernel_to_qwen2` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
29 | | -| Qwen2-VL | `liger_kernel.transformers.apply_liger_kernel_to_qwen2_vl` | RMSNorm, LayerNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
30 | | -| Phi3 & Phi3.5 | `liger_kernel.transformers.apply_liger_kernel_to_phi3` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 29 | +| LLaMA 2 & 3 | `liger_kernel.transformers.apply_liger_kernel_to_llama` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 30 | +| LLaMA 3.2-Vision | `liger_kernel.transformers.apply_liger_kernel_to_mllama` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 31 | +| Mistral | `liger_kernel.transformers.apply_liger_kernel_to_mistral` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 32 | +| Mixtral | `liger_kernel.transformers.apply_liger_kernel_to_mixtral` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 33 | +| Gemma1 | `liger_kernel.transformers.apply_liger_kernel_to_gemma` | RoPE, RMSNorm, GeGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 34 | +| Gemma2 | `liger_kernel.transformers.apply_liger_kernel_to_gemma2` | RoPE, RMSNorm, GeGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 35 | +| Qwen2, Qwen2.5, & QwQ | `liger_kernel.transformers.apply_liger_kernel_to_qwen2` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 36 | +| Qwen2-VL | `liger_kernel.transformers.apply_liger_kernel_to_qwen2_vl` | RMSNorm, LayerNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 37 | +| Phi3 & Phi3.5 | `liger_kernel.transformers.apply_liger_kernel_to_phi3` | RoPE, RMSNorm, SwiGLU, CrossEntropyLoss, FusedLinearCrossEntropy | |
| 38 | + |
| 39 | +### Function Signatures |
| 40 | + |
| 41 | +::: liger_kernel.transformers.apply_liger_kernel_to_llama |
| 42 | + options: |
| 43 | + extra: |
| 44 | + show_docstring: true |
| 45 | + show_signature: true |
| 46 | + |
| 47 | +::: liger_kernel.transformers.apply_liger_kernel_to_mllama |
| 48 | + options: |
| 49 | + extra: |
| 50 | + show_docstring: true |
| 51 | + show_signature: true |
| 52 | + |
| 53 | +::: liger_kernel.transformers.apply_liger_kernel_to_mistral |
| 54 | + options: |
| 55 | + extra: |
| 56 | + show_docstring: true |
| 57 | + show_signature: true |
| 58 | + |
| 59 | +::: liger_kernel.transformers.apply_liger_kernel_to_mixtral |
| 60 | + options: |
| 61 | + extra: |
| 62 | + show_docstring: true |
| 63 | + show_signature: true |
| 64 | + |
| 65 | +::: liger_kernel.transformers.apply_liger_kernel_to_gemma |
| 66 | + options: |
| 67 | + extra: |
| 68 | + show_docstring: true |
| 69 | + show_signature: true |
| 70 | + |
| 71 | +::: liger_kernel.transformers.apply_liger_kernel_to_gemma2 |
| 72 | + options: |
| 73 | + extra: |
| 74 | + show_docstring: true |
| 75 | + show_signature: true |
| 76 | + |
| 77 | +::: liger_kernel.transformers.apply_liger_kernel_to_qwen2 |
| 78 | + options: |
| 79 | + extra: |
| 80 | + show_docstring: true |
| 81 | + show_signature: true |
| 82 | + |
| 83 | +::: liger_kernel.transformers.apply_liger_kernel_to_qwen2_vl |
| 84 | + options: |
| 85 | + extra: |
| 86 | + show_docstring: true |
| 87 | + show_signature: true |
| 88 | + |
| 89 | +::: liger_kernel.transformers.apply_liger_kernel_to_phi3 |
| 90 | + options: |
| 91 | + extra: |
| 92 | + show_docstring: true |
| 93 | + show_signature: true |
0 commit comments