Feature: TransformerEngine FP8 support #331

sfc-gh-mwyatt · 2025-12-18T00:31:53Z

Adding support for FP8 autocasting feature from Transformer Engine library.

Replace nn.Linear layers with transformer engine linear layer in loaded models for layers that substring match fp8_target_modules
wrap loss calculation with autocast
Add config support for fp8_recipe:
- requires specifying a type that maps to Recipe subclass
- Requires specifying fp8_target_modules

Example Usage:

model:
  name_or_path: Qwen/Qwen3-8B
  fp8_recipe:
    type: delayedscaling
    fp8_format: hybrid
    amax_history_len: 16
    amax_compute_algo: max
  fp8_target_modules:
    - q_proj
    - k_proj
    - v_proj
    - up_proj
    - down_proj
    - gate_proj

data:
  sources:
    - type: huggingface_instruct
      name_or_path: HuggingFaceH4/ultrachat_200k:train[:1000]

sfc-gh-mwyatt · 2025-12-22T17:03:20Z

arctic_training/config/model.py

    hf_config_kwargs: Dict = Field(default_factory=dict)
    """ Optional kwargs to override in the HF model config object created by `AutoConfig.from_pretrained(model.name_or_path)` """

+    fp8_recipe: Optional[Any] = None


Perhaps this should be named differently. TE supports both FP8 and FP4 - plus there are other libraries we may wish to support. A better name may be:

quant_recipe

quant_target_modules

@sfc-gh-jrasley @sfc-gh-zhyao thoughts?

transformer engine fp8 integration

1604a36

sfc-gh-mwyatt requested a review from sfc-gh-jrasley as a code owner December 18, 2025 00:31

sfc-gh-mwyatt requested a review from sfc-gh-zhyao December 18, 2025 00:32

sfc-gh-mwyatt added 5 commits December 17, 2025 16:32

Merge branch 'main' into mwyatt/FP8-autocast

ab7e4b7

target modules

8e1c918

Merge branch 'main' into mwyatt/FP8-autocast

db6a833

ZeRO3 support

84d676e

Merge branch 'main' into mwyatt/FP8-autocast

df2920e

sfc-gh-jrasley approved these changes Dec 20, 2025

View reviewed changes

sfc-gh-mwyatt commented Dec 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: TransformerEngine FP8 support #331

Feature: TransformerEngine FP8 support #331

Uh oh!

sfc-gh-mwyatt commented Dec 18, 2025 •

edited

Loading

Uh oh!

sfc-gh-mwyatt Dec 22, 2025

Uh oh!

sfc-gh-mwyatt Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Feature: TransformerEngine FP8 support #331

Are you sure you want to change the base?

Feature: TransformerEngine FP8 support #331

Uh oh!

Conversation

sfc-gh-mwyatt commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sfc-gh-mwyatt Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-mwyatt Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sfc-gh-mwyatt commented Dec 18, 2025 •

edited

Loading