feat(policies): add autoregressive VLAs with tokenization PiFast #2734

jadechoghari · 2025-12-30T15:59:33Z

Title

feat(policies): add autoregressive VLAs with tokenization PiFast

This PR brings autoregressive Vision-Language-Action (VLA) models back to LeRobot, alongside the existing flow-matching–based policies.

Unlike flow matching, which predicts actions in parallel over a horizon, autoregressive VLAs model actions sequentially as discrete tokens.
As a first step toward supporting multiple action tokenizers, this PR introduces PiFast, together with a training script for FAST tokenization, this provides a concrete reference implementation for autoregressive action modeling in LeRobot.

Future work will extend this framework to additional tokenizers and autoregressive variants.

TODO:
2- Provide PiFast pretrained checkpoints, and unveil HF LeRobot new AR VLA work.
3- Add testing and docs.

DONE:
1- Trained and evaluated successfully on libero, we will share the ckpts along with the results.
2- Support KV-caching for faster inference (a must for this PR) https://mett29.github.io/posts/kv-cache/

Copilot

Pull request overview

This PR introduces autoregressive Vision-Language-Action (VLA) models to LeRobot, implementing PiFast alongside existing flow-matching policies. Unlike flow matching which predicts actions in parallel over a horizon, this implementation models actions sequentially as discrete tokens using the FAST (Fast Action Sequence Tokenization) tokenizer. The PR provides a complete reference implementation including model architecture, training scripts, and processor pipelines.

Key Changes:

Implements PI0Fast policy with autoregressive action token prediction using cross-entropy loss
Adds FAST tokenizer integration for converting continuous actions to discrete tokens via DCT coefficients and BPE
Introduces custom attention masking patterns supporting bidirectional attention for images/language and causal attention for action tokens

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 12 comments.

Show a summary per file

File	Description
src/lerobot/utils/constants.py	Adds constants for action tokens and token masks
src/lerobot/processor/tokenizer_processor.py	Implements ActionTokenizerProcessorStep for tokenizing actions using FAST with PaliGemma token space conversion
src/lerobot/processor/init.py	Exports ActionTokenizerProcessorStep for use in pipelines
src/lerobot/policies/pi0_fast/train_fast_tokenizer.py	Provides training script for FAST tokenizer with delta transforms, normalization, and compression statistics
src/lerobot/policies/pi0_fast/processor_pi0_fast.py	Creates pre/post-processor pipelines including state discretization and language tokenization
src/lerobot/policies/pi0_fast/modeling_pi0_fast.py	Implements core PI0FastPytorch model with PaliGemma+Gemma expert architecture and autoregressive decoding
src/lerobot/policies/pi0_fast/configuration_pi0_fast.py	Defines PI0FastConfig with model hyperparameters and training settings
src/lerobot/policies/pi0_fast/init.py	Exports PI0Fast components for module access
src/lerobot/policies/factory.py	Registers PI0FastPolicy in the policy factory
src/lerobot/policies/init.py	Exports PI0FastConfig at package level

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/lerobot/policies/pi0_fast/processor_pi0_fast.py

src/lerobot/policies/pi0_fast/modeling_pi0_fast.py

src/lerobot/policies/pi0_fast/configuration_pi0_fast.py

src/lerobot/policies/pi0_fast/modeling_pi0_fast.py

src/lerobot/policies/pi0_fast/processor_pi0_fast.py

HuggingFaceDocBuilderDev · 2026-01-06T16:49:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

add pifast policy with tokenization

81f52c6

Copilot AI review requested due to automatic review settings December 30, 2025 15:59

jadechoghari added the policies Items related to robot policies label Dec 30, 2025

github-actions bot added the processor Issue related to processor label Dec 30, 2025

Copilot started reviewing on behalf of jadechoghari December 30, 2025 16:00 View session

Copilot AI reviewed Dec 30, 2025

View reviewed changes

fix quality

53b288e

jadechoghari self-assigned this Dec 30, 2025

jadechoghari added 6 commits January 1, 2026 08:22

add kwargs

7dfbd57

fix preprocessor

5f2e5f8

remove brkpt

508d65a

put meanstd as default

f0d0faa

refactor the policy

911f7c4

add tests

e55cbc1

github-actions bot added the tests Problems with test coverage, failures, or improvements to testing label Jan 6, 2026

jadechoghari added 2 commits January 6, 2026 09:43

fix style

d32a2ef

add kv-cache, make it work

c8f1ba3

jadechoghari requested a review from michel-aractingi January 6, 2026 08:59

clean more

2e8169c

github-actions bot added the documentation Improvements or fixes to the project’s docs label Jan 6, 2026

jadechoghari added 3 commits January 6, 2026 17:49

fix style

6b6479e

fix action tokenizer attribute error

3a3ce0e

fix save tokenizer

5bda3d0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(policies): add autoregressive VLAs with tokenization PiFast #2734

feat(policies): add autoregressive VLAs with tokenization PiFast #2734

Uh oh!

jadechoghari commented Dec 30, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(policies): add autoregressive VLAs with tokenization PiFast #2734

Are you sure you want to change the base?

feat(policies): add autoregressive VLAs with tokenization PiFast #2734

Uh oh!

Conversation

jadechoghari commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Title

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jadechoghari commented Dec 30, 2025 •

edited

Loading