Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search

This repository contains implementation for the paper "Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search".

Installation

Clone the repository:

git clone https://github.com/cyrusneary/vlaps
cd vlaps

This repository requires git submodules to be initialized. Don't forget to run:

git submodule update --init --recursive

Install UV, a package and project manager: https://docs.astral.sh/uv/getting-started/installation/

Create and activate a new virtual environment:

cd VLAPS
uv venv .venv --python 3.10
source .venv/bin/activate

Install Octo:

cd ../third_party/octo
uv pip install -e .
uv pip install -r requirements.txt
uv pip install --upgrade "jax[cuda11_pip]==0.4.20" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

Then install LIBERO:

cd ../LIBERO
uv pip install cmake==3.24.3
uv pip install -r requirements.txt
uv pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113
uv pip install -e .

Install VLAPS requirements and libraries.

cd ../../VLAPS
uv pip install -r requirements.txt
uv pip install -e .

Add libero to the python path

cd ..
export PYTHONPATH=$PYTHONPATH:$PWD/third_party/LIBERO

Usage

Running experiments

uv run VLAPS/run_libero_experiment.py --config-name=config_vlaps_octo.yaml

Configuring experiments

This repository uses Hydra to configure expeirments. To edit the configurations, edit the yaml files in the VLAPS/config directory.

Before running the first experiment, you will need to edit the Octo model checkpoint path in VLAPS/config/agent_config/octo_agent.yaml and in VLAPS/config/agent_config/vlaps_octo_agent_config.yaml.

Finetuning Octo

To Finetune octo, navigate to the octo repository and run the finetuning script.

cd third_party/octo
uv run scripts/finetune_libero.py

Please refer to the README.md file in third_party/octo for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
VLAPS		VLAPS
plotting		plotting
third_party		third_party
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search

Installation

Usage

Running experiments

Configuring experiments

Finetuning Octo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search

Installation

Usage

Running experiments

Configuring experiments

Finetuning Octo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages