IE801 Project (Team 11)

Team 11 : JeongWoo Park (20243347), Sojeong Rhee (20243606)

Title : Test time adaptation in Offline RL

This repository is forked from Offbench

Main Idea

The project is based on the OPEX, introduced in Park et al., Is Value Learning Really the Main Bottleneck in Offline RL? (NeurIPS 2024 Workshop) Since the paper only reported single step IQL results without implementation details, we applied multi-step OPEX and normalization with gradient norm settings.

Methods

Hyperparameter Search

Setup the Environment

To setup the environment, we recommend to use docker. Simply run

./docker_run.sh

Run Experiments

Inside docker container, simply run

./run.sh

You can modify run.sh file with specific environments and algorithms.

Weights and Biases Online Visualization Integration

This codebase can also log to W&B online visualization platform. To log to W&B, you first need to set your W&B API key environment variable and add --logging.online when launching the script. Alternatively, you could simply run wandb login.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
algos		algos
configs		configs
core		core
experiments		experiments
plots		plots
utilities		utilities
viskit		viskit
.gitignore		.gitignore
.pylintrc		.pylintrc
Dockerfile		Dockerfile
README.md		README.md
config.json		config.json
config.yaml		config.yaml
docker_run.sh		docker_run.sh
environment.yml		environment.yml
run.sh		run.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IE801 Project (Team 11)

Main Idea

Methods

Hyperparameter Search

Setup the Environment

Run Experiments

Weights and Biases Online Visualization Integration

Results

Overall Results on Antmaze dataset

Results on Antmaze-Umaze-Diverse-v2, num_steps = 1

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

robotjoaa/IE801_project

Folders and files

Latest commit

History

Repository files navigation

IE801 Project (Team 11)

Main Idea

Methods

Hyperparameter Search

Setup the Environment

Run Experiments

Weights and Biases Online Visualization Integration

Results

Overall Results on Antmaze dataset

Results on Antmaze-Umaze-Diverse-v2, num_steps = 1

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages