GitHub - AIS-Bonn/LIAM

This is the code repository for the following publication:

Yihao Wang, Raphael Memmesheimer, and Sven Behnke: LIAM: Multimodal Transformer for Language Instructions, Images, Actions and Semantic Maps at the 19th International Conference on Intelligent Autonomous Systems (IAS) in Genoa, Italy

A preprint can be found on arXiv

Dataset and installation

essential packages for the code, please check requirements.txt

Download Alfred dataset, please check: https://github.com/askforalfred/alfred

If you want to try a lighter backbone, i.e., MobileCLIP, please install from their official repo: https://github.com/apple/ml-mobileclip

Repo structure:

pretraining: All the pretraining and preprocessing code

model: end-to-end model for generating action sequence

dataset: Dataset for end-to-end training.

Reference:

@inproceedings{Wang2023LIAM,
  title={LIAM: Multimodal Transformer for Language Instructions, Images, Actions and Semantic Maps},
  author={Wang, Yihao and Memmesheimer, Raphael and Behnke, Sven},
  conference={19th International Conference on Intelligent Autonomous Systems (IAS)},
  year={2023},
  location={Genoa, Italy}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dataset		dataset
model		model
pretraining		pretraining
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset and installation

Repo structure:

Reference:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

AIS-Bonn/LIAM

Folders and files

Latest commit

History

Repository files navigation

Dataset and installation

Repo structure:

Reference:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages