PixelFormer_efficient_attention

Aims to reduce the inference time while maintaining performance for monocular depth estimation. Report

Modifications

Added a PixelFormer_new.py to accommodate the new attention mechanisms used.
Added SAM_cosine.py to use cosine similarity window attention instead of dot product window attention.
Replaced the window attention with the below two attention mechanisms for fusing encoder and decoder features with global context (instead of the 7*7 window used in the original work) which improves the baseline model performance.
- Added SAM_efficient.py which implements Efficient Attention inside the skip attention module.
- Added SAM_fast.py which implements FAVOR+ Attention inside the skip attention module.

Result Comparison:

Pretrained Models (NYU DepthV2)

Download all the models from this Link and place them in the \pretrained folder.
Environment setup is mentioned in the PixelFormer.ipynb

How to Run

By default, the code is configured to train and evaluate the model which utilizes Efficient Attention.
Changes to the training config can be done in the configs/arguments_train_nyu.txt

Training

Run the following Python file to train the model on the NYU depthV2 dataset.
```
python pixelformer/train.py configs/arguments_train_nyu.txt
```

Evaluation and Testing

Run the following two commands to run eval and testing.

python pixelformer/eval.py configs/arguments_eval_nyu.txt
python pixelformer/test.py configs/arguments_test_nyu.txt

Acknowledgements

The code utilized in this project has been adapted from the PixelFormer repository. For a comprehensive view of the entire architecture, please refer to the original repository.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
configs		configs
data_splits		data_splits
pixelformer		pixelformer
LICENSE		LICENSE
PixelFormer.ipynb		PixelFormer.ipynb
README.md		README.md
Towards Low-Latency Monocular Depth Estimation_Report.pdf		Towards Low-Latency Monocular Depth Estimation_Report.pdf
comparison.png		comparison.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PixelFormer_efficient_attention

Modifications

Pretrained Models (NYU DepthV2)

How to Run

Training

Evaluation and Testing

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PixelFormer_efficient_attention

Modifications

Pretrained Models (NYU DepthV2)

How to Run

Training

Evaluation and Testing

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages