Skip to content

sovinbirla/Multimodal-Vision-Transformer

Repository files navigation

Multimodal-Vision-Transformer

Coding a vision language model from scratch

Batch Normalization

Drastic Input Batch Changes -> Input changes -> loss changes -> gradient changes -> weight updating changes -> Big change in the weights of the network -> Network will learn very slowly

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages