You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This demo shows how to implement and train a ConvMixer architecture for image classification with MATLAB®, as described in the paper "Patches are all you need?" https://openreview.net/forum?id=TVHS5Y4dNvM
4
+
5
+
The ConvMixer architecture employs a Patch Embedding representation of the input followed by repeated fully-convolutional blocks.
6
+
7
+

8
+
9
+
## How to get started
10
+
11
+
Start the project ConvMixer.prj to add to the path the relevant functions. There are examples in the `convmixer/examples` folder to get you started with training a ConvMixer for the digits dataset and the CIFAR-10 dataset [1].
12
+
13
+
The latter employs the ADAM algorithm with fixed weight decay regularization, as described in [2].
14
+
15
+
Training a ConvMixer for the CIFAR-10 architecture can be demanding in terms of computational resources: in the same `convmixer/examples` folder you can find a pretrained network. This model was trained on the CIFAR-10, available at https://www.cs.toronto.edu/~kriz/cifar-10-matlab.tar.gz
16
+
17
+
The source code for building the architecture is in the `convmixer/convmixer` directory.
18
+
19
+
## Requirements
20
+
21
+
- MATLAB® R2021b or later
22
+
- Deep Learning Toolbox™
23
+
24
+
## License
25
+
26
+
The license is available in the license file within this repository.
27
+
28
+
Copyright 2021 The MathWorks, Inc.
29
+
30
+
[1] Krizhevsky, Alex. "Learning multiple layers of features from tiny images." (2009). https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf
31
+
[2] Loshchilov, Ilya, and Frank Hutter. "Fixing weight decay regularization in ADAM." (2018). https://openreview.net/forum?id=rk6qdGgCZ
0 commit comments