Easier way to automatically figure out the input shape after the `nn.Flatten()` layer in a CNN? #313

ZachCalkins · 2023-02-23T21:00:21Z

ZachCalkins
Feb 23, 2023

In section 2, in the video "Mode 2: Using a Trick to Find the Input and Output Shapes of Each of Our Layers" a method is gone over of printing the shape after each convolutional layer block. This is to shown in the video to figure out the shape to enter into the linear layer after the flatten layer. We then are walked through multiplying the hidden units by 7 twice. Is there a way around this? Or do we need to figure out the shape this way everytime we build a CNN?

Answered by mrdbourke

Feb 24, 2023

Hey @ZachCalkins,

Good question!

And yes there is, you can use torch.nn.LazyLinear().

Check out the documentation above for an example.

But in essence, the "Lazy" means to "figure out the in_features parameter automatically.

For example:

# Original
self.classifier = nn.Sequential(
  nn.Flatten(),
  nn.Linear(in_features=hidden_units*7*7,
            out_features=output_shape)
)

Becomes:

# New with LazyLinear
self.classifier = nn.Sequential(
  nn.Flatten(),
  nn.LazyLinear(out_features=output_shape) # notice the no "in_features" (this is inferred by the layer)
)

Try it out and see how you go!

In fact, there are many "Lazy" layers (these are quite new in PyTorch), try searching the document…

View full answer

mrdbourke · 2023-02-24T07:02:01Z

mrdbourke
Feb 24, 2023
Maintainer

Hey @ZachCalkins,

Good question!

And yes there is, you can use torch.nn.LazyLinear().

Check out the documentation above for an example.

But in essence, the "Lazy" means to "figure out the in_features parameter automatically.

For example:

# Original
self.classifier = nn.Sequential(
  nn.Flatten(),
  nn.Linear(in_features=hidden_units*7*7,
            out_features=output_shape)
)

Becomes:

# New with LazyLinear
self.classifier = nn.Sequential(
  nn.Flatten(),
  nn.LazyLinear(out_features=output_shape) # notice the no "in_features" (this is inferred by the layer)
)

Try it out and see how you go!

In fact, there are many "Lazy" layers (these are quite new in PyTorch), try searching the documentation for "lazy layers" - https://pytorch.org/docs/stable/search.html?q=lazy&check_keywords=yes&area=default

2 replies

ZachCalkins Feb 24, 2023
Author

Thank you so much for the help! That's good to know!

shisirkha Dec 19, 2023

THANK YOU SO MUCH IT MADE MY LIFE EASIER TO HANDLE ERRORS

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Easier way to automatically figure out the input shape after the `nn.Flatten()` layer in a CNN? #313

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Easier way to automatically figure out the input shape after the nn.Flatten() layer in a CNN? #313

Uh oh!

ZachCalkins Feb 23, 2023

Replies: 1 comment · 2 replies

Uh oh!

Uh oh!

mrdbourke Feb 24, 2023 Maintainer

Uh oh!

ZachCalkins Feb 24, 2023 Author

Uh oh!

shisirkha Dec 19, 2023

Easier way to automatically figure out the input shape after the `nn.Flatten()` layer in a CNN? #313

ZachCalkins
Feb 23, 2023

Replies: 1 comment 2 replies

mrdbourke
Feb 24, 2023
Maintainer

ZachCalkins Feb 24, 2023
Author