Predictions for last (irregular) batch that is smaller than batch_size in examples/ogbn_products_gat.py #4233

LuisaWerner · 2022-03-11T10:34:34Z

LuisaWerner
Mar 11, 2022

Hello!

I have one question regarding the code in the example script ogbn_products_gat.py

At first batches are sampled with Neighbor Sampler.

train_loader = NeighborSampler(data.edge_index, node_idx=train_idx,
                               sizes=[10, 10, 10], batch_size=512,
                               shuffle=True, num_workers=12)

Here, the drop_last argument that comes from DataLoader is not specified and by default set to False. So that means that the last batch is smaller than the others.

Lateron, when iterating through the batches, the predictions and ground truth values for the target nodes in a batch are referenced with batch_size:

for batch_size, n_id, adjs in train_loader:
        [...]
        loss = F.nll_loss(out, y[n_id[:batch_size]])
		[...]

Doesn't this lead to false loss values in the last batch, since there are less than batch_size target nodes? Or did I get something wrong here?

Thanks a lot for your help in advance!

Answered by rusty1s

Mar 13, 2022

Yes, that is correct. In theory, it's better to use drop_last=True (since the examples in the last batch will have more impact on the loss). However, I doubt that this leads to any differences in practice. At least I've never made bad experience when not dropping the last batch. Furthermore, I don't this this is very common to see in any PyTorch projects.

View full answer

rusty1s · 2022-03-13T07:17:11Z

rusty1s
Mar 13, 2022
Maintainer

Yes, that is correct. In theory, it's better to use drop_last=True (since the examples in the last batch will have more impact on the loss). However, I doubt that this leads to any differences in practice. At least I've never made bad experience when not dropping the last batch. Furthermore, I don't this this is very common to see in any PyTorch projects.

1 reply

LuisaWerner Mar 13, 2022
Author

Okay, thanks for your fast reply :-)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Predictions for last (irregular) batch that is smaller than batch_size in examples/ogbn_products_gat.py #4233

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Predictions for last (irregular) batch that is smaller than batch_size in examples/ogbn_products_gat.py #4233

Uh oh!

LuisaWerner Mar 11, 2022

Replies: 1 comment · 1 reply

Uh oh!

rusty1s Mar 13, 2022 Maintainer

Uh oh!

LuisaWerner Mar 13, 2022 Author

LuisaWerner
Mar 11, 2022

Replies: 1 comment 1 reply

rusty1s
Mar 13, 2022
Maintainer

LuisaWerner Mar 13, 2022
Author