Fix Tensor Dimension Mismatch in Padding Operation for Batch Processing #254

sugary199 · 2024-04-10T03:23:14Z

Previously, pad_embeds was incorrectly constructed by repeating the pad_embed tensor along the wrong dimension, leading to a size mismatch when attempting to concatenate it with inputs['inputs_embeds'].
The error message is as follows：

Process Process-1:
Traceback (most recent call last):
  File "/ML-A100/team/mm/shuyu/anaconda3/envs/intern_clean/lib/python3.9/multiprocessing/process.py", line 315, in _bootstrap
    self.run()
  File "/ML-A100/team/mm/shuyu/anaconda3/envs/intern_clean/lib/python3.9/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
  File "/ML-A100/team/mm/shuyu/workspace/projects/InternLM-XComposer/cap_train.py", line 159, in inferCaptionsAndSave
    inputs = torch.cat([pad_embeds, inputs['inputs_embeds']], dim=1)
RuntimeError: Sizes of tensors must match except in dimension 1. Expected size 57 but got size 1 for tensor number 1 in the list.
FINISHED!

Modification: specifying the dimension as 1 when preparing pad_embeds.

This issue was not triggered by the official examples due to the difference in token counts across batches is 1 .Therefore I increase the difference in token numbers between the two examples

sugary199 added 2 commits April 10, 2024 11:01

Fix tensor dimension mismatch in padding operation

951bed0

Increase the difference in token numbers between the two examples

60ed6e2

mm-assistant bot assigned yhcao6 Apr 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Tensor Dimension Mismatch in Padding Operation for Batch Processing #254

Fix Tensor Dimension Mismatch in Padding Operation for Batch Processing #254

Uh oh!

sugary199 commented Apr 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix Tensor Dimension Mismatch in Padding Operation for Batch Processing #254

Are you sure you want to change the base?

Fix Tensor Dimension Mismatch in Padding Operation for Batch Processing #254

Uh oh!

Conversation

sugary199 commented Apr 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants