You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Error when finetuning Det V5 model: ((PreconditionNotMet) The element size of transformed_input should be <= INT_MAX(2147483647), but got 2463918080)
#16083
I tried adding the limit_side_len for the DetResizeForTest (suggestion from Gemini) to 2048 or 1024, but the same error keeps appearing: It yields the same number exceeds INT_MAX value, i.e., 2463918080, to any limit_side_len I configure.
As Gemini suggests, the limit_side_len should reduce the longest side of the image to the target size, here is 2048, so I think if that happened, then the max pixel length we have is 2048 x 2048 x 256 (OUT_CHANNELS)= 1,073,741,824, still < INT_MAX(2147483647). But as you see, regardless of 1024 or 2048, the exact same 2463918080 appears in the error.
My question:
My question is is there any way to overcome this? Some of my picture are large (over 3000 pixels). While training stage crop the image, the eval maybe use the full size for detection.
The next question is that somehow the error occurred from the det_mobilenet_v3.py backbone? Even I am using PP_OCR_v5 model, it is supposed to be the PPHGNetV2_B4 like in the yaml config file? I tried to look up the PPHGNetV2_B4 backbone in the git, but found only source code for the recognition model.
Sorry if I make any misunderstanding or mistake, I am still learning. Any help is highly appreciated.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone and Paddle devs,
First, I want to thank you for open-sourcing this project, much appreciated.
I am following this tutorial on how to fine-tune the PP-OCRv5 detection module. I have modified the yaml file to fit with my setup (as show below):
Yet, when finetuning, I got this error in the first evaluation stage:
I tried adding the
limit_side_len
for theDetResizeForTest
(suggestion from Gemini) to 2048 or 1024, but the same error keeps appearing: It yields the same number exceeds INT_MAX value, i.e., 2463918080, to anylimit_side_len
I configure.As Gemini suggests, the
limit_side_len
should reduce the longest side of the image to the target size, here is 2048, so I think if that happened, then the max pixel length we have is 2048 x 2048 x 256 (OUT_CHANNELS)= 1,073,741,824, still < INT_MAX(2147483647). But as you see, regardless of 1024 or 2048, the exact same 2463918080 appears in the error.My question:
PPHGNetV2_B4
like in the yaml config file? I tried to look up the PPHGNetV2_B4 backbone in the git, but found only source code for the recognition model.Sorry if I make any misunderstanding or mistake, I am still learning. Any help is highly appreciated.
Thanks
Beta Was this translation helpful? Give feedback.
All reactions