You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For use in other languages, I try to run the command by terminal or operating system and analyze the output log by RegExp.
For example in nodejs using child_process something like this:
And also in other languages similar to the same method.
If we ignore the complexities of handling commands and messing with stdout and stderr, etc. Again, doing this has many problems.
For example, we don't always have a fixed output pattern and RegExp always comes with an error.
For example, in these two examples, our text is preceded by (" and (' once:
[2024/08/1714:14:11]ppocr INFO: [[[207.0,130.0],[601.0,130.0],[601.0,156.0],[207.0,156.0]],("salam' khobi' che khabar ",0.9787972569465637)][2024/08/1714:14:44]ppocrINFO: [[[236.0,303.0],[626.0,303.0],[626.0,334.0],[236.0,334.0]],('salam "khobi" che khabar ',0.9980602860450745)]
Or here where even the found text characters are Escaped!
[2024/08/1714:15:38]ppocr INFO: [[[204.0,126.0],[1025.0,128.0],[1025.0,160.0],[203.0,158.0]],('salam\' khobi\' che khabar salam "khobi" che khabar',0.9854654669761658)]
Or in cases where there is no text in the photo, like this:
I don't really know what to set the criteria, is NoneType words or something like dt_boxes num : 0, enough?
But here the very big problem that can happen is that if the text inside my photo really has such words, then what unexpected things will happen!
Such outputs seem to be for direct use in code. But this is a log for me and cannot be interpreted directly.
These were examples, and even if all these cases were resolved, many unforeseen cases would remain.
It is really surprising for me to do such tasks that can be easily solved and but here it wastes a lot of energy.
But my question is, is this correct in principle?
Doesn't PaddleOCR offer a more basic method? To use other languages to reach exactly the desired text? Like API or whatever...
Do you have a better and more accurate idea for me to interpret the log?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
For use in other languages, I try to run the command by terminal or operating system and analyze the output log by RegExp.
For example in
nodejs
usingchild_process
something like this:And also in other languages similar to the same method.
If we ignore the complexities of handling commands and messing with
stdout
andstderr
, etc. Again, doing this has many problems.For example, we don't always have a fixed output pattern and RegExp always comes with an error.
For example, in these two examples, our text is preceded by
("
and('
once:Or here where even the found text characters are Escaped!
Or in cases where there is no text in the photo, like this:
I don't really know what to set the criteria, is
NoneType
words or something likedt_boxes num : 0,
enough?But here the very big problem that can happen is that if the text inside my photo really has such words, then what unexpected things will happen!
Such outputs seem to be for direct use in code. But this is a log for me and cannot be interpreted directly.
These were examples, and even if all these cases were resolved, many unforeseen cases would remain.
It is really surprising for me to do such tasks that can be easily solved and but here it wastes a lot of energy.
PaddleOCR
offer a more basic method? To use other languages to reach exactly the desired text? Like API or whatever...Beta Was this translation helpful? Give feedback.
All reactions