训练demo, 使用ic15_data, cpu 一直接近100%, 显存一直占用,但是GPU一直未使用,一直卡在第一步,有谁知道怎么回事 #14935
Unanswered
AlvinZheng
asked this question in
Q&A
Replies: 1 comment 2 replies
-
你的问题描述表明 PaddleOCR 训练时 CPU 占用 100%,但 GPU 未使用,并且卡在第一步。参考相关的讨论(分布式训练卡住),可能的原因和解决方案如下: 可能的原因:
可能的解决方案:
如果这些方法都无法解决问题,可以查看 Response generated by 🤖 feifei-bot | chatgpt-4o-latest |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
[2025/03/24 21:43:01] ppocr INFO: train with paddle 2.6.2 and device Place(gpu:0)
[2025/03/24 21:43:01] ppocr INFO: Initialize indexes of datasets:['./train_data/ic15_data/rec_gt_train.txt']
[2025/03/24 21:43:01] ppocr INFO: Initialize indexes of datasets:['./train_data/ic15_data/rec_gt_test.txt']
W0324 21:43:01.156922 78954 gpu_resources.cc:119] Please NOTE: device: 0, GPU Compute Capability: 6.1, Driver API Version: 12.4, Runtime API Version: 11.8
W0324 21:43:01.157807 78954 gpu_resources.cc:164] device: 0, cuDNN Version: 8.8.
[2025/03/24 21:43:01] ppocr INFO: train dataloader has 34 iters
[2025/03/24 21:43:01] ppocr INFO: valid dataloader has 17 iters
[2025/03/24 21:43:01] ppocr INFO: load pretrain successful from ./pretrain_models/en_PP-OCRv3_rec_train/best_accuracy
[2025/03/24 21:43:01] ppocr INFO: During the training process, after the 0th iteration, an evaluation is run every 2000 iterations
就一直卡这儿。确定use_gpu True
Beta Was this translation helpful? Give feedback.
All reactions