识别图片时,"傈僳"二字无法识别出来,在https://aistudio.baidu.com/community/app/91660/webUI 也无法识别出 #14617

PikachuGits · 2025-02-05T02:34:44Z

PikachuGits
Feb 5, 2025

识别图片时,"傈僳"二字无法识别出来,在https://aistudio.baidu.com/community/app/91660/webUI 也无法识别出
这是随意截取的含有傈僳的图片

这是识别结果

我也尝试了很多方法,issue中给的解决方案
#14601

并不能解决问题, 我查看了包中的字典文件, 是包含这两个字的, 但是在识别是,确实无法识别到

因为才开始接触PaddleOCR , 不清楚应该怎么处理这类的问题, 希望能提供一下解决方案

🏃‍♂️ Environment (运行环境)
mac i7环境, python 3.9

🌰 Minimal Reproducible Example (最小可复现问题的Demo)
from PIL import Image, ImageEnhance, ImageFilter
from paddleocr import PaddleOCR
import cv2
import numpy as np
import json
import os

初始化 OCR 引擎
ocr = PaddleOCR(use_angle_cls=True)

img_path = "Snipaste_2025-01-27_16-12-27.png"
try:

打开图像

image = Image.open(img_path).convert("RGB")

增强对比度

enhancer = ImageEnhance.Contrast(image)
image = enhancer.enhance(1.5) # 可以根据实际情况调整增强系数

锐化图像

image = image.filter(ImageFilter.SHARPEN)

将 PIL 图像转换为 OpenCV 格式

image_cv = np.array(image)

转换为灰度图

gray_image = cv2.cvtColor(image_cv, cv2.COLOR_BGR2GRAY)

降噪处理，使用中值滤波

denoised_image = cv2.medianBlur(gray_image, 3)

高斯模糊处理

blurred_image = cv2.GaussianBlur(denoised_image, (5, 5), 0)

二值化处理

_, binary_image = cv2.threshold(gray_image, 150, 255, cv2.THRESH_BINARY)

进行 OCR 识别

results = ocr.ocr(binary_image, cls=True)

提取文本行

text_lines = [line[1][0] for line in results[0] if line[1][0].strip()]
print(json.dumps(text_lines, ensure_ascii=False))
if results:
for line in results[0]:
text = line[1][0]
confidence = line[1][1]
print(f"识别文本: {text}, 置信度: {confidence:.2f}")
else:
print("未识别到任何文本信息。")

except FileNotFoundError:
print(f"未找到图像文件: {img_path}")
except Exception as e:
print(f"发生错误: {e}")

GreatV · 2025-02-05T02:38:44Z

GreatV
Feb 5, 2025
Maintainer

估计得微调一下识别模型

0 replies

GreatV · 2025-02-05T03:04:58Z

GreatV
Feb 5, 2025
Maintainer

也可以试试高精度模型

2 replies

PikachuGits Feb 5, 2025
Author

我在尝试
det_model_dir="ch_PP-OCRv4_det_server_infer", # 检测模型路径
rec_model_dir="ch_PP-OCRv4_rec_server_infer", # 识别模型路径

但是, 好像都会完美的避过傈僳这两个字.字典中能搜索到这两个字, 但是为什么识别不出我明白
才开始接触这个, 没想到碰到这么个问题

jingsongliujing Feb 6, 2025
Collaborator

可能关于这两个字的训练数据太少了，可以找一些相关的数据进行微调训练

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

识别图片时,"傈僳"二字无法识别出来,在https://aistudio.baidu.com/community/app/91660/webUI 也无法识别出 #14617

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

识别图片时,"傈僳"二字无法识别出来,在https://aistudio.baidu.com/community/app/91660/webUI 也无法识别出 #14617

Uh oh!

PikachuGits Feb 5, 2025

打开图像

增强对比度

锐化图像

将 PIL 图像转换为 OpenCV 格式

转换为灰度图

降噪处理，使用中值滤波

高斯模糊处理

二值化处理

进行 OCR 识别

提取文本行

Replies: 2 comments · 2 replies

Uh oh!

Uh oh!

GreatV Feb 5, 2025 Maintainer

Uh oh!

GreatV Feb 5, 2025 Maintainer

Uh oh!

PikachuGits Feb 5, 2025 Author

Uh oh!

jingsongliujing Feb 6, 2025 Collaborator

PikachuGits
Feb 5, 2025

Replies: 2 comments 2 replies

GreatV
Feb 5, 2025
Maintainer

GreatV
Feb 5, 2025
Maintainer

PikachuGits Feb 5, 2025
Author

jingsongliujing Feb 6, 2025
Collaborator