小编Mad*_*ddy的帖子

如何在python中使用OCR获取从Image识别的文本坐标

我正在尝试使用 Tesseract 从图像中获取文本字符的坐标或位置。我想知道确切的像素位置，以便我可以使用其他工具单击该文本。

编辑：

import pytesseract
from pytesseract import pytesseract
import PIL
from PIL import Image
import cv2
import csv

img = 'E:\\OCR-DATA\\sample.jpg'
imge = Image.open(img)
    data=pytesseract.image_to_string(imge,lang='eng',boxes=True,config='hocr')

print(data)

Run Code Online (Sandbox Code Playgroud)

data包含具有框边界值的识别文本。但我不确定，如何使用该边界值来获取文本的坐标。

data变量的值如下：

O 100 356 115 373 0

u 117 356 127 368 0

t 130 356 138 372 0

p 141 351 152 368 0

u 154 356 164 368 0

t 167 356 175 371 0

Run Code Online (Sandbox Code Playgroud)

python ocr tesseract image-processing python-tesseract

Mad*_*ddy

2018 02-22

8
推荐指数

2
解决办法

7784
查看次数

标签统计

image-processing ×1

ocr ×1

python ×1

python-tesseract ×1

tesseract ×1

如何在python中使用OCR获取从Image识别的文本坐标

标签 统计

小编Mad_ddy的帖子

标签统计