当我读取 15 页的多页 Tiff 图像并且是白色背景的黑色字母/单词的文档时,PyTesseract 在我循环页面并转换为字符串的步骤中抛出“OSError:-9”错误。
我将 pytesseract 包与 pyocr.builders 一起使用。单页似乎工作正常,但我相信当图像不是 RGB 时程序会转换为 RGB 时会出现错误。
img = Image.open(r'\users\ai\text.tiff')
img.load()
txt = ""
for frame in range(0, img.n_frames):
img.seek(frame)
txt += tool.image_to_string(img,builder=pyocr.builders.TextBuilder())
Run Code Online (Sandbox Code Playgroud)
预期输出是 jupyter 窗口中显示的所有 15 页。
错误信息
---------------------------------------------------------------------------
OSError Traceback (most recent call last)
<ipython-input-17-e59bdf3b773c> in <module>
2 for frame in range(0, img.n_frames):
3 img.seek(frame)
----> 4 txt += tool.image_to_string(img,builder=pyocr.builders.TextBuilder())
5
~\AppData\Local\Continuum\anaconda3\lib\site-packages\pyocr\tesseract.py in image_to_string(image, lang, builder)
357 with tempfile.TemporaryDirectory() as tmpdir:
358 if image.mode != "RGB":
--> 359 image = …Run Code Online (Sandbox Code Playgroud) 我正在努力绘制 PDF 和 CDF 图
Sn=X1+X2+X3+....+Xn 使用中心极限定理,其中 n = 1;2;3;4;5;10;20;40 我将 Xi 视为 (0,3) 之间值的均匀连续随机变量。
Here is what i have done so far -
close all
%different sizes of input X
%N=[1 5 10 50];
N = [1 2 3 4 5 10 20 40];
%interval (1,6) for random variables
a=0;
b=3;
%to store sum of differnet sizes of input
for i=1:length(N)
%generates uniform random numbers in the interval
X = a + (b-a).*rand(N(i),1);
S=zeros(1,length(X));
S=cumsum(X);
cd=cdf('Uniform',S,0,3);
plot(cd);
hold on; …Run Code Online (Sandbox Code Playgroud)