Tesseract OCR の使い方

1644 ワード

tesseract-ocr OCR tesseract-ocr テキストリンク

次のページと同じことを、Tesseract で行ってみました。
Microsoft Computer Vision API OCR の使い方 (日本語)
まったく、学習はさせていません。

Arch Linux でのインストール

sudo pacman -S tesseract
sudo pacman -S tesseract-data-jpn

Ubuntu でのインストール

sudo apt install tesseract-ocr tesseract-ocr-jpn

入力画像

実行結果

$ tesseract hensel_gretel.png out01 -l jpn
Tesseract Open Source OCR Engine v4.1.0 with Leptonica

out01.txt

大 き な 怪 の す く 近 く に 、 木 と り が お か み さ ん と 依
た ち と 一 細 に 住 ん で い ま し た 畑 の チ は ヘ ン ゼ ル で 如
の 子 は ダ レ ー テ ル と い う 名 前 で し た 、 本 こ り に は ほ

次のバージョンで確認しました。

Linux *** 5.4.7-arch1-1 #1 SMP PREEMPT Tue, 31 Dec 2019 17:20:16 +0000 x86_64 GNU/Linux

Author And Source

この問題について(Tesseract OCR の使い方), 我々は、より多くの情報をここで見つけました https://qiita.com/ekzemplaro/items/f7dc38bb2a35be9f9f10

著者帰属：元の著者の情報は、元のURLに含まれています。著作権は原作者に属する。

Content is automatically searched and collected through network algorithms . If there is a violation . Please contact us . We will adjust (correct author information ,or delete content ) as soon as possible .