开发者

Fraktur recognition with OCRopus/Tesseract on Linux

I am trying to perform recognition of a german text with fraktur typeface with ocropus but It doesn't seem to be using deu-f package.

Here are the steps I performed.

  1. Compiled and installed tesseract and ocropus.
  2. Downloaded http://tesseract-ocr.googlecode.com/files/tesseract-2.01.deu-f.开发者_JS百科tar.gz, unpacked it to tessdata/.

But when I call

 $ ocroscript recognize --tessLanguage=deu-f --output-mode=text image.png

the results are the same as when I call

$ ocroscript recognize --tessLanguage=eng --output-mode=text image.png

Any ideas what the problem is?


The problem is described in http://code.google.com/p/ocropus/issues/detail?id=87. Just need to apply the patch to ocropus and rebuild it.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜