Version 3.03 release candidate of tesseract is now available (source only so far) for download and contains many new features. (See the ReleaseNotes[1] for a full list.) Please add it to portage. [1] https://code.google.com/p/tesseract-ocr/wiki/ReleaseNotes
I can't find a tarball, what kind of release is this?
(In reply to Thomas Kahle from comment #1) > I can't find a tarball, what kind of release is this? From the home page[1]: A Note about Downloads With the discontinuation of downloads at code.google.com, new source downloads will be posted to GoogleDrive[2]. Other download folders will be setup as new files are uploaded, and the original Downloads page will go away. During the transition, other downloads can still be found at the Old Downloads[3] page. This Download link[4] works for me. [1] https://code.google.com/p/tesseract-ocr/ [2] https://drive.google.com/folderview?id=0B7l10Bj_LprhQnpSRkpGMGV2eE0&usp=sharing [3] https://code.google.com/p/tesseract-ocr/downloads/list [4] https://doc-14-1k-docs.googleusercontent.com/docs/securesc/8lndjh1gjd9neuhf43mscfj1bui7d3r1/bdfj86sfals0icn88r8sfl0spceh45e8/1399752000000/03152839789340657286/05143121894911104607/0B7l10Bj_LprhSGN2bTYwemVRREU?e=download&h=16653014193614665626&nonce=lpop567kj8vgm&user=05143121894911104607&hash=aka2rt08shjlte927ntrth72vuvgrr9l
We probably need to download the tarball and re-host it on dev.gentoo.org
New version is in tree. Points that remain open - I did not test if it works as expected for users - We need a new version of leptonica which I also bumped, but it has no alpha keywords, so tesseract lost ~alpha too (bug 510240) - The language files seem all outdated and we don't even ship some of the newer ones like "ancient greek" and "esperanto" because I could not fit them in the LINGUAS scheme. Any ideas? -> New enhancement bug please. Patches welcome. Also: Please test.