A new version of this software was released today: Changes: better threshold value detection fix pnm reads for 2 byte pixels (--with-libpbm=no) update man-page (mail me your suggestions) fix g++ warnings, float-OPs replaced by int-OPs spacing reviewed; make distance() more sensitive xml-objects (barcode, melted chars) handled with weights fix division by zero bug for vertical positioned chars default output is UTF8 now, UTF-encoding bug fixed added certainty option added uninstall to Makefile debug image format changed to png (using pipe) much better word spacing (line-by-line based) better DOT_ABOVE recognition fix output of char groups or strings stored in database, utf8 input fix buffer overflow in barcode decode39 fix lost comma on end of line internal vector format added for future use (faster, scalable, rotable) line detection extended internal list management rewritten to fix memory leaks and segfaults I have tested this with my portage overlay directory with a simple rename of the gocr-0.40.ebuild -> gocr-0.41.ebuild and all built and installed with no problems.
It's buggy! Please apply the pgm-patch (message at http://jocr.sourceforge.net/ about this release)
Ok, I missed the patch or it was released after the initial release. I have updated my ebuild and am attaching it here as well as the patch that needs to be applied.
Created attachment 96073 [details] proposed ebuild
Created attachment 96074 [details] pgm patch
GOCR 0.42 is now available from http://jocr.sourceforge.net/
The new version should have tk in IUSE, tk? ( dev-lang/tk ) in RDEPEND and install gocr.tcl only if tk is enabled.
GOCR 0.43 is now available from http://jocr.sourceforge.net/
fixed in cvs. The new version is definitely better on recognizing spam.