Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 245230 - Stabilize app-text/tesseract-2.03
Summary: Stabilize app-text/tesseract-2.03
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: New packages (show other bugs)
Hardware: All Linux
: Normal enhancement (vote)
Assignee: Patrick McLean
URL:
Whiteboard:
Keywords: STABLEREQ
Depends on:
Blocks:
 
Reported: 2008-11-02 06:29 UTC by SpanKY
Modified: 2009-05-21 20:18 UTC (History)
2 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description SpanKY gentoo-dev 2008-11-02 06:29:34 UTC
no open bugs and has been in the tree for a while now ... works fine for me with OCRing some pdfs i needed
Comment 1 Ferris McCormick (RETIRED) gentoo-dev 2008-11-02 14:59:58 UTC
Sparc stable.
Comment 2 Markus Meier gentoo-dev 2008-11-05 20:44:44 UTC
amd64/x86 stable
Comment 3 Brent Baude (RETIRED) gentoo-dev 2008-11-07 15:08:46 UTC
ppc stable
Comment 4 Tobias Klausmann (RETIRED) gentoo-dev 2008-11-09 16:00:28 UTC
Stable on alpha.

You might want to look into emitting a warning if LINGUAS is empty (i.e. contains only -*) since the whole program refuses to work if that's the case (and it doesn't really complain in a helpful way).
Comment 5 Brent Baude (RETIRED) gentoo-dev 2009-03-15 13:44:54 UTC
TOC failure on ppc64:

ar cru libtesseract_main.a tessedit.o adaptions.o applybox.o baseapi.o blobcmp.o callnet.o charcut.o charsample.o control.o docqual.o expandblob.o fixspace.o fixxht.o imgscale.o matmatch.o output.o paircmp.o reject.o scaleimg.o tessbox.o tessvars.o pagewalk.o pgedit.o varabled.o tfacepp.o tstruct.o werdit.o 
powerpc64-unknown-linux-gnu-ranlib libtesseract_main.a
ld -r -o libtesseract_full.o tesseractfull.o \
    libtesseract_main.a \
    ../textord/libtesseract_textord.a \
    ../pageseg/libtesseract_pageseg.a \
    ../wordrec/libtesseract_wordrec.a \
    ../classify/libtesseract_classify.a \
    ../dict/libtesseract_dict.a \
    ../viewer/libtesseract_viewer.a \
    ../image/libtesseract_image.a \
    ../cutil/libtesseract_cutil.a \
    ../ccstruct/libtesseract_ccstruct.a \
    ../ccutil/libtesseract_ccutil.a
ld: TOC section size exceeds 64k
make[3]: *** [libtesseract_full.o] Error 1
make[3]: Leaving directory `/var/tmp/portage/app-text/tesseract-2.03/work/tesseract-2.03/ccmain'
make[2]: *** [all-recursive] Error 1
make[2]: Leaving directory `/var/tmp/portage/app-text/tesseract-2.03/work/tesseract-2.03/ccmain'
make[1]: *** [all-recursive] Error 1
make[1]: Leaving directory `/var/tmp/portage/app-text/tesseract-2.03/work/tesseract-2.03'
make: *** [all-recursive-am] Error 2
 * 
 * ERROR: app-text/tesseract-2.03 failed.
Comment 6 Brent Baude (RETIRED) gentoo-dev 2009-04-14 20:48:14 UTC
ppc64 done
Comment 7 Richard Scott 2009-05-21 11:09:27 UTC
Can this not become stable as it doesn't include this file:

/usr/share/tessdata/eng.unicharset

Which stops OCR scanning in SpamAssasin from working correctly :-(

Apparently the language files have moved out of the main tarball into a new one that you have to also download and install.

Other distro's have an rpm for it:

http://rpm.pbone.net/index.php3/stat/4/idpl/12500217/com/tesseract-lang-en-2.03-1.i586.rpm.html
Comment 8 SpanKY gentoo-dev 2009-05-21 18:22:26 UTC
fix your lang settings.  the ebuild only installs what you configure it to.  otherwise, that is not something to post here.
Comment 9 Richard Scott 2009-05-21 19:14:53 UTC
(In reply to comment #8)
> fix your lang settings.  the ebuild only installs what you configure it to. 
> otherwise, that is not something to post here.
> 

Oh, I see... 

Would it not have made sense to default to English rather than no language at all!
Comment 10 SpanKY gentoo-dev 2009-05-21 20:18:31 UTC
probably, but you should file a new bug to ask for that enhancement ;)