Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 880011 - [guru] app-text/OCRmyPDF-14.0.1-r2 fails tests
Summary: [guru] app-text/OCRmyPDF-14.0.1-r2 fails tests
Status: RESOLVED FIXED
Alias: None
Product: GURU
Classification: Unclassified
Component: Package issues (show other bugs)
Hardware: All Linux
: Normal normal
Assignee: Alarig Le Lay
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2022-11-06 08:13 UTC by Agostino Sarubbo
Modified: 2022-11-16 19:33 UTC (History)
2 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments
build.log.xz (build.log.xz,274.10 KB, application/x-xz)
2022-11-06 08:13 UTC, Agostino Sarubbo
Details
1-test.log (1-test.log,72 bytes, text/plain)
2022-11-06 08:13 UTC, Agostino Sarubbo
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Agostino Sarubbo gentoo-dev 2022-11-06 08:13:13 UTC
https://blogs.gentoo.org/ago/2020/07/04/gentoo-tinderbox/

Issue: app-text/OCRmyPDF-14.0.1-r2 fails tests.
Discovered on: amd64 (internal ref: guru_ci)
Comment 1 Agostino Sarubbo gentoo-dev 2022-11-06 08:13:18 UTC
Created attachment 827977 [details]
build.log.xz

build log and emerge --info (compressed because it exceeds attachment limit, use 'xzless' to read it)
Comment 2 Agostino Sarubbo gentoo-dev 2022-11-06 08:13:19 UTC
Created attachment 827979 [details]
1-test.log

1-test.log
Comment 3 Agostino Sarubbo gentoo-dev 2022-11-06 08:13:20 UTC
Error(s) that match a know pattern:


    executor(
    for info in _process_content_streams(
    from PIL import _imagingcms
    page = PageInfo(pdf, pageno, infile, check_pages, detailed_analysis)
    pdfinfo = get_pdfinfo(
    pim_icc = pim.icc
    raise self._exception
    raise self.ex
    result = future.result()
    result = self.fn(*self.args, **self.kwargs)
    return PdfInfo(
    return self.__get_result()
    self._execute(
    self._gather_pageinfo(pdf, pageno, infile, check_pages, detailed_analysis)
    self._icc = ImageCmsProfile(iccbytesio)
    self._pages = _pdf_pageinfo_concurrent(
    self._set(core.profile_frombytes(profile.read()))
    yield ImageInfo(name=draw.name, pdfimage=pdfimage, shorthand=draw.shorthand)
    yield from _find_regular_images(container, contentsinfo)
  File "/usr/lib/python3.8/concurrent/futures/_base.py", line 389, in __get_result
  File "/usr/lib/python3.8/concurrent/futures/_base.py", line 437, in result
  File "/usr/lib/python3.8/concurrent/futures/thread.py", line 57, in run
  File "/usr/lib/python3.8/site-packages/PIL/ImageCms.py", line 190, in __init__
  File "/usr/lib/python3.8/site-packages/PIL/ImageCms.py", line 26, in <module>
  File "/usr/lib/python3.8/site-packages/PIL/_util.py", line 19, in __getattr__
  File "/usr/lib/python3.8/site-packages/pikepdf/models/image.py", line 458, in icc
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/_concurrent.py", line 87, in __call__
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/_pipeline.py", line 165, in get_pdfinfo
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/_sync.py", line 378, in run_pipeline
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/builtin_plugins/concurrency.py", line 141, in _execute
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/pdfinfo/info.py", line 382, in __init__
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/pdfinfo/info.py", line 513, in _find_regular_images
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/pdfinfo/info.py", line 595, in _process_content_streams
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/pdfinfo/info.py", line 668, in _pdf_pageinfo_sync
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/pdfinfo/info.py", line 711, in _pdf_pageinfo_concurrent
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/pdfinfo/info.py", line 748, in __init__
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/pdfinfo/info.py", line 794, in _gather_pageinfo
  File "/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/lib/python3.8/site-packages/ocrmypdf/pdfinfo/info.py", line 934, in __init__
')
').returncode
An exception occurred while executing the pipeline
E               subprocess.CalledProcessError: Command '['tesseract', '-l', 'eng', '--psm', '2', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.255omp7f/000001_rasterize.png', 'stdout']' returned non-zero exit status 2.
E               subprocess.CalledProcessError: Command '['tesseract', '-l', 'eng', '--psm', '2', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io._z255_z6/000001_rasterize.png', 'stdout']' returned non-zero exit status 2.
E               subprocess.CalledProcessError: Command '['tesseract', '-l', 'eng', '--psm', '2', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.oy00_z7w/000001_rasterize.png', 'stdout']' returned non-zero exit status 2.
E               subprocess.CalledProcessError: Command '['tesseract', '-l', 'eng', '-c', 'textonly_pdf=1', '--user-words', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/pytest-of-portage/pytest-0/test_user_words_ocr0/wordlist.txt', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.lrwhrvud/000001_ocr.png', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.lrwhrvud/000001_ocr_tess', 'pdf', 'txt']' returned non-zero exit status 1.
E               subprocess.CalledProcessError: Command '['tesseract', '-l', 'eng', '-c', 'textonly_pdf=1', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.i37lyte3/000001_ocr.png', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.i37lyte3/000001_ocr_tess', 'pdf', 'txt', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/pytest-of-portage/pytest-0/test_tesseract_config_valid0/test.cfg']' returned non-zero exit status 1.
E               subprocess.CalledProcessError: Command '['tesseract', '-l', 'eng', '-c', 'textonly_pdf=1', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.mh7nxerf/000002_ocr.png', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.mh7nxerf/000002_ocr_tess', 'pdf', 'txt']' returned non-zero exit status 1.
E               subprocess.CalledProcessError: Command '['tesseract', '-l', 'eng', '-c', 'textonly_pdf=1', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.nmzgaz0_/000001_ocr.png', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.nmzgaz0_/000001_ocr_tess', 'pdf', 'txt']' returned non-zero exit status 1.
E           subprocess.CalledProcessError: Command '['jbig2', '-p', PosixPath('/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.3vsjn650/images/00000011.png')]' returned non-zero exit status 3.
E           subprocess.CalledProcessError: Command '['jbig2', '-p', PosixPath('/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.ltz_87oe/images/00000014.png')]' returned non-zero exit status 3.
E           subprocess.CalledProcessError: Command '['jbig2', '-p', PosixPath('/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.nm09j02e/images/00000012.png')]' returned non-zero exit status 3.
E           subprocess.CalledProcessError: Command '['jbig2', '-p', PosixPath('/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.oivvblw9/images/00000030.png')]' returned non-zero exit status 3.
E           subprocess.CalledProcessError: Command '['jbig2', '-p', PosixPath('/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.t1mbvt4l/images/00000010.png')]' returned non-zero exit status 3.
E           subprocess.CalledProcessError: Command '['jbig2', '-p', PosixPath('/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/ocrmypdf.io.zawvjfkc/images/00000013.png')]' returned non-zero exit status 3.
E           subprocess.CalledProcessError: Command '['jbig2', '-p', PosixPath('/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/tmpk2_qzwuq/images/00000006.png')]' returned non-zero exit status 3.
E         ImportError: cannot import name '_imagingcms' from 'PIL' (/usr/lib/python3.8/site-packages/PIL/__init__.py)
E        +  where 15 = CompletedProcess(args=['/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/bin/python3.8', '-m', 'ocrmypdf', '--title', 'Du siehst den Wald vor lauter Bäumen nicht.', '--author', '孔子', '--output-type', 'pdf', '--plugin', 'tests/plugins/tesseract_noop.py', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1/tests/resources/c02-22.pdf', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/pytest-of-portage/pytest-0/test_override_metadata_pdf_0/out.pdf'], returncode=15, stdout='', stderr='Opened a file
E        +  where 15 = CompletedProcess(args=['/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/bin/python3.8', '-m', 'ocrmypdf', '--title', 'Du siehst den Wald vor lauter Bäumen nicht.', '--author', '孔子', '--output-type', 'pdfa', '--plugin', 'tests/plugins/tesseract_noop.py', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1/tests/resources/c02-22.pdf', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/pytest-of-portage/pytest-0/test_override_metadata_pdfa_0/out.pdf'], returncode=15, stdout='', stderr='Opened a file
E       AssertionError: Opened a file
E       ImportError: cannot import name '_imagingcms' from 'PIL' (/usr/lib/python3.8/site-packages/PIL/__init__.py)
FAILED tests/test_graft.py::test_links - ocrmypdf.exceptions.SubprocessOutput...
FAILED tests/test_main.py::test_blank_input_pdf - assert <ExitCode.child_proc...
FAILED tests/test_main.py::test_corrupt_icc - assert <ExitCode.other_error: 1...
FAILED tests/test_main.py::test_force_ocr - ImportError: cannot import name '...
FAILED tests/test_main.py::test_maximum_options[pdf-hocr] - ImportError: cann...
FAILED tests/test_main.py::test_maximum_options[pdf-sandwich] - ImportError: ...
FAILED tests/test_main.py::test_maximum_options[pdfa-hocr] - ImportError: can...
FAILED tests/test_main.py::test_maximum_options[pdfa-sandwich] - ImportError:...
FAILED tests/test_main.py::test_oversample[hocr] - ImportError: cannot import...
FAILED tests/test_main.py::test_oversample[sandwich] - ImportError: cannot im...
FAILED tests/test_main.py::test_pagesize_consistency[hocr] - ocrmypdf.excepti...
FAILED tests/test_main.py::test_pagesize_consistency[sandwich] - ocrmypdf.exc...
FAILED tests/test_main.py::test_redo_ocr - ocrmypdf.exceptions.SubprocessOutp...
FAILED tests/test_main.py::test_sidecar_pagecount - subprocess.CalledProcessE...
FAILED tests/test_main.py::test_tesseract_config_valid - ocrmypdf.exceptions....
FAILED tests/test_main.py::test_text_curves_force - ImportError: cannot impor...
FAILED tests/test_main.py::test_user_words_ocr - ocrmypdf.exceptions.Subproce...
FAILED tests/test_metadata.py::test_override_metadata[pdf] - AssertionError: ...
FAILED tests/test_metadata.py::test_override_metadata[pdfa] - AssertionError:...
FAILED tests/test_optimize.py::test_flate_to_jbig2 - subprocess.CalledProcess...
FAILED tests/test_optimize.py::test_mono_not_inverted - subprocess.CalledProc...
FAILED tests/test_page_numbers.py::test_limited_pages - ImportError: cannot i...
FAILED tests/test_pdfinfo.py::test_jpeg - ImportError: cannot import name '_i...
FAILED tests/test_preprocessing.py::test_convert_to_square_resolution[hocr]
FAILED tests/test_preprocessing.py::test_convert_to_square_resolution[sandwich]
FAILED tests/test_preprocessing.py::test_deskew - ocrmypdf.exceptions.Subproc...
FAILED tests/test_preprocessing.py::test_exotic_image[pdf-hocr-ccitt.pdf] - s...
FAILED tests/test_preprocessing.py::test_exotic_image[pdf-hocr-jbig2.pdf] - s...
FAILED tests/test_preprocessing.py::test_exotic_image[pdf-sandwich-ccitt.pdf]
FAILED tests/test_preprocessing.py::test_exotic_image[pdf-sandwich-jbig2.pdf]
FAILED tests/test_rotation.py::test_rotated_skew_timeout - ImportError: canno...
FAILED tests/test_tesseract.py::test_skip_pages_does_not_replicate[cardinal.pdf]
FAILED tests/test_tesseract.py::test_skip_pages_does_not_replicate[graph_ocred.pdf]
ImportError: cannot import name '_imagingcms' from 'PIL' (/usr/lib/python3.8/site-packages/PIL/__init__.py)
ImportError: cannot import name \'_imagingcms\' from \'PIL\' (/usr/lib/python3.8/site-packages/PIL/__init__.py)
Traceback (most recent call last):
p          = CompletedProcess(args=['/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/bin/python3.8', '-m', 'ocrmypdf', '--title', 'Du siehst den Wald vor lauter Bäumen nicht.', '--author', '孔子', '--output-type', 'pdf', '--plugin', 'tests/plugins/tesseract_noop.py', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1/tests/resources/c02-22.pdf', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/pytest-of-portage/pytest-0/test_override_metadata_pdf_0/out.pdf'], returncode=15, stdout='', stderr='Opened a file
p          = CompletedProcess(args=['/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1-python3_8/install/usr/bin/python3.8', '-m', 'ocrmypdf', '--title', 'Du siehst den Wald vor lauter Bäumen nicht.', '--author', '孔子', '--output-type', 'pdfa', '--plugin', 'tests/plugins/tesseract_noop.py', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/work/OCRmyPDF-14.0.1/tests/resources/c02-22.pdf', '/var/tmp/portage/app-text/OCRmyPDF-14.0.1-r2/temp/pytest-of-portage/pytest-0/test_override_metadata_pdfa_0/out.pdf'], returncode=15, stdout='', stderr='Opened a file
E       assert <ExitCode.child_process_error: 7> == <ExitCode.ok: 0>
Comment 4 Larry the Git Cow gentoo-dev 2022-11-16 19:33:06 UTC
The bug has been closed via the following commit(s):

https://gitweb.gentoo.org/repo/proj/guru.git/commit/?id=c8062dba27fdc4bf0be3b1eb94c72455a64a3d2e

commit c8062dba27fdc4bf0be3b1eb94c72455a64a3d2e
Author:     Anna (cybertailor) Vyalkova <cyber+gentoo@sysrq.in>
AuthorDate: 2022-11-16 14:54:51 +0000
Commit:     Anna (cybertailor) Vyalkova <cyber+gentoo@sysrq.in>
CommitDate: 2022-11-16 14:54:51 +0000

    app-text/OCRmyPDF: require all image formats
    
    Closes: https://bugs.gentoo.org/880011
    Signed-off-by: Anna (cybertailor) Vyalkova <cyber+gentoo@sysrq.in>

 .../OCRmyPDF/{OCRmyPDF-14.0.1-r2.ebuild => OCRmyPDF-14.0.1-r3.ebuild}  | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)