ocrmypdf_8.0.1+dfsg-1_all.deb


Advertisement

Description

ocrmypdf - add an OCR text layer to PDF files

Property Value
Distribution Debian 10 (Buster)
Repository Debian Main i386
Package filename ocrmypdf_8.0.1+dfsg-1_all.deb
Package name ocrmypdf
Package version 8.0.1+dfsg
Package release 1
Package architecture all
Package type deb
Category graphics
Homepage https://github.com/jbarlow83/OCRmyPDF
License -
Maintainer Sean Whitton <spwhitton@spwhitton.name>
Download size 109.48 KB
Installed size 431.00 KB
OCRmyPDF generates a searchable PDF/A file from a regular PDF
containing only images, allowing it to be searched.
It uses the Tesseract OCR engine and so supports all the languages
that Tesseract does.
Some other main features:
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a lossless operation
without rendering vector information
* Keeps file size about the same
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Provides debug mode to enable easy verification of the OCR results
* Processes pages in parallel when more than one CPU core is
available
* Battle-tested on thousands of PDFs, a test suite and continuous
integration.

Alternatives

Package Version Architecture Repository
ocrmypdf_8.0.1+dfsg-1_all.deb 8.0.1+dfsg all Debian Main
ocrmypdf - - -

Requires

Name Value
ghostscript >= 9.18~dfsg~
icc-profiles-free -
liblept5 -
python3-cffi-backend-api-max >= 9729
python3-cffi-backend-api-min <= 9729
python3-chardet -
python3-img2pdf >= 0.3.0
python3-pdfminer >= 20181108+dfsg-3
python3-pikepdf -
python3-pil -
python3-pkg-resources -
python3-reportlab -
python3-ruffus >= 2.8
python3:any -
qpdf >= 8.0.2
tesseract-ocr >= 4.0.0
zlib1g -

Download

Type URL
Mirror ftp.br.debian.org
Binary Package ocrmypdf_8.0.1+dfsg-1_all.deb
Source Package ocrmypdf

Install Howto

  1. Update the package index:
    # sudo apt-get update
  2. Install ocrmypdf deb package:
    # sudo apt-get install ocrmypdf

Files

Path
/usr/bin/ocrmypdf
/usr/lib/python3/dist-packages/ocrmypdf/__init__.py
/usr/lib/python3/dist-packages/ocrmypdf/__main__.py
/usr/lib/python3/dist-packages/ocrmypdf/_jobcontext.py
/usr/lib/python3/dist-packages/ocrmypdf/_pipeline.py
/usr/lib/python3/dist-packages/ocrmypdf/_unicodefun.py
/usr/lib/python3/dist-packages/ocrmypdf/_weave.py
/usr/lib/python3/dist-packages/ocrmypdf/exceptions.py
/usr/lib/python3/dist-packages/ocrmypdf/helpers.py
/usr/lib/python3/dist-packages/ocrmypdf/hocrtransform.py
/usr/lib/python3/dist-packages/ocrmypdf/leptonica.py
/usr/lib/python3/dist-packages/ocrmypdf/optimize.py
/usr/lib/python3/dist-packages/ocrmypdf/pdfa.py
/usr/lib/python3/dist-packages/ocrmypdf-8.0.1+dfsg.egg-info/PKG-INFO
/usr/lib/python3/dist-packages/ocrmypdf-8.0.1+dfsg.egg-info/dependency_links.txt
/usr/lib/python3/dist-packages/ocrmypdf-8.0.1+dfsg.egg-info/entry_points.txt
/usr/lib/python3/dist-packages/ocrmypdf-8.0.1+dfsg.egg-info/not-zip-safe
/usr/lib/python3/dist-packages/ocrmypdf-8.0.1+dfsg.egg-info/requires.txt
/usr/lib/python3/dist-packages/ocrmypdf-8.0.1+dfsg.egg-info/top_level.txt
/usr/lib/python3/dist-packages/ocrmypdf/data/sRGB.icc
/usr/lib/python3/dist-packages/ocrmypdf/exec/__init__.py
/usr/lib/python3/dist-packages/ocrmypdf/exec/ghostscript.py
/usr/lib/python3/dist-packages/ocrmypdf/exec/jbig2enc.py
/usr/lib/python3/dist-packages/ocrmypdf/exec/pngquant.py
/usr/lib/python3/dist-packages/ocrmypdf/exec/qpdf.py
/usr/lib/python3/dist-packages/ocrmypdf/exec/tesseract.py
/usr/lib/python3/dist-packages/ocrmypdf/exec/unpaper.py
/usr/lib/python3/dist-packages/ocrmypdf/lib/__init__.py
/usr/lib/python3/dist-packages/ocrmypdf/lib/_leptonica.py
/usr/lib/python3/dist-packages/ocrmypdf/lib/compile_leptonica.py
/usr/lib/python3/dist-packages/ocrmypdf/pdfinfo/__init__.py
/usr/lib/python3/dist-packages/ocrmypdf/pdfinfo/ghosttext.py
/usr/lib/python3/dist-packages/ocrmypdf/pdfinfo/layout.py
/usr/share/doc/ocrmypdf/NEWS.Debian.gz
/usr/share/doc/ocrmypdf/changelog.Debian.gz
/usr/share/doc/ocrmypdf/changelog.gz
/usr/share/doc/ocrmypdf/copyright
/usr/share/man/man1/ocrmypdf.1.gz

Changelog

2019-01-26 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (8.0.1+dfsg-1) unstable; urgency=medium
* New upstream release.
2019-01-14 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (8.0.0+dfsg-3) unstable; urgency=medium
* Require python3-pdfminer (>= 20181108+dfsg-3).
2019-01-14 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (8.0.0+dfsg-2) unstable; urgency=medium
* Revert changes in previous upload that disabled usage of pdfminer.six.
It turns out that the blocking problem was not #886291, but instead
the problem fixed by the 20181108+dfsg-3 upload of src:pdfminer.
Thanks to Daniele Tricoli for the fix.
2019-01-11 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (8.0.0+dfsg-1) unstable; urgency=medium
* New upstream release.
- Add tests/resources/enron1.pdf to Files-Excluded
See https://github.com/pikepdf/pikepdf/issues/21
- Patch out test_prevent_gs_invalid_xml
This test requires tests/resources/enron1.pdf
- Tighten dependency on tesseract-ocr.
- Tighten {build-,}dep on pikepdf.
* Drop dependencies on python3-pdfminer & patch pdfminer.six out of setup.py.
OCRmyPDF's usage of pdfminer is broken due to #886291.  The problem is
not likely to be fixed in time for the buster freeze, so disable
pdfminer functionality for now.
Also see https://github.com/jbarlow83/OCRmyPDF/issues/339
* Drop bogus Debian changes to upstream file tests/test_main.py by
checking out the file from tag v8.0.0+dfsg (Closes: #918891).
The changes were introduced in upstream releases 6.2.4 and 6.2.5 and
dropped by 7.4.0.  The merge of upstream version 7.4.0 into the Debian
packaging branch was not done correctly, such that the changes
remained.
2019-01-06 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (7.4.0-3) unstable; urgency=medium
* Upload to unstable.
2019-01-04 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (7.4.0-2) experimental; urgency=medium
* Regenerate manpage.
2019-01-04 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (7.4.0-1) experimental; urgency=medium
* New upstream release.
- Tighten {build-,}deps on python3-img2pdf, python3-pikepdf, python3-ruffus
- Drop python3-libxmp build-dep and autopkgtest dep
- Add python3-pdfminer versioned {build-,}dep.
- Add python3-cffi autopkgtest dep.
* In override_dh_auto_build, delete the line `from . import leptonica`
from debian/.debhelper/ocrmypdf/__init__.py.
The directory debian/.debhelper/ocrmypdf is just a hack so that
upstream's doc build can find the version number, and the cffi setup
does not work inside debian/.debhelper/ocrmypdf, so avoid the dlopen
attempt.
2018-10-20 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (7.2.1-1) experimental; urgency=medium
* New upstream release.
2018-10-10 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (7.2.0-2) experimental; urgency=medium
* Add pngquant to autopkgtest deps.
2018-10-10 - Sean Whitton <spwhitton@spwhitton.name>
ocrmypdf (7.2.0-1) experimental; urgency=medium
* New upstream release.
* Patch setup.py to not require setuptools_scm_git_archive.
Pending the resolution of #910742.

See Also

Package Description
ocrodjvu_0.10.4-1_all.deb tool to perform OCR on DjVu documents
ocserv_0.12.2-3_i386.deb OpenConnect VPN server compatible with Cisco AnyConnect VPN
ocsinventory-agent_2.4.2-3_i386.deb Hardware and software inventory tool (client)
ocsinventory-reports_2.5+dfsg1-1_all.deb Hardware and software inventory tool (Administration Console)
ocsinventory-server_2.5+dfsg1-1_all.deb Hardware and software inventory tool (Communication Server)
octave-arduino_0.3.0-2_all.deb Octave Arduino Toolkit
octave-bart_0.4.04-2_all.deb Octave bindings for BART
octave-bim_1.1.5-6_all.deb PDE solver using a finite element/volume approach in Octave
octave-biosig_1.9.3-2_i386.deb Octave bindings for BioSig library
octave-bsltl_1.1.1-2_all.deb biospeckle laser tool library for Octave
octave-cgi_0.1.2-2_all.deb Common Gateway Interface for Octave
octave-common_4.4.1-5_all.deb architecture-independent files for octave
octave-communications-common_1.2.1-7_all.deb communications package for Octave (arch-indep files)
octave-communications_1.2.1-7_i386.deb communications package for Octave
octave-control_3.1.0-3_i386.deb computer-aided control system design (CACSD) for Octave
Advertisement
Advertisement