python-pdfminer_20110515+dfsg-1_all.deb


Advertisement

Description

python-pdfminer - PDF parser and analyser

Distribution: Debian 7 (Wheezy)
Repository: Debian Main amd64
Package name: python-pdfminer
Package version: 20110515+dfsg
Package release: 1
Package architecture: all
Package type: deb
Installed size: 604 B
Download size: 118.47 KB
Official Mirror: ftp.br.debian.org
PDFMiner is a tool for extracting information from PDF documents, which focuses entirely on getting and analyzing text data. It allows one to obtain the exact location of text portions in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes than text analysis. This package provides the Python module and the command-line tools: pdf2txt and dumppdf.

Alternatives

    Download

    Source package: pdfminer

    Install Howto

    1. Update the package index:
      # sudo apt-get update
    2. Install python-pdfminer deb package:
      # sudo apt-get install python-pdfminer

    Files

    • /usr/bin/dumppdf
    • /usr/bin/pdf2txt
    • /usr/lib/python2.6/dist-packages/pdfminer-20110515.egg-info
    • /usr/lib/python2.6/dist-packages/pdfminer/__init__.py
    • /usr/lib/python2.6/dist-packages/pdfminer/arcfour.py
    • /usr/lib/python2.6/dist-packages/pdfminer/ascii85.py
    • /usr/lib/python2.6/dist-packages/pdfminer/cmapdb.py
    • /usr/lib/python2.6/dist-packages/pdfminer/converter.py
    • /usr/lib/python2.6/dist-packages/pdfminer/encodingdb.py
    • /usr/lib/python2.6/dist-packages/pdfminer/fontmetrics.py
    • /usr/lib/python2.6/dist-packages/pdfminer/glyphlist.py
    • /usr/lib/python2.6/dist-packages/pdfminer/latin_enc.py
    • /usr/lib/python2.6/dist-packages/pdfminer/layout.py
    • /usr/lib/python2.6/dist-packages/pdfminer/lzw.py
    • /usr/lib/python2.6/dist-packages/pdfminer/pdfcolor.py
    • /usr/lib/python2.6/dist-packages/pdfminer/pdfdevice.py
    • /usr/lib/python2.6/dist-packages/pdfminer/pdffont.py
    • /usr/lib/python2.6/dist-packages/pdfminer/pdfinterp.py
    • /usr/lib/python2.6/dist-packages/pdfminer/pdfparser.py
    • /usr/lib/python2.6/dist-packages/pdfminer/pdftypes.py
    • /usr/lib/python2.6/dist-packages/pdfminer/psparser.py
    • /usr/lib/python2.6/dist-packages/pdfminer/rijndael.py
    • /usr/lib/python2.6/dist-packages/pdfminer/runlength.py
    • /usr/lib/python2.6/dist-packages/pdfminer/utils.py
    • /usr/lib/python2.6/dist-packages/pdfminer/cmap/__init__.py
    • /usr/lib/python2.7/dist-packages/pdfminer-20110515.egg-info
    • /usr/lib/python2.7/dist-packages/pdfminer/__init__.py
    • /usr/lib/python2.7/dist-packages/pdfminer/arcfour.py
    • /usr/lib/python2.7/dist-packages/pdfminer/ascii85.py
    • /usr/lib/python2.7/dist-packages/pdfminer/cmapdb.py
    • /usr/lib/python2.7/dist-packages/pdfminer/converter.py
    • /usr/lib/python2.7/dist-packages/pdfminer/encodingdb.py
    • /usr/lib/python2.7/dist-packages/pdfminer/fontmetrics.py
    • /usr/lib/python2.7/dist-packages/pdfminer/glyphlist.py
    • /usr/lib/python2.7/dist-packages/pdfminer/latin_enc.py
    • /usr/lib/python2.7/dist-packages/pdfminer/layout.py
    • /usr/lib/python2.7/dist-packages/pdfminer/lzw.py
    • /usr/lib/python2.7/dist-packages/pdfminer/pdfcolor.py
    • /usr/lib/python2.7/dist-packages/pdfminer/pdfdevice.py
    • /usr/lib/python2.7/dist-packages/pdfminer/pdffont.py
    • /usr/lib/python2.7/dist-packages/pdfminer/pdfinterp.py
    • /usr/lib/python2.7/dist-packages/pdfminer/pdfparser.py
    • /usr/lib/python2.7/dist-packages/pdfminer/pdftypes.py
    • /usr/lib/python2.7/dist-packages/pdfminer/psparser.py
    • /usr/lib/python2.7/dist-packages/pdfminer/rijndael.py
    • /usr/lib/python2.7/dist-packages/pdfminer/runlength.py
    • /usr/lib/python2.7/dist-packages/pdfminer/utils.py
    • /usr/lib/python2.7/dist-packages/pdfminer/cmap/__init__.py
    • /usr/share/doc-base/pdfminer-documentation
    • /usr/share/doc/python-pdfminer/README.txt
    • /usr/share/doc/python-pdfminer/changelog.Debian.gz
    • /usr/share/doc/python-pdfminer/changelog.gz
    • /usr/share/doc/python-pdfminer/copyright
    • /usr/share/doc/python-pdfminer/index.html
    • /usr/share/doc/python-pdfminer/programming.html
    • /usr/share/doc/python-pdfminer/examples/pdf2html.cgi.gz
    • /usr/share/man/man1/dumppdf.1.gz
    • /usr/share/man/man1/pdf2txt.1.gz
    • /usr/share/pyshared/pdfminer-20110515.egg-info
    • /usr/share/pyshared/pdfminer/__init__.py
    • /usr/share/pyshared/pdfminer/arcfour.py
    • /usr/share/pyshared/pdfminer/ascii85.py
    • /usr/share/pyshared/pdfminer/cmapdb.py
    • /usr/share/pyshared/pdfminer/converter.py
    • /usr/share/pyshared/pdfminer/encodingdb.py
    • /usr/share/pyshared/pdfminer/fontmetrics.py
    • /usr/share/pyshared/pdfminer/glyphlist.py
    • /usr/share/pyshared/pdfminer/latin_enc.py
    • /usr/share/pyshared/pdfminer/layout.py
    • /usr/share/pyshared/pdfminer/lzw.py
    • /usr/share/pyshared/pdfminer/pdfcolor.py
    • /usr/share/pyshared/pdfminer/pdfdevice.py
    • /usr/share/pyshared/pdfminer/pdffont.py
    • /usr/share/pyshared/pdfminer/pdfinterp.py
    • /usr/share/pyshared/pdfminer/pdfparser.py
    • /usr/share/pyshared/pdfminer/pdftypes.py
    • /usr/share/pyshared/pdfminer/psparser.py
    • /usr/share/pyshared/pdfminer/rijndael.py
    • /usr/share/pyshared/pdfminer/runlength.py
    • /usr/share/pyshared/pdfminer/utils.py
    • /usr/share/pyshared/pdfminer/cmap/__init__.py

    Changelog

    2011-08-24 - Daniele Tricoli <eriol@mornie.org> pdfminer (20110515+dfsg-1) unstable; urgency=low * New upstream release * Upload to unstable * debian/control - Removed Jakub and added Debian Python Modules Team to Maintainer - Added myself to Uploaders (Closes: #629178) - Bumped Standards-Version to 3.9.2 (no changes needed) * debian/{control,rules} - Switched to dh_python2

    2011-03-05 - Jakub Wilk <jwilk@debian.org> pdfminer (20110227+dfsg-1) experimental; urgency=low * New upstream release. + Document the -V option in pdf2txt manual page. * Correct a few grammatical errors in the manual pages and in the package description. Thanks to Stefano Rivera for help. * Remove byte-compiled files from (repackaged) upstream tarball. * Use $() constructs rather than backticks in shell scripts. * Rename some private variables in debian/rules to make them lowercase.

    2010-12-28 - Jakub Wilk <jwilk@debian.org> pdfminer (20101226+dfsg-1) experimental; urgency=low * New upstream release. + Drop fix-test-psparser.diff, applied upstream. + Prevent upstream Makefile from using ‘python2’ binary. [python2.diff]

    2010-12-02 - Jakub Wilk <jwilk@debian.org> pdfminer (20101017+dfsg-1) experimental; urgency=low * New upstream release. * Fix a typo in the pdf2txt manual page. * Backport an upstream patch to fix test failures. [fix-test-psparser.diff] * To fix FTBFS when built twice in a row: + force dh_auto_clean to use distutils build system; + add samples/{*.txt,*.html,*.xml} to debian/clean.

    2010-08-29 - Jakub Wilk <jwilk@debian.org> pdfminer (20100829+dfsg-1) experimental; urgency=low * New upstream release. * Add mutual Breaks to ensure that if python-pdfminer and pdfminer-data are installed together, they have the same version. * Use pickle protocol 2 for serializing data. [pickle-protocol-2.diff]

    2010-08-26 - Jakub Wilk <jwilk@debian.org> pdfminer (20100619p1+dfsg-1) experimental; urgency=low * New upstream release. + Drop all patches: either applied upstream or not needed anymore. + Recreate non-empty cmap/__init__.py in the build target and remove it in the clean target. + Update debian/get-orig-source and debian/rules to take into account new location of non-free samples. + Relax debian/watch and debian/rules to allow versions with pN suffix. + Explain copyright status of samples/jo.* in debian/copyright. * Bump standards version to 3.9.1 (no changes needed).

    2010-06-13 - Jakub Wilk <jwilk@debian.org> pdfminer (20100424+dfsg-1) experimental; urgency=low * Initial release (closes: #584555). * Strip non-DFSG-free test documents from the .orig.tar.gz. + Run tests only on those files that are actually available. [dfsg-testsuite.diff] * Disable test suite for psparser.py, as it is currently broken. [psparser-testsuite.diff] * Store encoding data in gzipped pickles rather than in Python modules. This way we can save lots of disk space. [encoding-data.diff] * Backport upstream patches: + to fix a bug in layout analysis [layout.diff]; + to allow extraction of nested tags [nested-tags.diff].

    Advertisement
    Advertisement