catdoc - MS-Word to TeX or plain text converter

Property Value
Distribution Debian 8 (Jessie)
Repository Debian Main i386
Package name catdoc
Package version 0.94.4
Package release 1.1+deb8u1
Package architecture i386
Package type deb
Installed size 2.32 KB
Download size 290.96 KB
Official Mirror
This program extracts text from MS-Word files, trying to preserve
as many special printable characters as possible. catdoc supports
everything up to Word-97. Also supported are MS Write documents and RTF
It doesn't even try to preserve fancy Word formatting, because
Word users usually don't care about document structure, and it is
this very thing which is important to LaTeX users.
Also provided is xls2csv, which extracts data from Excel spreadsheets
and outputs it in comma-separated-value format and catppt, which extracts
data from PowerPoint presentations.
This package suggests tk because it also includes wordview, an
optional Tk-based GUI for catdoc.  The MIME config provided in this
package will use wordview if X is running, or catdoc directly if it
is not.


Name Value
libc6 >= 2.7


Type URL
Binary Package catdoc_0.94.4-1.1+deb8u1_i386.deb
Source Package catdoc

Install Howto

  1. Update the package index:
    # sudo apt-get update
  2. Install catdoc deb package:
    # sudo apt-get install catdoc




2017-07-21 - Salvatore Bonaccorso <>
catdoc (0.94.4-1.1+deb8u1) jessie-security; urgency=high
* Non-maintainer upload by the Security Team.
* CVE-2017-11110: Heap buffer overflow in ole_init (Closes: #867717)
2012-12-03 - Neil Williams <>
catdoc (0.94.4-1.1) unstable; urgency=low
* Non-maintainer upload.
* New upstream release to remove .pc subdirectory from
the orig tarball (Closes: #692073). Includes updating
version strings in generated manpages.
* Remove extra ';' in src/xlsparse.c which turned for loop in
xlsparse into a buffer overflow (Closes: #692076), applies
patch by Olly Betts <>.
2012-06-10 - Nick Bane <>
catdoc (0.94.3-1) unstable; urgency=low
* Declare new upstream release
* Fix codepage bugs (Closes: #648921)
* Fix charset bug (Closes: #648726)
* Handle negative numbers on 64bit architectures (Closes: #555622)
* Fix Macintosh MS1904 date bug in xlsparse reported ubuntu #349016
2011-07-30 - Nick Bane <>
catdoc (0.94.2-2) unstable; urgency=low
* New maintainer (Closes: #631798)
* Add font table support to rtf parser
* Handle invalid unicode character return of -1 correctly in rtf parser
* Trap zero length docs
* Add cross building support
* Increase DBCS support for cp932 in rtf parser
* Add check for reloading current charset in rtf parser
* Implement rtf deflang/plain control word interaction
* Add rtf support for font charsets suggesting codepages and mbcs support
* Add lang+codepage support in rtf parser
* Add removal of escaped chars after unicode char
* And default to single char removal and implement sanely
* Skip index entries in rtf parser
* Cleaup some whitespace/static defs in rtf parser
* Add gnome desktop entry under Office
* Updated copright file to DEP-5 format
2010-08-13 - Bastian Venthur <>
catdoc (0.94.2-1.1) unstable; urgency=low
* Non-maintainer upload. Applied patch by Sergei Golovan:
* Replaced obsolete tk8.3 build-dependency by default tk package.
2006-03-29 - Pawel Wiecek <>
catdoc (0.94.2-1) unstable; urgency=low
* New upstream version, fixes a few OLE-parsing bugs (closes: #358707)
* Fixed mailcap (closes: #313616, #316122)
* Fixed some typos (closes: #327905, #327907)
* Updated copyright file (closes: #353819)
* Updated standards-version in debian/control
2005-05-16 - Pawel Wiecek <>
catdoc (0.94.0-1) unstable; urgency=low
* New upstream version
- fixes field type problems in xls2csv (closes: #292555)
- adds new utility: catppt
* Applied numerous patches from A Costa <> to fix typos in
manpages (closes: #304318, #305965, #305966, #305967)
* Added some asian charsets (closes: #278004)
* Re-added xlsview that disappeared from package some time
2005-01-08 - Pawel Wiecek <>
catdoc (0.93.4-2) unstable; urgency=low
* Fixed TeX charset conversion table for backslash and hash
(closes: #278257)
* Moved wordview from /usr/X11R6 to /usr (policy)
2004-09-30 - Pawel Wiecek <>
catdoc (0.93.4-1) unstable; urgency=low
* New upstream version (closes: #255625)
2004-03-24 - Pawel Wiecek <>
catdoc (0.93.3-3) unstable; urgency=low
* Fixed charset files location in catdoc.1 (closes: #238461)
* Fixed a typo in catdoc.1
* Added information about current and previous Debian maintainers to
copyright file (closes: #239126)
* Enhanced long description with info about supported file formats other
than .doc (closes: #239127)

