catdoc - MS-Word to TeX or plain text converter

Property Value
Distribution Debian 8 (Jessie)
Repository Debian Main i386
Package name catdoc
Package version 0.94.4
Package release 1.1+deb8u1
Package architecture i386
Package type deb
Installed size 2.32 KB
Download size 290.96 KB
Official Mirror
This program extracts text from MS-Word files, trying to preserve
as many special printable characters as possible. catdoc supports
everything up to Word-97. Also supported are MS Write documents and RTF
It doesn't even try to preserve fancy Word formatting, because
Word users usually don't care about document structure, and it is
this very thing which is important to LaTeX users.
Also provided is xls2csv, which extracts data from Excel spreadsheets
and outputs it in comma-separated-value format and catppt, which extracts
data from PowerPoint presentations.
This package suggests tk because it also includes wordview, an
optional Tk-based GUI for catdoc.  The MIME config provided in this
package will use wordview if X is running, or catdoc directly if it
is not.


Package Version Architecture Repository
catdoc_0.94.4-1.1+deb8u1_amd64.deb 0.94.4 amd64 Debian Main
catdoc - - -


Name Value
libc6 >= 2.7


Type URL
Binary Package catdoc_0.94.4-1.1+deb8u1_i386.deb
Source Package catdoc

Install Howto

  1. Update the package index:
    # sudo apt-get update
  2. Install catdoc deb package:
    # sudo apt-get install catdoc




2017-07-21 - Salvatore Bonaccorso <>
catdoc (0.94.4-1.1+deb8u1) jessie-security; urgency=high
* Non-maintainer upload by the Security Team.
* CVE-2017-11110: Heap buffer overflow in ole_init (Closes: #867717)
2012-12-03 - Neil Williams <>
catdoc (0.94.4-1.1) unstable; urgency=low
* Non-maintainer upload.
* New upstream release to remove .pc subdirectory from
the orig tarball (Closes: #692073). Includes updating
version strings in generated manpages.
* Remove extra ';' in src/xlsparse.c which turned for loop in
xlsparse into a buffer overflow (Closes: #692076), applies
patch by Olly Betts <>.
2012-06-10 - Nick Bane <>
catdoc (0.94.3-1) unstable; urgency=low
* Declare new upstream release
* Fix codepage bugs (Closes: #648921)
* Fix charset bug (Closes: #648726)
* Handle negative numbers on 64bit architectures (Closes: #555622)
* Fix Macintosh MS1904 date bug in xlsparse reported ubuntu #349016
2011-07-30 - Nick Bane <>
catdoc (0.94.2-2) unstable; urgency=low
* New maintainer (Closes: #631798)
* Add font table support to rtf parser
* Handle invalid unicode character return of -1 correctly in rtf parser
* Trap zero length docs
* Add cross building support
* Increase DBCS support for cp932 in rtf parser
* Add check for reloading current charset in rtf parser
* Implement rtf deflang/plain control word interaction
* Add rtf support for font charsets suggesting codepages and mbcs support
* Add lang+codepage support in rtf parser
* Add removal of escaped chars after unicode char
* And default to single char removal and implement sanely
* Skip index entries in rtf parser
* Cleaup some whitespace/static defs in rtf parser
* Add gnome desktop entry under Office
* Updated copright file to DEP-5 format
2010-08-13 - Bastian Venthur <>
catdoc (0.94.2-1.1) unstable; urgency=low
* Non-maintainer upload. Applied patch by Sergei Golovan:
* Replaced obsolete tk8.3 build-dependency by default tk package.
2006-03-29 - Pawel Wiecek <>
catdoc (0.94.2-1) unstable; urgency=low
* New upstream version, fixes a few OLE-parsing bugs (closes: #358707)
* Fixed mailcap (closes: #313616, #316122)
* Fixed some typos (closes: #327905, #327907)
* Updated copyright file (closes: #353819)
* Updated standards-version in debian/control
2005-05-16 - Pawel Wiecek <>
catdoc (0.94.0-1) unstable; urgency=low
* New upstream version
- fixes field type problems in xls2csv (closes: #292555)
- adds new utility: catppt
* Applied numerous patches from A Costa <> to fix typos in
manpages (closes: #304318, #305965, #305966, #305967)
* Added some asian charsets (closes: #278004)
* Re-added xlsview that disappeared from package some time
2005-01-08 - Pawel Wiecek <>
catdoc (0.93.4-2) unstable; urgency=low
* Fixed TeX charset conversion table for backslash and hash
(closes: #278257)
* Moved wordview from /usr/X11R6 to /usr (policy)
2004-09-30 - Pawel Wiecek <>
catdoc (0.93.4-1) unstable; urgency=low
* New upstream version (closes: #255625)
2004-03-24 - Pawel Wiecek <>
catdoc (0.93.3-3) unstable; urgency=low
* Fixed charset files location in catdoc.1 (closes: #238461)
* Fixed a typo in catdoc.1
* Added information about current and previous Debian maintainers to
copyright file (closes: #239126)
* Enhanced long description with info about supported file formats other
than .doc (closes: #239127)

See Also

Package Description
catdvi_0.14-12.1_i386.deb DVI to plain text translator
catfish_1.2.2-1_all.deb File searching tool which is configurable via the command line
caveconverter_0~20131117-1_all.deb Cave survey data format converter
cavezofphear_0.5.1-1_i386.deb ASCII Boulder Dash clone
cb2bib_1.4.9-4_i386.deb extract bibliographic references from various sources
cba_0.3.6-4.1_i386.deb Continuous Beam Analysis
cbflib-bin_0.9.2.2-1_i386.deb utilities to manipulate CBF files
cbflib-doc_0.9.2.2-1_all.deb documentation for CBFlib
cbios_0.25-2_all.deb open source MSX BIOS roms
cbm_0.1-10_i386.deb display the current network traffic in colors
cbmc_4.9-4_i386.deb bounded model checker for C and C++ programs
cbootimage_1.4-1_i386.deb Tools to dump and generate boot config table on Tegra devices
cbp2make_147+dfsg-1_i386.deb Makefile generation tool for the Code::Blocks IDE
cbrpager_0.9.22-2_i386.deb viewer for CBR, CBZ and CB7 (comic book archive) files
cc1111_2.9.0-4_i386.deb C Compiler for TI/Chipcon 8051-based RF SOCs