unhtml - Remove the markup tags from an HTML file

Property Value
Distribution Debian 7 (Wheezy)
Repository Debian Main i386
Package name unhtml
Package version 2.3.9
Package release 3
Package architecture i386
Package type deb
Installed size 60 B
Download size 13.36 KB
Official Mirror ftp.br.debian.org
This program removes all HTML tags from an HTML file and directs its
output to stdout. It can be used as a filter for getting the text
content of an HTML file without the need of firing up a web browser.


Package Version Architecture Repository
unhtml_2.3.9-3_amd64.deb 2.3.9 amd64 Debian Main
unhtml - - -


Name Value
libc6 >= 2.4


Type URL
Binary Package unhtml_2.3.9-3_i386.deb
Source Package unhtml

Install Howto

  1. Update the package index:
    # sudo apt-get update
  2. Install unhtml deb package:
    # sudo apt-get install unhtml




2012-06-23 - Mònica Ramírez Arceda <monica@debian.org>
unhtml (2.3.9-3) unstable; urgency=low
* debian/compat: update to 9.
* debian/control:
- Update Maintainer field with Debian email.
- Bump to Standards-Version 3.9.3. No changes required.
- Update to debhelper 9.
* debian/copyright: switch to machine-readable format.
* Enable security hardening build flags:
- debian/rules: pass hardening build flags to Makefile.
- 40-add-flags-makefile.patch: add flags to unhtml target.
- 30-fix-format-security-error.patch: fix error: "format not a string 
literal and no format arguments [-Werror=format-security]". This error 
appears when applying hardening build flags.
2011-02-02 - Mònica Ramírez Arceda <monica@probeta.net>
unhtml (2.3.9-2) unstable; urgency=low
* Adopt the package (Closes: #576167).
* Add Vcs-Git,Vcs-Browser fields in debian/control.
2010-10-25 - Jari Aalto <jari.aalto@cante.net>
unhtml (2.3.9-1) unstable; urgency=low
* QA upload.
- Move to packaging format "3.0 (quilt)".
* debian/clean
- New file.
* debian/chnagelog
- Fix Debian version to *-1.
* debian/compat
- Update to 8.
* debian/control
- (Build-Depends): update to debhelper 8.
- (Depends): add ${misc:Depends}.
- (Maintainer): Set to Debian QA Group.
- (Standards-Version): update to 3.9.1.
* debian/copyright
- Update layout.
* debian/{docs,manpages}
- New file.
* debian/patches
- (10): New. Make 8-bit clean (Closes: #364236).
- (20): New. Off by one allocation (Closes: #534757).
* debian/rules
- (CC): Remove hard coded variable.
- Update to dh(1).
* debian/source/format
- New file.
2006-03-01 - Víctor Pérez Pereira <vperez@debianvenezuela.org>
unhtml (2.3.9) unstable; urgency=low
* New maintainer (closes: #309264).
2005-08-03 - Santiago Vila <sanvila@debian.org>
unhtml (2.3.8) unstable; urgency=low
* QA upload. Switch to debhelper.
2005-05-15 - Al Stone <ahs3@debian.org>
unhtml (2.3.7) unstable; urgency=low
* Orphaning the package.  Maintainer changed to Debian QA Group
2004-04-25 - Al Stone <ahs3@debian.org>
unhtml (2.3.6) unstable; urgency=low
* Forgot to change the Maintainer field in debian/control.  Doh.
* Closes: bug#134447 -- lintian warnings about standards level
* Added test cases and tests directory
* Closes: bug#58137 -- arbitrary limit on tag lengths removed
* Closes: bug#58135 -- too naive about <> in plain text; unhtml
now checks to see if a tag is a known html tag and only removes
the text if it is.
2004-04-23 - Al Stone <ahs3@debian.org>
unhtml (2.3.5) unstable; urgency=low
* New maintainer.
* Closes: bug#234419 -- ITA.
* Closes: bug#164613 -- typo in README.debian and long description
of the package
2004-03-10 - Paul Seelig <pseelig@debian.org>
unhtml (2.3.4) unstable; urgency=low
* Having already orphaned this package here is my last upload for it
setting the package maintainer to Debian QA Group.
2001-12-21 - Paul Seelig <pseelig@debian.org>
unhtml (2.3.3) unstable; urgency=low
* Fixed override disparity

See Also

Package Description
uni2ascii_4.18-2_i386.deb UTF-8 to 7-bit ASCII and vice versa converter
unicode-data_6.1.0-1_all.deb Property data for the Unicode character set
unicode-screensaver_0.4-1_i386.deb screensaver displaying unicode characters
unicode_0.9.5_all.deb display unicode character properties
unicon-imc2_3.0.4-13_i386.deb Chinese Input Method Library
uniconf-tools_4.6.1-5_i386.deb Tools to interface with UniConf
uniconfd_4.6.1-5_i386.deb Server that manages UniConf elements
unicorn_4.3.1-4_i386.deb Rack HTTP server for fast clients
unifdef_2.6-1_i386.deb Remove cpp '#ifdef' lines from files
unifont-bin_5.1.20080914-1.3_i386.deb utilities for manipulating the GNU Unifont
unifont_5.1.20080914-1.3_all.deb font with a glyph for each visible Unicode 5.1 Plane 0 character
unionfs-fuse_0.24-2.2_i386.deb Fuse implementation of unionfs
unison-all-gtk_2.40+1_all.deb file synchronization tool (all GTK+ versions)
unison-all_2.40+1_all.deb file synchronization tool (all console versions)
unison-gtk_2.40.65-2_i386.deb file-synchronization tool for Unix and Windows with GTK+ interface