NekoHTML is a simple HTML scanner and tag balancer that enables
application programmers to parse HTML documents and access the
information using standard XML interfaces. The parser can scan HTML
files and "fix up" many common mistakes that human (and computer)
authors make in writing HTML documents. NekoHTML adds missing parent
elements; automatically closes elements with optional end tags; and
can handle mismatched in-line element tags.


Install Howto

  1. Update the package index:
    # sudo apt-get update
  2. Install libnekohtml-java deb package:
    # sudo apt-get install libnekohtml-java




2016-01-19 - Emmanuel Bourg <>
nekohtml (1.9.22-1) unstable; urgency=medium
* Team upload
* New upstream release
* Removed the dependency on libjaxp1.3-java
* Set the locale when generating the javadoc to make the build reproducible
* Moved the package to Git
2014-11-12 - Miguel Landaeta <>
nekohtml (1.9.21-2) unstable; urgency=medium
* Team upload
* Allow to upgrade from 'wheezy' by avoiding to overwrite documentation
files. (Closes: #768210).
* Bump Standards-Version to 3.9.6. No changes were required.
2014-06-23 - Emmanuel Bourg <>
nekohtml (1.9.21-1) unstable; urgency=medium
* Team upload
* New upstream release
* Standards-Version updated to 3.9.5 (no changes)
* Moved the HTML documentation in the libnekohtml-java-doc package
* Renamed README.Debian to README.source
2013-10-23 - Emmanuel Bourg <>
nekohtml (1.9.19-1) unstable; urgency=low
* Team upload.
* New upstream release
- Refreshed the patch
* debian/control:
- Use canonical URLs for the Vcs-* fields
- Updated Standards-Version to 3.9.4 (no changes)
- Removed Michael Koch from the uploaders (Closes: #654124)
- Improved the package description
* Build depend on debhelper >= 9
* debian/rules: Improved the clean target
* debian/copyright: Updated the Format URI
* Use XZ compression for the upstream tarball
* Added a Lintian override for the codeless-jar warning on nekohtmlXni.jar
2011-09-26 - Torsten Werner <>
nekohtml (1.9.15-1) unstable; urgency=low
* Team upload.
* New upstream release
* Switch to debhelper level 7.
* Update Standards-Version: 3.9.2.
* Update debian/copyright.
2010-04-11 - Torsten Werner <>
nekohtml (1.9.14-1) unstable; urgency=low
* Team Upload
* New upstream release
* Let 'maintainers' start with uppercase M.
* Update Standards-Version: 3.8.4.
* Switch to source format 3.0.
2009-10-02 - Michael Koch <>
nekohtml (1.9.13-1) unstable; urgency=low
* New upstream release.
* Added debian/README.source.
* Added myself to Uploaders.
* Updated Standards-Version to 3.8.3.
2009-08-09 - Torsten Werner <>
nekohtml (1.9.12-2) unstable; urgency=low
* Upload to unstable.
2009-04-29 - Ludovic Claude <>
nekohtml (1.9.12-1) experimental; urgency=low
* New upstream version
* Remove the runtimes from Depends: as it's a library
* Change section to java, bump up Standards-Version to 3.8.1
* Add Homepage and Vcs-* properties
* Split the package into a pure binary and a documentation package,
put the main docs in the binary package, and the api docs in the
doc package
* Update the copyright to follow the new proposal format, and remove 
full text of Apache license to remove Lintian warnings
* Add the Maven POM to the package,
* Add a Build-Depends-Indep dependency on maven-repo-helper
* Use mh_installpom and mh_installjar to install the POM and the jar to the
Maven repository
* Change Build-Depends from libxalan-java to libjaxp3-java and 
add libjaxp3-java as it contains the xml-apis.jar needed for the build
and runtime on Java < 6.
2006-11-16 - Steinar H. Gunderson <>
nekohtml (0.9.5+dfsg-1.1) unstable; urgency=medium
* Non-maintainer upload.
* Add stubs for two missing XMLLocator methods; fixes FTBFS.
(Closes: #397697)

