libhtmlparser-java - java library to parse html

Distribution: Debian 8 (Jessie)
Repository: Debian Main amd64
Package name: libhtmlparser-java
Package version: 1.6.20060610.dfsg0
Package release: 5
Package architecture: all
Package type: deb
Installed size: 323 B
Download size: 270.29 KB
Official Mirror:
HTML Parser is a Java library used to parse HTML in either a linear or nested fashion. Primarily used for transformation or extraction, it features filters, visitors, custom tags and easy to use JavaBeans. The two fundamental use-cases that are handled by the parser are extraction and transformation (the syntheses use-case, where HTML pages are created from scratch, is better handled by other tools closer to the source of data). In general, to use the HTMLParser you will need to be able to write code in the Java programming language. Although some example programs are provided that may be useful as they stand, it's more than likely you will need (or want) to create your own programs or modify the ones provided to match your intended application.



    Source package: libhtmlparser-java

    Install Howto

    1. Update the package index:
      # sudo apt-get update
    2. Install libhtmlparser-java deb package:
      # sudo apt-get install libhtmlparser-java


    • /usr/share/doc/libhtmlparser-java/README.Debian-source
    • /usr/share/doc/libhtmlparser-java/changelog.Debian.gz
    • /usr/share/doc/libhtmlparser-java/changelog.gz
    • /usr/share/doc/libhtmlparser-java/copyright
    • /usr/share/java/libhtmlparser-1.6.20060610.dfsg0.jar
    • /usr/share/java/libhtmlparser.jar


    2014-09-10 - Emmanuel Bourg <> libhtmlparser-java (1.6.20060610.dfsg0-5) unstable; urgency=medium * Team upload. * Generate Java 5 compatible bytecode * Removed the SourceForge logo from the Javadoc * debian/control: - Maintenance transferred to the Java Team - Removed the dependency on the Java runtime - Standards-Version updated to 3.9.5 (no changes) - Use canonical URLs for the Vcs-* fields * Switch to debhelper level 9

    2010-02-21 - Tiago Saboga <> libhtmlparser-java (1.6.20060610.dfsg0-4) unstable; urgency=low * Bump debian version (no changes needed). * Change package section (thanks lintian). * Bugfix: debhelper but no misc:Depends (thanks lintian). * Switch to dpkg-source 3.0 (quilt) format * Use openjdk to build package. * Install changelog at the right place.

    2008-05-05 - Tiago Saboga <> libhtmlparser-java (1.6.20060610.dfsg0-3) unstable; urgency=low * Remove bashism in debian/rules (brace expansion). (Closes: #477610) * Add Vcs-Svn and Vcs-Browser fields to debian/control.

    2008-01-02 - Tiago Saboga <> libhtmlparser-java (1.6.20060610.dfsg0-2) unstable; urgency=low * Initial debian release (Closes: #448872). * Remove empty line in long descriptions. * Set priority to optional. * Bump standards-version to 3.7.3 (no changes needed). * Binary package depends on java-gcj-compat and not gij. * Homepage field now in source section of control file. * Do not repeat Section and Priority in binary packages when they are already in source package [control]. * Correct copyright file: license is LGPL 2.1 or later.

    2007-10-31 - Tiago Saboga <> libhtmlparser-java (1.6.20060610.dfsg0-1) unstable; urgency=low * Initial release.