webcheck - website link and structure checker

Property Value
Distribution Debian 7 (Wheezy)
Repository Debian Main i386
Package name webcheck
Package version 1.10.4
Package architecture all
Package type deb
Installed size 296 B
Download size 64.39 KB
Official Mirror ftp.br.debian.org
webcheck is a website checking tool for webmasters. It crawls a given
website and generates a number of reports in the form of html pages.
It is easy to use and generates simple, clear and readable reports.
Features of webcheck include:
* support for http, https, ftp and file schemes
* view the structure of a site
* track down broken links
* find potentially outdated and new pages
* list links pointing to external sites
* can run without user intervention


Package Version Architecture Repository
webcheck_1.10.4_all.deb 1.10.4 all Debian Main
webcheck - - -


Name Value
python >= 2.3
python-support >= 0.90.0


Type URL
Binary Package webcheck_1.10.4_all.deb
Source Package webcheck

Install Howto

  1. Update the package index:
    # sudo apt-get update
  2. Install webcheck deb package:
    # sudo apt-get install webcheck




2010-09-11 - Arthur de Jong <adejong@debian.org>
webcheck (1.10.4) unstable; urgency=low
* switch to source format 3.0 (native)
* remove some left-over debuging code (LP: #401050)
* remove old /etc/webcheck removal code
* upgrade to standards-version 3.9.1 (no changes needed)
* several small bugfixes which more or less drop support for Python 2.3
* limit list of "referenced from" to 10 items
* pass char_encoding option to tidy to fix some tidy-related errors
* add a Referer header if possible (thanks Devin Bayer)
2008-07-19 - Arthur de Jong <adejong@debian.org>
webcheck (1.10.3) unstable; urgency=low
* take a shot at making debian/copyright machine parseable
* support <iframe> and some common usages of <object>
* fix bug in command-line parsing of short -r option
* implement the --userpass option to pass username and password information
to specific sites based on a patch by Chris Shenton
* handle errors while parsing more gracefully (addresses: #483579)
* add parsing of <script> tag and background attributes, based on a patch by
Robert M. Jansen
* fix in parsing <style> tags and support style attributes
* call tidy (if available) on HTML content, based on a patch by Henning
* fix problem with port numbers in host headers
* upgrade to standards-version 3.8.0 (no changes needed)
2008-05-30 - Arthur de Jong <adejong@debian.org>
webcheck ( unstable; urgency=low
* Re-upload of version 1.10.2 because the previous 1.10.2 got lost somehow.
2007-07-20 - Arthur de Jong <adejong@debian.org>
webcheck (1.10.2) unstable; urgency=low
* changed the recommends to python-beautifulsoup to be more recent than
3.0.2 since that  version fixes a bug that causes severe problems for
webcheck (beware backporters to etch) (closes: #433446)
* remove old linbot provides/conflicts/replaces stuff as linbot last shipped
in woody
* move Homepage pseudo header to control header and remove XS- prefix for
Vcs tags
* add checking for bug in BeautifulSoup and issue warning if bug is found
* added support for Python 2.3 (alhough more recent versions of Python
are recommended)
* small documentation improvements
2007-07-15 - Arthur de Jong <adejong@debian.org>
webcheck (1.10.1) unstable; urgency=low
* some extra Unicode handling precautions
* fix problem in reading webcheck.dat for non-ASCII text (closes: #431625)
* be more verbose about HTTP retrieval failures
* split out URL normalization code into own module and do some basic
protocol-specific normalizations (closes: #425004)
* a number of big performance improvements
* fix a bug in handling some zero-size pages
* parse http-equiv meta HTML header to parse refresh option
* webcheck now requires python 2.4 or more recent
* added XS-Vcs-Svn and XS-Vcs-Browser as specified in #391023
2007-05-09 - Arthur de Jong <adejong@debian.org>
webcheck (1.10.0) unstable; urgency=low
* switched HTML parsing to using BeautifulSoup with a fall-back mechanism to
the old HTMLParser based solution
* the new parser is much more error-tolerant but is reportedly somewhat
slower and does not include line numbers in errors
* new features will likely only be added to the new parser
* some small improvements to the output to make it XHTML 1.1 compliant
* internal improvements for handling Unicode strings
* better support for parsing <applet> tags and anchors using id attributes
* re-enable robots.txt parsing that was disabled in 1.9.8 and add an
--ignore-robots option
2007-01-15 - Arthur de Jong <adejong@debian.org>
webcheck (1.9.8) unstable; urgency=low
* some checks for properly handling unknown and wrong encodings have been
* added proper error handling for SSL related socket problems (exceptions
are not a subclass of regular socket exceptions)
* a bugfix for urls that contain a user name without a password or the other
way around
* miscellaneous small report improvements
* switch packaging files to using latest syntax of python-support

See Also

