libweb-scraper-perl - Web Scraping Toolkit using HTML and CSS Selectors or XPath expressions

Property Value
Distribution Debian 8 (Jessie)
Repository Debian Main amd64
Package name libweb-scraper-perl
Package version 0.38
Package release 1
Package architecture all
Package type deb
Installed size 85 B
Download size 22.73 KB
Official Mirror
Web::Scraper is a web scraper toolkit, inspired by Ruby's equivalent Scrapi.
It provides a DSL-ish interface for traversing HTML documents and returning a
neatly arranged Perl data strcuture.
The scraper and process blocks provide a method to define what segments of a
document to extract. It understands HTML and CSS Selectors as well as XPath


Package Version Architecture Repository
libweb-scraper-perl_0.38-1_all.deb 0.38 all Debian Main
libweb-scraper-perl - - -


Name Value
libhtml-parser-perl -
libhtml-selector-xpath-perl -
libhtml-tagset-perl -
libhtml-tree-perl -
libhtml-treebuilder-xpath-perl -
libuniversal-require-perl -
liburi-perl -
libwww-perl -
libxml-xpathengine-perl -
libyaml-perl -
perl -


Type URL
Binary Package libweb-scraper-perl_0.38-1_all.deb
Source Package libweb-scraper-perl

Install Howto

  1. Update the package index:
    # sudo apt-get update
  2. Install libweb-scraper-perl deb package:
    # sudo apt-get install libweb-scraper-perl




2014-10-22 - gregor herrmann <>
libweb-scraper-perl (0.38-1) unstable; urgency=medium
[ gregor herrmann ]
* Strip trailing slash from metacpan URLs.
[ Salvatore Bonaccorso ]
* Update Vcs-Browser URL to cgit web frontend
[ gregor herrmann ]
* Add debian/upstream/metadata
* Import upstream version 0.38
* Update debian/copyright:
+ Bump years of upstream and packaging copyright.
+ Drop comment about copyright, now there is a clear statement.
+ Drop section about removed third-party files.
* Build-depend on libmodule-build-tiny-perl.
Bump debhelper dependency accordingly.
* debian/rules: drop manual removal of script.
It doesn't get installed anymore.
* Add a spelling patch.
* Mark package as autopkgtest-able.
* Declare compliance with Debian Policy 3.9.6.
2013-03-02 - gregor herrmann <>
libweb-scraper-perl (0.37-1) unstable; urgency=low
[ gregor herrmann ]
* debian/control: update {versioned,alternative} (build) dependencies.
[ Salvatore Bonaccorso ]
* Change Vcs-Git to canonical URI (git://
* Change based URIs to based URIs
[ gregor herrmann ]
* New upstream release.
* debian/copyright: update copyright years, convert to Copyright Format
* Set Standards-Version to 3.9.4 (no changes).
* Add libhtml-treebuilder-libxml-perl to Build-Depends-Indep and
2011-11-20 - gregor herrmann <>
libweb-scraper-perl (0.36-1) unstable; urgency=low
* New upstream release.
2011-10-01 - gregor herrmann <>
libweb-scraper-perl (0.35-1) unstable; urgency=low
[ Ansgar Burchardt ]
* debian/control: Convert Vcs-* fields to Git.
[ Salvatore Bonaccorso ]
* debian/copyright: Replace DEP5 Format-Specification URL from to URL.
[ gregor herrmann ]
* New upstream release.
* Update years of copyright for inc/Module/*.
2011-04-09 - gregor herrmann <>
libweb-scraper-perl (0.34-1) unstable; urgency=low
* Initial release (closes: #530467).

See Also

Package Description
libweb-simple-perl_0.030-1_all.deb simple web framework
libwebauth-dev_4.6.1-1+b1_amd64.deb Development files for WebAuth authentication
libwebauth-perl_4.6.1-1+b1_amd64.deb Perl library for WebAuth authentication
libwebauth11_4.6.1-1+b1_amd64.deb Shared libraries for WebAuth authentication
libwebcam0-dev_0.2.4-1.1_amd64.deb Webcam Library - Development files
libwebcam0_0.2.4-1.1_amd64.deb Webcam Library
libwebinject-perl_1.86-1_all.deb Perl Module for testing web services
libwebkdc-perl_4.6.1-1_all.deb Perl libraries for WebAuth central login server
libwebkit-cil-dev_0.3-6_all.deb CLI binding for the WebKit library - development package
libwebkit-dev_2.4.9-1~deb8u1_all.deb Transitional package for the development files of WebKitGTK+
libwebkit1.1-cil_0.3-6_all.deb CLI binding for the WebKit library
libwebkit2gtk-3.0-25_2.4.9-1~deb8u1_amd64.deb WebKit2 API layer for WebKitGTK+
libwebkit2gtk-3.0-dev_2.4.9-1~deb8u1_amd64.deb WebKit2 API layer for WebKitGTK+ - development files
libwebkit2gtk-4.0-37_2.6.2+dfsg1-4_amd64.deb Web content engine library for GTK+
libwebkit2gtk-4.0-dev_2.6.2+dfsg1-4_amd64.deb Web content engine library for GTK+ - development files