libweb-scraper-perl - Web Scraping Toolkit using HTML and CSS Selectors or XPath expressions

Property Value
Distribution Debian 10 (Buster)
Repository Debian Main i386
Package filename libweb-scraper-perl_0.38-1_all.deb
Package name libweb-scraper-perl
Package version 0.38
Package release 1
Package architecture all
Package type deb
Category devel::lang:perl devel::library implemented-in::perl perl
License -
Maintainer Debian Perl Group <>
Download size 22.73 KB
Installed size 85.00 KB
Web::Scraper is a web scraper toolkit, inspired by Ruby's equivalent Scrapi.
It provides a DSL-ish interface for traversing HTML documents and returning a
neatly arranged Perl data strcuture.
The scraper and process blocks provide a method to define what segments of a
document to extract. It understands HTML and CSS Selectors as well as XPath


Package Version Architecture Repository
libweb-scraper-perl_0.38-1_all.deb 0.38 all Debian Main
libweb-scraper-perl - - -


Name Value
libhtml-parser-perl -
libhtml-selector-xpath-perl -
libhtml-tagset-perl -
libhtml-tree-perl -
libhtml-treebuilder-xpath-perl -
libuniversal-require-perl -
liburi-perl -
libwww-perl -
libxml-xpathengine-perl -
libyaml-perl -
perl -


Type URL
Binary Package libweb-scraper-perl_0.38-1_all.deb
Source Package libweb-scraper-perl

Install Howto

  1. Update the package index:
    # sudo apt-get update
  2. Install libweb-scraper-perl deb package:
    # sudo apt-get install libweb-scraper-perl




2014-10-22 - gregor herrmann <>
libweb-scraper-perl (0.38-1) unstable; urgency=medium
[ gregor herrmann ]
* Strip trailing slash from metacpan URLs.
[ Salvatore Bonaccorso ]
* Update Vcs-Browser URL to cgit web frontend
[ gregor herrmann ]
* Add debian/upstream/metadata
* Import upstream version 0.38
* Update debian/copyright:
+ Bump years of upstream and packaging copyright.
+ Drop comment about copyright, now there is a clear statement.
+ Drop section about removed third-party files.
* Build-depend on libmodule-build-tiny-perl.
Bump debhelper dependency accordingly.
* debian/rules: drop manual removal of script.
It doesn't get installed anymore.
* Add a spelling patch.
* Mark package as autopkgtest-able.
* Declare compliance with Debian Policy 3.9.6.
2013-03-02 - gregor herrmann <>
libweb-scraper-perl (0.37-1) unstable; urgency=low
[ gregor herrmann ]
* debian/control: update {versioned,alternative} (build) dependencies.
[ Salvatore Bonaccorso ]
* Change Vcs-Git to canonical URI (git://
* Change based URIs to based URIs
[ gregor herrmann ]
* New upstream release.
* debian/copyright: update copyright years, convert to Copyright Format
* Set Standards-Version to 3.9.4 (no changes).
* Add libhtml-treebuilder-libxml-perl to Build-Depends-Indep and
2011-11-20 - gregor herrmann <>
libweb-scraper-perl (0.36-1) unstable; urgency=low
* New upstream release.
2011-10-01 - gregor herrmann <>
libweb-scraper-perl (0.35-1) unstable; urgency=low
[ Ansgar Burchardt ]
* debian/control: Convert Vcs-* fields to Git.
[ Salvatore Bonaccorso ]
* debian/copyright: Replace DEP5 Format-Specification URL from to URL.
[ gregor herrmann ]
* New upstream release.
* Update years of copyright for inc/Module/*.
2011-04-09 - gregor herrmann <>
libweb-scraper-perl (0.34-1) unstable; urgency=low
* Initial release (closes: #530467).

See Also

Package Description
libweb-simple-perl_0.033-1_all.deb simple web framework
libwebauth-dev_4.7.0-7_i386.deb Development files for WebAuth authentication
libwebauth-perl_4.7.0-7_i386.deb Perl library for WebAuth authentication
libwebauth12_4.7.0-7_i386.deb Shared libraries for WebAuth authentication
libwebcam0-dev_0.2.4-1.1+b2_i386.deb Webcam Library - Development files
libwebcam0_0.2.4-1.1+b2_i386.deb Webcam Library
libwebinject-perl_1.94-1_all.deb Perl Module for testing web services
libwebjars-locator-core-java_0.30-1_all.deb WebJars Locator Core
libwebjars-locator-java_0.32-1_all.deb WebJars Locator
libwebkdc-perl_4.7.0-7_all.deb Perl libraries for WebAuth central login server
libwebkit2-sharp-4.0-cil-dev_2.10.9+git20160917-1.1_i386.deb CLI bindings for WebKitGTK+ 4.0 using GObject Introspection - development
libwebkit2-sharp-4.0-cil_2.10.9+git20160917-1.1_i386.deb CLI bindings for WebKitGTK+ 4.0 using GObject Introspection
libwebkit2gtk-4.0-37-gtk2_2.24.3-1~deb10u1_i386.deb Web content engine library for GTK - GTK 2 plugin process
libwebkit2gtk-4.0-37_2.24.3-1~deb10u1_i386.deb Web content engine library for GTK
libwebkit2gtk-4.0-dev_2.24.3-1~deb10u1_i386.deb Web content engine library for GTK - development files