libweb-scraper-perl_0.38-1_all.deb


Advertisement

Description

libweb-scraper-perl - Web Scraping Toolkit using HTML and CSS Selectors or XPath expressions

Distribution: Debian 8 (Jessie)
Repository: Debian Main amd64
Package name: libweb-scraper-perl
Package version: 0.38
Package release: 1
Package architecture: all
Package type: deb
Installed size: 85 B
Download size: 22.73 KB
Official Mirror: ftp.br.debian.org
Web::Scraper is a web scraper toolkit, inspired by Ruby's equivalent Scrapi. It provides a DSL-ish interface for traversing HTML documents and returning a neatly arranged Perl data strcuture. The scraper and process blocks provide a method to define what segments of a document to extract. It understands HTML and CSS Selectors as well as XPath expressions.

Alternatives

    Download

    Source package: libweb-scraper-perl

    Install Howto

    1. Update the package index:
      # sudo apt-get update
    2. Install libweb-scraper-perl deb package:
      # sudo apt-get install libweb-scraper-perl

    Files

    • /usr/share/doc/libweb-scraper-perl/changelog.Debian.gz
    • /usr/share/doc/libweb-scraper-perl/changelog.gz
    • /usr/share/doc/libweb-scraper-perl/copyright
    • /usr/share/doc/libweb-scraper-perl/examples/dave-trailer-HD.pl
    • /usr/share/doc/libweb-scraper-perl/examples/ebay-auction.pl
    • /usr/share/doc/libweb-scraper-perl/examples/extract-links.pl
    • /usr/share/doc/libweb-scraper-perl/examples/hatena-keyword.pl
    • /usr/share/doc/libweb-scraper-perl/examples/jp-playstation-store.pl
    • /usr/share/doc/libweb-scraper-perl/examples/rel-tag.pl
    • /usr/share/doc/libweb-scraper-perl/examples/scraper
    • /usr/share/doc/libweb-scraper-perl/examples/twitter-friends.pl
    • /usr/share/man/man3/Web::Scraper.3pm.gz
    • /usr/share/man/man3/Web::Scraper::Filter.3pm.gz
    • /usr/share/man/man3/Web::Scraper::LibXML.3pm.gz
    • /usr/share/perl5/Web/Scraper.pm
    • /usr/share/perl5/Web/Scraper/Filter.pm
    • /usr/share/perl5/Web/Scraper/LibXML.pm

    Changelog

    2014-10-22 - gregor herrmann <gregoa@debian.org> libweb-scraper-perl (0.38-1) unstable; urgency=medium [ gregor herrmann ] * Strip trailing slash from metacpan URLs. [ Salvatore Bonaccorso ] * Update Vcs-Browser URL to cgit web frontend [ gregor herrmann ] * Add debian/upstream/metadata * Import upstream version 0.38 * Update debian/copyright: + Bump years of upstream and packaging copyright. + Drop comment about copyright, now there is a clear statement. + Drop section about removed third-party files. * Build-depend on libmodule-build-tiny-perl. Bump debhelper dependency accordingly. * debian/rules: drop manual removal of script. It doesn't get installed anymore. * Add a spelling patch. * Mark package as autopkgtest-able. * Declare compliance with Debian Policy 3.9.6.

    2013-03-02 - gregor herrmann <gregoa@debian.org> libweb-scraper-perl (0.37-1) unstable; urgency=low [ gregor herrmann ] * debian/control: update {versioned,alternative} (build) dependencies. [ Salvatore Bonaccorso ] * Change Vcs-Git to canonical URI (git://anonscm.debian.org) * Change search.cpan.org based URIs to metacpan.org based URIs [ gregor herrmann ] * New upstream release. * debian/copyright: update copyright years, convert to Copyright Format 1.0. * Set Standards-Version to 3.9.4 (no changes). * Add libhtml-treebuilder-libxml-perl to Build-Depends-Indep and Recommends.

    2011-11-20 - gregor herrmann <gregoa@debian.org> libweb-scraper-perl (0.36-1) unstable; urgency=low * New upstream release.

    2011-10-01 - gregor herrmann <gregoa@debian.org> libweb-scraper-perl (0.35-1) unstable; urgency=low [ Ansgar Burchardt ] * debian/control: Convert Vcs-* fields to Git. [ Salvatore Bonaccorso ] * debian/copyright: Replace DEP5 Format-Specification URL from svn.debian.org to anonscm.debian.org URL. [ gregor herrmann ] * New upstream release. * Update years of copyright for inc/Module/*.

    2011-04-09 - gregor herrmann <gregoa@debian.org> libweb-scraper-perl (0.34-1) unstable; urgency=low * Initial release (closes: #530467).

    Advertisement
    Advertisement