<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
<title>Snipplr - noah</title>
<link>http://snipplr.com/users/noah/tags/aggregator</link>
<description>Recent snippets posted on Snipplr.com</description>
<language>en-us</language>
<pubDate>Tue, 07 Oct 2008 03:35:15 GMT</pubDate>
<item>
<title>(Bash) Scrape Google from the command line</title>
<link>http://snipplr.com/view/4299/scrape-google-from-the-command-line/</link>
<description><![CDATA[ <p>This code is POC only -- actually using it would violate Google's TOS, which forbids scraping.  It is published here for educational value only.

Hypothetically, the following command should return a list of the top 500 or so hits in Google for mysite.com.

The results will be prepended with digits, followed by a dot and some whitespace (Lynx adds these).</p> ]]></description>
<pubDate>Sun, 09 Dec 2007 21:16:58 GMT</pubDate>
<guid>http://snipplr.com/view/4299/scrape-google-from-the-command-line/</guid>
</item>
<item>
<title>(Perl) scraper</title>
<link>http://snipplr.com/view/3131/scraper/</link>
<description><![CDATA[ <p>For a while I used this to scrape weather.com.  Then they changed their HTML and my script broke.</p> ]]></description>
<pubDate>Tue, 03 Jul 2007 22:48:36 GMT</pubDate>
<guid>http://snipplr.com/view/3131/scraper/</guid>
</item>
<item>
<title>(Perl) Aggregate RSS feeds</title>
<link>http://snipplr.com/view/3130/aggregate-rss-feeds/</link>
<description><![CDATA[ <p>Expects a newline-delimited list of RSS feed urls.  Prints an HTML 4 page containing a grid of modules.  The modules contain hyperlinked feed headlines.

Very fragile, any invalid feed will break the whole aggregator.</p> ]]></description>
<pubDate>Tue, 03 Jul 2007 22:45:27 GMT</pubDate>
<guid>http://snipplr.com/view/3130/aggregate-rss-feeds/</guid>
</item>
</channel>
</rss>