Published in: Bash
This code is POC only -- actually using it would violate Google's TOS, which forbids scraping. It is published here for educational value only.
Hypothetically, the following command should return a list of the top 500 or so hits in Google for mysite.com.
The results will be prepended with digits, followed by a dot and some whitespace (Lynx adds these).
perl -e "$i=0;while($i<1000){open(WGET,qq/|xargs lynx -dump/);printf WGET qq{http://www.google.com/search?q=site:mysite.com&hl=en&start=$i&sa=N},$i+=10}"|grep "\/\/mysite.com\/"
You need to login to post a comment.
