<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Pocket SEO &#187; Scripts</title>
	<atom:link href="http://pocketseo.com/category/scripts/feed" rel="self" type="application/rss+xml" />
	<link>http://pocketseo.com</link>
	<description>Practical SEO blog</description>
	<lastBuildDate>Wed, 28 Jul 2010 17:12:22 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How to Add a Dynamic Copyright to Your Web Site Footer</title>
		<link>http://pocketseo.com/scripts/197</link>
		<comments>http://pocketseo.com/scripts/197#comments</comments>
		<pubDate>Sun, 09 Dec 2007 20:45:04 +0000</pubDate>
		<dc:creator>Josh</dc:creator>
				<category><![CDATA[Scripts]]></category>

		<guid isPermaLink="false">http://pocketseo.com/scripts/197</guid>
		<description><![CDATA[In preparation for the new year, this is a simple but useful PHP snippet for keeping your copyright notice up to date on your blog or Web site.

&#60;?php
echo 'Copyright &#38;copy; 2006 &#38;ndash; ' . date&#40;&#34;Y&#34;&#41;;
?&#62;

That will output Copyright © 2006 &#8211; 2007, with the &#8220;2007&#8243; dynamically updating to the current year.
The HTML entity &#38;ndash; just [...]


Related posts:<ol><li><a href='http://pocketseo.com/scripts/116' rel='bookmark' title='Permanent Link: How to Turn Proxy Hijacking Into Inbound Links to Your Web Site'>How to Turn Proxy Hijacking Into Inbound Links to Your Web Site</a></li>
<li><a href='http://pocketseo.com/analytics/168' rel='bookmark' title='Permanent Link: How to Add Google Analytics To vBulletin (including archives)'>How to Add Google Analytics To vBulletin (including archives)</a></li>
<li><a href='http://pocketseo.com/scripts/21' rel='bookmark' title='Permanent Link: Getting More EDU Backlinks'>Getting More EDU Backlinks</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>In preparation for the new year, this is a simple but useful PHP snippet for keeping your copyright notice up to date on your blog or Web site.</p>

<div class="wp_syntax"><div class="code"><pre class="php" style="font-family:monospace;"><span style="color: #000000; font-weight: bold;">&lt;?php</span>
<span style="color: #b1b100;">echo</span> <span style="color: #0000ff;">'Copyright &amp;copy; 2006 &amp;ndash; '</span> <span style="color: #339933;">.</span> <span style="color: #990000;">date</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;Y&quot;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
<span style="color: #000000; font-weight: bold;">?&gt;</span></pre></div></div>

<p>That will output <strong>Copyright © 2006 &ndash; 2007</strong>, with the &#8220;2007&#8243; dynamically updating to the current year.</p>
<p>The HTML entity <strong>&amp;ndash;</strong> just means a dash.  You could also do just:</p>

<div class="wp_syntax"><div class="code"><pre class="php" style="font-family:monospace;">Copyright &amp;copy; <span style="color: #000000; font-weight: bold;">&lt;?php</span> <span style="color: #b1b100;">echo</span> <span style="color: #990000;">date</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;Y&quot;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span> <span style="color: #000000; font-weight: bold;">?&gt;</span></pre></div></div>

<p>&#8230;which would output <strong>Copyright © 2007</strong> (with the date always being the current year).</p>
<p>It works on WordPress blogs also &mdash; just insert the code snippet in footer.php.</p>
<img src="http://pocketseo.com/?ak_action=api_record_view&id=197&type=feed" alt="" />

<p>Related posts:<ol><li><a href='http://pocketseo.com/scripts/116' rel='bookmark' title='Permanent Link: How to Turn Proxy Hijacking Into Inbound Links to Your Web Site'>How to Turn Proxy Hijacking Into Inbound Links to Your Web Site</a></li>
<li><a href='http://pocketseo.com/analytics/168' rel='bookmark' title='Permanent Link: How to Add Google Analytics To vBulletin (including archives)'>How to Add Google Analytics To vBulletin (including archives)</a></li>
<li><a href='http://pocketseo.com/scripts/21' rel='bookmark' title='Permanent Link: Getting More EDU Backlinks'>Getting More EDU Backlinks</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://pocketseo.com/scripts/197/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>How to Turn Proxy Hijacking Into Inbound Links to Your Web Site</title>
		<link>http://pocketseo.com/scripts/116</link>
		<comments>http://pocketseo.com/scripts/116#comments</comments>
		<pubDate>Sat, 13 Oct 2007 04:49:40 +0000</pubDate>
		<dc:creator>Josh</dc:creator>
				<category><![CDATA[Scripts]]></category>

		<guid isPermaLink="false">http://pocketseo.com/scripts/116</guid>
		<description><![CDATA[How to turn a dangerous proxy duplication into a beneficial inbound link.


Related posts:<ol><li><a href='http://pocketseo.com/site-analysis/142' rel='bookmark' title='Permanent Link: Introduction to Advanced SEO Site Analysis for Large Web Sites'>Introduction to Advanced SEO Site Analysis for Large Web Sites</a></li>
<li><a href='http://pocketseo.com/google/110' rel='bookmark' title='Permanent Link: How Google Detects Paid Text Links Based on Web Site Profiling'>How Google Detects Paid Text Links Based on Web Site Profiling</a></li>
<li><a href='http://pocketseo.com/tools/59' rel='bookmark' title='Permanent Link: 12 1/2 Essential Firefox Extensions for SEO'>12 1/2 Essential Firefox Extensions for SEO</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Proxy hijacking is a problem that has been in the SEO news lately.  Dan Thies has a good post about it on <a href="http://www.seofaststart.com/blog/google-proxy-hacking">SEO Fast Start</a>.</p>
<p>Basically, proxy hijacking occurs when search engines index your Web pages through someone else&#8217;s proxy.  Here is an example from a proxy site that has indexed a duplicate of wunderground.com&#8217;s about page:</p>
<p><img src="http://pocketseo.com/wp-content/uploads/2007/10/proxy-hijacking.png" alt="Proxy Hijacking oddproxy.com and wunderground.com" /></p>
<p>Dan Thies mentioned a few solutions to the problem.  One is</p>
<blockquote><p>&#8230;a PHP script that implements the &#8220;reverse cloaking&#8221; defense, putting a &#8220;nonindex, nofollow&#8221; robots meta tag into your pages unless it&#8217;s a spider that you have configured the script to recognize.</p></blockquote>
<p>That solution may remove your proxied page from Google index (unless the proxy strips your &lt;head&gt; element), but it doesn&#8217;t provide any benefit for your site.</p>
<p>Another alternative is to turn that indexed proxy page from a dangerous duplicate of your Web page into an inbound link.</p>
<h2>Getting the IBL</h2>
<p>To find the IP address of the proxy, use the <a href="https://addons.mozilla.org/en-US/firefox/addon/590" title="Download the Show IP Firefox Extension">Show IP Firefox Extension</a>.  It will show the IP address(es) of the Web page that is in your browser.</p>
<p><img src="http://pocketseo.com/wp-content/uploads/2007/10/show-ip-firefox.png" alt="Show IP Firefox Extension" /></p>
<p>The IP of the proxy is usually the same as the IP that grabs your content.  If not, it&#8217;s probably on the same C-block of IP addresses, and you can grep it out of your logs (see below).</p>
<p>The first example assumes that the proxy is grabbing your content from the same IP address that is seen with the Show IP Firefox Extension. This PHP code has not been fully tested, but is just intended to be an example of the technique.</p>

<div class="wp_syntax"><div class="code"><pre class="php" style="font-family:monospace;"><span style="color: #000000; font-weight: bold;">&lt;?php</span>
<span style="color: #000088;">$ip</span> <span style="color: #339933;">=</span> <span style="color: #000088;">$_SERVER</span><span style="color: #009900;">&#91;</span><span style="color: #0000ff;">'REMOTE_ADDR'</span><span style="color: #009900;">&#93;</span><span style="color: #339933;">;</span>
<span style="color: #666666; font-style: italic;">// change the following nn.nnn.nnn.nnn to IP addresses</span>
<span style="color: #666666; font-style: italic;">// and add others to the array as necessary</span>
<span style="color: #000088;">$cloaked_ips</span> <span style="color: #339933;">=</span> <span style="color: #990000;">array</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;nn.nnn.nnn.nnn&quot;</span><span style="color: #339933;">,</span><span style="color: #0000ff;">&quot;nnn.nnn.nnn.nn&quot;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
<span style="color: #b1b100;">if</span><span style="color: #009900;">&#40;</span><span style="color: #990000;">in_array</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;<span style="color: #006699; font-weight: bold;">$ip</span>&quot;</span><span style="color: #339933;">,</span><span style="color: #000088;">$cloaked_ips</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
    <span style="color: #b1b100;">echo</span> <span style="color: #0000cc; font-style: italic;">&lt;&lt;&lt;EOF
&lt;html&gt;
&lt;head&gt;
&lt;title&gt;Your Main Keyword&lt;/title&gt;
&lt;/head&gt;
&lt;body&gt;
&lt;h1&gt;Name of Site or Some Keywords&lt;/h1&gt;
&lt;p&gt;Please visit the site directly:
&lt;a href=&quot;http://example.com/&quot;&gt;name of site or keywords&lt;/a&gt;.&lt;/p&gt;
&lt;/body&gt;
&lt;/html&gt;
EOF</span><span style="color: #339933;">;</span>
    <span style="color: #990000;">exit</span><span style="color: #339933;">;</span>
<span style="color: #009900;">&#125;</span>
<span style="color: #666666; font-style: italic;">// everything above is for the proxy,</span>
<span style="color: #666666; font-style: italic;">// and everything below this code is for direct visitors</span>
<span style="color: #000000; font-weight: bold;">?&gt;</span></pre></div></div>

<p>If the above code works when you view your site in the browser, then you know that the proxy is using the same IP address that is showing in the Show IP Firefox Extension.</p>
<p>If it is not showing the alternate content to the proxy then you can test it by removing the last number from the IP address in the script.  If that doesn&#8217;t work, I recommend grepping (searching) your log files for IP addresses on the same <a href="http://samspade.org/d/ipdns.html">C-block</a>.</p>
<h2>IP Hunting With Grep</h2>
<p><acronym title="Globally search for the Regular Expression and Print">Grep</acronym> is a Unix command that you can use to search large files (such as logs) for information.  If you haven&#8217;t used grep before, I recommend a <a href="http://www.panix.com/~elflord/unix/grep.html">grep tutorial</a> or a full command line tutorial like <a href="http://www.linuxcommand.org/">this</a>, <a href="http://unixmages.com/">this</a>, or <a href="http://www.freesoftwaremagazine.com/articles/command_line_intro">this</a>.</p>
<p>To find IP addresses on the same C-block use a line similar to this in a Linux, MacOSX, or <a href="http://www.cygwin.com/">Cygwin</a> (Windows) terminal:</p>
<p><code>egrep "nnn\.nnn\.nnn\." access_log &gt; ip_addresses.txt</code></p>
<p>That will give you a file named ip_addresses.txt that contains a list of all the hits on your server from the same C-block as the proxy.  You will probably find the correct IP to block in that file.</p>
<h2>Tracking Proxy Indexing</h2>
<p>This is just an idea that I haven&#8217;t fully tested yet, but you may be able to keep track of duplicate content on proxies with <a href="http://www.google.com/alerts">Google Alerts</a>.</p>
<p>Setup a <em>comprehensive</em> search for something like <strong>intitle:&#8221;site name&#8221;</strong>, or even better, <strong>intitle:&#8221;site name&#8221; inurl:&#8221;cgi&#8221;</strong>.  Here is an example screenshot:</p>
<p><img src="http://pocketseo.com/wp-content/uploads/2007/10/google-alert-proxy.png" alt="Google Alerts for Proxy Content Hijacking" /></p>
<p>If you have additional tips for combating proxy duplication, please leave a comment below.</p>
<img src="http://pocketseo.com/?ak_action=api_record_view&id=116&type=feed" alt="" />

<p>Related posts:<ol><li><a href='http://pocketseo.com/site-analysis/142' rel='bookmark' title='Permanent Link: Introduction to Advanced SEO Site Analysis for Large Web Sites'>Introduction to Advanced SEO Site Analysis for Large Web Sites</a></li>
<li><a href='http://pocketseo.com/google/110' rel='bookmark' title='Permanent Link: How Google Detects Paid Text Links Based on Web Site Profiling'>How Google Detects Paid Text Links Based on Web Site Profiling</a></li>
<li><a href='http://pocketseo.com/tools/59' rel='bookmark' title='Permanent Link: 12 1/2 Essential Firefox Extensions for SEO'>12 1/2 Essential Firefox Extensions for SEO</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://pocketseo.com/scripts/116/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Install Linux on an External Hard Drive</title>
		<link>http://pocketseo.com/scripts/43</link>
		<comments>http://pocketseo.com/scripts/43#comments</comments>
		<pubDate>Thu, 21 Jun 2007 03:00:12 +0000</pubDate>
		<dc:creator>Josh</dc:creator>
				<category><![CDATA[Scripts]]></category>

		<guid isPermaLink="false">http://pocketseo.com/scripts/43</guid>
		<description><![CDATA[How to install Linux on an external hard drive for SEO scripting purposes.


Related posts:<ol><li><a href='http://pocketseo.com/tools/174' rel='bookmark' title='Permanent Link: Ubuntu Gutsy Gibbon and Google OS &#8211; Linux on the Desktop'>Ubuntu Gutsy Gibbon and Google OS &#8211; Linux on the Desktop</a></li>
<li><a href='http://pocketseo.com/google/123' rel='bookmark' title='Permanent Link: Google Spyware Errors? (in Linux)'>Google Spyware Errors? (in Linux)</a></li>
<li><a href='http://pocketseo.com/scripts/11' rel='bookmark' title='Permanent Link: How to Find Thousands of Potential EDU Backlinks with GNU/Linux'>How to Find Thousands of Potential EDU Backlinks with GNU/Linux</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Some of the SEO recipes that I will be posting on this site require a Unix-based operating system such as Linux.  Mac OS/X should also work for those examples, but I&#8217;m not as familiar with Mac so you may have to adapt the scripts somewhat.</p>
<p>This post shows a way to install Linux on an external hard drive so that you don&#8217;t have to modify your internal hard drive for a dual-boot Linux/Windows setup.</p>
<p><img src='http://pocketseo.com/wp-content/uploads/2007/06/linux-external-hard-drive.jpg' alt='Run Linux from External Hard Drive' /></p>
<p>(Sorry about the quality of the image above &mdash; it was taken with my cell phone.)</p>
<p>The reason that I recommend having Linux around for SEO is because of the scriptable shell and text processing abilities.  For example, if you wanted to find the number of outbound links on a page you could just type the following into a terminal:</p>
<p><code>lynx -dump "http://pocketseo.com/" | egrep -o "http:.*" | wc -l</code></p>
<p>That is just a very basic example.  There are many great possibilities with Linux, and I will cover more-interesting SEO scripting recipes in the future.</p>
<h2>Install Ubuntu on External Hard Drive</h2>
<p>I&#8217;m currently running my main operating system from a 120 Gb external hard drive that I got at Best Buy for $101 including tax.  Here are the steps:</p>
<ol>
<li>Download a copy of <a href="http://www.ubuntu.com/">Ubuntu Linux</a>.  I&#8217;m using version Ubuntu 7.04 &#8220;Feisty Fawn&#8221;.</li>
<li>Burn the CD image to a CD-R with <a href="https://help.ubuntu.com/community/BurningIsoHowto">these instructions</a>.  You will end up with a Linux live CD.</li>
<li>Turn off the computer and remove the internal drive(s).  Otherwise you risk getting the MBR overwritten when you install Linux.</li>
<li>Boot the computer from the live CD.  As soon as the Ubuntu start screen appears, plug in the external hard drive that you want to install to.  Start Ubuntu from the menu that is provided.</li>
<li>Once the Ubuntu desktop has loaded, double-click on the <em>install</em> icon and follow the installation instructions.  The installation process should take between 20 to 40 minutes.</li>
</ol>
<p>When the installation is done, remove the live CD and reboot the computer from the external hard drive.  You should then be running a full installation of Ubuntu.  You can put your regular hard drive back in the computer and just boot from the external hard drive when you want to use Linux.  You can take your Linux installation with you wherever you go.  Any computer that you plug the external hard drive into will have your entire operating system on it.</p>
<p>If you haven&#8217;t used Linux before, check out the <a href="https://help.ubuntu.com/">Ubuntu help docs</a> and the <a href="http://www.ubuntuforums.org/">Ubuntu Forums</a>.</p>
<p>If you want to experiment with the latest Linux desktop, try the <a href="http://www.sabayonlinux.org/">Sabayon Linux Live DVD</a>.  An example is below:</p>
<p><object width="425" height="350"><param name="movie" value="http://www.youtube.com/v/fJ8IIyikhkc"></param><param name="wmode" value="transparent"></param><embed src="http://www.youtube.com/v/fJ8IIyikhkc" type="application/x-shockwave-flash" wmode="transparent" width="425" height="350"></embed></object></p>
<p>More about Linux and SEO techniques is coming soon.</p>
<img src="http://pocketseo.com/?ak_action=api_record_view&id=43&type=feed" alt="" />

<p>Related posts:<ol><li><a href='http://pocketseo.com/tools/174' rel='bookmark' title='Permanent Link: Ubuntu Gutsy Gibbon and Google OS &#8211; Linux on the Desktop'>Ubuntu Gutsy Gibbon and Google OS &#8211; Linux on the Desktop</a></li>
<li><a href='http://pocketseo.com/google/123' rel='bookmark' title='Permanent Link: Google Spyware Errors? (in Linux)'>Google Spyware Errors? (in Linux)</a></li>
<li><a href='http://pocketseo.com/scripts/11' rel='bookmark' title='Permanent Link: How to Find Thousands of Potential EDU Backlinks with GNU/Linux'>How to Find Thousands of Potential EDU Backlinks with GNU/Linux</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://pocketseo.com/scripts/43/feed</wfw:commentRss>
		<slash:comments>20</slash:comments>
		</item>
		<item>
		<title>Getting More EDU Backlinks</title>
		<link>http://pocketseo.com/scripts/21</link>
		<comments>http://pocketseo.com/scripts/21#comments</comments>
		<pubDate>Fri, 25 May 2007 03:03:45 +0000</pubDate>
		<dc:creator>Josh</dc:creator>
				<category><![CDATA[Scripts]]></category>

		<guid isPermaLink="false">http://pocketseo.com/scripts/21</guid>
		<description><![CDATA[This is a shell script that will help you find more EDU backlinks for your Web sites.


Related posts:<ol><li><a href='http://pocketseo.com/scripts/11' rel='bookmark' title='Permanent Link: How to Find Thousands of Potential EDU Backlinks with GNU/Linux'>How to Find Thousands of Potential EDU Backlinks with GNU/Linux</a></li>
<li><a href='http://pocketseo.com/analytics/31' rel='bookmark' title='Permanent Link: How to Extract Bot Activity From Logfiles'>How to Extract Bot Activity From Logfiles</a></li>
<li><a href='http://pocketseo.com/scripts/116' rel='bookmark' title='Permanent Link: How to Turn Proxy Hijacking Into Inbound Links to Your Web Site'>How to Turn Proxy Hijacking Into Inbound Links to Your Web Site</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>If you already have <a href="/scripts/11">lists of hundreds or thousands of potential backlinks</a> contained in multiple files and want to extract the EDU links from them you can run a quick shell script on them.</p>
<p>Take the files that contain your lists of backlinks and put them in their own directory.  Then navigate to that directory in the terminal and type the following lines:</p>
<p><code>for f in *<br />
do<br />
cat $f | grep &apos;\.edu&apos; &gt;EDU-$f<br />
done</code></p>
<p>An explanation of the script:</p>
<ol>
<li><code>for f in *</code> means to loop over each file in the directory.</li>
<li><code>cat $f</code> outputs the contents of each file.</li>
<li><code>grep &apos;\.edu&apos; &gt;EDU-$f</code> searches through the current file in the loop and extracts each line that contains <em>.edu</em> and then writes it to a file that begins with <em>EDU-</em>.
</li>
</ol>
<img src="http://pocketseo.com/?ak_action=api_record_view&id=21&type=feed" alt="" />

<p>Related posts:<ol><li><a href='http://pocketseo.com/scripts/11' rel='bookmark' title='Permanent Link: How to Find Thousands of Potential EDU Backlinks with GNU/Linux'>How to Find Thousands of Potential EDU Backlinks with GNU/Linux</a></li>
<li><a href='http://pocketseo.com/analytics/31' rel='bookmark' title='Permanent Link: How to Extract Bot Activity From Logfiles'>How to Extract Bot Activity From Logfiles</a></li>
<li><a href='http://pocketseo.com/scripts/116' rel='bookmark' title='Permanent Link: How to Turn Proxy Hijacking Into Inbound Links to Your Web Site'>How to Turn Proxy Hijacking Into Inbound Links to Your Web Site</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://pocketseo.com/scripts/21/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Find Thousands of Potential EDU Backlinks with GNU/Linux</title>
		<link>http://pocketseo.com/scripts/11</link>
		<comments>http://pocketseo.com/scripts/11#comments</comments>
		<pubDate>Mon, 21 May 2007 20:18:57 +0000</pubDate>
		<dc:creator>Josh</dc:creator>
				<category><![CDATA[Scripts]]></category>

		<guid isPermaLink="false">http://pocketseo.com/scripts/11</guid>
		<description><![CDATA[It's easy to automate the process of finding backlinks when using GNU/Linux or Mac.  This script builds a formatted list of potential backlinks from EDU sites based on a keyword of your choice.


Related posts:<ol><li><a href='http://pocketseo.com/scripts/21' rel='bookmark' title='Permanent Link: Getting More EDU Backlinks'>Getting More EDU Backlinks</a></li>
<li><a href='http://pocketseo.com/google/33' rel='bookmark' title='Permanent Link: Google Search URL Parameters'>Google Search URL Parameters</a></li>
<li><a href='http://pocketseo.com/scripts/43' rel='bookmark' title='Permanent Link: How to Install Linux on an External Hard Drive'>How to Install Linux on an External Hard Drive</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>If you use GNU/Linux or Mac OS/X (or even Cygwin on Windows) you can use a little one-line script to quickly build lists of thousands of possible backlinks.  It can be used to build lists on any kind of Google query, but this one specifically finds pages on EDU sites.</p>
<p>The script below visits a page of Google SERPs with a text browser and then formats the data for you.  The example below uses the <a href="http://links.sourceforge.net/">Links Browser</a>, but you can also use <a href="http://lynx.browser.org/">Lynx</a>.</p>
<p><code>links -dump &apos;http://www.google.com/search?q=site%3Aedu+keyword&amp;num=100&amp;start=0&apos; | egrep -o &quot;http:.*&quot; | grep -v &quot;google.com&quot; | egrep -v &apos;[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}&apos; | awk &apos;{ print &quot;&lt;li&gt;&lt;a href=\&quot;&quot;$1&quot;\&quot;&gt;&quot;$1&quot;&lt;/a&gt;&lt;/li&gt;&quot; }&apos; &gt;&gt; backlink_hunter.html</code></p>
<p>The output is an HTML page with a bulleted list of links.  You can then use Firefox to middle-click on each link and quickly scan them to see if they might be a candidate for a backlink request.  Useful Firefox keyboard shortcuts for this are Ctrl-w (close tab), Ctrl-Tab and Ctrl-Shift-Tab (move between tabs).  Here is a screenshot of part of the output:</p>
<p><img src='http://pocketseo.com/wp-content/uploads/2007/05/edu-links.png' alt='EDU Backlinks' /></p>
<p>A quick explanation of the script:</p>
<ol>
<li><code>links -dump</code> (or <code>lynx -dump</code>) tells your text browser to dump the contents of the page into the terminal.  Both text browsers create a list of URLs from the page at the bottom of the output.  We are dumping a Google query for your desired keyword, with 100 results per page, starting at the beginning (&amp;start=0).  You can run the script multiple times, increasing the <em>start</em> parameter each time (for example: 101, 201, 301, up to 901).  100 results is probably enough to start with though.</li>
<li><code>egrep -o</code> searches each line for text starting with <em>http:</em>.  The <code>-o</code> option means to only return the matching portion of the line &mdash; in this case, just the URL.</li>
<li><code>grep</code> with the <code>-v</code> option removes lines that match the regular expression &mdash; in this case removing any URL that interally links to Google.</li>
<li>The next <code>grep -v</code> line removes all URLs that match an IP address because Google links back to itself in the SERPs with an IP address.  This will leave us with only outbound links from the SERPs.</li>
<li><code>awk</code> takes each remaining URL and encloses them in &lt;li&gt; tags to format them in a list.</li>
<li>The final command of the script &mdash; <em>&gt;&gt; backlink_hunter.html</em> &mdash; appends the output the the file backlink_hunter.html.  If the file doesn&apos;t exist, it will be created.  If it does exist, it will just append the output to the end of the file.  That way you can change the <em>&#038;start=</em> parameter in the Google query and append the next set of results to the file, creating list of up to 1000 URLs.</li>
</ol>
<p>I recommend experimenting with different Google operators and seeing what happens.  Using this one-liner is much faster than some other methods of hunting for backlinks.  The script can be expanded to do more &mdash; this is just a basic introduction to the concept.</p>
<img src="http://pocketseo.com/?ak_action=api_record_view&id=11&type=feed" alt="" />

<p>Related posts:<ol><li><a href='http://pocketseo.com/scripts/21' rel='bookmark' title='Permanent Link: Getting More EDU Backlinks'>Getting More EDU Backlinks</a></li>
<li><a href='http://pocketseo.com/google/33' rel='bookmark' title='Permanent Link: Google Search URL Parameters'>Google Search URL Parameters</a></li>
<li><a href='http://pocketseo.com/scripts/43' rel='bookmark' title='Permanent Link: How to Install Linux on an External Hard Drive'>How to Install Linux on an External Hard Drive</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://pocketseo.com/scripts/11/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
