<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Fetch and Parse HTML Web Page Content From Bash. Wow.</title>
	<atom:link href="http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/</link>
	<description>Matt Wynne taking it one tea at a time</description>
	<lastBuildDate>Fri, 27 Jan 2012 08:31:34 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Wim</title>
		<link>http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/comment-page-1/#comment-1235</link>
		<dc:creator>Wim</dc:creator>
		<pubDate>Thu, 17 Feb 2011 18:59:20 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/#comment-1235</guid>
		<description>&lt;p&gt;Great stuff, it was exactly what I needed for a script I am working on wright now!&lt;/p&gt;

&lt;p&gt;see: http://forum.nedlinux.nl/viewtopic.php?pid=354120#p354120&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Great stuff, it was exactly what I needed for a script I am working on wright now!</p>

<p>see: <a href="http://forum.nedlinux.nl/viewtopic.php?pid=354120#p354120" rel="nofollow">http://forum.nedlinux.nl/viewtopic.php?pid=354120#p354120</a></p>]]></content:encoded>
	</item>
	<item>
		<title>By: Eoin McCarthy</title>
		<link>http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/comment-page-1/#comment-834</link>
		<dc:creator>Eoin McCarthy</dc:creator>
		<pubDate>Mon, 19 Oct 2009 01:19:03 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/#comment-834</guid>
		<description>&lt;p&gt;Great stuff! I have to admit I wasn&#039;t aware of curl before. However I couldn&#039;t track down w3m for the mac and ended up downloading lynx instead. &lt;/p&gt;

&lt;p&gt;lynx --dump  gives you very similar results.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Great stuff! I have to admit I wasn&#8217;t aware of curl before. However I couldn&#8217;t track down w3m for the mac and ended up downloading lynx instead. </p>

<p>lynx &#8211;dump  gives you very similar results.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Pete</title>
		<link>http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/comment-page-1/#comment-820</link>
		<dc:creator>Pete</dc:creator>
		<pubDate>Mon, 05 Oct 2009 06:39:34 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/#comment-820</guid>
		<description>&lt;p&gt;Geez... my formatting is horrible...&lt;/p&gt;

&lt;p&gt;On the *nix boxes... I got the scripts combined... Windows.... well... uhhh.... batch files have lost their appeal many moon&#039;s ago.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Geez&#8230; my formatting is horrible&#8230;</p>

<p>On the *nix boxes&#8230; I got the scripts combined&#8230; Windows&#8230;. well&#8230; uhhh&#8230;. batch files have lost their appeal many moon&#8217;s ago.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Pete</title>
		<link>http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/comment-page-1/#comment-819</link>
		<dc:creator>Pete</dc:creator>
		<pubDate>Mon, 05 Oct 2009 06:36:56 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/#comment-819</guid>
		<description>&lt;p&gt;Well there are a few things you can do with with curl, w3m, html2text&lt;/p&gt;

&lt;p&gt;Currently I have a few things in my bag o&#039; scripts:&lt;/p&gt;

&lt;p&gt;1) Command line wikipedia script
2) Zipcode look up script 
3) NPA IE: Area code lookup script (telco thing)
4) NPA-NXX Area code and prefix lookup script. (telco thing)
5) NPA-NXX-XXXX phone number lookup script. (telco thing).
6) Dictionary lookup script using curl.  Since curl understands the dict protocol.
5) acronyms lookup script (curl again)
6) soundslike script that looks words up in the soundex database (helps for misspelled words that your word processor or text editor doesn&#039;t catch.&lt;/p&gt;

&lt;p&gt;Ahhhh what the heck; here is the windows version of curl with the dictionary script:&lt;/p&gt;

&lt;p&gt;curl -s dict://dict.org/d:%1:gcide &amp;
curl. -s dict://dict.org/d:%1:wn &amp;
curl. -s dict://dict.org/d:%1:web1913 &amp;&lt;/p&gt;

&lt;p&gt;Replace the %1 on windows to a $1 in OSX, Linux, BSD, Solaris and your golden.&lt;/p&gt;

&lt;p&gt;Here is the scripts for the soundex database (IE: word sounds like):&lt;/p&gt;

&lt;p&gt;curl.-s dict://dict.org/m:%1::soundex
curl. -s dict://dict.org/m:%1::soundex:1&lt;/p&gt;

&lt;p&gt;Script for a thesaurus&lt;/p&gt;

&lt;p&gt;curl. -s dict://dict.org/d:%1:moby-thes&lt;/p&gt;

&lt;p&gt;check out dict.org for the protocol and a list of database that they have accessible to the public.&lt;/p&gt;

&lt;p&gt;Here is the script for the wiktionary look up script.  Check out the database to see what other data formats can be outputted.  Text, html, etc.  Wikitionary is dictionary side of the house of Wikipedia.  They are making a free dictionary.&lt;/p&gt;

&lt;p&gt;curl. -s dict://dict.hewgill.com/d:%1:en-brief &amp;&lt;/p&gt;

&lt;p&gt;curl --manual &#124; grep dict&lt;/p&gt;

&lt;p&gt;Don&#039;t forget to read the RFC on the dict protocol.  &lt;/p&gt;

&lt;p&gt;I would post the others but those scripts are quite long and their on my nix boxes.&lt;/p&gt;

&lt;p&gt;PS: You can use the dump option in w3m and look into the column option .  On a related note; check out &quot;html2text&quot;
Options -ascii -style pretty&lt;/p&gt;

&lt;p&gt;html2text is part curl and part w3m.  More options exist when using curl and html2text, than html2text alone.&lt;/p&gt;

&lt;p&gt;PSS: Good luck on your job search.
Regards.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Well there are a few things you can do with with curl, w3m, html2text</p>

<p>Currently I have a few things in my bag o&#8217; scripts:</p>

<p>1) Command line wikipedia script
2) Zipcode look up script 
3) NPA IE: Area code lookup script (telco thing)
4) NPA-NXX Area code and prefix lookup script. (telco thing)
5) NPA-NXX-XXXX phone number lookup script. (telco thing).
6) Dictionary lookup script using curl.  Since curl understands the dict protocol.
5) acronyms lookup script (curl again)
6) soundslike script that looks words up in the soundex database (helps for misspelled words that your word processor or text editor doesn&#8217;t catch.</p>

<p>Ahhhh what the heck; here is the windows version of curl with the dictionary script:</p>

<p>curl -s dict://dict.org/d:%1:gcide &amp;
curl. -s dict://dict.org/d:%1:wn &amp;
curl. -s dict://dict.org/d:%1:web1913 &amp;</p>

<p>Replace the %1 on windows to a $1 in OSX, Linux, BSD, Solaris and your golden.</p>

<p>Here is the scripts for the soundex database (IE: word sounds like):</p>

<p>curl.-s dict://dict.org/m:%1::soundex
curl. -s dict://dict.org/m:%1::soundex:1</p>

<p>Script for a thesaurus</p>

<p>curl. -s dict://dict.org/d:%1:moby-thes</p>

<p>check out dict.org for the protocol and a list of database that they have accessible to the public.</p>

<p>Here is the script for the wiktionary look up script.  Check out the database to see what other data formats can be outputted.  Text, html, etc.  Wikitionary is dictionary side of the house of Wikipedia.  They are making a free dictionary.</p>

<p>curl. -s dict://dict.hewgill.com/d:%1:en-brief &amp;</p>

<p>curl &#8211;manual | grep dict</p>

<p>Don&#8217;t forget to read the RFC on the dict protocol.  </p>

<p>I would post the others but those scripts are quite long and their on my nix boxes.</p>

<p>PS: You can use the dump option in w3m and look into the column option .  On a related note; check out &#8220;html2text&#8221;
Options -ascii -style pretty</p>

<p>html2text is part curl and part w3m.  More options exist when using curl and html2text, than html2text alone.</p>

<p>PSS: Good luck on your job search.
Regards.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: David</title>
		<link>http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/comment-page-1/#comment-476</link>
		<dc:creator>David</dc:creator>
		<pubDate>Sun, 15 Mar 2009 02:47:13 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mattwynne.net/2008/04/26/fetch-and-parse-html-web-page-content-from-bash-wow/#comment-476</guid>
		<description>&lt;p&gt;Awesome sauce thanks buddy :)&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Awesome sauce thanks buddy <img src='http://blog.mattwynne.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>]]></content:encoded>
	</item>
</channel>
</rss>

