<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Using Python to Grab Images From a Web Site</title>
	<atom:link href="http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/</link>
	<description>Programming, design, business and productivity content for tech entrepreneurs living in a 2.0 world.</description>
	<lastBuildDate>Thu, 04 Mar 2010 17:15:57 -0800</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: michaelangela</title>
		<link>http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/comment-page-1/#comment-3831</link>
		<dc:creator>michaelangela</dc:creator>
		<pubDate>Thu, 16 Oct 2008 01:01:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/#comment-3831</guid>
		<description>Ah... forgot to add the

&#124; xargs wget -q

at the end of the curl,egrep line to do the actual downloading.</description>
		<content:encoded><![CDATA[<p>Ah&#8230; forgot to add the</p>
<p>| xargs wget -q</p>
<p>at the end of the curl,egrep line to do the actual downloading.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ramblings</title>
		<link>http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/comment-page-1/#comment-3830</link>
		<dc:creator>Ramblings</dc:creator>
		<pubDate>Wed, 15 Oct 2008 17:29:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/#comment-3830</guid>
		<description>&lt;strong&gt;grep instead of python?...&lt;/strong&gt;

...</description>
		<content:encoded><![CDATA[<p><strong>grep instead of python?&#8230;</strong></p>
<p>&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: michaelangela</title>
		<link>http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/comment-page-1/#comment-3829</link>
		<dc:creator>michaelangela</dc:creator>
		<pubDate>Wed, 15 Oct 2008 17:21:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/#comment-3829</guid>
		<description>Great tip using grep. I was going to use python, too. :) My box doesn&#039;t have the -P option :( but I used egrep to similar effect. Also, the items I needed were in  tags so I had to remove those with a sed command. The images were also retrieved from a database and didn&#039;t have an extension so I did a one liner loop to rename those.

curl -s http://domain.tld/feed &#124; egrep -o &quot;.*&quot; &#124; egrep -o &quot;(http.*)&quot; &#124; sed -e &#039;s/]*&gt;//g&#039;
for f in *; do mv ./&quot;$f&quot; &quot;${f}.jpg&quot;; done</description>
		<content:encoded><![CDATA[<p>Great tip using grep. I was going to use python, too. <img src='http://www.thinkingserious.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  My box doesn&#8217;t have the -P option <img src='http://www.thinkingserious.com/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' />  but I used egrep to similar effect. Also, the items I needed were in  tags so I had to remove those with a sed command. The images were also retrieved from a database and didn&#8217;t have an extension so I did a one liner loop to rename those.</p>
<p>curl -s <a href="http://domain.tld/feed" rel="nofollow">http://domain.tld/feed</a> | egrep -o &#8220;.*&#8221; | egrep -o &#8220;(http.*)&#8221; | sed -e &#8217;s/]*&gt;//g&#8217;<br />
for f in *; do mv ./&#8221;$f&#8221; &#8220;${f}.jpg&#8221;; done</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ThinkingSerious</title>
		<link>http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/comment-page-1/#comment-1360</link>
		<dc:creator>ThinkingSerious</dc:creator>
		<pubDate>Fri, 02 May 2008 22:06:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/#comment-1360</guid>
		<description>Great example. Thanks for your contribution.</description>
		<content:encoded><![CDATA[<p>Great example. Thanks for your contribution.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: btruelove</title>
		<link>http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/comment-page-1/#comment-1359</link>
		<dc:creator>btruelove</dc:creator>
		<pubDate>Fri, 02 May 2008 17:42:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.thinkingserious.com/2008/04/30/using-python-to-grab-images-from-a-web-site/#comment-1359</guid>
		<description>Overkill IMO. A less trivial example is going to get large and messy quickly. What about when the XML file in on an FTP, or requires authentication, needs some cookie, has to filter downloads by file size, wants to use a proxy, spawn multiple processes and so on? A more apt tool is curl or wget (I&#039;m lazy so I used both). Also, when you keep it at the shell it&#039;s more natural to pull in other shell commands when needed.

curl -s http://99designs.com/contests/6999/feed &#124;  grep -Po  &quot;src=\&quot;.*(png&#124;jpg)&quot; &#124; grep -o &quot;http.*&quot; &#124; xargs wget -q</description>
		<content:encoded><![CDATA[<p>Overkill IMO. A less trivial example is going to get large and messy quickly. What about when the XML file in on an FTP, or requires authentication, needs some cookie, has to filter downloads by file size, wants to use a proxy, spawn multiple processes and so on? A more apt tool is curl or wget (I&#8217;m lazy so I used both). Also, when you keep it at the shell it&#8217;s more natural to pull in other shell commands when needed.</p>
<p>curl -s <a href="http://99designs.com/contests/6999/feed" rel="nofollow">http://99designs.com/contests/6999/feed</a> |  grep -Po  &#8220;src=\&#8221;.*(png|jpg)&#8221; | grep -o &#8220;http.*&#8221; | xargs wget -q</p>
]]></content:encoded>
	</item>
</channel>
</rss>
