<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: NewsNow vs the Times: Right to crawl vs right to link</title>
	<atom:link href="http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/</link>
	<description>Where to find Malcolm Coles, reviews, and tips on how to do things I couldn&#039;t do.</description>
	<lastBuildDate>Mon, 25 Mar 2013 16:22:31 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.4</generator>
	<item>
		<title>By: Julian Burgess</title>
		<link>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/#comment-8404</link>
		<dc:creator>Julian Burgess</dc:creator>
		<pubDate>Tue, 09 Mar 2010 16:12:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=3779#comment-8404</guid>
		<description>Glad to see an article which has got this correct. In my spare time I work on website which crawls a number of websites, I always obey the robots.txt, however it is perfectly possible to ignore it, or to move your crawler to a different IP address which isn&#039;t banned. When someone is crawling in a way which you object to for any reason then robots.txt is good way to indicate they should stop.</description>
		<content:encoded><![CDATA[<p>Glad to see an article which has got this correct. In my spare time I work on website which crawls a number of websites, I always obey the robots.txt, however it is perfectly possible to ignore it, or to move your crawler to a different IP address which isn't banned. When someone is crawling in a way which you object to for any reason then robots.txt is good way to indicate they should stop.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Murdoch news block-out continues</title>
		<link>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/#comment-7135</link>
		<dc:creator>Murdoch news block-out continues</dc:creator>
		<pubDate>Tue, 12 Jan 2010 13:54:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=3779#comment-7135</guid>
		<description>[...] like we missed the change going through, but blogger Malcolm Coles didn&#8217;t. As of this morning, NewsNow is locked out of Murdoch&#8217;s UK newspaper portfolio. [...]</description>
		<content:encoded><![CDATA[<p>[...] like we missed the change going through, but blogger Malcolm Coles didn&#8217;t. As of this morning, NewsNow is locked out of Murdoch&#8217;s UK newspaper portfolio. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Adam Newby</title>
		<link>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/#comment-7104</link>
		<dc:creator>Adam Newby</dc:creator>
		<pubDate>Mon, 11 Jan 2010 15:20:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=3779#comment-7104</guid>
		<description>Indeed, the right to crawl and the right to link are not the same, but are they so different? In order to publish a link, one must first obtain it. So even if one has a right to link freely, if one doesn&#039;t have the right to obtain those links, that right to link alone wouldn&#039;t be worth much. How does one obtain links? By starting somewhere on a site - probably the homepage - then navigating through the links. Looked at this way, crawling is just systematic linking.

We recognise that with systematic linking, there is a risk of abusing a website&#039;s resources. Using robots.txt - a voluntary standard - to prevent this, is fair. However if sites start to use robots.txt in an arbitrary or discriminatory fashion, that&#039;s not so obviously fair. It&#039;s one thing NewsNow&#039;s user base not being able to reach Times Online content but if, for instance, as has been suggested, News Corp were to use robots.txt to block Google but allow Bing, that would be bad for users of the Internet in general. One of the strengths of any search engine is its neutrality: that the results have not been skewed to the commercial advantage of anyone to which it links.

Also, although Times Online has imposed a crawler restriction via robots.txt, it should be emphasised that The Newspaper Licensing Agency Ltd&#039;s proposed &#039;licence&#039; is about linking, not just crawling. This is shown by the existence of the &quot;end user&quot; licence, that intends to grant permission and impose charges on the customers of news monitoring organisations, for the apparent privilege of receiving or circulating links within their own organisations. Is this a slippery slope towards requiring all organisations pay a news tax for permission to &quot;make commercial use&quot; of newspaper content?

On a separate note, the Times Online block applies only to NewsNow. Why would this be? If the argument is about paid-for services featuring links to The Times&#039; content, then why has NI not targeted other organisations with paid-for services?</description>
		<content:encoded><![CDATA[<p>Indeed, the right to crawl and the right to link are not the same, but are they so different? In order to publish a link, one must first obtain it. So even if one has a right to link freely, if one doesn't have the right to obtain those links, that right to link alone wouldn't be worth much. How does one obtain links? By starting somewhere on a site - probably the homepage - then navigating through the links. Looked at this way, crawling is just systematic linking.</p>
<p>We recognise that with systematic linking, there is a risk of abusing a website's resources. Using robots.txt - a voluntary standard - to prevent this, is fair. However if sites start to use robots.txt in an arbitrary or discriminatory fashion, that's not so obviously fair. It's one thing NewsNow's user base not being able to reach Times Online content but if, for instance, as has been suggested, News Corp were to use robots.txt to block Google but allow Bing, that would be bad for users of the Internet in general. One of the strengths of any search engine is its neutrality: that the results have not been skewed to the commercial advantage of anyone to which it links.</p>
<p>Also, although Times Online has imposed a crawler restriction via robots.txt, it should be emphasised that The Newspaper Licensing Agency Ltd's proposed 'licence' is about linking, not just crawling. This is shown by the existence of the "end user" licence, that intends to grant permission and impose charges on the customers of news monitoring organisations, for the apparent privilege of receiving or circulating links within their own organisations. Is this a slippery slope towards requiring all organisations pay a news tax for permission to "make commercial use" of newspaper content?</p>
<p>On a separate note, the Times Online block applies only to NewsNow. Why would this be? If the argument is about paid-for services featuring links to The Times' content, then why has NI not targeted other organisations with paid-for services?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Recession, Decade of SEO, Big Media and Linking @ Webmaster Chronic Blog</title>
		<link>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/#comment-7076</link>
		<dc:creator>Recession, Decade of SEO, Big Media and Linking @ Webmaster Chronic Blog</dc:creator>
		<pubDate>Sun, 10 Jan 2010 08:21:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=3779#comment-7076</guid>
		<description>[...] Right to crawl vs right to link » malcolm coles clueless newspapers say don&#8217;t link to us&#8230; [...]</description>
		<content:encoded><![CDATA[<p>[...] Right to crawl vs right to link » malcolm coles clueless newspapers say don&#8217;t link to us&#8230; [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Your Mum</title>
		<link>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/#comment-7045</link>
		<dc:creator>Your Mum</dc:creator>
		<pubDate>Fri, 08 Jan 2010 17:00:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=3779#comment-7045</guid>
		<description>Very nice article Malcolm, i am so proud of you.. have you cleaned your room? Oh and one other thing... if all of these media sites keep denying me and potentially thousands of others from linking to them who is monitoring all the incoming links?</description>
		<content:encoded><![CDATA[<p>Very nice article Malcolm, i am so proud of you.. have you cleaned your room? Oh and one other thing... if all of these media sites keep denying me and potentially thousands of others from linking to them who is monitoring all the incoming links?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
