<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>malcolm coles &#187; NewsNow</title>
	<atom:link href="http://www.malcolmcoles.co.uk/blog/tag/newsnow/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.malcolmcoles.co.uk/blog</link>
	<description>Where to find Malcolm Coles, reviews, and tips on how to do things I couldn't do.</description>
	<lastBuildDate>Thu, 29 Jul 2010 20:08:36 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Sun blocks NewsNow from crawling its site</title>
		<link>http://www.malcolmcoles.co.uk/blog/sun-blocks-newsnow/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/sun-blocks-newsnow/#comments</comments>
		<pubDate>Tue, 12 Jan 2010 13:01:00 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[How to]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Newspapers]]></category>
		<category><![CDATA[NewsNow]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[Sun]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=3819</guid>
		<description><![CDATA[<a href="http://www.malcolmcoles.co.uk/blog/sun-blocks-newsnow/"><img align="right" hspace="5" width="150" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/sun-robots-newsnow.png" class="alignright wp-post-image tfe" alt="The Sun" title="sun-robots-newsnow" /></a>NewsNow revealed this morning that it's heard that News International will soon be blocking NewsNow from crawling The Sun's site - just as it has done with the Times.

This has now happened, as the Sun's robots.txt file shows.]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Fsun-blocks-newsnow%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Fsun-blocks-newsnow%2F&amp;source=malcolmcoles&amp;style=normal&amp;service=bit.ly&amp;service_api=malcolmcoles%3AR_f88779674d748923526914d6d785ca26" height="61" width="50" /><br />
			</a>
		</div>
<p>NewsNow <a href="http://paidcontent.co.uk/article/419-news-int.-will-also-block-sun-online-notw.co.uk-from-newsnow/">revealed this morning</a> that it's heard that News International will soon be blocking NewsNow from crawling The Sun's site - <a href="http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/">just as it has done with the Times</a>.</p>
<p><strong>This has now happened, as the Sun's robots.txt file shows:</strong></p>
<p><div id="attachment_3822" class="wp-caption alignright" style="width: 491px"><img class="size-full wp-image-3822" title="sun-robots-newsnow" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/sun-robots-newsnow.png" alt="The Sun's robots.txt file disallows NewsNow's crawler from indexing The Sun's content" width="481" height="64" /><p class="wp-caption-text">The Sun&#39;s robots.txt file disallows NewsNow&#39;s crawler from indexing The Sun&#39;s content</p></div></p>
<p>The same is true of the News of the World site.</p>
<p>You can see NewsNow's views on the situation in comments on my previous posts <a href="http://www.malcolmcoles.co.uk/blog/newsnow-times-not-sympathetic/#comment-7100">here</a> and <a href="http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/#comment-7104">here</a>.</p>
<p>If you are going to report on this, please don't confuse linking and crawling (see previous posts) ...
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Fsun-blocks-newsnow%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Fsun-blocks-newsnow%2F&amp;source=malcolmcoles&amp;style=normal&amp;service=bit.ly&amp;service_api=malcolmcoles%3AR_f88779674d748923526914d6d785ca26" height="61" width="50" /><br />
			</a>
		</div>
]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/sun-blocks-newsnow/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>NewsNow vs the Times: I&#039;ve decided I&#039;m not sympathetic</title>
		<link>http://www.malcolmcoles.co.uk/blog/newsnow-times-not-sympathetic/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/newsnow-times-not-sympathetic/#comments</comments>
		<pubDate>Sat, 09 Jan 2010 13:19:23 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Bad reviews]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Newspapers]]></category>
		<category><![CDATA[NewsNow]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[Times]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=3793</guid>
		<description><![CDATA[I wrote yesterday that I was sympathetic towards NewsNow and its ongoing row with newspapers, especially the Times. I've now decided that I'm not.

NewsNow crawls the Times site in order to gather information for its paid service to businesses.
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Fnewsnow-times-not-sympathetic%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Fnewsnow-times-not-sympathetic%2F&amp;source=malcolmcoles&amp;style=normal&amp;service=bit.ly&amp;service_api=malcolmcoles%3AR_f88779674d748923526914d6d785ca26" height="61" width="50" /><br />
			</a>
		</div>
<p>I wrote yesterday that I was sympathetic towards <a href="http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/">NewsNow and its ongoing row with newspapers, especially the Times</a>.</p>
<p>I've now decided that I'm not.</p>
<p>NewsNow crawls the Times site in order to gather information for its <a href="http://www.newsnow.co.uk/services/">paid service to businesses</a> as well as its free service to consumers:</p>
<blockquote><p>Our unique filing engine is designed to match each new breaking news article, against any of thousands of complex search specifications, in seconds. Our subscription services harness the power of our filing engine for your organisation's specific purpose.</p></blockquote>
<p>It charges from £95 a month for tailored feeds and from £75 a month for generic (pre-built) feeds.</p>
<p>So it basically charges organisations to show them news stories about a given subject or keyword.</p>
<p>Put like that, there's no real reason why the Times should let NewsNow crawl its site, unless <a href="http://blogs.journalism.co.uk/editors/tag/newsnow/">it pays up to the Newspaper Licensing Authority</a>.
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Fnewsnow-times-not-sympathetic%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Fnewsnow-times-not-sympathetic%2F&amp;source=malcolmcoles&amp;style=normal&amp;service=bit.ly&amp;service_api=malcolmcoles%3AR_f88779674d748923526914d6d785ca26" height="61" width="50" /><br />
			</a>
		</div>
]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/newsnow-times-not-sympathetic/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>NewsNow vs the Times: Right to crawl vs right to link</title>
		<link>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/#comments</comments>
		<pubDate>Fri, 08 Jan 2010 16:41:11 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[How to]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Newspapers]]></category>
		<category><![CDATA[NewsNow]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[Times]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=3779</guid>
		<description><![CDATA[<a href="http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/"><img align="right" hspace="5" width="150" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/newsnow-times.png" class="alignright wp-post-image tfe" alt="Newsnow" title="newsnow-times" /></a>I am sympathetic towards NewsNow over the newspapers attempts to block it linking to them. The site has been blocked from showing Times Online stories. But I think it's campaign is a little overblown:]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Ftimes-newsnow-crawling-linking%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Ftimes-newsnow-crawling-linking%2F&amp;source=malcolmcoles&amp;style=normal&amp;service=bit.ly&amp;service_api=malcolmcoles%3AR_f88779674d748923526914d6d785ca26" height="61" width="50" /><br />
			</a>
		</div>
<p><strong>Update</strong>: <a href="http://www.malcolmcoles.co.uk/blog/sun-blocks-newsnow/">The Sun and News of the World are now also blocking NewsNow</a>.</p>
<p><strong>Original post: </strong>I am <span style="text-decoration: line-through;">sympatheti</span>c <a href="http://www.malcolmcoles.co.uk/blog/newsnow-times-not-sympathetic/">not sympathetic</a> towards NewsNow over the <a href="http://econsultancy.com/blog/5117-news-aggregator-newsnow-to-stop-linking-to-big-media">newspapers' attempts</a> to prevent it linking to them. The site has now been <a href="http://www.pressgazette.co.uk/story.asp?sectioncode=1&amp;storycode=44868&amp;c=1">blocked from showing Times Online</a> stories (<a href="http://www.journalism.co.uk/2/articles/537097.php">more background here</a>). But its campaign is a bit odd as it - and most of the media reporting the story - are confusing <strong>linking</strong> and <strong>crawling / indexing</strong>:</p>
<p><div id="attachment_3780" class="wp-caption alignright" style="width: 500px"><img class="size-full wp-image-3780" title="newsnow-times" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/newsnow-times.png" alt="Newsnow's response to the Times" width="490" height="146" /><p class="wp-caption-text">Newsnow&#39;s response to the Times</p></div></p>
<h3>The right to link</h3>
<p><a href="http://www.right2link.org/">Right2Link</a> (founding sponsor NewsNow) asserts that:</p>
<blockquote><p>We, the supporters of the right to link, declare:</p>
<ol>
<li>All should be free to create, forward and follow links — they are the signposts to content on the web.</li>
<li>Linking should require no permission nor charge.</li>
<li>The free circulation of publicly accessible information is threatened if individuals, organisations and search engines cannot continue to create, forward and follow links without undue restraint.</li>
</ol>
</blockquote>
<h3>Crawling vs linking</h3>
<p>What the Times has done, however, is block NewsNow from <strong>crawling</strong> its site using its robots.txt file. This is nothing to do with <strong>linking</strong> to its site.</p>
<p>In NewsNow's case, it requires crawling to do its linking, as it's an automatic news aggregator. So it needs to crawl / index the Times site to put its own site together.</p>
<p>But this afternoon's piece of pedantry from me is to point out that, for the rest of us, they are not the same. And nothing the Times has done today stops anyone from linking to them - the demand of the Right2Link campaign.</p>
<p>So a big red box about undermining access to public information seems irrelevant.</p>
<p><strong>Update: </strong>What's more,<a href="http://www.malcolmcoles.co.uk/blog/newsnow-times-not-sympathetic/"> NewsNow charges for some of its services</a> so this has even less to do with public information ...</p>
<h3>Oh wait, we are banned from linking, my mistake</h3>
<p>Of course, if you look through the T&amp;Cs of the Times site, it says:</p>
<blockquote><p>Illegal and/or unauthorized use of the Services, including ... unauthorised framing of or linking to the Website is prohibited.</p></blockquote>
<p>It's not very clear what unauthorised linking is, however. But I'd better not link to the T&amp;Cs just in case! (This follows on from my posts last year in which I pointed out that <a href="http://www.malcolmcoles.co.uk/blog/newspapers-no-linking-to-us/">most major newspapers</a> and <a href="http://www.malcolmcoles.co.uk/blog/newspapers-no-linking-to-us/">many other organisations</a> forbade you from linking to them. SEO anyone?).
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Ftimes-newsnow-crawling-linking%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.malcolmcoles.co.uk%2Fblog%2Ftimes-newsnow-crawling-linking%2F&amp;source=malcolmcoles&amp;style=normal&amp;service=bit.ly&amp;service_api=malcolmcoles%3AR_f88779674d748923526914d6d785ca26" height="61" width="50" /><br />
			</a>
		</div>
]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/times-newsnow-crawling-linking/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
