<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Malcolm Coles &#187; seo</title>
	<atom:link href="http://www.malcolmcoles.co.uk/blog/tag/seo/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.malcolmcoles.co.uk/blog</link>
	<description>Where to find Malcolm Coles, reviews, and tips on how to do things I couldn&#039;t do.</description>
	<lastBuildDate>Tue, 20 Dec 2011 10:16:53 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.4</generator>
		<item>
		<title>Google gives big sites a free pass on author profile pictures</title>
		<link>http://www.malcolmcoles.co.uk/blog/google-automatic-author-pictures/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/google-automatic-author-pictures/#comments</comments>
		<pubDate>Fri, 16 Sep 2011 08:37:08 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[How to]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Latest]]></category>
		<category><![CDATA[SEO advice]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google author information]]></category>
		<category><![CDATA[Guardian]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=6434</guid>
		<description><![CDATA[Guardian journalists get author profile pictures in search results without applying the markup. I doubt Google will do that for your site ...]]></description>
			<content:encoded><![CDATA[<p><div id="attachment_6443" class="wp-caption alignleft" style="width: 560px"><img class="size-full wp-image-6443" title="google-author-information-profile-picture" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/google-author-information-profile-picture1.png" alt="Google results with author profile pictures" width="550" height="230" /><p class="wp-caption-text">These aren&#39;t what you think</p></div></p>
<p>Notice anything odd about these two web results? They've got <a href="http://www.malcolmcoles.co.uk/blog/google-author-profile-pics-prominent/">Google's new author profile pictures</a> in, right?</p>
<h3>How you normally make those photos appear</h3>
<p>Well, no. Normal websites have to go through a slightly complicated process to get photos of the author of a page showing up in Google's web results (<a href="http://www.google.com/support/webmasters/bin/answer.py?answer=1229920">the old process</a>, <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=1408986">the new process</a>). This basically involves cross linking your Google profile and either the page in question or to an "author page" that the page in question also links to.</p>
<p><div id="attachment_6439" class="wp-caption alignright" style="width: 238px"><img class="size-full wp-image-6439" title="normal-google-author-result" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/normal-google-author-result.png" alt="Normal Google author result" width="228" height="93" /><p class="wp-caption-text">There&#39;s a &quot;by Malcolm Coles&quot; link next to the photo</p></div></p>
<p>But when you do that, Google's results show not just a photo - but also a link to your Google profile.</p>
<p>And this link to the Google profile is missing from the screenshot above (but you can see it in the picture here).</p>
<h3>So what's going on?</h3>
<p>It turns out that those two web results from the Guardian aren't showing photos because of Google's normal process for making them appear.</p>
<p>There are no links from the author profiles on the Guardian to the journalist's Google profiles. And their Google profiles don't link to their author pages. (Here is Charles Arthur's <a href="http://www.guardian.co.uk/profile/charlesarthur">author page on the Guardian</a> and <a href="https://plus.google.com/101391023051989281550/about">his Google profile</a>. Here is Jemima Kiss's <a href="http://www.guardian.co.uk/profile/jemimakiss">author page on the Guardian</a> and <a href="https://plus.google.com/113391812624898456365/about">her Google profile</a>.)</p>
<p>However, both those author pages link to the journalist's twitter accounts - as do their Google profiles. And it appears that Google has decided that on the basis of this, it can presume that the photo on their Twitter profile picture is a picture of the article's author - and is showing the photo.</p>
<p>I think we can assume that this isn't going to work for most sites ... Thanks to <a href="https://twitter.com/#!/yesiamben">@yesiamben</a>, <a href="https://twitter.com/#!/ysekand">@ysekand</a>, <a href="https://twitter.com/#!/danbarker">@danbarker</a> and <a href="https://twitter.com/#!/yoast">@yoast</a> for helping work all this out yesterday.</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=6434&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/google-automatic-author-pictures/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Brighton SEO: Winning at SEO with duplicate content</title>
		<link>http://www.malcolmcoles.co.uk/blog/brighton-seo-duplicate-content/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/brighton-seo-duplicate-content/#comments</comments>
		<pubDate>Fri, 09 Sep 2011 07:00:14 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[How to]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[SEO advice]]></category>
		<category><![CDATA[Wordpress]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=6395</guid>
		<description><![CDATA[It turns out that you can dominate Google's search results in the short term via duplicate content - and here's the explanation, based on a talk I'm giving at Brighton SEO. 
]]></description>
			<content:encoded><![CDATA[<p>Usually duplicate content is bad. Google filters it out and links and social signals are split over several different URLs even if the content is the same.</p>
<p>But it turns out that you can dominate Google's search results in the short term via duplicate content - and here's the explanation, based on a talk I'm giving at Brighton SEO.</p>
<p><strong>Look at all the traffic I got</strong></p>
<p>Here's a graph that showed how I got loads of traffic to this blog by exploiting duplicate content. Don't look any further if you're easily offended or love the royal family.</p>
<p><div id="attachment_6396" class="wp-caption alignleft" style="width: 560px"><img class="size-large wp-image-6396" title="duplicate-content-traffic" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/duplicate-content-traffic-550x203.png" alt="Traffic graph" width="550" height="203" /><p class="wp-caption-text">WOW. Where did all that traffic come from? Where did it go?</p></div></p>
<p>I took this blog from a few hundred page views a day to 30,000 at its peak by exploiting the fact that Google was looking for new content when it comes to suddenly popular or brand new search terms. Then I stopped and I went back to normal levels.</p>
<p><span id="more-6395"></span>I'm sorry to report those search terms involved <a href="http://www.malcolmcoles.co.uk/blog/tag/pippa-middleton-arse/">Pippa Middleton's arse</a> and underwear and <a href="http://www.malcolmcoles.co.uk/blog/tag/twitter-super-injunction/">super injunctions and twitter</a>.</p>
<p>So I waited for something to happen that I knew would trigger lots of searches. After the Royal Wedding it was Pippa Middleton's arse. Then a few days later it was her underwear. Just as that died down, the whole Ryan Giggs / Twitter / superinjunction thing kicked off.</p>
<p>Each time something happened I would throw a short blog post together and publish as quickly as possible - my ambition each time was to beat the mainstream news sites to publishing something.</p>
<p>Then I pulled a trick - for a few stories, I republished the story shortly afterwards on a different URL and 301ed the first URL to the second one. Then I did it again.</p>
<p>(This trick only works on trending news topics - so search terms that are suddenly popular and which make Google think that it should throw away its usual search results and replace them with pages that have only just been published (to correspond with the new interest in the search term).)</p>
<h3>Look, two results for the same content ...</h3>
<p>Not sure what I'm on about? Well, look at this picture.</p>
<p><div id="attachment_6397" class="wp-caption alignleft" style="width: 560px"><img class="size-full wp-image-6397" title="brighton-seo-1" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/brighton-seo-1.png" alt="Google screenshot" width="550" height="486" /><p class="wp-caption-text">Step one: two results for the same content.</p></div></p>
<p>Hopefully at the bottom you can see that I have two URLs for my <a href="http://www.malcolmcoles.co.uk/blog/pippa-middleton-underwear-pictures-google-updates-autosuggests2/">Pippa Middleton underwear pictures</a> (NOTE: there are no pictures of this kind. She's in swimwear - blame the tabloids).</p>
<h3>What I did</h3>
<p>I took the first URL which was http://www.malcolmcoles.co.uk/blog/pippa-middleton-underwear-pictures-google-updates-<strong>autosuggest</strong> and I changed it to http://www.malcolmcoles.co.uk/blog/pippa-middleton-underwear-pictures-google-updates-<strong>autosuggests</strong> - the same URL but with an S on the end. I also tweaked the HTML title a bit (took the word its out).</p>
<h3>How WordPress helped</h3>
<p>Now a quirk of WordPress means that it tries to match partial URLs and will redirect accordingly. What this means is that if you tried to access the old -autosuggest URL you would get automatically redirected to the new -autosuggests one.</p>
<p>Try it with this URL: <a href="http://www.malcolmcoles.co.uk/blog/pippa-middleton-underwear-pictures-google-updates-auto">http://www.malcolmcoles.co.uk/blog/pippa-middleton-underwear-pictures-google-updates-auto</a> and see where you end up. That's not a manual redirect, it's WordPress doing it automatically.</p>
<h3>How this fooled Google</h3>
<p>What Google sees is as follows:</p>
<ul>
<li>It sees some content on a subject everyone is searching for but no on has published on lately.</li>
<li>So it shoves my result on page one.</li>
<li>Then it sees I've published another URL.</li>
<li>It's still desperate for new content so it gives me 2 results.</li>
</ul>
<p>Of course, the 2nd one is 99% the same as the first one but (1) it hasn't noticed that yet and (2) it hasn't revisited the first URL to see that it redirects to a new one.</p>
<h3>Now I've got three!</h3>
<p>But if I can pull that trick once ... Here's another screenshot.</p>
<p><div id="attachment_6398" class="wp-caption alignleft" style="width: 560px"><img class="size-full wp-image-6398" title="brighton-seo-2" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/brighton-seo-2.png" alt="Screenshot" width="550" height="487" /><p class="wp-caption-text">Hat trick</p></div></p>
<p>Yes, I've now got three results on page one. I'd changed the URL again. This time from <strong>-autosuggests</strong> to <strong>-autosuggests2</strong>. I didn't bother changing the title.</p>
<p>Again, Google is still desperate for new content. Everyone is searching for this term because American sites published the pictures. But very few UK sites have. So it gives me three results.</p>
<p>As a searcher, however, if you click on the old -autosuggest or -autosuggests URLs, which no longer exist, then WordPress will just match those to what it thinks you're looking for which is -autosuggests2. So you always end up at the same place.</p>
<h3>Boo. Back to two.</h3>
<p>Eventually, Google wised up to the fact that the first URL was 301ing (or was maybe the same, who knows). And it took me back to two results.</p>
<p><div id="attachment_6399" class="wp-caption alignleft" style="width: 560px"><img class="size-full wp-image-6399" title="brighton-seo-3" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/brighton-seo-3.png" alt="Results" width="550" height="448" /><p class="wp-caption-text">Back to two</p></div></p>
<p>As you can see, the first -autosuggest has now vanished.</p>
<p>Eventually, all the duplicates were filtered out and my current URL is now top of Google for a search on her undies. Oh good.</p>
<p>But for a crucial few hours, when loads of people were searching, I had the same content several times on page one of Google's results.</p>
<h3>Conclusion</h3>
<p>One side effect of this is that people think I'm a pervert. That aside, for spiking news terms you can insert the same piece of content into Google's results. It will eventually notice that the earlier URLs redirect or are duplicates of the later ones. But by that time everyone will have stopped searching.</p>
<p>To be honest, you probably shouldn't do this. It breaks Google.</p>
<p>You also need to be on the ball. You need to beat big news sites (if you're dedicated, this is possible). And it only works on spiking news terms (these <a href="http://www.malcolmcoles.co.uk/blog/news-search-tools/">news data tools</a> might be handy).</p>
<p>And don't cock it up. Duplicate content is bad. <a href="http://www.malcolmcoles.co.uk/blog/rel-canonical-infinite-express/">Ask the Express</a>. <a href="http://www.malcolmcoles.co.uk/blog/indy-jelly-bean/">Or the Indy</a>.</p>
<p>This is the last of my posts or conference talks on Pippa Middleton (here's my earlier talk on <a href="http://www.youtube.com/watch?v=PpW0S2vawcM">Karen Gillan's underwear</a>). I hope you enjoyed them. Don't blame me for the fact that nation is a bunch of perverts.</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=6395&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/brighton-seo-duplicate-content/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google makes author profile pics MUCH more prominent in its results</title>
		<link>http://www.malcolmcoles.co.uk/blog/google-author-profile-pics-prominent/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/google-author-profile-pics-prominent/#comments</comments>
		<pubDate>Tue, 30 Aug 2011 15:07:58 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[How to]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Latest]]></category>
		<category><![CDATA[SEO advice]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google author information]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=6372</guid>
		<description><![CDATA[Google+ links are also much more visible]]></description>
			<content:encoded><![CDATA[<p>Google appears to be testing making author profiles more prominent in its search results (or else it's changed it and no one told me).</p>
<p>In the past, they used to be shoved over on the right hand side (EG see the screenshot <a href="http://blog.louisgray.com/2011/06/search-results-on-google-now-highlight.html">here</a>). Now they are integrated into the search results - with a bit prominent link to my google+ profile. See this screenshot of a <a href="http://www.google.co.uk/search?q=site:malcolmcoles.co.uk+pippa">site: search for this site</a> (I see similar on other sites, too).</p>
<p><div id="attachment_6373" class="wp-caption alignleft" style="width: 559px"><img class="size-full wp-image-6373" title="Screen shot 2011-08-30 at 16.19.57" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/Screen-shot-2011-08-30-at-16.19.57.png" alt="google results with author pictures" width="549" height="500" /><p class="wp-caption-text">Author pictures integrated</p></div></p>
<p>Anyone else see this? You can add author pictures by <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=1229920">following these instructions</a> (which were recently simplified).</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=6372&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/google-author-profile-pics-prominent/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google&#8217;s improved search pages: no results</title>
		<link>http://www.malcolmcoles.co.uk/blog/no-results-location-searches/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/no-results-location-searches/#comments</comments>
		<pubDate>Mon, 06 Jun 2011 16:22:42 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=6264</guid>
		<description><![CDATA[Two examples of searches for a location with no results. Maybe a very high proportion of people who search this generically go on to refine their search?]]></description>
			<content:encoded><![CDATA[<p>I'm guessing this is a test of some sort - but I've not come across two examples of search pages with no results in. They were both searches for locations - maybe Google sees a very high proportion of people who search this generically go on to refine their search. So instead of bothering with any results, it's just showing related searches?</p>
<p>Here's one example.</p>
<p><div id="attachment_6265" class="wp-caption alignleft" style="width: 560px"><img class="size-large wp-image-6265" title="Screen shot 2011-06-06 at 17.34.36" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/Screen-shot-2011-06-06-at-17.34.36-550x198.png" alt="No results - Kentish town" width="550" height="198" /><p class="wp-caption-text">No results, just refinements when I search for Kentish Town</p></div></p>
<p>Here's another.</p>
<p><div id="attachment_6266" class="wp-caption alignleft" style="width: 560px"><img class="size-large wp-image-6266" title="Screen shot 2011-06-06 at 17.29.12" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/Screen-shot-2011-06-06-at-17.29.12-550x176.png" alt="Hyde Park - no results" width="550" height="176" /><p class="wp-caption-text">No results, just refinements when I search for Hyde Park</p></div></p>
<p>Or maybe it's a bug!</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=6264&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/no-results-location-searches/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Pippa Middleton&#8217;s arse: how newspapers optimise for the phrase without showing it to their readers</title>
		<link>http://www.malcolmcoles.co.uk/blog/pippa-middletons-arse-cheeky/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/pippa-middletons-arse-cheeky/#comments</comments>
		<pubDate>Thu, 12 May 2011 11:30:21 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[How to]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Mail]]></category>
		<category><![CDATA[Pippa Middleton arse]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=6104</guid>
		<description><![CDATA[Interesting use of different headlines on the Daily Mail today to solve the problem of how to optimise for the high-volume search term "Pippa Middleton's arse".]]></description>
			<content:encoded><![CDATA[<p>Interesting use of different headlines on the Daily Mail today to solve the problem of how to optimise for the high-volume search term "<a href="http://www.malcolmcoles.co.uk/blog/pippa-middletons-arse-dress/">Pippa Middleton's arse</a>".</p>
<p>They have one version of their headline in their HTML title (what Google shows in its web results) and news sitemap - this includes "Pippa Middleton arse" at the front: "Pippa Middleton arse: Motorist paid price for bum comments to girlfriend". This is also the URL they use: <a href="http://www.dailymail.co.uk/news/article-1386280/Pippa-Middleton-arse-Motorist-paid-price-bum-comments-girlfriend.html?ito=feeds-newsxml">http://www.dailymail.co.uk/news/article-1386280/Pippa-Middleton-arse-Motorist-paid-price-bum-comments-girlfriend.html</a></p>
<p>But the version of the headline on their page just says "'Is Pippa's bum still better than mine???' How motorist with Middleton envy paid the price for comments to girlfriend".</p>
<p>Here's how their story looks in Google.</p>
<p><div id="attachment_6105" class="wp-caption alignleft" style="width: 560px"><img class="size-large wp-image-6105" title="arse-mail-google" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/arse-mail-google-550x174.png" alt="The Pippa Middleton arse story in Google" width="550" height="174" /><p class="wp-caption-text">Saucy</p></div></p>
<p>And here's how it looks when you get to the Mail.</p>
<p><div id="attachment_6106" class="wp-caption alignleft" style="width: 560px"><img class="size-large wp-image-6106" title="no-arse-mail" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/no-arse-mail-550x274.png" alt="No arse" width="550" height="274" /><p class="wp-caption-text">No arse in the on-page version</p></div></p>
<p>Cheeky, one might say. They haven't gone as far as optimising for <a href="http://www.malcolmcoles.co.uk/blog/pippa-middleton-underwear-pictures-google-updates-autosuggests2/">Pippa's underwear</a> yet (and they need to do a bit more as the caption on the last picture says "write caption here").</p>
<p>Update: They've removed the word arse! New screenshot:</p>
<p><div id="attachment_6119" class="wp-caption alignleft" style="width: 560px"><img class="size-full wp-image-6119" title="Screen shot 2011-05-12 at 16.20.32" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/Screen-shot-2011-05-12-at-16.20.32.png" alt="Bottom not arse" width="550" height="184" /><p class="wp-caption-text">Booooo</p></div></p>
<p>Pah.</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=6104&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/pippa-middletons-arse-cheeky/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Express newspaper creates an infinite number of URLs using rel = canonical</title>
		<link>http://www.malcolmcoles.co.uk/blog/rel-canonical-infinite-express/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/rel-canonical-infinite-express/#comments</comments>
		<pubDate>Fri, 06 May 2011 06:00:16 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Bad reviews]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[How to]]></category>
		<category><![CDATA[Newspapers]]></category>
		<category><![CDATA[SEO advice]]></category>
		<category><![CDATA[Express]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[ur]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=6016</guid>
		<description><![CDATA[The Express newspaper has cocked up its implementation of the rel=canonical command SO BADLY that it has created an infinite number of duplicate webpages ... many of which now have links from elsewhere on the internet.]]></description>
			<content:encoded><![CDATA[<p>The Express newspaper has cocked up its implementation of the rel=canonical command SO BADLY that it has created <strong>an infinite number </strong>of duplicate webpages ... many of which now have links from elsewhere on the internet.</p>
<p><div id="attachment_6018" class="wp-caption alignleft" style="width: 560px"><a href="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/express-urls.png"><img class="size-full wp-image-6018" title="express-urls" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/express-urls.png" alt="Buzz Lightyear" width="550" height="416" /></a><p class="wp-caption-text">To infinite URLs - and beyond</p></div></p>
<h3>Using rel = canonical properly</h3>
<p>You use the <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=139394">rel=canonical</a> command to tell Google that a given URL is actually a version of another URL - and that the search engine should treat the second version as if it was that main URL.</p>
<p>It's useful if you have multiple copies of a page in different directories, have lots of versions of the same page due to <a href="http://www.blindfiveyearold.com/wordpress-duplicate-content">EG WordPress making 2 versions of every page</a>, or <a href="http://www.malcolmcoles.co.uk/blog/indy-jelly-bean/">allow anyone to rewrite your URLs</a> so it looks like your insulting <a href="http://www.malcolmcoles.co.uk/blog/tag/pippa-middleton-arse/">Pippa Middleton's</a> sister.</p>
<p>Make a mistake with rel=canonical, however, and it can <a href="http://www.seomoz.org/blog/catastrophic-canonicalization">wipe your website off the face of the internet</a>.</p>
<h3>Using rel = canonical to make infinite URLs</h3>
<p>The Express site's CMS is creating a duplicate version of every single page via the rel=canonical tag. And then a 3rd version, and then a 4th ... and it's never stopping until it gets to infinity.</p>
<p>Take a sample page like this one: <a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-system">http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-system</a></p>
<p>If you look at the HTML code, you can find:</p>
<p>&lt;link rel="canonical" href="<a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NO" target="_blank">http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NO</a>"&gt;</p>
<p>The CMS has miscoded the canonical URL to include the first bit of the URL relating to the individual page (the <em>AV-referendum-Why-we-must-vote-NO</em> bit) twice.</p>
<p>If you visit that supposedly canonical URL, you see this, with the page-specific bit in there three times.</p>
<p>&lt;link rel="canonical" href="<a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NO" target="_blank">http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NO</a>"&gt;</p>
<p>Go to that URL, and you find it there 4 times. Etc.</p>
<p>I got bored at <a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NO">http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NO</a></p>
<p>but this will never stop. Each time you visit the canonical URL, a new canonical URL is created.</p>
<p>All these URLs are working pages because the Express only looks at the number in the URL to decide what content to show. So <a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-system">http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-system</a> is the same as <a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-system">http://www.express.co.uk/features/view/244786/vote-YES</a> is the same as <a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-system">http://www.express.co.uk/features/view/244786/who-exactly-specced-this-CMS</a>.</p>
<h3>Dozens of URls for each Express story</h3>
<p>Sometimes these duplicate canonical URLs aren't in Google's index (I guess as each one is cancelled out by the next one). Although you can find them. <a href="http://www.google.co.uk/search?sourceid=chrome&amp;ie=UTF-8&amp;q=BRITAIN'S+benefits+culture+was+laid+bare+yesterday+as+%C2%ADfigures+showed+nearly+a+million+people+have+been+on+sickness+handouts+for+a+decade#hl=en&amp;q=+site:express.co.uk+BRITAIN'S+benefits+culture+was+laid+bare+yesterday+as+%C2%ADfigures+showed+nearly+a+million+people+have+been+on+sickness+handouts+for+a+decade&amp;sa=X&amp;ei=HRjDTcfuD46AhQeJutm3BQ&amp;ved=0CAIQqAQwAg&amp;fp=7b81afcb0f4f5e32">This search, for instance</a>, has this URL showing up: <a href="http://www.express.co.uk/posts/view/242092/DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-">http://www.express.co.uk/posts/view/242092/DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-DEBATE-Is-Britain-a-soft-touch-for-benefit-spongers-</a></p>
<p>Even worse, the first URL that appears for that search is <a href="http://www.express.co.uk/printer/view/242007/">the printable URL of the page with no adverts on</a>!</p>
<p><div id="attachment_6025" class="wp-caption alignleft" style="width: 560px"><img class="size-large wp-image-6025" title="express-results" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/express-results-550x449.png" alt="Google's results" width="550" height="449" /><p class="wp-caption-text">One paragraph, 55 results ...</p></div></p>
<p>And as that search, with 55 results, reveals, the Express has a massive problem with duplicate content.</p>
<h3>The Express then makes the problem even worse ...</h3>
<p>This is a problem it makes worse via its use of Tynt to add URLs when you copy and paste content. So if you copy and paste the first sentence from this URL: <a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-system">http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-system</a>, what you end up with is this:</p>
<p>"BY the time you read this you will have probably already voted No to AV in today’s referendum.</p>
<p>Read more: http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NO#ixzz1LW2s00ge".</p>
<p>The Express uses <a href="http://onlinejournalismblog.com/2009/08/20/an-interview-with-tyntcom-in-movie-form/">Tynt</a> to add the read more bit and the URL to what you've copied.</p>
<p>But, yes, the code they are adding contains the wrong URL with two versions of the page slug. Follow that link and copy a sentence and you end up with this:</p>
<p>"BY the time you read this you will have probably already voted No to AV in today’s referendum.</p>
<p>Read more: <a href="http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NO#ixzz1LW31La3e">http://www.express.co.uk/features/view/244786/AV-referendum-Why-we-must-vote-NO-to-the-new-voting-systemAV-referendum-Why-we-must-vote-NOAV-referendum-Why-we-must-vote-NO#ixzz1LW31La3e</a>"</p>
<p>Yup, another new URL created by the system that's designed to channel links to the main story.</p>
<p>You can see this in action on <a href="http://www.dailymail.co.uk/debate/newsdebate/f/t-10232057/p-1/index.html?threadIndex=4">this page on the Daily Mail</a> where someone has copied the opening para from some other bat shit story, and the Tynt URL is to <a href="http://www.express.co.uk/posts/view/244206/EU-wants-to-merge-uk-with-franceEU-wants-to-merge-uk-with-franceEU-wants-to-merge-uk-with-france#ixzz1LCIcD5jI" target="_blank">http://www.express.co.uk/posts/view/244206/EU-wants-to-merge-uk-with-franceEU-wants-to-merge-uk-with-franceEU-wants-to-merge-uk-with-france#ixzz1LCIcD5jI</a>.</p>
<p>This might explain <a href="http://www.google.co.uk/search?sourceid=chrome&amp;ie=UTF-8&amp;q=FURY+erupted+last+night+after+a+European+Union+plot+to+%E2%80%9Ccarve+up+Britain%E2%80%9D+by+%C2%ADsetting+up+a+cross-Channel+region+was+exposed.Read+more%3A+http%3A%2F%2Fwww.express.co.uk%2Fposts%2Fview%2F244282%2FDEBATE-Should-parts-of-Britain-be-merged-with-France-DEBATE-Should-parts-of-Britain-be-merged-with-France-%23ixzz1LW3jTKFw#sclient=psy&amp;hl=en&amp;source=hp&amp;q=FURY+erupted+last+night+after+a+European+Union+plot+to+%E2%80%9Ccarve+up+Britain%E2%80%9D+by+%C2%ADsetting+up+a+cross-Channel+region+was+exposed&amp;aq=f&amp;aqi=&amp;aql=&amp;oq=&amp;pbx=1&amp;fp=7b81afcb0f4f5e32">why the Express can't rank in first place for a paragraph from its own story</a>.</p>
<h3>To sum up</h3>
<p>The Express isn't appearing top of Google's results for searches using their own content and Google is serving up versions of its pages with no adverts on - all because Google can't work out which page is the correct one because the Express constantly points to yet another URL for every single page - even the made up ones.</p>
<p>My head hurts.</p>
<p><a href="http://www.flickr.com/photos/drtjff/5321150585/">Image credit</a>.</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=6016&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/rel-canonical-infinite-express/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Pippa Middleton&#8217;s arse closing the gap on her dress</title>
		<link>http://www.malcolmcoles.co.uk/blog/pippa-middletons-arse-dress/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/pippa-middletons-arse-dress/#comments</comments>
		<pubDate>Mon, 02 May 2011 11:31:51 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[Pippa Middleton arse]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=5999</guid>
		<description><![CDATA[Searches for "Pippa Middleton arse" running at 72% of the level of those for "Pippa Middleton dress"]]></description>
			<content:encoded><![CDATA[<p><div id="attachment_6006" class="wp-caption alignleft" style="width: 229px"><a href="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/arse-pippa-middleton.png"><img class="size-full wp-image-6006" title="arse-pippa-middleton" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/arse-pippa-middleton.png" alt="" width="219" height="216" /></a><p class="wp-caption-text">Pippa Middleton&#39;s arse</p></div></p>
<p>Google Insights gives you a unique insight into how people search the internet. An often depressing insight ...</p>
<p>I can reveal that by Saturday, the day after the Royal Wedding took place and the most recent day for which Google is currently showing data as I type this, searches for "Pippa Middleton arse" were running at 72% of the volume of those for "Pippa Middleton dress".</p>
<p>I'll keep the graph below, updated ... (there's another 3% relating to Pippa Middleton ass, not shown in the graph).</p>
<p><div id="attachment_6000" class="wp-caption alignleft" style="width: 560px"><a href="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/pippa-middleton-arse-dress.png"><img class="size-large wp-image-6000" title="pippa-middleton-arse-dress" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/pippa-middleton-arse-dress-550x290.png" alt="Philippa Middleton: arse v dress. A graph of search volumes over last 7 days" width="550" height="290" /></a><p class="wp-caption-text">Interest in the dress has peaked but is still rising in the arse.</p></div></p>
<p>The good old British people.</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=5999&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/pippa-middletons-arse-dress/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Pippa Middleton&#8217;s arse: classy, Google, classy &#8230;</title>
		<link>http://www.malcolmcoles.co.uk/blog/pippa-middletons/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/pippa-middletons/#comments</comments>
		<pubDate>Sun, 01 May 2011 21:20:48 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[autosuggest]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[Pippa Middleton arse]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=5979</guid>
		<description><![CDATA[In a new low for google autosuggest, it's suggesting "arse" as the top suggested search term when you type in Pippa Middleton's name.]]></description>
			<content:encoded><![CDATA[<p>In a new low for google autosuggest (remember <a href="http://www.malcolmcoles.co.uk/blog/cheryl-coles-fart/">Cheryl Cole's fart</a>?), it's suggesting "arse" as the top suggested search term when you type in <a href="http://pippa-middleton.co.uk/">Pippa Middleton's</a> name. Off with their heads ...</p>
<p><div id="attachment_5992" class="wp-caption alignleft" style="width: 560px"><img class="size-full wp-image-5992" title="pippa-middleton-arse" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/pippa-middleton-arse.png" alt="Searching for Pippa Middleton on Google" width="550" height="226" /><p class="wp-caption-text">Pippa Middleton&#39;s arse is suggested</p></div></p>
<p><a href="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/arse-pippa-middleton.png"><img class="alignleft size-full wp-image-6006" title="arse-pippa-middleton" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/arse-pippa-middleton.png" alt="" width="219" height="216" /></a>Interestingly, it's treating ass and arse as synonyms and showing results from pages using either.</p>
<p>(Update: A I now appear to be the top result for a search on "pippa middleton's arse". But not ass. Which is nice.)</p>
<p>I've written before about <a href="http://www.malcolmcoles.co.uk/blog/google-instant-filter/">the problems Google Instant has with deciding what to filter and not filter when it comes to rude words</a>. Maybe arse isn't as rude in the USA ... (Update: interest in <a href="http://www.malcolmcoles.co.uk/blog/pippa-middletons-arse-dress/">Pippe Middleton's arse vs dress</a>)</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=5979&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/pippa-middletons/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google’s indexed 64 fake Independent jelly-bean Kate-Middleton URLs</title>
		<link>http://www.malcolmcoles.co.uk/blog/indy-jelly-bean/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/indy-jelly-bean/#comments</comments>
		<pubDate>Tue, 19 Apr 2011 10:06:14 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Newspapers]]></category>
		<category><![CDATA[SEO advice]]></category>
		<category><![CDATA[Independent]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=5934</guid>
		<description><![CDATA[You can change Indy URLs to anything if you leave the unique ID number in. Google's started to index the fake ones people are circulating on Twitter ...]]></description>
			<content:encoded><![CDATA[<p>Someone spotted today that you can change the Independent's URLs to anything as long as you leave in the unique identifier number. (See updates at the bottom for how multiple URLs for this story are now in Google's index via the power of Twitter.)</p>
<p>So <a href="http://www.independent.co.uk/life-style/food-and-drink/kate-middleton-jelly-bean-expected-to-fetch-500-2269573.html">this story</a> (about a jelly bean with Kate Middleton's face in) can also be reached <a href="http://www.independent.co.uk/life-style/food-and-drink/utter-PR-fiction-but-people-love-this-shit-so-fuck-it-lets-just-print-it-2269573.html">here</a> with a URL of http://www.independent.co.uk/life-style/food-and-drink/utter-PR-fiction-but-people-love-this-shit-so-fuck-it-lets-just-print-it-2269573.html (<strong>Update</strong> - this rude version has had 1,476 retweets compared with 33 for the original version!)</p>
<p>People are busy changing this to anything they like on Twitter. Google has already indexed the two versions:<span id="more-5934"></span></p>
<p><div id="attachment_5935" class="wp-caption alignleft" style="width: 560px"><img class="size-full wp-image-5935" title="Picture 206" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/Picture-206.png" alt="Two URLs indexed via Twitter" width="550" height="250" /><p class="wp-caption-text">Sweary URLs ...</p></div></p>
<p>How many more?!?</p>
<p><strong> Update</strong>: While writing this post, a third one had been indexed <a href="http://www.independent.co.uk/life-style/food-and-drink/you-can-say-anything-you-like-in-this-bit-including-read-the-guardian-instead-of-the-indie-2269573.html">at</a> http://www.independent.co.uk/life-style/food-and-drink/you-can-say-anything-you-like-in-this-bit-including-read-the-guardian-instead-of-the-indie-2269573.html.</p>
<p>More to come I'm sure ... You can test for yourself how many Google has indexed by <a href="http://www.google.co.uk/search?q=site:independent.co.uk+inurl%3A2269573#q=site:independent.co.uk+inurl:2269573&amp;hl=en&amp;prmd=ivns&amp;filter=0&amp;fp=d23d833ccb0bda61">running this search</a> (Google is at least recognising them as duplicate URLs).</p>
<p><strong>Update 2</strong>: Google has the sweary version on page one of its results when you <a href="http://www.google.co.uk/search?q=kate+middleton+jelly+bean">search for kate middleton jelly bean</a>. (And the sweary version is first for a search on Kate Middleton bean)</p>
<p><strong>Update 3</strong>: There are now 12 URLs indexed for this story, including the following:</p>
<ul>
<li>http://www.independent.co.uk/life-style/food-and-drink/benjamin-gray-is-the-true-heir-to-the-throne-2269573.html</li>
<li>http://www.independent.co.uk/life-style/food-and-drink/OMFG-i-can-write-ANYTHING-I-WANT-in-here-and-create-FAKE-TWITTER-MEMES-2269573.html</li>
<li>http://www.independent.co.uk/life-style/food-and-drink/Load-of-old-wank-2269573.html</li>
<li>http://www.independent.co.uk/life-style/food-and-drink/unfortunately-you-can-change-the-url-on-any-indy-story-to-whatever-you-like-2269573.html</li>
</ul>
<p><strong>Update 4</strong>: 17 indexed now. And counting ...</p>
<p><strong>Update 5</strong>: .The Head of Digital Audience &amp; Content Development at this Indy has tweeted to <a href="https://twitter.com/#!/_JackRiley/status/60308668364767232">blame the Guardian</a> for the whole fiasco.</p>
<p><div id="attachment_5951" class="wp-caption alignleft" style="width: 546px"><img class="size-full wp-image-5951" title="Picture 207" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/Picture-207.png" alt="Tweet" width="536" height="91" /><p class="wp-caption-text">Jack Riley blames a Guardian sub</p></div></p>
<p>Maybe try the IT department too?</p>
<p><strong>Update 6</strong>: By the end of the day, Google had indexed 62 different URLs for this story, nearly all via being made up and spread on twitter.</p>
<p>New ones included:</p>
<ul>
<li>AllAboutTheGames-is-the-best-gaming-site-on-planet-earth-2269573.html</li>
<li>middleton-flicks-bean-as-she-waits-for-wedding-night-2269573.html</li>
<li>what-first-attracted-you-to-royal-heir-prince-william-2269573.html</li>
</ul>
<p>And a load of others I'm too tired to check.</p>
<p><strong>Update 7</strong>: the next morning. Google has indexed 64 URLs (not all via Twitter. Reddit was having fun too ...).</p>
<p>It's no longer treating them all as duplicates.</p>
<p>Take a sentence from the story (which was not originally produced by the Independent) and add the word Independent and <a href="http://www.google.co.uk/search?client=safari&amp;rls=en&amp;q=The+bean+has+yet+to+appear+on+eBay+but+royal+watchers+are+sure+to+be+keeping+an+eye+out.+Hosie,+a+trainee+accountant,+said+he+plans+to+sell+it+for+£500+(€567).&amp;ie=UTF-8&amp;oe=UTF-8&amp;redir_esc=&amp;ei=FpCuTZTcEY2zhAeZooDdAw#sclient=psy&amp;hl=en&amp;client=safari&amp;rls=en&amp;source=hp&amp;q=Independent+The+bean+has+yet+to+appear+on+eBay+but+royal+watchers+are+sure+to+be+keeping+an+eye+out.+Hosie%2C+a+trainee+accountant%2C+said+he+plans+to+sell+it+for+£500+(€567).&amp;aq=f&amp;aqi=&amp;aql=&amp;oq=&amp;pbx=1&amp;fp=ecf601ac4a37849c">you are shown  5 versions of the story</a>. Even without the word Independent, <a href="http://www.google.co.uk/search?client=safari&amp;rls=en&amp;q=The+bean+has+yet+to+appear+on+eBay+but+royal+watchers+are+sure+to+be+keeping+an+eye+out.+Hosie,+a+trainee+accountant,+said+he+plans+to+sell+it+for+£500+(€567).&amp;ie=UTF-8&amp;oe=UTF-8&amp;redir_esc=&amp;ei=FpCuTZTcEY2zhAeZooDdAw#sclient=psy&amp;hl=en&amp;client=safari&amp;rls=en&amp;source=hp&amp;q=The+bean+has+yet+to+appear+on+eBay+but+royal+watchers+are+sure+to+be+keeping+an+eye+out.+Hosie%2C+a+trainee+accountant%2C+said+he+plans+to+sell+it+for+£500+(€567).&amp;aq=f&amp;aqi=&amp;aql=&amp;oq=&amp;pbx=1&amp;fp=ecf601ac4a37849c">you see three versions on page one of the results</a> and the option to see lots more.</p>
<p>The Indy's written a blog post <a href="http://blogs.independent.co.uk/2011/04/19/fakes-abuse-and-a-social-media-storm/">apologising</a> (for some reason).</p>
<blockquote><p>We are however acting urgently (as are now some of our major competitors) to find a way to avoid such abuse and hope that will be in place shortly.</p></blockquote>
<p>The solution's fairly simple. Do what other sites do with the same CMS and redirect to the canonical URL when someone types in gibberish in the part of the URL (the keywords) that isn't actually parsed.</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=5934&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/indy-jelly-bean/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Online news isn&#8217;t turning journalists into sheep</title>
		<link>http://www.malcolmcoles.co.uk/blog/chasing-ratings-news/</link>
		<comments>http://www.malcolmcoles.co.uk/blog/chasing-ratings-news/#comments</comments>
		<pubDate>Tue, 19 Apr 2011 06:30:36 +0000</pubDate>
		<dc:creator>Malcolm Coles</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[Newspapers]]></category>
		<category><![CDATA[Guardian]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.malcolmcoles.co.uk/blog/?p=5922</guid>
		<description><![CDATA[John Naughton wrote an Observer article at the weekend complaining that news websites such as Gawker "allow ratings to dictate content". What's wrong with that, I say?]]></description>
			<content:encoded><![CDATA[<p>John Naughton wrote an Observer article at the weekend complaining that news websites such as Gawker "allow ratings to dictate content". And although he didn't say so in so many words, his targets are clearly <a href="http://www.malcolmcoles.co.uk/blog/tag/seo/">SEO</a> and site-analytics and <a href="http://www.malcolmcoles.co.uk/blog/news-search-tools/">real-time trend data</a>.</p>
<p>I say he wrote it. It's mostly a restirring of an original source, <a href="http://www.theatlantic.com/magazine/archive/2011/04/learning-to-love-the-shallow-divisive-unreliable-new-media/8415/">an article in the Atlantic</a>. But his main complaints are that journalists are turning into sheep, with the herd all charging in the same direction. I can't see anything different about how online news works compared to how newspapers work. Here's why.</p>
<p><div id="attachment_5924" class="wp-caption alignleft" style="width: 560px"><img class="size-full wp-image-5924" title="sheep" src="http://www.malcolmcoles.co.uk/blog/wp-content/uploads/2009/04/sheep.png" alt="Sheep" width="550" height="230" /><p class="wp-caption-text">Ewe don&#39;t have to follow the herd</p></div></p>
<p>His <a href="http://www.guardian.co.uk/technology/2011/apr/17/news-websites-driven-by-ratings">issues are</a> as follows.</p>
<h3>News organisations are writing about what their readers are interested in</h3>
<blockquote><p>"Market-driven news [means] news agendas that are driven not by some professional assessment of what's important and relevant, but by research into what viewers like and respond to."</p></blockquote>
<p>There are five sorts of stories that news organisations can report on:</p>
<ol>
<li>News they uncover themselves (scandals like MPs' expenses)</li>
<li>News that they and everyone else knows is going to happen (a sports event or a court hearing)</li>
<li>News that's sent to them (press releases)</li>
<li>News that someone else has uncovered (copying someone else's scoop)</li>
<li>News that wouldn't be news but for the fact that it's related to things people are interested in and actively searching for.</li>
</ol>
<p>With any of these, I fail to see what's wrong with taking account of what viewers like and respond to. If you can't get interested anyone in your story, what's the point of writing it?</p>
<p>There are sites that only do (4) and (5). We tend to call them content farms and dislike them. A couple have made a success out of it. But as a long-term strategy, I wouldn't bet your business on it - you're at the mercy of Google's next algorithm update.</p>
<p>What's more, newspapers have always done this. They have an editorial line - it's based on what their readers expect. The fact that there are tools that make feedback more precise online, doesn't mean it's any different to what they get up to in print.</p>
<h3>News organisations are writing about what their readers are interested in</h3>
<blockquote><p>"Somewhere in [online newsrooms'] fancy open-plan offices is a guy who is watching the second-by-second audience for every page on the site. He's the chap who knows what's "trending" now and if your stories aren't figuring then – depending on the editorial culture – you may eventually feel the heat of managerial disapproval.</p></blockquote>
<p>Again, this surely applies in print as well. If you can't uncover or write about stories that are of interest to people, or make them sound interesting or compelling, you're not going to last long. There are tools that make it more obvious with online - but good.</p>
<p>Also, what's trending doesn't have to be a passive thing. You can shape the news agenda, you can drive interest in a topic by the quality of your journalism.</p>
<h3>News organisations are writing about what their readers are interested in</h3>
<blockquote><p>AOL asks its journalists to consider "Traffic Potential ("How many page views will this content generate?"); Revenue/Profit ("What CPM [cost per thousand impressions, a measure of online advertising effectiveness] will this content earn?"); Turnaround Time ("How long will it take to produce?"); and finally Editorial Integrity ("Will this content conform to AOL's editorial standards?"). In that order."</p></blockquote>
<p>I'm not really sure the order matters does it? But I'm struggling to see anything wrong with this as a set of questions. Newspapers are not a public good - they survive on the basis of the audience they deliver to advertisers (<a href="http://www.malcolmcoles.co.uk/blog/guardian-future/">or not</a>). Maybe journalists were never asked this as explicitly before - but is this really a bad thing?</p>
<h3>To sum up</h3>
<p>I'm not denying the pressure on journalists, the toll on news rooms caused by cost cutting or the dumbing down of the news agenda.</p>
<p>But none of this is caused by the measurement of online news per se - the new tools just give news organisations a more accurate way to measure what they've always done.</p>
<p><a href="http://www.flickr.com/photos/foxypar4/474250992/">Photo credit</a>.</p>
<img src="http://www.malcolmcoles.co.uk/blog/?ak_action=api_record_view&id=5922&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.malcolmcoles.co.uk/blog/chasing-ratings-news/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

