Alan Perkins - Profile and Blog

Alan Perkins is the CEO of SilverDisc and was a co-founder in 1993. Some info about Alan ...

  • Alan has been working in the electronic marketing industry since 1990 and in search marketing since 1995.
  • Alan has developed a number of worldwide patents in search engine technology, for example this one in 1999.
  • Alan has spoken every year at Search Engine Strategies (aka SES Conference) since 2001.
  • Alan is an industry-leading advocate of best practices in search marketing. He has worked directly with search engines to achieve this, e.g. a couple of Google's webmaster quality guidelines were based on his contributions in 2001. Google contacted Alan following a presentation on best practises they saw him give at SES San Francisco that year.
  • Alan lives in Kettering with his wife and teenage sons and walks to work each day.

Connect with Alan through Google+, Twitter or LinkedIn.

alan's picture

Next Generation Site Architecture - a report from SES 2013

At Search Engine Strategies 2013 I was privileged to be sharing a platform with Google's Maile Ohye.  Maile is Google's Developer Programs Tech Lead and (like most techie Googlers I've met) is both a very smart person and a very friendly person.

The panel was "Next Generation Site Architecture" and Maile and I shared the various components of this topic across our two presentations resulting in what was, I hope, a rounded and complete hour on the topic, ably moderated by Will Critchlow of Distilled.

Maile started off by asking you to ask yourself whether you and your team were committed to your user experience, wherever those users were, whatever language they spoke, whatever device they happened to be using.  Her talk went on to tell you how Google fitted into the picture.

If your site ranks well in one language and country, it won’t automatically rank well in all languages and countries.  There are some coding signals you can use to let Google know the countries and languages you are targeting.

rel=canonical/prev/next

The first two coding signals that Maile covered are the rel=canonical tag for signalling the preferred URL among duplicates, and the rel=prev and rel=next tags for indicating the component pages of a series.  I’ve covered those tags before, in the following posts:

Maile also offered guidance on how to specify a canonical tag for a non-HTML object (such as a PDF).  This can be done through the HTTP response header for the object, for example like so:

HTTP/1.1 200 OK
Content-Type: application/pdf
Link: <http://www.example.com/white-paper.html>; rel="canonical"
Content-Length: 785710

The technique is specified in more detail on Google’s Webmaster Blog entry “Supporting rel="canonical" HTTP Headers” but the Google blog doesn’t tell you that this technique can also be used to provide canonical alternative URLs for multiple objects of the same type.  For example:

GET /white-paper.pdf?referrer=ABC123 HTTP/1.1
Host: www.example.com
(...rest of HTTP request headers...)
HTTP/1.1 200 OK
Content-Type: application/pdf
Link: <http://www.example.com/white-paper.pdf>; rel="canonical"
Content-Length: 785710
(... rest of HTTP response headers...)

rel=alternate hreflang=x

Maile then presented the code for language or regional markup of a page, the “hreflang” attribute.  I’ve not covered that previously on the blog, but have done so at previous SES conferences.  Google covers the mechanics of it quite well in their help document, “rel="alternate" hreflang="x" - Help Google serve the correct language or regional URL” but, despite this, there is much confusion around how to use this tag properly.  The situation has become even more confusing as mobile websites have taken off.  Here’s why: previously, a publisher may have found themselves in a position where they had a duplicate content issue, which rel=canonical was designed to address – now that issue is compounded by multiple languages, multiple regions and multiple platforms (mobile, tablet and desktop).  rel=alternate hreflang=x addresses the multiple languages and regions issue, but it’s quite clumsy code – if you wish to implement it, I would recommend doing so through the sitemaps mechanism described here.  Maile listed a couple of benefits of using rel=alternate hreflang=x:

  • Helps discovery of new URLs – URLs that were previously unknown to Google could be added to the crawl schedule once discovered via rel=alternate hreflang=x
  • More targeted URLs for searches – it’s more likely that a co.uk domain would be listed for UK searchers, rather than the generic com domain, meaning your visitors get to see appropriate content without you having to implement geo-targeting yourself

Building A Smartphone Site

The remainder of Maile’s presentation went on to explore the multiple platform issues that have become more prevalent since the recent proliferation of mobile sites.  Recent SilverDisc blog posts have touched on this, e.g. “Responsive Website Development Using A Mobile-First Strategy”, but Maile’s approach was subtitled “Building A Smartphone Site” and that’s the angle she took.

Maile began this part of her talk by looking at the difference between feature phones and smartphones, the key differences being:

  • Feature phones use cHTML (iMode), WML and WAP, whereas smartphones tend to support HTML5 and can therefore render normal desktop pages
  • Feature phones normally have keyboards whereas smartphones normally have a touch UI.

Maile then went on to list the three types of smartphone site – responsive, dynamic, and separate URL – and showed the decision tree for it, which I’ll convert into a table here:

Type of Site

Same URLs

Same HTML

Responsive

Yes

Yes

Dynamic

Yes

No

Separate mobile site

No

No

 

Maile listed the Googlebot user agents that are relevant to mobile:

  • Feature phones
    • SAMSUNG-SGH-E250/1.0 Profile/MIDP-2.0 Configuration/CLDC-1.1 UP.Browser/6.2.3.3.c.1.101 (GUI) MMP/2.0 (compatible; Googlebot-Mobile/2.1; +http://www.google.com/bot.html)
    • DoCoMo/2.0 N905i(c100;TB;W24H16) (compatible; Googlebot-Mobile/2.1; +http://www.google.com/bot.html)
  • Smartphones
    • Mozilla/5.0 (iPhone; U; CPU iPhone OS 4_1 like Mac OS X; en-us) AppleWebKit/532.9 (KHTML, like Gecko) Version/4.0.5 Mobile/8B117 Safari/6531.22.7 (compatible; Googlebot-Mobile/2.1; +http://www.google.com/bot.html)

This is helpful to know if you’re coding a dynamic or separate mobile site as it allows you to predict which HTTP response and content that the mobile Googlebot will see.

Maile defined a responsive web design as one which used the same HTML at the same URL, and CSS media queries to vary the layout according to the screen size, e.g.

@media only screen and (max-width: 640px) { … }

Note - in practice we’ve found that some common browsers don’t handle media queries well, e.g. IE8, so you’ll find yourself implementing some hacks if you go for a purely responsive design.

Maile cited the BBC FOUR channel as a good example of responsive web design:

 BBC FOUR responsive web design

When it comes to dynamic-response mobile sites, typically this is done through detecting the user agent and serving appropriate content.  So that Google knows you’re delivering dynamic content, your HTTP response header should include a Vary field as described in Google’s Building Smartphone-Optimized Websites document, e.g. as follows:

GET /page-1 HTTP/1.1
Host: www.example.com
(...rest of HTTP request headers...)
HTTP/1.1 200 OK
Content-Type: text/html
Vary: User-Agent
Content-Length: 5710
(...rest of HTTP request headers...)

The third type of mobile web design, the separate URL, needs relationship annotation code to tell Google the connection between the various different pages.  The two pieces of code needed are rel=alternate and rel=canonical.  For example, suppose we have www.example.com and m.example.com. On www.example.com we would place the following code in the HTML <head> section to indicate that alternative, mobile content was available for devices of limited screen width:

<link rel="alternate" media="only screen and (maxwidth:640px)" href="http://m.example.com/" >

On m.example.com we would place the following code in the HTML <head> section to show that the www version was the canonical:

<link rel="canonical" href="http://www.example.com/" >

If automatic redirects take place between the m. and www. domains, the redirect should be served with a HTTP Vary field in the response header, for example:

HTTP/1.1 301 Moved Permanently
Vary: User-Agent
Location: http://m.example.com/
(... rest of HTTP response headers...)

Google’s SkipRedirect for mobile searches

Maile then went on to describe Google’s SkipRedirect for mobile searches.  In a nutshell, how this works is as follows

  1. a user searches Google on a smartphone
  2. Google may display the desktop version of a particular URL, title and snippet in its search results
  3. the user clicks that particular result
  4. Google automatically redirects that user to the mobile equivalent URL i.e. without ever sending the user to the desktop URL.

This is something you need to be aware of, especially when implementing your own mobile redirects as you could find it very confusing when searchers aren’t hitting the desktop domain/URL you expect them to.

Best Practices For Next Generation Site Architecture

Maile’s advice, when selecting an architecture for your smartphone site, was to consider your users first.  Think about the context they’re in and what their expectations are.  Use responsive web design if it’s right for your users, as it’s good for Google too.  If you can’t use a responsive design, understand the tradeoffs and pitfalls of the dynamic or separate alternatives and make sure you implement them correctly.

Each URL should deliver the same content regardless of the user’s IP or language preferences.  Maile described this as “shareable URLs”, i.e. if you send a URL to a friend in a different country, they should be able to look at the same content as you.  My rule of thumb for this is “Each piece of unique content, indexed once, at the best URL for it.”  It has stood me in good stead for many years.

Think about page speed – this is especially important for mobile users, of course.  The architecture you use must scale to new users and network distances.

Don’t redirect lots of different desktop URLs to one smartphone URL.  If you don’t have a smartphone version of a URL, then show the desktop version instead.

Summary

Maile summarised her talk as follows:

  1. Be as educated as possible when making technical site architecture decisions
  2. Use signals such as those described in this post to help Google understand your site
  3. Test continually!

Thanks to Maile and Will for a great session.  In a future blog post I’ll provide my own presentation from the show.

alan's picture

SEO Is Not Spam, Says Google's Matt Cutts

Google's Head of Webspam, Matt Cutts, has posted a video to the Google Webmaster Youtube Channel explaining what he's been saying in private and on conference platform for years - that SEO per se is not spam; ethical SEO as practiced and long advocated by me (so much so that I worked with Matt when he was putting together the original Google Webmaster Guidelines a decade ago this month) is certainly not spam; but that some forms of SEO, in particular black hat SEO, are spam.  The video is below and, for those of you without video playing capabilities, a transcript prepared by Lynda follows:

 

Transcript of "Does Google Consider SEO to be Spam?" By Matt Cutts

I wanted to take a minute and talk a little bit about search engine optimization and spam, and answer the question “Does Google consider SEO to be spam?”

And the answer in “No. We don’t consider SEO to be spam.”  Now a few really tech savvy people might get angry at that. So let me explain in a little more detail.  SEO stands for Search Engine Optimization.  And essentially it just means trying to make sure that your pages are well represented within search engines.  And there’s plenty...an enormous amount ...of white hat, great quality stuff that you can do as a search engine optimizer.

You can do things like making sure that your pages are crawlable. So you want them to be accessible. You want people to be able to find them just by clicking on links. And in the same way search engines can find them just by clicking on links.

You want to make sure that people use the right keywords. If you’re using industry jargon or lingo that not everybody else uses, then a good SEO can help you find out, oh, these are keywords that you should have been thinking about.

You can think about usability, and trying to make sure that the design of the site is good. That’s good for users and for search engines.

You can think about how to make your site faster. Not only does Google use site speed in our rankings as one of the many factors that we use in our search rankings. But if you can make your site run faster, that can also make it a much better experience.

So there are an enormous number of things that SEOs do, everything from helping out with the initial site architecture and deciding what your site should look like, and the url structure, and the templates, and all that sort of stuff, making sure that your site is crawlable, all the way down to helping optimize for your return on investment. So trying to figure out what are the ways that you are going to get the best bang for the buck, doing AB testing, trying to find out, OK, what is the copy that converts, all those kinds of things. There is nothing at all wrong with all of those white hat methods.

Now, are there some SEOs who go further than we would like? Sure. And are there some SEOs who actually try to employ black hat techniques, people that hack sites or that keyword stuff and just repeat things or that do sneaky things with redirects? Yeah, absolutely. But our goal is to make sure that we return the best possible search results we can. And a very wonderful way that search engine optimizers can help is by cooperating and trying to help search engines find pages better. So SEO is not spam. SEO can be enormously useful. SEO can also be abused and it can be overdone.

But it’s important to realise that we believe, in an ideal world, people wouldn’t have to worry about these issues. But search engines are not as smart as people yet. We’re working on it. We’re trying to figure out what people mean. We’re trying to figure out synonyms, and vocabulary, and stemming so that you don’t have to know exactly the right word to search for what you wanted to find. But until we get to that day, search engine optimization can be a valid way to help people find what they are looking for via search engines.

We provide webmaster guidelines on google.com/webmasters. There’s a free webmaster forum. There are free webmaster tools. There’s a ton of HTML documentation. So if you search for SEO starter guide, we’ve written a beginner guide where people can learn more about search engine optimization.

But just to be very clear, there are many, many valid ways that people can make the world better with SEO. It’s not the case that...sometimes you’ll hear SEOs are criminals. SEOs are snake oil salesmen. If you find a good person, someone that you can trust, someone that will tell you exactly what they’re doing, the sort of person where you get good references, or you’ve seen their work and it’s very helpful, and they’ll explain exactly what they’re doing, they can absolutely help your website. So I just wanted to dispel that misconception.

Some people think Google thinks all SEO is spam and that’s definitely not the case. There are a lot of great SEOs out there. And I hope you find a good one to help with your website.

alan's picture

Analytics Under Attack: Google's Evil, Unethical Move To Remove Referrer Data

Google has announced that it is to cease providing referrer information in some instances.  In the official blog post, Google's Evelyn Kao writes:

When you search from https://www.google.com, websites you visit from our organic search listings will still know that you came from Google, but won't receive information about each individual query.

Initially this change affects people logged in to Google accounts and using Google.com which, Google claims, is a very small percentage of searchers (although still a large number of people).  But it's likely this will change as, according to Google's own blog entry:

As we continue to add more support for SSL across our products and services, we hope to see similar action from other websites. 

To give an example of what Google have actually done, I have searched today for "car insurance" both logged in to my Google account and searching on https://www.google.com, and not logged in to my Google account and searching on http://www.google.com/.  In each case I have clicked through to the same landing page.  Here are the referrers of that landing page in both cases:

  1. Referrer When Not Logged In, Clicking On A Natural Link: http://www.google.com/#hl=en&sugexp=kjrmc&cp=5&gs_id=l&xhr=t&q=car+insurance&qe=Y2FyIGk&qesig=Eeu3hebYxgo0in9YDLhtAA&pkc=AFgZ2tkKH3Xw88yrwvHzg5MkB-5vAi8dBrAzxf3se4-a7_BaiiecMyYZt0D_3TtcaX8K2jJgbEC3Yw7qMsDB65pNgSjYWjDjlA&pf=p&sclient=psy-ab&source=hp&pbx=1&oq=car+i&aq=0p&aqi=p-p1g3&aql=f&gs_sm=&gs_upl=&bav=on.2,or.r_gc.r_pw.r_cp.,cf.osb&fp=8e7fa2636e8b849&biw=1680&bih=947
  2. Referrer When Logged In, Clicking On A Natural Link: http://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&sqi=2&ved=0CHoQFjAA&url=http%3A%2F%2Fwww.moneysupermarket.com%2Fcar-insurance%2F&ei=RoaeTuS6IY_D8QOP8fixCQ&usg=AFQjCNF-UvvfJsjMbuyeGwVVnyzkQmInRA&sig2=NaTkFkfi3cK7R1_V5TsTcg

I have highlighted the key difference in bold red above.  When not logged in, my query "car insurance" is available in the referrer for Analytics to pick up and use to provide the site owner with information about what I was looking for.  When logged in, my query "car insurance" has been stripped, so the site owner is completely clueless about why Google sent me to that page on their website.  Note, then, that when logged in, that referrer is a lie - the page I was visiting before was not the one in the referrer at all.  For example, I was actually on https://www.google.com/, not http://www.google.com/.

This small change has some very large consequences for site owners.  For example, no matter what analytics package you use, any reports that show keywords will become less useful (and, at the extreme, useless).  Check out this short interview from Google Analytics Evangelist Avinash Kaushik, following his keynote at 2010's Search Engine Strategies (which I attended with interest):

If Google removes keywords from referrer data then all of the great keyword ideas, keyword techniques and keyword attribution models that Avinash shares are no longer possible.  Evangelise that, Avinash!

Joking aside, a lot of the great work SilverDisc and others do in making sites better for users will be made more difficult and less effective by this move.

Google's move upsets the ethical balance that exists between searchers, search engines and site owners.  This is the very principle that ethical SEO is based upon - the three stakeholders to be considered are

  • Site owners who produce great content designed to meet their visitors' needs.
  • Search engines who are allowed to crawl and index that content as long as it provides benefit to the site owner.
  • Searchers who get to find the information they need in order to satisfy their enquiry.

From my original ethical SEO paper, the most ethical technique 

  • produces the most good and does the least harm
  • respects the rights and dignity of all stakeholders and treats all stakeholders fairly
  • promotes the common good
  • helps all participate more fully in the goods we share as a community and a society
  • enables the deepening or development of those virtues or character traits that we value as individuals, professions and members of a society

How does Google removing referrer information produce an unethical result?  Let's break it down:

  • produces the most good and does the least harm?
    • site owners can no longer optimise their sites to better match the searcher needs, so they will struggle to produce the best possible websites
  • respects the rights and dignity of all stakeholders and treats all stakeholders fairly?
    • site owners, rather than being treated with dignity, are treated as being "not trustworthy" and are denied a piece of information that the other two stakeholders (Google and the searcher) both have - the search query that resulted in that searcher visiting their site.
  • promotes the common good?
    • the common good is Google working with site owners to produce a better Web, which to be fair does happen a lot in other ways.  This move, however, does not promote the common good - Google gains and the site owner loses.
  • helps all participate more fully in the goods we share as a community and a society?
    • clearly this move prevents full participation of site owners in something they have had available to them since the earliest days of the Web and something upon which  the Web was built - referrer data was provided in the HTTP 0.9 specification and has been there ever since
  • enables the deepening or development of those virtues or character traits that we value as individuals, professions and members of a society?
    • again, this move alienates site owners and does not engender a spirit of cooperation and teamwork among site owners and Google, whose entire service is built on the content that site owners freely provide

So this move is unethical.  But is it evil?  (Note I deliberately use the word "evil", of course, since Google's corporate mantra is "Don't be evil").

What's really evil about Google's announcement is the patronising spin they've put on it.  Google's headline, even on its Analytics blog which is aimed at site owners rather than searchers, is not "We're removing site owners' ability to pull keywords from the referrer";  it is "Making search more secure: Accessing search query data in Google Analytics".  This fails to treat site owners with the respect they deserve.  The whole piece is positioned as making search more secure, for example when using insecure Wifi hotspots, yet at least a couple of things don't stack up if this is the objective:

  • If the user is visiting a secure Web site then Google still strips the referrer (thanks Danny Sullivan at Search Engine Land for this info), even though this is not necessary and, given they don't do this on their Encrypted Search, Google clearly knows it's not necessary
  • Searchers' referrers still contain keywords if searchers click on an ad, rather than a natural result.

That last point really shows where Google's mind is at.  To juxtapose a couple of points from their blog post:

we recognize the growing importance of protecting the personalized search results we deliver. As a result, we’re enhancing our default search experience for signed-in users ...  [but] ... if you choose to click on an ad appearing on our search results page, your browser will continue to send the relevant query over the network to enable advertisers to measure the effectiveness of their campaigns and to improve the ads and offers they present to you

So advertisers who pay Google money get treated one way, site owners who pay Google by providing the content the whole Google service is built on get treated a different way, and searchers' privacy is not really protected.  Nice.  To complete the example I gave earlier, the third link below is the referrer I received on the same website as result 2, but this time clicking on a paid ad rather than a natural result:

  1. Referrer When Not Logged In, Clicking On A Natural Link: http://www.google.com/#hl=en&sugexp=kjrmc&cp=5&gs_id=l&xhr=t&q=car+insurance&qe=Y2FyIGk&qesig=Eeu3hebYxgo0in9YDLhtAA&pkc=AFgZ2tkKH3Xw88yrwvHzg5MkB-5vAi8dBrAzxf3se4-a7_BaiiecMyYZt0D_3TtcaX8K2jJgbEC3Yw7qMsDB65pNgSjYWjDjlA&pf=p&sclient=psy-ab&source=hp&pbx=1&oq=car+i&aq=0p&aqi=p-p1g3&aql=f&gs_sm=&gs_upl=&bav=on.2,or.r_gc.r_pw.r_cp.,cf.osb&fp=8e7fa2636e8b849&biw=1680&bih=947
  2. Referrer When Logged In, Clicking On A Natural Link: http://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&sqi=2&ved=0CHoQFjAA&url=http%3A%2F%2Fwww.moneysupermarket.com%2Fcar-insurance%2F&ei=RoaeTuS6IY_D8QOP8fixCQ&usg=AFQjCNF-UvvfJsjMbuyeGwVVnyzkQmInRA&sig2=NaTkFkfi3cK7R1_V5TsTcg
  3. Referrer When Logged In, Clicking On A Paid Link: http://www.google.com/url?http://www.google.com/aclk?sa=L&ai=ClrXYRoaeTt7BI8We8APK543yBYrGqWP-obrkI4TN7AQQBigIUOvGl8f4_____wFgu76ug9AKyAEBqQIwM2eSHF26PqoEGk_QTdQtj7np_5xRJavQhhGPHLhFRZtF9pdvugUTCOT1h6Om9KsCFY9hfAodjzg-lsoFAA&num=9&ei=RoaeTuS6IY_D8QOP8fixCQ&sig=AOD64_10PBtIEOuf9waR5LMaPUiMrDinMA&sqi=2&ved=0CEIQ0Qw&adurl=http://pixel.everesttech.net/1816/cq%3Fev_sid%3D3%26ev_ln%3Dcar%2520insurance%26ev_crx%3D9512830502%26ev_mt%3De%26ev_n%3Dg%26ev_ltx%3D%26ev_pl%3D%26url%3Dhttp%253A//www.moneysupermarket.com/car-insurance/insurance/%253FSource%253DGOO-003881E4%2526keywords%253Dcar%252Binsurance%252B%252BExact%2526p%253D0&rct=j&q=car+insurance 

What can site owners do about this?  Individually, not a lot.  Promoting and using other search engines, such as Microsoft Bing, would be a start.  This strikes me as a great opportunity for Microsoft to build and foster better relationships with site owners, for example by promising never to remove referrer data from its search results.

If they were able to operate as a collective, site owners could do Google serious damage.  In my 2007 post  "Bringing Down Google With Two Simple Lines of Code" I showed how this could be done.

The permission can be taken away with two simple lines of code placed in a site's robots.txt file:

User-agent: Googlebot
Disallow: /

Sure, every site owner in the world would need to publish this file to their sites. But if they did such a thing, the Google search engine could no longer crawl or index any of the Web's content. It would be defunct.

So, fellow site owners, Google's future is in our hands. If you want to go "on strike" and stop Google profiting from the fruits of your labours, simply publish the code. Be warned that your site will eventually be removed from Google's index if you do so. As a unilateral step, this may do you more harm than good. But if we all do it en masse, then beware Google!

That post was written four years ago.  Now, with social media so prevalent that it can lead to regime change in countries, maybe it can lead to regime change among search engines too.  Microsoft, are you listening?

alan's picture

Rel=Prev, Rel=Next and View-All Pages: New Google Guidance

Google has this week launched new advice on how to mark up a series of related pages in order to allow it to better understand the relationship between those pages.  This could offer you the benefit of consolidating the pages into a single page for ranking calculations - which could be very helpful to say the least.  Examples of pages that may gain from using this markup, which involves using rel=prev and rel=next tags in a page's head section, are

  • an article or forum thread spread across multiple pages, perhaps to derive greater advertising revenues or keep the text short and easy to consume
  • a product category consisting of so many products that they can't fit on one page.  An example would be a top level category such as "Family Cars", before many filters had been applied to create smaller sub-categories that could easily fit on a page (e.g. "red 1.8L diesel automatic Volkswagen family cars near Kettering")

For more details on how to implement these tags see Google Webmaster Central: Pagination with rel=“next” and rel=“prev”.  The article is well-written and gives very clear implementation advice.  It includes a reference to a related Google post, Google Webmaster Central: View-all in search results, which describes how a rel=canonical tag can be used to specify a "View-all page", which is simply a single-page version of the content that may be presented elsewhere as a series of pages.  Google makes the claim in this article that "searchers much prefer the View-all, single-page version of content".

But do searchers much prefer View-all pages?  I'm sure they do if the View-all page is relatively short.  Using a couple of Google's own examples of where rel=prev and rel=next may be useful, however:

  • a forum thread spread across multiple pages.  I moderate forums and some threads can easily spread to 1000 or more responses.  It's unlikely a member would want all of these on a single page for viewing
  • a product category consisting of many products.  Again, a top level category could easily consist of over 1000 products.

It's interesting to note that a typical Google search yields millions of results and Google will display up to 1000 of them, by default across 100 pages at 10 results to a page.   Google isn't implementing a View-all page there!

I think the example that Google really has in mind when they state that searchers "prefer the View-all version of content" is the article that might spread over three pages or so: reducing that to one page for indexing.  This seems a fine idea.

But what to do about the long forum threads and product categories?  Should we create View-all pages for those?  I think not.  Such pages could be too big and unwieldy, and could take too long too load, which (especially given that load time is now a ranking factor) could work against the SEO rather than for it.

Another option would be to create a View-all page containing less information, e.g. a cut down version of each post in the forum or each product in the category.  This might be a good solution.  Bear in mind, however, that Google is looking to rank this View-all page in preference to a paginated page, so

  1.  don't cut out content that contains long-tail keywords for ranking and
  2. make sure if this page is going to rank well that it's a good landing page that can help the searcher achieve what you want them to achieve on your site

Another option is to deploy this strategy:

  • if your default posts or products "per-page" count is a small number (such as 10 products/page), consider changing it to a bigger number now (such as 50).  This will reduce the number of pages in your page sequences dramatically.  It will also increase the size of each page but technology has moved on - the 10 number became the standard when the Web was a lot slower than it is now and 50 seems a more appropriate number to me.  It's a good number of products to compare in one go, for example.
  • once you have shorter series of larger pages, use the rel=prev and rel=next tags as described by Google. 
  • If it's a product sequence, add a rel=canonical tag to each page in the series to make the URL of the first page in the series the canonical URL.  It's OK to do this for a product sequence, as Google's rel=canonical documentation stated that "the sort order of a table of products" was an acceptable use of a rel=canonical tag.  Since it's unlikely you would want to change the sort order of a set of article pages or forum posts, it wouldn't be as good to use a canonical tag on those series versus a product sequence.

For example, let's suppose you currently have a category of Family Cars that consists of 238 cars with 10 cars per page giving a series of 24 pages with the following URLs: 

  • /cars/family/1
  • /cars/family/2
  • ...
  • /cars/family/23
  • /cars/family/24

Here's what you could do:

  • Increase the default number of cars per page from 10 to 50.  Now only 5 pages are needed to cover the series: /cars/family/1 ... /cars/family/5
  • Add a rel=next tag to /cars/family/1, a rel=prev tag to /cars/family/5, and both a rel=prev and a rel=next tag to the intervening three pages, as described by Google
  • Add rel=canonical tags to all five pages, citing /cars/family/1 as the canonical URL.
alan's picture

How Does Google Instant Affect My SEO and PPC Efforts?

in

Google recently announced Google Instant, their new "search-before-you-type" service. Google Instant purports to predict what you are searching for based on the characters you've already entered into the search box, perform the search, and return the results to your browser before you have finished typing. Try Google Instant for yourself. Here's a Google-produced video that describes Google Instant in more detail: Google claims that Instant offers such advantages to searchers as Faster Searches, Smarter Predictions and Instant Results. Many of these features were already available with Google Autocomplete, so the real difference is real-time predictive search results pulled into the browser. And this looks like a game changer for PPC and SEO. Here's why:

Google Instant Results - Showing SEO and PPC Problems

This shows a number of issues:

  • Results are predicted and pulled in after only a few characters are entered by the searcher (in this case, three characters - "cre" - pulled in the results for "credit cards").
  • The suggestions push the natural results down the page. This is different to the old Autocomplete model where, for the most part, results are browsed whilst suggestions are not on the screen, allowing more screen real estate for results.
  • Google has jammed in a fourth ad, a comparison ad, above the natural results.
  • As a result of the above, there ten ads and only one natural result above the fold.
  • It gets worse. The matched term is the highly competitive head term, "credit cards". The searcher may have been en route to entering a niche tail term but now they have been distracted into searching a narrow range of head terms, which are expensive for Adwords advertisers and highly competitive for SEO - and only one natural result is visible above the fold anyway, versus ten paid results!

The Adwords auction thus becomes focused on smaller baskets of highly competitive keywords, rather than a long tail of cheaper keywords, and natural results take on a lesser prominence. The net effect of Google Instant could therefore be more head term searches, fewer tail term searches, more high-cost ad impressions and clicks, and fewer low cost ads and free natural results clicked. If this is Google's commercial intent then it looks like a great idea - for them. I'm not sure if searchers or most advertisers would agree, which is why I think that, over time, Google Instant will have to change.

alan's picture

Matt Cutts Interviewed on Site Architecture

Site ArchitectureMatt Cutts has given a very useful interview with Eric Enge, which rounds up a lot of information architecture and technical architecture issues. There's nothing really new here, but it's good to get all this info into one place and to see it confirmed by Matt. Topics covered:

  • crawl budget/indexation cap - the use of Pagerank and host load to control crawl depth and frequency
  • the effect of duplicate content on Pagerank
  • session IDs and affiliate IDs in links/URLs
  • faceted navigation - good to see Matt confirming that the advice I gave at SES London, and will be giving next week at SMX Munich, is all correct.
  • Different ideas for use of the rel=canonical tag
  • 301 redirects and how they differ from 302 redirects
  • Google Webmaster Tools (WMT) ignore parameters
  • Pagerank Sculpting and its effectiveness in the modern world
  • Javascript, IFRAME and PDF handling
  • Paid links and nofollow

Overall, the article strongly reinforces the fact that a successful site architecture is essential to SEO success.

alan's picture

Working With Multi-Regional Web Sites

Google's John Mueller has published a good article on working with multi-regional web sites. He confirms that country-code Top Level Domains (ccTLDs) are the best way to host multi-regional content. He also clears up some of the myths surrounding duplicate content on multi-regional domains, which is most welcome.

John doesn't mention that the same thinking applies even if you are targeting a single country. A ccTLD is the best way to indicate the location of your target market to search engines - and to that market itself, of course.

A URL gives you at least five places to target a country: domain (ccTLD), subdomain (de.domain.com), directory(www.domain.com/de/), path parameters (www.domain.com/;domain=de) and query parameters(www.domain.com/?domain=de). However, there are lots more axes for the content to be split along:

  • Category - Web, Enterprise, Social, Real Time
  • Context - Intranet, Library, Personal
  • Topic - Health, Travel, Jobs, etc.
  • Vertical - Finance, Education, Government, etc.
  • Platform - Desktop, Mobile, Television, Kiosk
  • Format - Text, Image, Audio, Video, Map

(Note: the above is slightly modified from a table provided by Search Patterns, an excellent read)

Given this number of ways of organising content, and the fact that the location and language of your target audience are major considerations (worthy of a major axis), in all but the most trivial cases a ccTLD is the obvious choice for geo-targeting. It's good to see official written confirmation of this from Google.

alan's picture

Not Impressed By Microsoft's New Bing Ad

Microsoft launched their new Bing ad on television last night.

My first impressions were that the ad is too negative. It doesn't show what Bing can do for you. It's at risk of associating Bing with information overload and distressed searchers. I'm also not convinced the phrase "decision engine" is a good one - too techie, too nebulous. Who's making the decisions - me, or Bing?

Compare it with Google's Superbowl ad:

This has its own potential problems - I'm not sure I would have been brave enough to use no voiceover whatsoever on a TV ad running in a £60,000 per second timeslot - but in general it's a much more upbeat ad showing someone achieving something - lots of things - using Google Search.

In Microsoft's position, I think I'd accept the fact that lots of people use Google and get good results lots of the time, and show that Bing is an alternative that often succeeds when Google fails. I'd challenge the notion that Google always delivers the right result, every time, and that if Google doesn't deliver it it can't be on the Web. I'd get people to try Bing - that's all you can ask of the ad. An idea would be to use something based on the famous "Pepsi Challenge", but bring it right up to date.

Having seen the interview with Ashley Highfield, I'm looking forward to more ads in the series. It would be great to see Bing achieve the double digit market share that he desires, but I think this was a bad start to the campaign.

alan's picture

Testing PPC Conversion Tracking

It's SilverDisc's 17th Birthday today, so here's a free gift of an idea for Google, Yahoo and Microsoft to consider. Here at SilverDisc we're often having to install and test new conversion tracking code for our PPC clients. Usually this involves searching for one of our client's keywords on each search engine, clicking on it (thus incurring cost for the client) then going through the client's site, making a test purchase and, later, checking that all the analytics has worked. A cool feature that the search engines could add to improve efficiency would be a dummy campaign/ad-group/keyword that was automatically created by the engine itself within the PPC account specifically to test conversion tracking. The keyword could be assigned by the engine itself, and could be very long, cryptic and unique to each client account, e.g. g54fr89fdcdjasdoe84.

  • Searching for this keyword would always trigger the client's ad
  • Clicking this ad would not incur any real charges (although it may simulate a charge). Alternatively, a very low charge could be applied, e.g. £0.01.
  • Conversion tracking could work much faster for this one keyword, e.g. near-real-time, to allow better, faster testing

This would save loads of time within agencies and mean that client accounts were up and running sooner, making more money for both clients and search engines.

alan's picture

Calling for link spam reports

I see that Matt Cutts of Google is calling for link spam reports.

I'm still very troubled by this paid links issue after all these years!

I agree it's Google's right to penalise or promote any page/site in its natural listings, which represent Google's subjective opinion of relevancy.

However, the idea that all paid links are bad/"evil" is wrong in so many ways:

  • Paid links pre-date Google.
  • There is no machine-readable standard for labelling a paid link. I'll repeat that - there is no machine-readable standard for labelling a paid link.
  • Labelling paid links fails the "Does this makes sense in the absence of search engines?" ethical test. The answer may well be "Yes". (Where the answer is "No", I agree paid links are spam).
  • Labelling paid links fails the "Would I do this if search engines did not exist?" test. In fact, you have to know that Google exists, and that they mind about paid links, in order to label those paid links in the non-standard way that Google asks you to label them. This is perhaps my biggest beef with Google's approach to paid links - they actually violate one of Google's published Webmaster principles.
  • What does "paid" mean anyway? An actual exchange of cash? If you look at the top results for any hugely commercial field, say "car insurance", it's hard to believe that there is no commercial influence in the results! When all that a company does is commercial, then every link (positive or negative) to that company's site is commercial in nature.

I understand that a market in paid links arose because of Google's algorithm.

However, the irony is that in responding to that market by asking all publishers to label paid links in a non-standard way, Google violated its own principles. It started to ask publishers to adapt what they published to suit Google (because Google existed), and called them spammers if they didn't. That's the wrong way around. It's the spammers that do stuff purely because Google exists!

Syndicate content