SEARCH MARKETING BLOG

What Googlebot sees – the 100k file size limit

As mentioned in my blog  earlier this week there is now a Labs section in Google Webmaster Tools which allows you to see your website as the Googlebot sees it.

This is a great new function, however when we were looking at this earlier in the week we noticed that there seems to be a 100k file size limit on the HTML files that Google is reading.

From an SEO point of view this is interesting information as it implies that Google will only read the first 100k of the code on any page on your website.

This could become a useful tool for you in ensuring that all of the pages on your site are visible (fully) to Google, as ensuring that all of your content can be seen will ensure that your website has a better chance of appearing higher in the SERPs

The cut of point of 100k is more important if your HTML code contains lots of commented out sections.  Although using the commenting out HTML code on your website can enable you to keep old code on the page whilst not displaying it on the site this could lead to your webpages not being seen property by the Googlebot.

With this in mind I’d recommend removing all commented out code from your site so the content on your pages that you want Google to see is visible to Google and not a lot of old code you no longer wish Google, or your visitors to see.

This entry was posted in Search Marketing Blog by Emily Mace. Bookmark the permalink.

About Emily Mace

Emily joined Vertical Leap as an SEO Campaign Delivery Manager in 2008, having gained wide search marketing experience as a web developer, SEO specialist and trainer for local Government departments and Tourism South East. Emily gained Google Analytics Individual Qualification in 2011, and regularly blogs on the technical aspects of SEO, sharing her expertise with our readers.