SEARCH MARKETING NEWS

Google explains recrawling for news content

A post on the Google news blog has explained how the search engine’s news aggregator finds up to date content and eliminates dead links.

The post says that “If you’ve read news online, you’ve probably noticed that articles aren’t static” which “presents a unique set of challenges [for] a news search engine that crawls hundreds of articles at thousands of sites every minute”.

Google’s following explanation of their methods provide an insight into how news content marketing is introduced to the Google News service. This information could be useful for SEO companies who have made this part of their search engine marketing services.

“To deal with these issues, Google news has implemented a re-crawl feature that allows us to focus on getting the newest articles around while still ensuring that we’re displaying the most up-to-date information”.

Google explains that “From the moment we discover a new article, we’ll keep revisiting it looking for changes”. The re-crawl algorithm revisits articles most frequently in the first day after they have been originally located, then visits the page less often as time passes. In some cases, the crawler will also revisit pages that it had “trouble crawling the first time round”.

“For readers, this feature is intended to reduce the number of out-dated headlines and dead links you might find” says Google. “For publishers, rest assured that we’ll be back to find your latest stories and updates as soon as we can”.