Google seemingly leaked a treasure trove of technical search algorithm details by accident and now SEO people are getting real aggro

A Google search box with the query: how does google search work
(Image credit: Google)

Around 2,500 technical documents detailing the nuts and bolts of Google's ranking algorithms have apparently leaked. If the documents are real, it's an unprecedented look into the workings of the utterly dominant internet search engine. And one hell of an error, because it is stated that Google itself published the documents to GitHub before taking them down. But nothing published to the web disappears overnight, and the documents have been kept for posterity elsewhere.

This leak provides an interesting opportunity to compare the reality of how Google ranks its search results with the various claims the company has made about what has hitherto been largely a mysterious black box. The inner workings of Google Search have long been speculated upon but never really known outside of the company itself—or indeed inside the company by most Google employees.

The documents were shared with long-time SEO specialist Rand Fishkin by Erfan Azimi, an SEO advisor at EA Eagle Digital. Azimi says he shared the documents in the hope that they would reveal the "lies" propagated by Google in relation to its search platform.

That is obviously a very, very bold claim. Frankly, the documentation is incredibly dense and technical and covers a huge array of topics and systems. In really broad brush terms, it covers the type and character of data Google collects and uses, which sites Google elevates for sensitive topics like elections, how Google handles small websites, and much, much more. 

There are various areas where it's claimed that analysis of the documents throws up clear contradictions with Google's claims. For instance, in 2016 Google Search engineer Paul Haahr said that "using clicks directly in rankings would be a mistake."

But it's claimed the documents prove that Google uses a system known as NavBoost that directly incorporates various click count metrics into the page rankings and search results.

Other areas highlighted in contradiction to previous Google claims include the use of Domain Authority, sandboxing new websites while more data is collected, including user data collected from the Chrome web browser and more.

If these claims are all true, it's hard to be clear how much of this comes down to Google simply wanting to protect its search IP from potential competitors and how much can be chalked up to more cynical or even sinister motives.

Your next machine

Gaming PC group shot

(Image credit: Future)

Best gaming PC: The top pre-built machines.
Best gaming laptop: Great devices for mobile gaming.

Moreover, as far as we can tell the documents do not actually reveal exactly how Google currently ranks pages. In other words, it does not appear that this leak will make it straight forward to optimise a web page to improve Google search ranking, which is what a lot of observers would presumably have been praying for.

But if the documents are real, and the claims being made about the implications contained therein are broadly accurate, at minimum Google has a pretty major scandal on its hands in terms of the statements it has made in the past and its corporate credibility and ethics.

For now, that's a pretty big "if". This is a story that won't be resolved overnight. As far as we are aware, Google has yet to comment whether the documents are real let alone provide a riposte to the main critiques that have followed.

No doubt Google is formulating a detailed response as we write these very words. But we have a feeling that won't be the end of it and the full fall out from this alleged scandal will be measured in months if not years.

TOPICS
Jeremy Laird
Hardware writer

Jeremy has been writing about technology and PCs since the 90nm Netburst era (Google it!) and enjoys nothing more than a serious dissertation on the finer points of monitor input lag and overshoot followed by a forensic examination of advanced lithography. Or maybe he just likes machines that go “ping!” He also has a thing for tennis and cars.

Read more
Seattle, USA - Jul 24, 2022: The South Lake Union Google Headquarter entrance at sunset.
'New year, new low, Microsoft'—even the search engines are firing shots on social media now, as Google employees take aim at Bing over 'long history of tricks'
MOUNTAIN VIEW, CALIFORNIA - AUGUST 22: A view of Google Headquarters in Mountain View, California, United States on August 22, 2024.
One educational company accuses Google's AI summary of leading to a 'hollowed-out information ecosystem of little use and unworthy of trust' in latest lawsuit
A conceptual image illustrating strategy and risk with a white mouse hanging mid-air in a harness, wearing a communication headset with earpiece and microphone being lowered towards a primed mousetrap load with Swiss cheese on a tiled floor. Light From a slightly ajar door illuminates the scene.
Google's AI made up a fake cheese fact that wound up in an ad for Google's AI, perfectly highlighting why relying on AI is a bad idea
A Bing search bar that looks a lot like a Google search bar.
Microsoft's latest trick to get you using Bing is disguising it as Google
MOUNTAIN VIEW, CALIFORNIA - AUGUST 22: A view of Google Headquarters in Mountain View, California, United States on August 22, 2024.
'Google must divest the Chrome browser:' DOJ renews call for Google to sell Chrome, and Android could be next
Redhead woman using computer laptop at home stressed with hand on head, shocked with shame and surprise face, angry and frustrated. Fear and upset for mistake.
Court documents show not only did Meta torrent terabytes of pirated books to train AI models, employees wouldn't stop emailing each other about it: 'Torrenting from a corporate laptop doesn't feel right'
Latest in Hardware
A woman wearing a VR headset with dramatic, colourful lighting across the background
'World’s smallest LEDs' could lead to accurately lit screens with 127,000 pixels per inch and much more immersive VR
The NES themed 8BitDo Retro mechanical gaming keyboard on a blue background
I love the 8BitDo Retro C64 keyboard but I'd pick its cheaper NES-themed model near its lowest price ever during Amazon's Big Spring Sale
The snazzy red and black HyperX Cloud Alpha wireless headphones float in a teal void. The microphone is attached to the headset.
The best wireless gaming headset is now even better in the Amazon Big Spring Sale, boasting a more than $50 discount
A chip being held up in an Intel fab
Intel is reportedly 'working to finalize commitments from Nvidia' as a foundry partner, suggesting gaming potential for the 18A node
Amazon box
Don't panic! The 'Do Not Send Voice Recordings' option Amazon just removed was only used by 0.03% of customers and they can still have it
Digital generated image of people surrounded by interactive transparent and glowing panels with data. Visualising smart technology, blockchain and artificial intelligence
Now I shall demand the cookies! Proposed new browsing agreement turns the tables and lets users dictate terms to websites
Latest in News
Image of Ronaldo from Fatal Fury: City of the Wolves trailer
It doesn't really make sense that soccer star Ronaldo is now a Fatal Fury character, but if you follow the money you can see how it happened
Junah beginning a battle in Metaphor: ReFantazio.
Today's RPG fans are 'very sensitive to feeling like they wasted time' when they die, says Metaphor: ReFantazio battle planner—but Atlus still made combat hard anyway
Image of Cersei Lanniser from Game of Thrones: Kingsroad Steam early access trailer
A new Game of Thrones RPG is coming to Steam today with a cast of 'familiar faces,' which is good because it's really the only way to tell it's a GoT game at all
The new Prime Asset featured in the upcoming update for the Outlast Trials.
The Outlast Trials puts its already paranoid players under surveillance for a time-limited story event
A Viera looking confused in Final Fantasy 14.
Old armor continues to fall victim to Final Fantasy 14's bizarre two-channel dye system, unless you're super into changing the colour of teeny-tiny eyelets: 'Why even bother at this point?'
Starfield: Shattered Space
By the time Bethesda was on Starfield, you'd 'basically get in trouble' for breaking schedule, says former dev: 'A lot of the great stuff within Skyrim came from having the freedom to do what you want'