DeepMind's Chinchilla AI toasts FLAC and PNG at lossless data compression despite essentially being just a large language model

AI image lady face.
(Image credit: Getty - peepo)

If you think FLAC is the audiophile's friend when it comes to lossless music files, a large language model (LLM) has news for you, as it's now laying claim to compression as part of AI's growing realm of influence, too.

A study titled "Language Modeling Is Compression" (via ArsTechnica) discusses a finding about an LLM by DeepMind called Chinchilla 70B and its ability to perform lossless data compression better than FLAC for audio and PNG for pictures.

Chinchilla 70B could significantly shrink the size of image patches from the ImageNet database, reducing them to only 43.4% of their original size without losing any detail. This performance is better than the PNG algorithm, which could only reduce the image sizes to 58.5%.

Additionally, Chinchilla compresses audio data from the LibriSpeech to just 16.4% of their actual size for sound files. This is impressive, especially compared to the FLAC compression, which could only reduce the audio sizes to 30.3%.

Lossless compression means nothing is lost or left out when data is squeezed into smaller packages. This differs from lossy compression, which is what the image compression format JPEG uses. That removes some data and then guesses at what it should look like when you open the file again, all to make the file size that much smaller.

The study's findings show that even though Chinchilla 70B was mostly made to work with text, it is also surprisingly adept at making other types of data much smaller. And is often better at it than programs specifically made to do so.

Researchers of the study suggest that predicting and compressing data go both ways. This means if you have a good tool for making data smaller, like gzip, you can also use it to create new information based on what it learned during the whole making-data-smaller process.

Your next machine

Gaming PC group shot

(Image credit: Future)

Best gaming PC: The top pre-built machines.
Best gaming laptop: Great devices for mobile gaming.

In one part of their research, they tested this idea by trying to create new text, images, and sound using gzip and another tool, Chinchilla, after giving them a sample of data. As expected, gzip didn’t do great and generated mostly nonsense.

This shows that, while gzip can create data, that data might need to be more meaningful. On the other hand, Chinchilla, which is specifically made for processing language, did much better at creating new, meaningful results.

Almost 20 years ago, researchers argued that compression was a form of general intelligence, saying that "ideal text compression, if it were possible, would be equivalent to passing the Turing test for artificial intelligence."

However, as ArsTechnica points out, this paper has yet to be peer-reviewed. The idea that making data smaller is related to intelligence is a topic we will probably still be hearing about in the future. We are still just scratching the surface of what these LLMs can do.

Jorge Jimenez
Hardware writer, Human Pop-Tart

Jorge is a hardware writer from the enchanted lands of New Jersey. When he's not filling the office with the smell of Pop-Tarts, he's reviewing all sorts of gaming hardware, from laptops with the latest mobile GPUs to gaming chairs with built-in back massagers. He's been covering games and tech for over ten years and has written for Dualshockers, WCCFtech, Tom's Guide, and a bunch of other places on the world wide web. 

Read more
A screenshot taken from the 2025 Nvidia tech demo Zorah
I've been testing Nvidia's new Neural Texture Compression toolkit and the impressive results could be good news for game install sizes
Alibaba
Forget DeepSeek R1, apparently it's now Alibaba that has the most powerful, the cheapest, the most everything-est chatbot
gotg llama
Blasting AI into the past: Modders get Llama AI working on an old Windows 98 PC
SUQIAN, CHINA - JANUARY 27, 2025 - An illustration photo shows the logo of DeepSeek and ChatGPT in Suqian, Jiangsu province, China, January 27, 2025. (Photo credit should read CFOTO/Future Publishing via Getty Images)
China's DeepSeek chatbot reportedly gets much more done with fewer GPUs but Nvidia still thinks it's 'excellent' news
SUQIAN, CHINA - JANUARY 27, 2025 - An illustration photo shows the logo of DeepSeek and ChatGPT in Suqian, Jiangsu province, China, January 27, 2025. (Photo credit should read CFOTO/Future Publishing via Getty Images)
The brass balls on these guys: OpenAI complains that DeepSeek has been using its data, you know, the copyrighted data it's been scraping from everywhere
OpenAI logo displayed on a phone screen and ChatGPT website displayed on a laptop screen are seen in this illustration photo taken in Krakow, Poland on December 5, 2022.
New research says ChatGPT likely consumes '10 times less' energy than we initially thought, making it about the same as Google search
Latest in Hardware
A woman wearing a VR headset with dramatic, colourful lighting across the background
'World’s smallest LEDs' could lead to accurately lit screens with 127,000 pixels per inch and much more immersive VR
The NES themed 8BitDo Retro mechanical gaming keyboard on a blue background
I love the 8BitDo Retro C64 keyboard but I'd pick its cheaper NES-themed model near its lowest price ever during Amazon's Big Spring Sale
The snazzy red and black HyperX Cloud Alpha wireless headphones float in a teal void. The microphone is attached to the headset.
The best wireless gaming headset is now even better in the Amazon Big Spring Sale, boasting a more than $50 discount
A chip being held up in an Intel fab
Intel is reportedly 'working to finalize commitments from Nvidia' as a foundry partner, suggesting gaming potential for the 18A node
Amazon box
Don't panic! The 'Do Not Send Voice Recordings' option Amazon just removed was only used by 0.03% of customers and they can still have it
Digital generated image of people surrounded by interactive transparent and glowing panels with data. Visualising smart technology, blockchain and artificial intelligence
Now I shall demand the cookies! Proposed new browsing agreement turns the tables and lets users dictate terms to websites
Latest in News
Image of Ronaldo from Fatal Fury: City of the Wolves trailer
It doesn't really make sense that soccer star Ronaldo is now a Fatal Fury character, but if you follow the money you can see how it happened
Junah beginning a battle in Metaphor: ReFantazio.
Today's RPG fans are 'very sensitive to feeling like they wasted time' when they die, says Metaphor: ReFantazio battle planner—but Atlus still made combat hard anyway
Image of Cersei Lanniser from Game of Thrones: Kingsroad Steam early access trailer
A new Game of Thrones RPG is coming to Steam today with a cast of 'familiar faces,' which is good because it's really the only way to tell it's a GoT game at all
The new Prime Asset featured in the upcoming update for the Outlast Trials.
The Outlast Trials puts its already paranoid players under surveillance for a time-limited story event
A Viera looking confused in Final Fantasy 14.
Old armor continues to fall victim to Final Fantasy 14's bizarre two-channel dye system, unless you're super into changing the colour of teeny-tiny eyelets: 'Why even bother at this point?'
Starfield: Shattered Space
By the time Bethesda was on Starfield, you'd 'basically get in trouble' for breaking schedule, says former dev: 'A lot of the great stuff within Skyrim came from having the freedom to do what you want'