Data Compression Method

TurboQuant: Reducing LLM Memory Usage With Vector Quantization

Large language models (LLMs) aren’t actually giant computer brains. Instead, they are massive vector spaces in which the ...

Tech Xplore

New software may nearly double pooled SSD performance in data centers

To improve data center efficiency, multiple storage devices are often pooled together over a network so many applications can share them. But even with pooling, significant device capacity remains ...

‘Neural texture compression’ might save gamers in a RAM-starved world

Intel and Nvidia show off how textures -- which take up a large chunk of PC games -- could be compressed to save you money ...

Intel TSNC Promises Up to 18x Texture Compression With Neural Tech

Intel TSNC brings neural texture compression with up to 18x reduction, faster decoding, and flexible SDK support for modern ...

Windows Report

NVIDIA NTC Slashes VRAM Usage by 85% With AI Compression

NVIDIA showcases Neural Texture Compression at GTC 2026, cutting VRAM usage by up to 85% with real-time AI reconstruction.

TweakTown

NVIDIA's Neural Texture Compression cuts VRAM usage from 6.5GB down to 970MB

Neural Texture Compression (NTC) could be a game-changer on par with DLSS if it can reduce the VRAM requirement for textures ...

Nvidia shows neural compression can cut VRAM usage from 6.5GB to 970MB

In its "Tuscan Wheels" demo, the company showed VRAM usage dropping from roughly 6.5GB with traditional BCN-compressed ...

Morning Overview on MSN

Google’s new AI compression could cut demand for NAND, pressuring Micron

A new compression technique from Google Research threatens to shrink the memory footprint of large AI models so dramatically ...

TechCrunch

Google unveils TurboQuant, a new AI memory compression algorithm — and yes, the internet is calling it ‘Pied Piper’

If Google’s AI researchers had a sense of humor, they would have called TurboQuant, the new, ultra-efficient AI memory compression algorithm announced Tuesday, “Pied Piper” — or, at least that’s what ...

Morning Overview on MSN

New detector chip compresses X-ray data up to 200x in real time

Researchers at Argonne National Laboratory and SLAC have designed a detector chip that compresses X-ray data by factors of 100 to 250 in real time, directly on the silicon that captures each frame.

Stark Insider

Google’s TurboQuant: The Unsexy AI Breakthrough Worth Watching

TurboQuant compresses AI model vectors from 32 bits down to as few as 3 bits by mapping high-dimensional data onto an efficient quantized grid. (Image: Google Research) The AI industry loves a big ...

VentureBeat

Nvidia says it can shrink LLM memory 20x without changing model weights

Nvidia researchers have introduced a new technique that dramatically reduces how much memory large language models need to track conversation history — by as much as 20x — without modifying the model ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results