Latest Memory Compression

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

Breaking the 100M Token Limit: EverMind's MSA Architecture Achieves Efficient End-to-End Long-Term Memory for LLMs

The research introduces a novel memory architecture called MSA (Memory Sparse Attention). Through a combination of the Memory Sparse Attention mechanism, Document-wise RoPE for extreme context ...

12d

New KV cache compaction technique cuts LLM memory 50x without accuracy loss

MIT researchers developed Attention Matching, a KV cache compaction technique that compresses LLM memory by 50x in seconds — ...

EE World Online

How to approach AI hardware design to address the memory wall?

This article outlines the design strategies currently used to address these bottlenecks, ranging from data center systolic ...

Korea JoongAng Daily

Samsung unveils HBM4E at Nvidia GTC, raises bar for AI memory

Samsung Electronics debuted its seventh-generation high bandwidth memory, HBM4E, at the Nvidia GTC 2026 developer conference ...

Cult of Mac

Why every Mac user should be thrilled MacBook Neo packs 8GB of RAM

MacBook Neo needs just 8GB of RAM, and that's great news for all Mac users, even users of high-end computers. Here's why.

10d

GTC 2026 to become stage for low power memory competition

The upcoming GTC 2026, Nvidia’s artificial intelligence (AI) and advanced computing conference, is set to become a stage for ...

Dow, S&P 500 lower as Middle East tensions escalate, miners lead fallers

Another broad decline in markets as attacks on Gulf energy sites sent energy prices soaring to three-year highs ...

15d

Scalper bots are now scraping DDR5 memory supply chains as AI data centers consume more RAM

DataDome reports that a single scalping operation has been hammering memory listings with requests every 6.5 seconds, ...

XDA Developers on MSN

The RAM stick is dying, and the replacement is something most PC builders have never seen

There may be a new memory standard coming to town soon ...

AlphaGalileo

Sustainable construction: Custom-made concrete from a 3D printer

Instead of using more and more concrete and steel, a European research team including Empa is focusing on intelligent shapes, digital manufacturing, ...

Nanowerk

Custom-made concrete from a 3D printer

Instead of using more and more concrete and steel, a research team is focusing on intelligent shapes, digital manufacturing, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results