Advanced Memory Compression Methods

Breaking the 100M Token Limit: EverMind's MSA Architecture Achieves Efficient End-to-End Long-Term Memory for LLMs

The research introduces a novel memory architecture called MSA (Memory Sparse Attention). Through a combination of the Memory Sparse Attention mechanism, Document-wise RoPE for extreme context ...

EE World Online

How to approach AI hardware design to address the memory wall?

This article outlines the design strategies currently used to address these bottlenecks, ranging from data center systolic ...

3don MSN

[Comprehensive interview] Tencent unveils 3 core AI technologies at GDC 2026 "Integrated into the production pipeline"

Tencent showcased its three core AI solutions to the world: ‘MagicDawn,’ ‘VISVISE,’ and ‘ACE.’ According to Tencent, the most decisive shift in its AI technology this year, compared to last year, is ...

Nvidia says it can shrink LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

TMCnet

Samsung Unveils HBM4E, Showcasing Comprehensive AI Solutions, NVIDIA Partnership and Vision at NVIDIA GTC 2026

Samsung Electronics Co., Ltd., a global leader in advanced semiconductor technology, today announced the comprehensive AI computing technologies it will showcase at NVIDIA GTC 2026 in San Jose, ...

Korea JoongAng Daily

Samsung unveils HBM4E at Nvidia GTC, raises bar for AI memory

Samsung Electronics debuted its seventh-generation high bandwidth memory, HBM4E, at the Nvidia GTC 2026 developer conference ...

AI is changing PC graphics. Microsoft wants DirectX ready

DirectX, the Microsoft technology that allows PC games to talk to your gaming hardware, is going to build in support for the ...

14d

New KV cache compaction technique cuts LLM memory 50x without accuracy loss

MIT researchers developed Attention Matching, a KV cache compaction technique that compresses LLM memory by 50x in seconds — ...

Business Wire

Keysight Demonstrates 5G-Advanced AI-Powered Channel State Information Compression and Paves the Way for 6G

Joint lab validation shows more than 40 percent downlink throughput gain versus standardized channel feedback in four-layer (rank-4) operation SANTA ROSA, Calif.--(BUSINESS WIRE)--Keysight ...

Seeking Alpha

ASML plans to build tools for advanced packaging for AI chips: report

ASML (ASML) plans to expand its chipmaking equipment portfolio with new products to capture more of the growing market for AI chips, Reuters reported, citing the company's Chief Technology Officer ...

Reuters

Exclusive: ASML plots future of chipmaking tools for AI beyond EUV

ASML plans to expand into advanced packaging for AI chips Company to use AI to enhance tool performance and production speed ASML explores larger chip sizes and new scanner systems SAN JOSE, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results