Memory Compression Algorithms

10h

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

21h

Tackling The Demo-to-Production Gap In AI Agents: How Raunak Bhandari Built KiwiQ For Enterprise Reliability

Raunak Bhandari, a former Google AI lead, designed KiwiQ, a multi-agent orchestration platform built to support complex ...

13d

Nota AI Reduces Memory Usage of Upstage's Solar LLM by 72%, Demonstrating Proprietary Quantization Technology

Nota AI, an AI optimization technology company behind the Nota AI brand, announced that it has developed a next-generation ...

12d

Databricks built a RAG agent it says can handle every kind of enterprise search

Databricks' KARL agent uses reinforcement learning to generalize across six enterprise search behaviors — the problem that breaks most RAG pipelines.

Rock Paper Shotgun

Don’t know your FSR Diamonds from your DirectStorages? Here’s what all that Project Helix tech-speak actually means

We jargon-bust the impenetrable wall of techy marketing words that was Microsoft's Project Helix hardware tease.

Tencent Debuts MagicDawn at GDC Showcasing AI-Driven Global Illumination and Spatial Audio for Next-Gen Game Experiences

SAN FRANCISCO, CA, UNITED STATES, March 13, 2026 /EINPresswire.com/ -- During this year’s GDC Festival of Gaming, ...

2don MSN

Nvidia says we'll see the future of real-time rendering at its GTC event

But it's most likely to be more stuff about path tracing ...

8don MSN

Resident Evil Requiem appears to be randomly choosing whether to use your GPU to decompress data

It's a puzzle worthy of the game's heritage ...

BW Businessworld

Inside the Engineering Behind Samsung's Galaxy S26 & Buds 4 Pro

Samsung Galaxy S26 Ultra and Buds 4 Pro engineering brief : f/1.4 aperture, AI-ISP selfie, APV 8K codec, Super Wide Woofer, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results