OpenAI inference cost reduction cut ChatGPT guest traffic from tens of thousands of Nvidia GPUs to just a couple hundred, using software optimization alone. Engineers achieved more than 50% savings ...
Claude creator Anthropic launches an early in-house AI chip project, holding talks with Samsung to manufacture custom 2nm ...