Hello and welcome to Eye on AI. In this edition: DeepSeek defies AI convention (again)…Meta’s AI layoffs…More legal trouble for OpenAI…and what AI gets wrong about the news. Hi, Beatrice Nolan here, ...
DeepSeek has already shown that it prefers clever engineering over sheer size, and the new training method extends that philosophy. With DeepSeek R. 1.0, the company leaned heavily on reinforcement ...
Excitement is growing among researchers about another powerful artificial intelligence (AI) model to emerge from China, after DeepSeek shocked the world with its launch of R1 in January. The ...
DeepSeek, the Chinese AI startup spun off of Hong Kong high-frequency trading firm High Flyer Capital Management (and which uses a whale icon for its logo), is back today with a new large language ...
DeepSeek hinted that China will have homegrown "next generation" chips to support its AI models. Its mention of China's coming next-generation chips may signal plans to work more closely with China's ...
In the lead-up to China's Labor Day Golden Week, the country's AI sector is experiencing a flurry of large language model (LLM) upgrades. Baidu and Alibaba have rolled out new flagship models, while ...
Remember when DeepSeek briefly shook up the entire artificial intelligence industry by launching its large language model, R1, that was trained for a fraction of the money that OpenAI and other big ...
On Thursday, Chinese AI startup DeepSeek (DEEPSEEK) officially launched its updated DeepSeek-V3.1 AI model, which surpasses its R1 model on key benchmarks. The company unveiled V3.1 earlier this week.
Even as Meta fends off questions and criticisms of its new Llama 4 model family, graphics processing unit (GPU) master Nvidia has released a new, fully open source large language model (LLM) based on ...