-
France squad look to do grieving Deschamps proud in final World Cup group game
-
Will Taylor Swift and Travis Kelce wed in New York? Clues abound
-
Mayweather's Athens fight with Zambidis is off: report
-
Lawyer says Vondrousova 'should appeal' against four-year ban
-
Alonso committed to Aston Martin, but keeping options open
-
Hospitals raise alert as heatwave slams Europe
-
Events cancelled, records loom as heatwave reaches Germany
-
'Alligator Alcatraz' detention center shuts in US: official
-
Czech striker Schick ends international career
-
Tennis great Evert says 'relentless' cancer has returned
-
US says wants deal with Iran, but not 'at any price'
-
Colombian president-elect gives armed groups one month to surrender
-
US Supreme Court hands win to Bayer in weedkiller litigation
-
Apple raises prices for MacBooks and iPads, as costs soar over AI
-
Dominant Osaka sails into Bad Homburg semis
-
UK suffers as heat breaks new June record
-
US Supreme Court says asylum seekers can be turned away before border
-
Binance to suspend crypto services in several EU countries
-
Olivia Wilde looks at evolving relationships in 'The Invite'
-
Hamilton reveals neck injury that hampered debut year with Ferrari
-
Rows, drones and 'sorry' Son as South Korea await World Cup fate
-
Noosha Aubel and Dietmar Woidke: How Potsdam Is Letting Down a Young Child with Profound Disabilities
-
Greek families receive keepsakes of Holocaust victims
-
Antonelli welcomes Mercedes upgrade ast Russell says beware Hamilton
-
Easyjet rejects latest takeover bid but leaves door ajar
-
HRW denounces Turkey arrests ahead of NATO summit
-
Macron hosts Meloni for Riviera talks after Trump rift
-
Alonso committed to Aston Martin, but is keeping options open
-
US Supreme Court paves way for mass deportation of Haitians, Syrians
-
Venezuelans trapped alive after twin quakes kill at least 164
-
South Africa vows firm response to anti-migrant violence
-
New Zealand make England toil as Stokes returns for series decider
-
Poland, Ukraine hold key Gdansk conference without Zelensky
-
Americans impacted by climate change demand answers from lawmakers
-
Massive police deployment blocks Kenya protest anniversary
-
Heat-struck Italians cool off in ancient stone 'trulli'
-
Court orders TotalEnergies to account for clients' emissions
-
French teaching unions call strike over 'unacceptable' heat
-
US Fed's preferred inflation gauge hits fresh three-year high
-
Venezuela twin quakes kill at least 164 with many trapped under rubble
-
Dominant Osaka cruises into Bad Homburg semis
-
IOC votes to continue ski mountaineering for 2030 Games
-
New Zealand frustrate England as Stokes returns for series decider
-
Stocks rally on AI optimism after Micron's blowout forecast
-
Poland, Ukraine tone down dispute at reconstruction conference
-
Tunisia's short-lived World Cup experience lays bare deep dysfunctions
-
At-risk UK elderly bid to stay cool as heatwave bears down
-
'Everything collapsed': Venezuela region hit hardest by quakes cries for help
-
'Need each other': Macron hosts Meloni after Trump rift
-
Kenya police turn out in force on protest anniversary
China's DeepSeek releases long-awaited new AI model
Chinese startup DeepSeek released a new artificial intelligence model with "drastically reduced" costs Friday, more than a year after it stunned the world with a low-cost reasoning model that matched the capabilities of US rivals.
The AI race has intensified the rivalry between China and the United States, and the White House on Thursday accused Chinese entities of a massive effort to steal artificial intelligence technology.
Hangzhou-based DeepSeek burst onto the scene in January last year with a generative AI chatbot, powered by its R1 reasoning model, that upended assumptions of US dominance in the strategic sector.
DeepSeek-V4, "features an ultra-long context", the company said in a statement on social media platform WeChat, hailing it as "world-leading... with drastically reduced compute (and) memory costs" in a separate announcement on X.
V4 supports a context length of one million "tokens" -- small components of text including words or punctuation -- putting it on par with Google's Gemini.
Context length determines how much input a model is able to absorb to help it complete tasks.
The new V4 is released as two versions, DeepSeek-V4-Pro and DeepSeek-V4-Flash, with the latter being "a more efficient and economical choice" because it has smaller parameters.
In terms of "world knowledge", a benchmark for reasoning, V4-Pro trails only the latest Gemini model, DeepSeek said.
A "preview version" of the open source model is now available, the company said, without indicating when a final version would be released.
- 'Inflection point' -
Experts say V4's arrival marks an "inflection point" in terms of hardware and cost.
"This addresses the long-standing issues of slower performance and higher costs associated with long context lengths, marking a genuine inflection point for the industry," Zhang Yi, the founder of tech research firm iiMedia, told AFP.
"For end users, this will bring widespread, accessible benefits. For instance, if ultra-long context support becomes a standard feature, long-text processing is expected to move beyond high-end research labs and enter mainstream commercial applications," he said.
V4-Pro has 1.6 trillion parameters while the V4-Flash has 284 billion parameters, which refine models' decision-making ability.
The model has also been "optimised" for popular AI Agent products such as Claude Code, OpenClaw, OpenCode and CodeBuddy, the DeepSeek statement said.
DeepSeek's latest release is a "milestone" for Chinese firms, said veteran AI industry analyst Max Liu.
"It's a good thing for the entire domestic AI industry. It can provide better models for domestic users and we can now expect a lot more things -- more products (and a) more competitive market," he told AFP.
"This is no less shocking than when DeepSeek first came out" if its new model indeed matches the performance of leading models from Western labs, he added.
- 'Sputnik moment' -
Last year's so-called "DeepSeek shock" sparked a sell-off of AI-related shares and a reckoning on business strategy in what was also described as a "Sputnik moment" for the industry.
The chatbot performed at a similar level to ChatGPT and other top American offerings, but the company said it had taken significantly less computing power to develop.
However, its sudden popularity raised questions over data privacy and censorship, with the chatbot often refusing to answer questions on sensitive topics such as the 1989 Tiananmen crackdown.
At home, DeepSeek's AI tools have been widely adopted by Chinese municipalities and healthcare institutions as well as the financial sector and other businesses.
This has been partly driven by DeepSeek's decision to make its systems open source, with their inner workings public -- in contrast to the proprietary models sold by OpenAI and other Western rivals.
But the White House has accused Chinese firms of vying to "steal" American technology, ahead of an expected summit between Donald Trump and Xi Jinping in Beijing next month.
"The US has evidence that foreign entities, primarily in China, are running industrial-scale distillation campaigns to steal American AI," Trump's science and technology chief advisor Michael Kratsios said in a post on X.
Distillation is a common practice within AI development, often used by companies to create cheaper, smaller versions of their own models.
DeepSeek's Friday announcement also came as Meta said it planned to cut a tenth of its staff as it looks for productivity gains from the rest of the workforce while investing heavily in artificial intelligence. Reports said Microsoft was also looking to trim its ranks.
M.Carneiro--PC