-
Microsoft boss 'proud' of profit-making OpenAI investment
-
Indie series 'Everyone Is Doing Great' returns... on Netflix
-
EU to invite Taliban officials to Brussels for migrant return talks
-
Leeds draw leaves Spurs deep in relegation peril
-
Napoli's Champions League spot in balance after last-gasp Bologna defeat
-
Curacao World Cup preparations rocked as coach resigns
-
US Supreme Court maintains mail access to abortion pill for now
-
Hantavirus ship heads to Netherlands after passengers flown home
-
Trump warns Mideast truce on 'life support', Iran says ready for any aggression
-
Frustrated Trump learns he doesn't have the cards on Iran
-
Cannes Film Festival defends male-dominated competition
-
Patel, Miller lead Delhi to record-breaking win over Punjab
-
Final hantavirus ship evacuations begin after weather delay
-
No longer peripheral: SKorean director makes Cannes history
-
Military strikes, gang massacres in Nigeria kill around 100 civilians
-
SNC Scandic Coin: Real assets meet digital utility
-
SNC Scandic Coin: реальные активы и цифровые возможности
-
Venezuela has 'never considered' becoming 51st US state: acting president
-
Wembanyama escapes playoff suspension after ejection: NBA source
-
Trump to suspend US gas tax as Iran war spikes prices
-
Macron announces 23 bn euros of investment at Africa summit
-
Oil rises, stocks mostly higher on US-Iran deadlock
-
SNC Scandic Coin: поєднання реальних активів та цифрової функціональності
-
Sinner demolishes Popyrin to stroll into Italian Open last 16
-
Dua Lipa sues Samsung in US over use of her likeness on TV box
-
White House press gala shooting suspect pleads not guilty
-
England women's great Mead to leave Arsenal at the end of the season
-
NATO 'could never be more important than today': Canada FM
-
Boycotters Spain, Ireland, Slovenia will not show Eurovision
-
Oil rises, stocks mixed on US-Iran deadlock
-
Tens of millions risk hunger as Hormuz standoff blocks fertiliser, UN official says
-
Beatles to open first London museum on site of last gig
-
Lewis-Skelly says leaders Arsenal know 'job is not yet done'
-
Boycotting Spain, Ireland, Slovenia will not show Eurovision
-
Every goalie 'illegally blocked' says West Ham's Hermansen after Arsenal agony
-
Thai police arrest 9 in largest ivory seizure in decade
-
Hantavirus: confirmed cases by nationality
-
US, French evacuees from hantavirus ship test positive
-
China seeks 'more stability' as it confirms Trump-Xi meet
-
Man City boss Guardiola backs Marmoush to play big role in run-in
-
Philippine lawmakers vote to impeach VP Sara Duterte
-
No end to deadlock as Iran, US reject talks terms
-
Iran hangs 'elite student' on espionage charges: NGOs
-
Party's over: China tells fans to end birthday blowouts for sport idols
-
Australia to quarantine six people from hantavirus ship
-
Groundbreaking: 'Controlled' quakes triggered under Swiss Alps
-
Nazi-looted portrait found in home of Dutch SS leader's family: art sleuth
-
US citizen from hantavirus ship tests positive
-
Hantavirus outbreak renews painful memories for Patagonian village
-
Myanmar complains over pariah treatment in ASEAN bloc
As AI data scrapers sap websites' revenues, some fight back
A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.
Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.
But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.
Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.
"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.
But the arrival of generative AI "completely breaks" that model, he told AFP.
Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.
"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.
- 'No trespassing' -
Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.
"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.
"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."
The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.
On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.
"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".
TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.
The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".
But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".
"This is an evolution of the entire internet economy, which will take years," he said.
If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.
"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."
A.Aguiar--PC