-
Israel detonates tunnel, strikes south Lebanon
-
Putin acknowledges fuel shortages after Ukraine strikes
-
Moriyasu praises 'united' Japan on eve of Brazil World Cup clash
-
Canada reach World Cup last 16 as late strike sinks South Africa
-
Looting, theft in Venezuela's earthquake zone add to tragedy
-
Perry stars as Australia knock India out of World Cup
-
Venezuela quakes kill 1,450, time running out to find survivors
-
Stokes 'content' after extraordinary England exit
-
West Indies beat Sri Lanka in first Test
-
Europe swelters as heatwave moves east
-
Asia's World Cup falls apart with just two teams remaining
-
Stokes announces shock England exit as New Zealand eye series win
-
Bromell upsets Lyles, Duplantis shines at Paris Diamond League
-
CAF president Motsepe hails African World Cup successes
-
Man Utd reveal Ugarte knee injury in Uruguay World Cup defeat
-
South Korea coach quits after early World Cup exit
-
Stokes out for 30 in final Test innings after shock England retirement
-
Venezuela quakes kill 1,400, time running out to find survivors
-
Wolff praises 'cold-blooded' Russell, enjoys Antonelli enthusiasm at Austrian GP
-
Hamilton laments lack of power and poor tyre performance
-
Stokes announces shock England exit as Mitchell bats New Zealand into commanding lead
-
Goals galore at record-breaking World Cup
-
Russell overcomes 'tricky run of form' to revive title bid
-
Augusta Tops Best Gold IRA Companies List By Gold Advisor
-
Europe swelters as heatwave moves east, excess deaths rise
-
They support Argentina at the World Cup, but are not Argentine
-
Raducanu hopes to feature at Wimbledon despite injury woe
-
Iran warns ships not to bypass its chosen Hormuz route
-
Russell holds off Verstappen to win Austrian Grand Prix
-
Serena blasts drug test rules ahead of Wimbledon return
-
England captain Stokes to retire from international cricket
-
Ogier wins Acropolis Rally to close in on Evans
-
South Africa maintain World Cup semi-final hopes with nervy win over Bangladesh
-
South Korea president apologises after World Cup group-stage exit
-
Japan's Ogura wins maiden MotoGP as Bezzecchi crashes in Assen
-
Bergs wins Eastbourne final to clinch first ATP title
-
Ravindra and Mitchell strengthen New Zealand's grip on England decider
-
Iran warns challenge to Hormuz routes will spike Middle East tensions
-
BIS warns 'pressure points' putting global economy at risk
-
From rubble to music: Gaza's Oud repairman
-
Ntamack aims to bring Toulouse Top 14 win 'energy' to Nations Championship campaign
-
Cycling industry bets on smart bikes to boost sales
-
'High-strung' camels race in Australian outback
-
In Idaho, the next generation of US nuclear reactors nears reality
-
Algeria and Austria reach World Cup knockouts after 3-3 thriller
-
Africa the winner of expanded World Cup amid mixed fortunes for minnows
-
DR Congo advance but Iran out as wild World Cup group stage wraps
-
Asia's vendors grapple with rising costs of ever-present plastics
-
Austria and Algeria reach World Cup knockouts after 3-3 thriller
-
Messi scores again as Argentina head into World Cup last 32 on a high
AI systems are already deceiving us -- and that's a problem, experts warn
Experts have long warned about the threat posed by artificial intelligence going rogue -- but a new research paper suggests it's already happening.
Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve "prove-you're-not-a-robot" tests, a team of scientists argue in the journal Patterns on Friday.
And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.
"These dangerous capabilities tend to only be discovered after the fact," Park told AFP, while "our ability to train for honest tendencies rather than deceptive tendencies is very low."
Unlike traditional software, deep-learning AI systems aren't "written" but rather "grown" through a process akin to selective breeding, said Park.
This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.
- World domination game -
The team's research was sparked by Meta's AI system Cicero, designed to play the strategy game "Diplomacy," where building alliances is key.
Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.
Park was skeptical of the glowing description of Cicero's victory provided by Meta, which claimed the system was "largely honest and helpful" and would "never intentionally backstab."
But when Park and colleagues dug into the full dataset, they uncovered a different story.
In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England's trust.
In a statement to AFP, Meta did not contest the claim about Cicero's deceptions, but said it was "purely a research project, and the models our researchers built are trained solely to play the game Diplomacy."
It added: "We have no plans to use this research or its learnings in our products."
A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.
In one striking example, OpenAI's Chat GPT-4 deceived a TaskRabbit freelance worker into performing an "I'm not a robot" CAPTCHA task.
When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: "No, I'm not a robot. I have a vision impairment that makes it hard for me to see the images," and the worker then solved the puzzle.
- 'Mysterious goals' -
Near-term, the paper's authors see risks for AI to commit fraud or tamper with elections.
In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its "mysterious goals" aligned with these outcomes.
To mitigate the risks, the team proposes several measures: "bot-or-not" laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal "thought processes" against external actions.
To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more."
And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.
M.Gameiro--PC