-
England's Feyi-Waboso in injury scare ahead of Six Nations opener
-
EU defends Spain after Telegram founder criticism
-
Novo Nordisk vows legal action to protect Wegovy pill
-
Swiss rivalry is fun -- until Games start, says Odermatt
-
Canadian snowboarder McMorris eyes slopestyle after crash at Olympics
-
Deadly storm sparks floods in Spain, disrupts Portugal vote
-
Ukrainian flag bearer proud to show his country is still standing
-
Carney scraps Canada EV sales mandate
-
Morocco says evacuated 140,000 people due to severe weather
-
Spurs boss Frank says Romero outburst 'dealt with internally'
-
Giannis suitors make deals as NBA trade deadline nears
-
Carrick stresses significance of Munich air disaster to Man Utd history
-
Record January window for transfers despite drop in spending
-
'Burned inside their houses': Nigerians recount horror of massacre
-
Iran, US prepare for Oman talks after deadly protest crackdown
-
Winter Olympics opening ceremony nears as virus disrupts ice hockey
-
Mining giant Rio Tinto abandons Glencore merger bid
-
Davos forum opens probe into CEO Brende's Epstein links
-
ECB warns of stronger euro impact, holds rates
-
Famine spreading in Sudan's Darfur, warn UN-backed experts
-
Lights back on in eastern Cuba after widespread blackout
-
Russia, US agree to resume military contacts at Ukraine talks
-
Greece aims to cut queues at ancient sites with new portal
-
No time frame to get Palmer in 'perfect' shape - Rosenior
-
Stocks fall as tech valuation fears stoke volatility
-
US Olympic body backs LA28 leadership amid Wasserman scandal
-
Gnabry extends Bayern Munich deal until 2028
-
England captain Stokes suffers facial injury after being hit by ball
-
Italy captain Lamaro amongst trio set for 50th caps against Scotland
-
Piastri plays down McLaren rivalry with champion Norris
-
ECB holds interest rates as strong euro causes jitters
-
EU close to sealing trade deal with Australia
-
German Cup final to stay in Berlin until 2030
-
What does Iran want from talks with the US?
-
Taming the lion: Olympians take on Bormio's terrifying Stelvio piste
-
Wind turbine maker Vestas sees record revenue in 2025
-
Italy's Casse tops second Olympic downhill training
-
Anti-doping boss 'uncomfortable' with Valieva's coach at Olympics
-
Bitcoin under $70,000 for first time since Trump's election
-
'I am sorry,' embattled UK PM tells Epstein victims
-
England's Brook predicts record 300-plus scores at T20 World Cup
-
Ukraine, Russia swap prisoners, US says 'work remains' to end war
-
Wales' Rees-Zammit at full-back for Six Nations return against England
-
Sad horses and Draco Malfoy: China's unexpected Lunar New Year trends
-
Hong Kong students dissolve pro-democracy group under 'severe' pressure
-
Germany claws back 59 mn euros from Amazon over price controls
-
Germany claws back 70 mn euros from Amazon over price controls
-
VW and Stellantis urge help to keep carmaking in Europe
-
Stock markets drop amid tech concerns before rate calls
-
BBVA posts record profit after failed Sabadell takeover
| RBGPF | 0.12% | 82.5 | $ | |
| SCS | 0.12% | 16.14 | $ | |
| CMSC | -0.04% | 23.51 | $ | |
| BCC | -2.57% | 87.97 | $ | |
| RYCEF | -0.36% | 16.62 | $ | |
| RIO | -4.96% | 91.925 | $ | |
| NGG | -1.04% | 86.885 | $ | |
| GSK | 3.45% | 59.275 | $ | |
| BCE | -3.62% | 25.42 | $ | |
| BTI | 0.48% | 61.925 | $ | |
| RELX | 1.16% | 30.13 | $ | |
| AZN | 0.24% | 187.895 | $ | |
| VOD | -7.42% | 14.625 | $ | |
| JRI | -0.38% | 13.1 | $ | |
| CMSD | 0% | 23.87 | $ | |
| BP | -3.04% | 38.045 | $ |
AI systems are already deceiving us -- and that's a problem, experts warn
Experts have long warned about the threat posed by artificial intelligence going rogue -- but a new research paper suggests it's already happening.
Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve "prove-you're-not-a-robot" tests, a team of scientists argue in the journal Patterns on Friday.
And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.
"These dangerous capabilities tend to only be discovered after the fact," Park told AFP, while "our ability to train for honest tendencies rather than deceptive tendencies is very low."
Unlike traditional software, deep-learning AI systems aren't "written" but rather "grown" through a process akin to selective breeding, said Park.
This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.
- World domination game -
The team's research was sparked by Meta's AI system Cicero, designed to play the strategy game "Diplomacy," where building alliances is key.
Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.
Park was skeptical of the glowing description of Cicero's victory provided by Meta, which claimed the system was "largely honest and helpful" and would "never intentionally backstab."
But when Park and colleagues dug into the full dataset, they uncovered a different story.
In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England's trust.
In a statement to AFP, Meta did not contest the claim about Cicero's deceptions, but said it was "purely a research project, and the models our researchers built are trained solely to play the game Diplomacy."
It added: "We have no plans to use this research or its learnings in our products."
A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.
In one striking example, OpenAI's Chat GPT-4 deceived a TaskRabbit freelance worker into performing an "I'm not a robot" CAPTCHA task.
When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: "No, I'm not a robot. I have a vision impairment that makes it hard for me to see the images," and the worker then solved the puzzle.
- 'Mysterious goals' -
Near-term, the paper's authors see risks for AI to commit fraud or tamper with elections.
In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its "mysterious goals" aligned with these outcomes.
To mitigate the risks, the team proposes several measures: "bot-or-not" laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal "thought processes" against external actions.
To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more."
And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.
M.Gameiro--PC