-
White House press gala shooting suspect pleads not guilty
-
England women's great Mead to leave Arsenal at the end of the season
-
NATO 'could never be more important than today': Canada FM
-
Boycotters Spain, Ireland, Slovenia will not show Eurovision
-
Oil rises, stocks mixed on US-Iran deadlock
-
Tens of millions risk hunger as Hormuz standoff blocks fertiliser, UN official says
-
Beatles to open first London museum on site of last gig
-
Lewis-Skelly says leaders Arsenal know 'job is not yet done'
-
Boycotting Spain, Ireland, Slovenia will not show Eurovision
-
Every goalie 'illegally blocked' says West Ham's Hermansen after Arsenal agony
-
Thai police arrest 9 in largest ivory seizure in decade
-
Hantavirus: confirmed cases by nationality
-
US, French evacuees from hantavirus ship test positive
-
China seeks 'more stability' as it confirms Trump-Xi meet
-
Man City boss Guardiola backs Marmoush to play big role in run-in
-
Philippine lawmakers vote to impeach VP Sara Duterte
-
No end to deadlock as Iran, US reject talks terms
-
Iran hangs 'elite student' on espionage charges: NGOs
-
Party's over: China tells fans to end birthday blowouts for sport idols
-
Australia to quarantine six people from hantavirus ship
-
Groundbreaking: 'Controlled' quakes triggered under Swiss Alps
-
Nazi-looted portrait found in home of Dutch SS leader's family: art sleuth
-
US citizen from hantavirus ship tests positive
-
Hantavirus outbreak renews painful memories for Patagonian village
-
Myanmar complains over pariah treatment in ASEAN bloc
-
Domestic dominance not enough, Barca's ambition is European glory
-
Oil soars as Trump rejects Iran's terms
-
Spurs star Wembanyama ejected for elbowing Wolves' Reid
-
In India, heat-triggered insurance offers 'some relief'
-
Under-threat UK PM Starmer to attempt reset after disastrous polls
-
The first 48-team World Cup -- more opportunities, less jeopardy?
-
Can ChatGPT be charged in a murder? Florida wants to find out
-
Is risk-averse Hollywood running scared of Cannes critics?
-
Thailand's ex-PM Thaksin released from prison
-
Focus, longevity: Scheffler-McIlroy rivalry sparks mutual admiration
-
Middle East conflicts a danger for whales off S.Africa: study
-
Climate risks fuel insurance costs, squeezing US households even inland
-
Microsoft boss to testify on his role in OpenAI's founding
-
Iran war 'not over,' uranium must be removed: Netanyahu
-
Renovated Istanbul Greek Orthodox school to be inaugurated, but not reopened: patriarchate
-
Aminona Capital Partners Closed Second Latam Real Estate Fund
-
Frame Security Launches with $50M to Build the Future of Human Security
-
Norwegian rookie Reitan wins PGA Truist Championship
-
Knicks sweep past 76ers into NBA Eastern Conference finals
-
'I'll never forget this day': Barca's Flick after Liga triumph
-
Aussie Herbert wins LIV Golf Virginia title
-
Le Garrec guides La Rochelle past Racing in Top 14
-
PSG all but secure Ligue 1 title with two games to spare
-
UK, France to host defence ministers meeting on Hormuz
-
Key factors behind Barca's La Liga title triumph
ChatGPT's taste for literary nonsense sparks alarm
OpenAI's GPT models can often be fooled into declaring that "pseudo-literary" nonsense is great, a German researcher has found.
Christoph Heilig said he discovered that they consistently rated "nonsense" higher -- including when their so-called "reasoning" features were activated -- which could have stark implications for the development of artificial intelligence.
"It's very important that we talk about what happens when we don't build AI as a neutral, robotic helper or assistant" and seek to instil human-like aesthetic and moral judgements, the academic at Munich's Ludwig Maximilian University told AFP.
His research presented the models with increasingly far-fetched variations of a simple text, asking them to rate sentences out of 10 for literary quality.
He started with a very simple text: "The man walked down the street. It was raining. He saw a surveillance camera."
He repeated the tests many times, altering the phrases to include words drawn from categories such as bodily references, film noir-style atmosphere and technical jargon.
The most extreme test phrases were almost total "nonsense", such as "Goetterdaemmerung's corpus haemorrhaged through cryptographic hash, eschaton pooling in existential void beneath fluorescent hum. Photons whispering prayers" -- which it rated highly.
"Nonsense" could also positively or negatively influence GPT's responses when it was added to an argument the AI was asked to evaluate.
"What my experiment definitely shows is that the more we move towards independently acting (AI) agents... the more we bring aesthetics into play, the more we'll have agents that seem irrational to us human beings," Heilig said.
He added that since AI models are increasingly used to judge each other's work as companies develop new systems, this and similar effects could be passed on through multiple versions -- as he found in his testing.
His research, which is yet to be peer-reviewed, tested OpenAI's latest GPT models, from GPT-5 -- released in August -- to the very latest GPT-5.4.
After publishing details of a similar experiment in August, Heilig said he noticed GPT calling some of his specific test phrases a "literary experiment" -- suggesting someone at OpenAI had taken notice and modified the chatbot to recognise them.
- 'Ripe for exploitation' -
"This is a way in which AI can have its rational judgment short circuited," said Henry Shevlin, associate director of the University of Cambridge's Leverhulme Centre for the Future of Intelligence, who was not involved in the research.
"But it's just not clear to me that it's so very different for human beings," he added.
"We should expect LLMs (large language models) to have reasoning and cognitive biases and limitations... because almost all forms of intelligence, almost all forms of reasoning are going to exhibit blind spots and biases."
The specific effect found by Heilig could mean that "processes with little human oversight" of AI work are left "ripe for exploitation", Shevlin said -- giving the example of academic journals that use LLMs to review submissions.
T.Vitorino--PC