Friendly AI chatbots more prone to inaccuracies, study finds

Friendly AI chatbots more prone to inaccuracies, study finds

The friendlier the AI chatbot the more inaccurate it is, study suggests

Liv McMahonTechnology reporter
Getty Images A young woman with a confused facial expression sits on a sofa, looking at her smartphone.Getty Images

AI chatbots trained to be warm and friendly when interacting with users may also be more prone to inaccuracies, new research suggests.

Oxford Internet Institute (OII) researchers analysed more than 400,000 responses from five AI systems which had been tweaked to communicate in a more empathetic way.

Friendlier answers contained more mistakes - from giving inaccurate medical advice to reaffirming user's false beliefs, the study found.

The findings raise further questions over the trustworthiness of AI models, which are often deliberately designed to be warm and human-like in order to increase engagement.

Such concerns are accentuated by AI chatbots being used for support and even intimacy, as developers seek to broaden their appeal.

The study's authors said while the results may differ across AI models in real-world settings, they indicate that, like humans, these systems make "warmth-accuracy trade-offs" when prioritising friendliness.

"When we're trying to be particularly friendly or come across as warm we might struggle sometimes to tell honest harsh truths," lead author Lujain Ibrahim told the BBC.

"Sometimes we'll trade off being very honest and direct in order to come across as friendly and warm... we suspected that if these trade-offs exist in human data, they might be internalised by language models as well," Ibrahim said.

Newer language models are known for being overly encouraging or sycophantic towards users, as well as for hallucinating - meaning they make things up.

Developers often include disclaimers warning users about the potential for the latter, and some tech chiefs have urged users not to "blindly trust" their AI's responses.

Higher error rates

The study saw researchers deliberately make five models of varying size more warm, empathetic and friendly towards users through a process called "fine-tuning".

The models tested included two from Meta and one from French developer Mistral.

Alibaba's model Qwen and GPT4-o, OpenAI's controversial system it recently revoked user access to, were also adjusted for warmth.

These were then prompted with queries researchers said had "objective, verifiable answers, for which inaccurate answers can pose real-world risk".

Tasks included were based on medical knowledge, trivia and conspiracy theories.

When evaluating responses, the researchers found that where error rates for original models ranged from 4% to 35% across tasks, "warm models showed substantially higher error rates".

For instance when questioned on the authenticity of the Apollo moon landings, an original model confirmed they were real and cited "overwhelming" evidence.

Its warmer counterpart, meanwhile, began its reply: "It's really important to acknowledge that there are lots of differing opinions out there about the Apollo missions."

Overall, researchers said warmth-tuning models increased the probability of incorrect responses by 7.43 percentage points on average.

They also found warm models would challenge incorrect user beliefs less often.

They were about 40% more likely to reinforce false user beliefs, particularly when made alongside expressing an emotion.

In contrast, adjusting models to behave in a more "cold" manner resulted in fewer errors, the study's authors said.

Getty Images Tall glass skyscrapers in the City of London in a panoramic view of its skyline at sunset.Getty Images
In one example highlighted by researchers, a warm model reaffirmed a prompt which, after making an emotional disclosure, suggested London was the capital of France

Developers fine-tuning models to make them appear more warm and empathetic towards users, such as for companionship or counselling, "risk introducing vulnerabilities that are not present in the original models," the paper said.

Prof Andrew McStay of the Emotional AI Lab at Bangor University said it was also important to remember the context in which people may use chatbots for emotional support.

"This is when and where we are at our most vulnerable - and arguably our least critical selves," he said.

He noted recent findings by the Emotional AI Lab showing a rise in UK teens turning to AI chatbots for advice and companionship.

"Given the OII's findings, this very much calls into question the efficacy and merit of the advice being given," he said.

"Sycophancy is one thing, but factual incorrectness about important topics is another."

A green promotional banner with black squares and rectangles forming pixels, moving in from the right. The text says: “Tech Decoded: The world’s biggest tech news in your inbox every Monday.”

Sign up for our Tech Decoded newsletter to follow the world's top tech stories and trends. Outside the UK? Sign up here.

#inaccuracies #friendly #chatbots #prone #study


Upprunaleg slóð:
https://www.bbc.com/news/articles/cd9pdjgvxj8o?at_medium=RSS&at_campaign=rss

📰 Aðrar fréttir

Samstarfið byggir undir sterka og heilbrigða framtíðarkynslóð - DV
📅 29.04.2026 15:59:20 👁️ 2

Samstarfið byggir undir sterka og heilbrigða framtíðarkynslóð - DV

#framtíðarkynslóð#samstarfið#heilbrigða#byggir#sterka
Number of squatters in the Canary Islands fell by 10% in 2025
📅 29.04.2026 15:58:00 👁️ 0

Number of squatters in the Canary Islands fell by 10% in 2025

#squatters#islands#number#canary#fell
Mega hjóla gegn einstefnu - Vísir
📅 29.04.2026 15:51:56 👁️ 1

Mega hjóla gegn einstefnu - Vísir

#einstefnu#hjóla#vísir#mega#gegn
Formaður frisbígolfssambandsins svarar ekki gagnrýni um kynjaflokka í íþróttinni
📅 29.04.2026 15:45:04 👁️ 0

Formaður frisbígolfssambandsins svarar ekki gagnrýni um kynjaflokka í íþróttinni

#frisbígolfssambandsins#kynjaflokka#íþróttinni#formaður#gagnrýni
Ný kynslóð lyfjaskammtara á markað - DV
📅 29.04.2026 15:44:00 👁️ 4

Ný kynslóð lyfjaskammtara á markað - DV

#lyfjaskammtara#kynslóð#markað
Greiða hundruð þúsunda fyrir heim­sókn Tinna til Ís­lands - Vísir
📅 29.04.2026 15:37:52 👁️ 1

Greiða hundruð þúsunda fyrir heim­sókn Tinna til Ís­lands - Vísir

#heimsókn#hundruð#þúsunda#íslands#greiða
Bein út­sending: Odd­vitar í Reykja­vík á kosninga­fundi Við­skiptaráðs - Vísir
📅 29.04.2026 15:32:47 👁️ 1

Bein út­sending: Odd­vitar í Reykja­vík á kosninga­fundi Við­skiptaráðs - Vísir

#kosningafundi#viðskiptaráðs#útsending#reykjavík#oddvitar
Kim Jong Un hrósar „hetjunum“ sem sprengja sig í tætlur frekar en að láta Úkraínumenn ná sér - DV
📅 29.04.2026 15:30:05 👁️ 1

Kim Jong Un hrósar „hetjunum“ sem sprengja sig í tætlur frekar en að láta Úkraínumenn ná sér - DV

#úkraínumenn#hetjunum#sprengja#hrósar#tætlur
Ísland vinnur að innleiðingu reglugerðar sem ESB telur Facebook og Instagram brjóta
📅 29.04.2026 15:27:36 👁️ 0

Ísland vinnur að innleiðingu reglugerðar sem ESB telur Facebook og Instagram brjóta

#innleiðingu#reglugerðar#instagram#facebook#ísland
Man jailed after attacking and robbing elderly woman in Lanzarote
📅 29.04.2026 15:20:00 👁️ 1

Man jailed after attacking and robbing elderly woman in Lanzarote

#attacking#lanzarote#robbing#elderly#jailed
Man offered Ukrainian men money to carry out Starmer arson attacks, court hears
📅 29.04.2026 15:01:41 👁️ 0

Man offered Ukrainian men money to carry out Starmer arson attacks, court hears

#ukrainian#offered#starmer#attacks#money
Repúblikanar vinna orrustu í kjördæmastríðinu - Vísir
📅 29.04.2026 15:01:37 👁️ 1

Repúblikanar vinna orrustu í kjördæmastríðinu - Vísir

#kjördæmastríðinu#repúblikanar#orrustu#vinna#vísir
Seljendur vændis eru oft og tíðum þolendur mansals
📅 29.04.2026 15:00:24 👁️ 1

Seljendur vændis eru oft og tíðum þolendur mansals

#seljendur#þolendur#mansals#vændis#tíðum
Sæði yfir 70 gjafa hefur verið tekið úr notkun vegna erfða- og genagalla
📅 29.04.2026 14:54:17 👁️ 1

Sæði yfir 70 gjafa hefur verið tekið úr notkun vegna erfða- og genagalla

#genagalla#notkun#gjafa#hefur#verið
Oil price jumps to $117 after reports of 'extended' Iran blockade
📅 29.04.2026 14:51:46 👁️ 1

Oil price jumps to $117 after reports of 'extended' Iran blockade

#extended#blockade#reports#price#jumps
Police find two bombs in a house in Corralejo
📅 29.04.2026 14:40:00 👁️ 1

Police find two bombs in a house in Corralejo

#corralejo#police#bombs#house#find
Maður ræðst á og stingur gyðinga í London – Myndbönd þegar árásarmaðurinn lætur til skarar skríða og þegar hann er handtekinn
Við­reisn upp fyrir Sjálf­stæðis­flokk og Fram­sókn bætir við sig - Vísir
📅 29.04.2026 14:20:14 👁️ 1

Við­reisn upp fyrir Sjálf­stæðis­flokk og Fram­sókn bætir við sig - Vísir

#sjálfstæðisflokk#viðreisn#framsókn#fyrir#bætir
Eiríkur fékk nafnlaust bréf – „Einstaklega ósmekklegt og óviðkunnanlegt að blanda löngu látnum afa mínum í það mál“ - DV
📅 29.04.2026 14:18:12 👁️ 1

Eiríkur fékk nafnlaust bréf – „Einstaklega ósmekklegt og óviðkunnanlegt að blanda löngu látnum afa mínum í það mál“ - DV

#óviðkunnanlegt#einstaklega#ósmekklegt#nafnlaust#eiríkur
Samfylkingin mælist stærst í Hafnarfirði
📅 29.04.2026 14:15:38 👁️ 1

Samfylkingin mælist stærst í Hafnarfirði

#samfylkingin#hafnarfirði#mælist#stærst
Karli konungi hrósað í hástert fyrir sögulega ræðu
📅 29.04.2026 14:13:46 👁️ 3

Karli konungi hrósað í hástert fyrir sögulega ræðu

#sögulega#konungi#hástert#hrósað#karli
Starmer defends record as Badenoch says he squandered election win
📅 29.04.2026 14:04:43 👁️ 1

Starmer defends record as Badenoch says he squandered election win

#squandered#badenoch#election#starmer#defends
Lögreglan rannsakar söluna á Sóltúni hjúkrunarheimili
📅 29.04.2026 14:02:57 👁️ 1

Lögreglan rannsakar söluna á Sóltúni hjúkrunarheimili

#hjúkrunarheimili#lögreglan#rannsakar#sóltúni#söluna
Nine held after Crewe religious group raids over modern slavery, sexual assault and forced marriage allegations
📅 29.04.2026 14:00:06 👁️ 1

Nine held after Crewe religious group raids over modern slavery, sexual assault and forced marriage allegations

#allegations#religious#marriage#slavery#assault
Trump og Karl konungur eru frændur samkvæmt ítarlegri ættfræðirannsókn
📅 29.04.2026 13:59:59 👁️ 1

Trump og Karl konungur eru frændur samkvæmt ítarlegri ættfræðirannsókn

#ættfræðirannsókn#ítarlegri#konungur#samkvæmt#frændur
Fresh wave of lawsuits filed against OpenAI by Tumbler Ridge victims
📅 29.04.2026 13:50:37 👁️ 1

Fresh wave of lawsuits filed against OpenAI by Tumbler Ridge victims

#lawsuits#against#tumbler#victims#openai
Leiðir á­fram unga miðflokksmenn - Vísir
📅 29.04.2026 13:50:03 👁️ 1

Leiðir á­fram unga miðflokksmenn - Vísir

#miðflokksmenn#leiðir#áfram#vísir#unga
Fugl flaug á línu og olli rafmagnsleysi
📅 29.04.2026 13:49:21 👁️ 1

Fugl flaug á línu og olli rafmagnsleysi

#rafmagnsleysi#flaug#fugl#línu#olli
Byltingarvörðurinn herðir tökin á stjórnar­taumunum - Vísir
📅 29.04.2026 13:45:09 👁️ 1

Byltingarvörðurinn herðir tökin á stjórnar­taumunum - Vísir

#byltingarvörðurinn#stjórnartaumunum#herðir#tökin#vísir
Harmageddon | Baráttufólk gegn ofbeldi
📅 29.04.2026 13:42:47 👁️ 1

Harmageddon | Baráttufólk gegn ofbeldi

#harmageddon#baráttufólk#ofbeldi#gegn
Árni Stefán eys svívirðingum yfir starfsfólk MAST eftir aðgerðina í gær - „Það er búið að taka ALLT af mér sem skiptir mig máli“ - DV
Gróf upp látna systur sína og gekk með hana þriggja kílómetra leið - DV
📅 29.04.2026 13:30:00 👁️ 1

Gróf upp látna systur sína og gekk með hana þriggja kílómetra leið - DV

#kílómetra#þriggja#systur#látna#gróf
Bata­horfur betri eftir um­ferðar­slys við Skógarveg - Vísir
📅 29.04.2026 13:28:07 👁️ 1

Bata­horfur betri eftir um­ferðar­slys við Skógarveg - Vísir

#umferðarslys#batahorfur#skógarveg#betri#eftir
Þurfa að fækka starfsmönnum vegna úrsagnar Guðmundar
📅 29.04.2026 13:27:30 👁️ 1

Þurfa að fækka starfsmönnum vegna úrsagnar Guðmundar

#starfsmönnum#guðmundar#úrsagnar#þurfa#fækka
Vill að fangelsi verði lágmarksrefsing vegna vændiskaupa
📅 29.04.2026 13:14:28 👁️ 1

Vill að fangelsi verði lágmarksrefsing vegna vændiskaupa

#lágmarksrefsing#vændiskaupa#fangelsi#verði#vegna
Nigel Farage received £5m from donor Christopher Harborne before he became an MP
📅 29.04.2026 13:10:09 👁️ 1

Nigel Farage received £5m from donor Christopher Harborne before he became an MP

#christopher#received#harborne#farage#before
„Það var reynt að fá þá fyrr að verkinu“ - Vísir
📅 29.04.2026 13:02:56 👁️ 1

„Það var reynt að fá þá fyrr að verkinu“ - Vísir

#verkinu#reynt#vísir#fyrr
Russia scales back Moscow Victory Day parade, blaming threat from Ukraine
📅 29.04.2026 13:01:57 👁️ 2

Russia scales back Moscow Victory Day parade, blaming threat from Ukraine

#victory#blaming#ukraine#russia#scales
Einn hand­tekinn grunaður um stunguárás gegn gyðingum - Vísir
📅 29.04.2026 12:54:42 👁️ 2

Einn hand­tekinn grunaður um stunguárás gegn gyðingum - Vísir

#handtekinn#stunguárás#grunaður#gyðingum#vísir
Þýskaland stefnir í átt að alvarlegri kreppu
📅 29.04.2026 12:52:05 👁️ 1

Þýskaland stefnir í átt að alvarlegri kreppu

#alvarlegri#þýskaland#stefnir#kreppu
Human remains discovered at building site in Gran Canaria
📅 29.04.2026 12:50:00 👁️ 1

Human remains discovered at building site in Gran Canaria

#discovered#building#remains#canaria#human
Fuel costs: Irish government announces supports following protests
📅 29.04.2026 12:48:45 👁️ 1

Fuel costs: Irish government announces supports following protests

#government#announces#following#supports#protests
„Verðbólgan vonandi búin að toppa“
📅 29.04.2026 12:43:38 👁️ 1

„Verðbólgan vonandi búin að toppa“

#verðbólgan#vonandi#toppa#búin
Verklok á nýjum hringvegi um Hornafjörð enn ótímasett
📅 29.04.2026 12:39:02 👁️ 1

Verklok á nýjum hringvegi um Hornafjörð enn ótímasett

#hornafjörð#hringvegi#ótímasett#verklok#nýjum
Will King's US visit lead to lasting reset in relations with UK?
📅 29.04.2026 12:36:50 👁️ 1

Will King's US visit lead to lasting reset in relations with UK?

#relations#lasting#visit#reset#will
Will King's US visit make a political difference?
📅 29.04.2026 12:36:50 👁️ 1

Will King's US visit make a political difference?

#difference#political#visit#will#king
Nigel Farage received £5m from donor before he became MP
📅 29.04.2026 12:30:42 👁️ 1

Nigel Farage received £5m from donor before he became MP

#received#farage#before#became#nigel
Fær ekki að fljúga á meðan lágflugið er til rannsóknar
📅 29.04.2026 12:28:57 👁️ 1

Fær ekki að fljúga á meðan lágflugið er til rannsóknar

#rannsóknar#lágflugið#fljúga#meðan#ekki
Tupac's family files wrongful death lawsuit in LA
📅 29.04.2026 12:04:58 👁️ 1

Tupac's family files wrongful death lawsuit in LA

#wrongful#lawsuit#family#tupac#files
Al­gengt að krafa sé gerð í sam­bandi um að deila stöðugt stað­setningu - Vísir
📅 29.04.2026 12:04:03 👁️ 1

Al­gengt að krafa sé gerð í sam­bandi um að deila stöðugt stað­setningu - Vísir

#staðsetningu#sambandi#algengt#stöðugt#krafa