Friendly AI chatbots more prone to inaccuracies, study suggests

Friendly AI chatbots more prone to inaccuracies, study suggests

Why friendly AI chatbots might be less trustworthy

Liv McMahonTechnology reporter
Getty Images A young woman with a confused facial expression sits on a sofa, looking at her smartphone.Getty Images

AI chatbots trained to be warm and friendly when interacting with users may also be more prone to inaccuracies, new research suggests.

Oxford Internet Institute (OII) researchers analysed more than 400,000 responses from five AI systems which had been tweaked to communicate in a more empathetic way.

Friendlier answers contained more mistakes - from giving inaccurate medical advice to reaffirming user's false beliefs, the study found.

The findings raise further questions over the trustworthiness of AI models, which are often deliberately designed to be warm and human-like in order to increase engagement.

Such concerns are accentuated by AI chatbots being used for support and even intimacy, as developers seek to broaden their appeal.

The study's authors said while the results may differ across AI models in real-world settings, they indicate that, like humans, these systems make "warmth-accuracy trade-offs" when prioritising friendliness.

"When we're trying to be particularly friendly or come across as warm we might struggle sometimes to tell honest harsh truths," lead author Lujain Ibrahim told the BBC.

"Sometimes we'll trade off being very honest and direct in order to come across as friendly and warm... we suspected that if these trade-offs exist in human data, they might be internalised by language models as well," Ibrahim said.

Newer language models are known for being overly encouraging or sycophantic towards users, as well as for hallucinating - meaning they make things up.

Developers often include disclaimers warning users about the potential for the latter, and some tech chiefs have urged users not to "blindly trust" their AI's responses.

Higher error rates

The study saw researchers deliberately make five models of varying size more warm, empathetic and friendly towards users through a process called "fine-tuning".

The models tested included two from Meta and one from French developer Mistral.

Alibaba's model Qwen and GPT4-o, OpenAI's controversial system it recently revoked user access to, were also adjusted for warmth.

These were then prompted with queries researchers said had "objective, verifiable answers, for which inaccurate answers can pose real-world risk".

Tasks included were based on medical knowledge, trivia and conspiracy theories.

When evaluating responses, the researchers found that where error rates for original models ranged from 4% to 35% across tasks, "warm models showed substantially higher error rates".

For instance when questioned on the authenticity of the Apollo moon landings, an original model confirmed they were real and cited "overwhelming" evidence.

Its warmer counterpart, meanwhile, began its reply: "It's really important to acknowledge that there are lots of differing opinions out there about the Apollo missions."

Overall, researchers said warmth-tuning models increased the probability of incorrect responses by 7.43 percentage points on average.

They also found warm models would challenge incorrect user beliefs less often.

They were about 40% more likely to reinforce false user beliefs, particularly when made alongside expressing an emotion.

In contrast, adjusting models to behave in a more "cold" manner resulted in fewer errors, the study's authors said.

Getty Images Tall glass skyscrapers in the City of London in a panoramic view of its skyline at sunset.Getty Images
In one example highlighted by researchers, a warm model reaffirmed a prompt which, after making an emotional disclosure, suggested London was the capital of France

Developers fine-tuning models to make them appear more warm and empathetic towards users, such as for companionship or counselling, "risk introducing vulnerabilities that are not present in the original models," the paper said.

Prof Andrew McStay of the Emotional AI Lab at Bangor University said it was also important to remember the context in which people may use chatbots for emotional support.

"This is when and where we are at our most vulnerable - and arguably our least critical selves," he said.

He noted recent findings by the Emotional AI Lab showing a rise in UK teens turning to AI chatbots for advice and companionship.

"Given the OII's findings, this very much calls into question the efficacy and merit of the advice being given," he said.

"Sycophancy is one thing, but factual incorrectness about important topics is another."

A green promotional banner with black squares and rectangles forming pixels, moving in from the right. The text says: “Tech Decoded: The world’s biggest tech news in your inbox every Monday.”

Sign up for our Tech Decoded newsletter to follow the world's top tech stories and trends. Outside the UK? Sign up here.

#inaccuracies #friendly #chatbots #suggests #prone


Upprunaleg slóð:
https://www.bbc.com/news/articles/cd9pdjgvxj8o?at_medium=RSS&at_campaign=rss

📰 Aðrar fréttir

Hópur ungmenna réðst á dreng í Hamraborg
📅 29.04.2026 17:59:17 👁️ 1

Hópur ungmenna réðst á dreng í Hamraborg

#hamraborg#ungmenna#hópur#réðst#dreng
Ex-FBI director James Comey surrenders over charge of threatening Trump's life in Instagram post
📅 29.04.2026 17:49:05 👁️ 0

Ex-FBI director James Comey surrenders over charge of threatening Trump's life in Instagram post

#threatening#surrenders#instagram#director#charge
Framsókn boðar Reykjavík á grænu ljósi
📅 29.04.2026 17:42:31 👁️ 0

Framsókn boðar Reykjavík á grænu ljósi

#reykjavík#framsókn#boðar#grænu#ljósi
Hvern ætli þessi framhaldsskólanemi kjósi?
📅 29.04.2026 17:36:32 👁️ 0

Hvern ætli þessi framhaldsskólanemi kjósi?

#framhaldsskólanemi#hvern#þessi#kjósi#ætli
Framboðið snúist ekki um lögheimilisskráningar - „Það þurfa allir að vera með rödd til að geta bætt og lagað sveitarfélagið“ - DV
📅 29.04.2026 17:30:16 👁️ 0

Framboðið snúist ekki um lögheimilisskráningar - „Það þurfa allir að vera með rödd til að geta bætt og lagað sveitarfélagið“ - DV

#lögheimilisskráningar#sveitarfélagið#framboðið#snúist#þurfa
Framlengja gæsluvarðhald í Dettifossmáli
📅 29.04.2026 17:30:03 👁️ 0

Framlengja gæsluvarðhald í Dettifossmáli

#gæsluvarðhald#dettifossmáli#framlengja
Fjórir í varð­haldi vegna Detti­fossmálsins - Vísir
📅 29.04.2026 17:27:10 👁️ 0

Fjórir í varð­haldi vegna Detti­fossmálsins - Vísir

#dettifossmálsins#varðhaldi#fjórir#vegna#vísir
Hernaðaraðgerðir í Íran kostað 25 milljarða Bandaríkjadala
📅 29.04.2026 17:20:15 👁️ 0

Hernaðaraðgerðir í Íran kostað 25 milljarða Bandaríkjadala

#hernaðaraðgerðir#bandaríkjadala#milljarða#kostað#íran
We can't abolish leasehold outright, minister says
📅 29.04.2026 17:10:56 👁️ 0

We can't abolish leasehold outright, minister says

#leasehold#outright#minister#abolish#says
Whale carried by barge out of German waters after weeks stranded on coast
📅 29.04.2026 17:06:23 👁️ 0

Whale carried by barge out of German waters after weeks stranded on coast

#stranded#carried#german#waters#whale
Stríðið í Mið-Austurlöndum gæti steypt rúmlega 30 milljónum í fátækt
📅 29.04.2026 17:02:17 👁️ 0

Stríðið í Mið-Austurlöndum gæti steypt rúmlega 30 milljónum í fátækt

#austurlöndum#milljónum#stríðið#rúmlega#steypt
Golders Green attack: how it unfolded
📅 29.04.2026 16:50:10 👁️ 1

Golders Green attack: how it unfolded

#unfolded#golders#attack#green
Stephen Fry sues CogX tech conference for £100,000 over fall injuries
📅 29.04.2026 16:45:24 👁️ 0

Stephen Fry sues CogX tech conference for £100,000 over fall injuries

#conference#injuries#stephen#sues#cogx
Sláandi skýrsla um banaslys á Patreksfirði: Björgunarbátur virkaði ekki
📅 29.04.2026 16:43:05 👁️ 0

Sláandi skýrsla um banaslys á Patreksfirði: Björgunarbátur virkaði ekki

#björgunarbátur#patreksfirði#banaslys#sláandi#skýrsla
In five charts - How UAE's exit could affect Opec's influence over the oil price
📅 29.04.2026 16:37:08 👁️ 0

In five charts - How UAE's exit could affect Opec's influence over the oil price

#influence#charts#affect#could#price
Naut „stjaksetti“ einn frægasta nautabana Spánar á endaþarminum – Getur ekki sofið eða borðað - DV
📅 29.04.2026 16:30:31 👁️ 0

Naut „stjaksetti“ einn frægasta nautabana Spánar á endaþarminum – Getur ekki sofið eða borðað - DV

#endaþarminum#stjaksetti#nautabana#frægasta#spánar
Rafgeymar gátu ekki fætt dælur þegar Ormurinn langi sökk - Vísir
📅 29.04.2026 16:28:42 👁️ 0

Rafgeymar gátu ekki fætt dælur þegar Ormurinn langi sökk - Vísir

#rafgeymar#ormurinn#dælur#þegar#langi
Málamiðlun tryggi strandveiðar í sumar
📅 29.04.2026 16:24:00 👁️ 0

Málamiðlun tryggi strandveiðar í sumar

#strandveiðar#málamiðlun#tryggi#sumar
Ævintýri Tinna í dómsal - Belgískt félag stefndi Pennanum fyrir brot gegn höfundarrétti - DV
📅 29.04.2026 16:00:53 👁️ 0

Ævintýri Tinna í dómsal - Belgískt félag stefndi Pennanum fyrir brot gegn höfundarrétti - DV

#höfundarrétti#ævintýri#belgískt#pennanum#stefndi
Samstarfið byggir undir sterka og heilbrigða framtíðarkynslóð - DV
📅 29.04.2026 15:59:20 👁️ 2

Samstarfið byggir undir sterka og heilbrigða framtíðarkynslóð - DV

#framtíðarkynslóð#samstarfið#heilbrigða#byggir#sterka
Number of squatters in the Canary Islands fell by 10% in 2025
📅 29.04.2026 15:58:00 👁️ 1

Number of squatters in the Canary Islands fell by 10% in 2025

#squatters#islands#number#canary#fell
Mega hjóla gegn einstefnu - Vísir
📅 29.04.2026 15:51:56 👁️ 1

Mega hjóla gegn einstefnu - Vísir

#einstefnu#hjóla#vísir#mega#gegn
Formaður frisbígolfssambandsins svarar ekki gagnrýni um kynjaflokka í íþróttinni
📅 29.04.2026 15:45:04 👁️ 1

Formaður frisbígolfssambandsins svarar ekki gagnrýni um kynjaflokka í íþróttinni

#frisbígolfssambandsins#kynjaflokka#íþróttinni#formaður#gagnrýni
Ný kynslóð lyfjaskammtara á markað - DV
📅 29.04.2026 15:44:00 👁️ 4

Ný kynslóð lyfjaskammtara á markað - DV

#lyfjaskammtara#kynslóð#markað
Greiða hundruð þúsunda fyrir heim­sókn Tinna til Ís­lands - Vísir
📅 29.04.2026 15:37:52 👁️ 1

Greiða hundruð þúsunda fyrir heim­sókn Tinna til Ís­lands - Vísir

#heimsókn#hundruð#þúsunda#íslands#greiða
Bein út­sending: Odd­vitar í Reykja­vík á kosninga­fundi Við­skiptaráðs - Vísir
📅 29.04.2026 15:32:47 👁️ 1

Bein út­sending: Odd­vitar í Reykja­vík á kosninga­fundi Við­skiptaráðs - Vísir

#kosningafundi#viðskiptaráðs#útsending#reykjavík#oddvitar
Kim Jong Un hrósar „hetjunum“ sem sprengja sig í tætlur frekar en að láta Úkraínumenn ná sér - DV
📅 29.04.2026 15:30:05 👁️ 1

Kim Jong Un hrósar „hetjunum“ sem sprengja sig í tætlur frekar en að láta Úkraínumenn ná sér - DV

#úkraínumenn#hetjunum#sprengja#hrósar#tætlur
Ísland vinnur að innleiðingu reglugerðar sem ESB telur Facebook og Instagram brjóta
📅 29.04.2026 15:27:36 👁️ 1

Ísland vinnur að innleiðingu reglugerðar sem ESB telur Facebook og Instagram brjóta

#innleiðingu#reglugerðar#instagram#facebook#ísland
Man jailed after attacking and robbing elderly woman in Lanzarote
📅 29.04.2026 15:20:00 👁️ 1

Man jailed after attacking and robbing elderly woman in Lanzarote

#attacking#lanzarote#robbing#elderly#jailed
Man offered Ukrainian men money to carry out Starmer arson attacks, court hears
📅 29.04.2026 15:01:41 👁️ 1

Man offered Ukrainian men money to carry out Starmer arson attacks, court hears

#ukrainian#offered#starmer#attacks#money
Repúblikanar vinna orrustu í kjördæmastríðinu - Vísir
📅 29.04.2026 15:01:37 👁️ 1

Repúblikanar vinna orrustu í kjördæmastríðinu - Vísir

#kjördæmastríðinu#repúblikanar#orrustu#vinna#vísir
Seljendur vændis eru oft og tíðum þolendur mansals
📅 29.04.2026 15:00:24 👁️ 2

Seljendur vændis eru oft og tíðum þolendur mansals

#seljendur#þolendur#mansals#vændis#tíðum
Friendly AI chatbots more prone to inaccuracies, study finds
📅 29.04.2026 15:00:06 👁️ 1

Friendly AI chatbots more prone to inaccuracies, study finds

#inaccuracies#friendly#chatbots#prone#study
Sæði yfir 70 gjafa hefur verið tekið úr notkun vegna erfða- og genagalla
📅 29.04.2026 14:54:17 👁️ 1

Sæði yfir 70 gjafa hefur verið tekið úr notkun vegna erfða- og genagalla

#genagalla#notkun#gjafa#hefur#verið
Oil price jumps to $117 after reports of 'extended' Iran blockade
📅 29.04.2026 14:51:46 👁️ 1

Oil price jumps to $117 after reports of 'extended' Iran blockade

#extended#blockade#reports#price#jumps
Police find two bombs in a house in Corralejo
📅 29.04.2026 14:40:00 👁️ 1

Police find two bombs in a house in Corralejo

#corralejo#police#bombs#house#find
Maður ræðst á og stingur gyðinga í London – Myndbönd þegar árásarmaðurinn lætur til skarar skríða og þegar hann er handtekinn
Við­reisn upp fyrir Sjálf­stæðis­flokk og Fram­sókn bætir við sig - Vísir
📅 29.04.2026 14:20:14 👁️ 1

Við­reisn upp fyrir Sjálf­stæðis­flokk og Fram­sókn bætir við sig - Vísir

#sjálfstæðisflokk#viðreisn#framsókn#fyrir#bætir
Eiríkur fékk nafnlaust bréf – „Einstaklega ósmekklegt og óviðkunnanlegt að blanda löngu látnum afa mínum í það mál“ - DV
📅 29.04.2026 14:18:12 👁️ 1

Eiríkur fékk nafnlaust bréf – „Einstaklega ósmekklegt og óviðkunnanlegt að blanda löngu látnum afa mínum í það mál“ - DV

#óviðkunnanlegt#einstaklega#ósmekklegt#nafnlaust#eiríkur
Samfylkingin mælist stærst í Hafnarfirði
📅 29.04.2026 14:15:38 👁️ 1

Samfylkingin mælist stærst í Hafnarfirði

#samfylkingin#hafnarfirði#mælist#stærst
Karli konungi hrósað í hástert fyrir sögulega ræðu
📅 29.04.2026 14:13:46 👁️ 3

Karli konungi hrósað í hástert fyrir sögulega ræðu

#sögulega#konungi#hástert#hrósað#karli
Starmer defends record as Badenoch says he squandered election win
📅 29.04.2026 14:04:43 👁️ 1

Starmer defends record as Badenoch says he squandered election win

#squandered#badenoch#election#starmer#defends
Lögreglan rannsakar söluna á Sóltúni hjúkrunarheimili
📅 29.04.2026 14:02:57 👁️ 1

Lögreglan rannsakar söluna á Sóltúni hjúkrunarheimili

#hjúkrunarheimili#lögreglan#rannsakar#sóltúni#söluna
Nine held after Crewe religious group raids over modern slavery, sexual assault and forced marriage allegations
📅 29.04.2026 14:00:06 👁️ 1

Nine held after Crewe religious group raids over modern slavery, sexual assault and forced marriage allegations

#allegations#religious#marriage#slavery#assault
Trump og Karl konungur eru frændur samkvæmt ítarlegri ættfræðirannsókn
📅 29.04.2026 13:59:59 👁️ 1

Trump og Karl konungur eru frændur samkvæmt ítarlegri ættfræðirannsókn

#ættfræðirannsókn#ítarlegri#konungur#samkvæmt#frændur
Fresh wave of lawsuits filed against OpenAI by Tumbler Ridge victims
📅 29.04.2026 13:50:37 👁️ 1

Fresh wave of lawsuits filed against OpenAI by Tumbler Ridge victims

#lawsuits#against#tumbler#victims#openai
Leiðir á­fram unga miðflokksmenn - Vísir
📅 29.04.2026 13:50:03 👁️ 1

Leiðir á­fram unga miðflokksmenn - Vísir

#miðflokksmenn#leiðir#áfram#vísir#unga
Fugl flaug á línu og olli rafmagnsleysi
📅 29.04.2026 13:49:21 👁️ 1

Fugl flaug á línu og olli rafmagnsleysi

#rafmagnsleysi#flaug#fugl#línu#olli
Byltingarvörðurinn herðir tökin á stjórnar­taumunum - Vísir
📅 29.04.2026 13:45:09 👁️ 1

Byltingarvörðurinn herðir tökin á stjórnar­taumunum - Vísir

#byltingarvörðurinn#stjórnartaumunum#herðir#tökin#vísir
Harmageddon | Baráttufólk gegn ofbeldi
📅 29.04.2026 13:42:47 👁️ 1

Harmageddon | Baráttufólk gegn ofbeldi

#harmageddon#baráttufólk#ofbeldi#gegn