By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
IndebtaIndebta
  • Home
  • News
  • Banking
  • Credit Cards
  • Loans
  • Mortgage
  • Investing
  • Markets
    • Stocks
    • Commodities
    • Crypto
    • Forex
  • Videos
  • More
    • Finance
    • Dept Management
    • Small Business
9
Notification Show More
News
How much money does Ukraine need?
25 minutes ago
News
Donald Trump and Volodymyr Zelenskyy to discuss pause to US weapons deliveries in call
1 hour ago
News
House of Representatives approves ‘big beautiful bill’ in victory for Donald Trump
2 hours ago
News
Signs of a pick-up in venture capital exits are finally emerging
3 hours ago
News
Germany seeks deal with Taliban to deport Afghan migrants
4 hours ago
News
French air traffic control strikes cause flight delays across Europe
6 hours ago
News
Full EU-US trade deal ‘impossible’ before deadline, says Ursula von der Leyen
7 hours ago
News
Chris Hohn’s hedge fund TCI beats stock markets with 21% gain
8 hours ago
News
US economy surpasses expectations to add 147,000 jobs
9 hours ago
Aa
IndebtaIndebta
Aa
  • Banking
  • Credit Cards
  • Loans
  • Dept Management
  • Mortgage
  • Markets
  • Investing
  • Small Business
  • Videos
  • Home
  • News
  • Banking
  • Credit Cards
  • Loans
  • Mortgage
  • Investing
  • Markets
    • Stocks
    • Commodities
    • Crypto
    • Forex
  • Videos
  • More
    • Finance
    • Dept Management
    • Small Business
Follow US
Indebta > News > We are the new gremlins in the AI machine
News

We are the new gremlins in the AI machine

News Room
Last updated: 2025/06/26 at 9:23 AM
By News Room
Share
6 Min Read
SHARE

Stay informed with free updates

Simply sign up to the Artificial intelligence myFT Digest — delivered directly to your inbox.

One of my relatives heard some strange stories when working on a healthcare helpline during the Covid pandemic. Her job was to help callers complete the rapid lateral flow tests used millions of times during lockdown. But some callers were clearly confused by the procedure. “So, I’ve drunk the fluid in the tube. What do I do now?” asked one.

That user confusion may be an extreme example of a common technological problem: how ordinary people use a product or service in the real world may diverge wildly from the designers’ intentions in the lab. 

Sometimes that misuse can be deliberate, for better or worse. For example, the campaigning organisation Reporters Without Borders has tried to protect free speech in several authoritarian countries by hiding banned content on the Minecraft video game server. Criminals, meanwhile, have been using home 3D printers to manufacture untraceable guns. More often, though, misuse is unintentional, as with the Covid tests. Call it the inadvertent misuse problem, or “imp” for short. The new gremlins in the machines might well be the imps in the chatbots.

Take the general purpose chatbots, such as ChatGPT, that are being used by 17 per cent of Americans at least once a month to self-diagnose health concerns. These chatbots have amazing technological capabilities that would have seemed like magic a few years ago. In terms of clinical knowledge, triage, text summarisation and responses to patient questions, the best models can now match human doctors, according to various tests. Two years ago, for example, a mother in Britain successfully used ChatGPT to identify tethered cord syndrome (related to spina bifida) in her son that had been missed by 17 doctors.

That raises the prospect that these chatbots could one day become the new “front door” to healthcare delivery, improving access at lower cost. This week, Wes Streeting, the UK’s health minister, promised to upgrade the NHS app using artificial intelligence to provide a “doctor in your pocket to guide you through your care”. But the ways in which they can best be used are not the same as how they are most commonly used. A recent study led by the Oxford Internet Institute has highlighted some troubling flaws, with users struggling to use them effectively. 

The researchers enrolled 1,298 participants in a randomised, controlled trial to test how well they could use chatbots to respond to 10 medical scenarios, including acute headaches, broken bones and pneumonia. The participants were asked to identify the health condition and find a recommended course of action. Three chatbots were used: OpenAI’s GPT-4o, Meta’s Llama 3 and Cohere’s Command R+, which all have slightly different characteristics. 

When the test scenarios were entered directly into the AI models, the chatbots correctly identified the conditions in 94.9 per cent of cases. However, the participants did far worse: they provided incomplete information and the chatbots often misinterpreted their prompts, resulting in the success rate dropping to just 34.5 per cent. The technological capabilities of these models did not change but the human inputs did, leading to very different outputs. What’s worse, the test participants were also outperformed by a control group, who had no access to chatbots but consulted regular search engines instead.

The results of such studies do not mean we should stop using chatbots for health advice. But it does suggest that designers should pay far more attention to how ordinary people might use their services. “Engineers tend to think that people use the technology wrongly. Any user malfunction is therefore the user’s fault. But thinking about a user’s technological skills is fundamental to design,” one AI company founder tells me. That is particularly true with users seeking medical advice, many of whom may be desperate, sick or elderly people showing signs of mental deterioration.

More specialist healthcare chatbots may help. However, a recent Stanford University study found that some widely used therapy chatbots, helping address mental health challenges, can also “introduce biases and failures that could result in dangerous consequences”. Researchers suggest that more guardrails should be included to refine user prompts, proactively request information to guide the interaction and communicate more clearly.

Tech companies and healthcare providers should also do far more user testing in real-world conditions to ensure their models are used appropriately. Developing powerful technologies is one thing; learning how to deploy them effectively is quite another. Beware the imps.

john.thornhill@ft.com

 

Read the full article here

News Room June 26, 2025 June 26, 2025
Share this Article
Facebook Twitter Copy Link Print
Leave a comment Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Finance Weekly Newsletter

Join now for the latest news, tips, and analysis about personal finance, credit cards, dept management, and many more from our experts.
Join Now
How much money does Ukraine need?

How much money does Ukraine need? I ask the question because the…

Donald Trump and Volodymyr Zelenskyy to discuss pause to US weapons deliveries in call

Donald Trump and Volodymyr Zelenskyy will discuss the abrupt halt in some…

House of Representatives approves ‘big beautiful bill’ in victory for Donald Trump

Unlock the White House Watch newsletter for freeYour guide to what Trump’s…

Signs of a pick-up in venture capital exits are finally emerging

Unlock the Editor’s Digest for freeRoula Khalaf, Editor of the FT, selects…

Germany seeks deal with Taliban to deport Afghan migrants

Stay informed with free updatesSimply sign up to the German politics myFT…

- Advertisement -
Ad imageAd image

You Might Also Like

News

How much money does Ukraine need?

By News Room
News

Donald Trump and Volodymyr Zelenskyy to discuss pause to US weapons deliveries in call

By News Room
News

House of Representatives approves ‘big beautiful bill’ in victory for Donald Trump

By News Room
News

Signs of a pick-up in venture capital exits are finally emerging

By News Room
News

Germany seeks deal with Taliban to deport Afghan migrants

By News Room
News

French air traffic control strikes cause flight delays across Europe

By News Room
News

Full EU-US trade deal ‘impossible’ before deadline, says Ursula von der Leyen

By News Room
News

Chris Hohn’s hedge fund TCI beats stock markets with 21% gain

By News Room
Facebook Twitter Pinterest Youtube Instagram
Company
  • Privacy Policy
  • Terms & Conditions
  • Press Release
  • Contact
  • Advertisement
More Info
  • Newsletter
  • Market Data
  • Credit Cards
  • Videos

Sign Up For Free

Subscribe to our newsletter and don't miss out on our programs, webinars and trainings.

I have read and agree to the terms & conditions
Join Community

2023 © Indepta.com. All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?