Buy Crypto Markets Spot FuturesUNH Earn Event Center

ChatGPT may dominate the AI chatbot market, but a new report suggests popularity does not equal trustworthiness. A…ChatGPT may dominate the AI chatbot market, but a new report suggests popularity does not equal trustworthiness. A…

ChatGPT named least reliable work chatbot in new AI reliability report

Author: Technext

Source: Technext

2025/12/11 02:38

3 min read

SLEEPLESSAI$0.02016--%

Trade

NOT$0.0003777-0.34%

Trade

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

ChatGPT may dominate the AI chatbot market, but a new report suggests popularity does not equal trustworthiness. A December 2025 study examining how leading AI chatbots perform in everyday work scenarios has ranked ChatGPT as the least reliable option for professional tasks. The findings raise fresh concerns for businesses that increasingly depend on AI tools for daily operations.

The study, conducted by Relum, didn’t just look at specs on paper; they stress-tested ten major AI chatbots in real-world professional scenarios. The results? A massive disconnect between hype and reality.

The study assessed each chatbot across four key criteria. These were hallucination rate, customer product ratings, response consistency across tasks, and downtime frequency. Each factor contributed to a composite reliability risk score, with higher scores indicating greater potential workplace issues.

Here is the stat that should keep business leaders up at night: Despite controlling a massive 81% of the market and boasting high user ratings, ChatGPT recorded a hallucination rate of 35%.

In plain English, that means more than one out of every three answers it gives contains fabricated or incorrect information. If you are using it to draft a fantasy novel, that’s fine, but if you are using it for compliance reports or financial decision-making, that is a recipe for disaster. Consequently, the study slapped ChatGPT with a reliability risk score of 99 out of 99, the worst in the group.

ChatGPT

Google didn’t fare any better. While Gemini had better uptime, it actually performed worse on pure accuracy, registering the highest hallucination rate of the entire group at 38%. It highlights a weird paradox in the current AI market: the tools we use the most are often the ones struggling the hardest to keep their facts straight.

Claude and Meta AI occupy a murky middle ground. Claude, despite being a favourite for its writing style, ranked as the second least reliable due to frequent downtime and a 17% hallucination rate. Meta AI was more accurate (15% hallucination), but users seem not to like the experience, giving it the lowest satisfaction rating of the bunch (3.4 out of 5).

The “underdogs” – Grok and DeepSeek steal the show from ChatGPT

If the big names are dropping the ball, who is actually doing the work? Surprisingly, the study points to Grok and DeepSeek as the most reliable tools for professional use. They don’t have the massive marketing budgets or brand recognition of OpenAI, but they simply worked better. DeepSeek recorded zero service outages and kept hallucinations to a minimum.

Kimi also scored well, finding a sweet spot between consistency and uptime. Meanwhile, paid options like Perplexity AI were solid but raised questions about whether the subscription cost is worth it when cheaper, lesser-known alternatives are outperforming them.

Relum’s Chief Product Officer, Razvan-Lucian Haiduc, warned that reliability should be a central factor in AI adoption decisions. He noted that around 65% of US companies now use AI chatbots in daily workflows. Nearly 45% of employees admit to sharing sensitive company information with these tools.

As AI becomes more embedded in routine work, the risks of misinformation multiply. Haiduc emphasised that the most widely used chatbot is not always the best fit for every industry. Accuracy, uptime and task-specific performance should outweigh brand familiarity.

The report serves as a reality check for the industry. Trust shouldn’t be given just because a chatbot is famous; it should be earned through consistent, verifiable truth. Right now, it looks like the market leaders have some serious catching up to do.

Market Opportunity

Sleepless AI Price(SLEEPLESSAI)

$0.02017

$0.02017$0.02017

-1.80%

USD

Sleepless AI (SLEEPLESSAI) Live Price Chart

World Cup Combo: Aim for 200x

Combine up to 20 World Cup matches in one order

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

Tags:

#THAT #KEY #Major #JUST #AI

24/7 Live News

CRO receives four hundred million dollar strategic investment from Citadel Securities

Author: 方程式新闻 BWEnews 🏎️02:01

Morgan Stanley enables spot Bitcoin trading for all ETrade clients expanding institutional access

Author: BlockNews01:21

BlackRock spot Bitcoin ETF holds forty seven billion dollars showing continued institutional exposure

Author: BlockNews2026/07/16 23:24

Polygon Labs layoffs mark transition from blockchain foundation to payments company operations

Author: DEGEN NEWS2026/07/16 21:40

BTC fell below Rainbow Chart, a historically rare technical event occurring only twice previously

Author: Vivek Sen2026/07/16 20:36

Crypto Prices

Bitcoin

BTC

$64,184.48

$64,184.48$64,184.48

-0.78%

Ethereum

ETH

$1,874.16

$1,874.16$1,874.16

-0.41%

USDCoin

USDC

$1.00061

$1.00061$1.00061

0.00%

Solana

SOL

$75.87

$75.87$75.87

-0.99%

XRP

$1.0979

$1.0979$1.0979

-1.49%

Activate to Enjoy Special Perks

Access 0 fees, premium support, and loss coverage.

ChatGPT named least reliable work chatbot in new AI reliability report

The “underdogs” – Grok and DeepSeek steal the show from ChatGPT

You May Also Like

Not a loophole: Singapore AI export controls let China tap US AI legally

CoreWeave (CRWV) Stock Surges 12% on $8.5B GPU-Backed Financing Deal — Here’s the Full Picture

Bitcoin, Gold, and U.S. Stocks Dive as Trump Pledges to Hit Iran ‘Extremely Hard’

Trending News

NordFX Morning Update — July 10, 2026

Germany Trade Balance Surges to €19.1 Billion in May, Handily Beating Forecasts

Arbitrum Announces Ten Innovative Teams — And Why It’s Not Just Hype

Cathie Wood’s ARK Invest Buys $13.7M in Circle Shares While Selling Robinhood Stock

The changing face of elder care in Malaysia — Sayed Mohammad Reza Yamani Sayed Umar

24/7 Live News

Quick Reads

World Cup Coin Tokenomics Explained: WORLDCUP Supply, Creator Fees and Token Burns

What Happens to World Cup Coin After the 2026 World Cup? Post-Tournament Risks Explained

World Cup Coin Price History: WORLDCUP Volatility, Liquidity and Delisting Risks

France vs England Head-to-Head Record: World Cup History and Previous Meetings

France vs England Tactical Preview: Midfield Battle, Transitions and Key Matchups

Crypto Prices