AIRLINK 191.84 Decreased By ▼ -1.66 (-0.86%)
BOP 9.87 Increased By ▲ 0.23 (2.39%)
CNERGY 7.67 Increased By ▲ 0.14 (1.86%)
FCCL 37.86 Increased By ▲ 0.16 (0.42%)
FFL 15.76 Increased By ▲ 0.16 (1.03%)
FLYNG 25.31 Decreased By ▼ -0.28 (-1.09%)
HUBC 130.17 Increased By ▲ 3.10 (2.44%)
HUMNL 13.59 Increased By ▲ 0.09 (0.67%)
KEL 4.67 Increased By ▲ 0.09 (1.97%)
KOSM 6.21 Increased By ▲ 0.11 (1.8%)
MLCF 44.29 Increased By ▲ 0.33 (0.75%)
OGDC 206.87 Increased By ▲ 3.63 (1.79%)
PACE 6.56 Increased By ▲ 0.16 (2.5%)
PAEL 40.55 Decreased By ▼ -0.43 (-1.05%)
PIAHCLA 17.59 Increased By ▲ 0.10 (0.57%)
PIBTL 8.07 Increased By ▲ 0.41 (5.35%)
POWER 9.24 Increased By ▲ 0.16 (1.76%)
PPL 178.56 Increased By ▲ 4.31 (2.47%)
PRL 39.08 Increased By ▲ 1.01 (2.65%)
PTC 24.14 Increased By ▲ 0.07 (0.29%)
SEARL 107.85 Increased By ▲ 0.61 (0.57%)
SILK 0.97 No Change ▼ 0.00 (0%)
SSGC 39.11 Increased By ▲ 2.71 (7.45%)
SYM 19.12 Increased By ▲ 0.08 (0.42%)
TELE 8.60 Increased By ▲ 0.36 (4.37%)
TPLP 12.37 Increased By ▲ 0.59 (5.01%)
TRG 66.01 Increased By ▲ 1.13 (1.74%)
WAVESAPP 12.78 Increased By ▲ 1.15 (9.89%)
WTL 1.70 Increased By ▲ 0.02 (1.19%)
YOUW 3.95 Increased By ▲ 0.10 (2.6%)
BR100 11,930 Increased By 162.4 (1.38%)
BR30 35,660 Increased By 695.9 (1.99%)
KSE100 113,206 Increased By 1719 (1.54%)
KSE30 35,565 Increased By 630.8 (1.81%)

Chinese AI startup DeepSeek’s chatbot achieved only 17% accuracy in delivering news and information in a NewsGuard audit that ranked it tenth out of eleven in a comparison with its Western competitors including OpenAI’s ChatGPT and Google Gemini.

The chatbot repeated false claims 30% of the time and gave vague or not useful answers 53% of the time in response to news-related prompts, resulting in an 83% fail rate, according to a report published by trustworthiness rating service NewsGuard on Wednesday.

That was worse than an average fail rate of 62% for its Western rivals and raises doubts about AI technology that DeepSeek has claimed performs on par or better than Microsoft-backed OpenAI at a fraction of the cost.

Within days of its roll-out, DeepSeek’s chatbot became the most downloaded app in Apple’s App Store, stirring concerns about United States’ lead in AI and sparking a market rout that wiped around $1 trillion off U.S. technology stocks.

Alibaba releases AI model it claims surpasses DeepSeek-V3

The Chinese startup did not immediately respond to a request for comment.

NewsGuard said it applied the same 300 prompts to DeepSeek that it had used to evaluate its Western counterparts, which included 30 prompts based on 10 false claims spreading online.

Topics for the claims included last month’s killing of UnitedHealthcare executive Brian Thompson and the downing of Azerbaijan Airlines flight 8243.

NewsGuard’s audit also showed that in three of the ten prompts, DeepSeek reiterated the Chinese government’s position on the topic without being asked anything relating to China.

On prompts related to the Azerbaijan Airlines crash — questions unrelated to China — DeepSeek responded with Beijing’s position on the topic, NewsGuard said.

“The importance of the DeepSeek breakthrough is not in answering Chinese news-related question accurately, it is in the fact that it can answer any question at 1/30th of the cost of comparable AI models,” D.A. Davidson analyst Gil Luria said.

Like other AI models, DeepSeek was most vulnerable to repeating false claims when responding to prompts used by people seeking to use AI models to create and spread false claims, NewsGuard added.

Comments

200 characters