This event is archived. Final snapshot from when the story concluded. View on Dashboard
Tech AI research findings

ChatGPT's Reasoning Limitations Revealed in WSU Study

Analysis based on 7 articles · First reported Mar 16, 2026 · Last updated Mar 18, 2026

Sentiment
-20
Attention
4
Articles
7
Market Impact
Direct
Live prominence charts, article sentiment distribution, and event development timeline available on the NewsDesk Dashboard

The study's findings on ChatGPT's limitations in reasoning and consistency could temper investor enthusiasm for AI companies, particularly those focused on large language models. It suggests that while AI is powerful, its current capabilities may not meet the high expectations for advanced conceptual understanding, potentially leading to more cautious investment in the short term.

Artificial intelligence Technology Business

A study led by Mesut Cicek of Washington State University evaluated ChatGPT's ability to determine the truthfulness of scientific hypotheses. The AI, tested on over 700 hypotheses, achieved 76.5% accuracy in 2024 and 80% in 2025. However, after adjusting for random guessing, its performance was only about 60% better than chance. ChatGPT struggled significantly with identifying false statements (16.4% accuracy) and showed notable inconsistency, providing different answers to identical prompts 27% of the time. The researchers concluded that current AI tools lack true conceptual understanding and merely memorize, suggesting that artificial general intelligence is still distant. The study, published in the Rutgers Business Review, recommends caution for business leaders relying on AI for complex decisions.

95 ChatGPT demonstrated inconsistent and inaccurate reasoning
90 Mesut Cicek led research study on AI performance ChatGPT
priv
ChatGPT's performance in a study by Washington State University researchers showed significant limitations in accuracy and consistency when evaluating scientific hypotheses, particularly in identifying false statements. This suggests that while it can generate fluent language, its conceptual understanding and reasoning capabilities are still developing.
Importance 100 Sentiment -30
per
Mesut Cicek, an associate professor at Washington State University, led the research team that conducted the study on ChatGPT's performance. His findings suggest that artificial general intelligence is further away than many expect due to AI's current limitations in understanding and consistency.
Importance 80 Sentiment 10
Mesut Cicek related ChatGPT
NEWSDESK
Track this event live

Set up alerts, explore entity relationships, search across thousands of events, and build custom intelligence feeds.

Open Dashboard

About NewsDesk

NewsDesk is a news intelligence platform that converts raw news articles into structured data. It tracks events, entities, and the relationships between them, with sentiment and attention metrics derived from thousands of articles. Pages on this site are daily static snapshots from the platform's live database. For real-time tracking, search, and alerts, the full dashboard is at app.newsdesk.dev.