Practice Arithmetic Reasoning Problems

Mathematicians devised novel problems to challenge advanced AIs' reasoning skills — and they failed almost every test

Current AI models struggle to solve research-level math problems, with the most advanced AI systems we have today solving just 2% of the hundreds of challenges faced. Mathematicians have stumped the ...

TechCrunch

Researchers question AI’s ‘reasoning’ ability as models stumble on math problems with trivial changes

How do machine learning models do what they do? And are they really “thinking” or “reasoning” the way we understand those things? This is a philosophical question as much as a practical one, but a new ...

Business Insider

This DeepSeek demo shows how good the Chinese AI model is at math and reasoning

DeepSeek's AI models rival top Silicon Valley offerings, excelling in some complex tasks. The models use inference-time compute, breaking queries into smaller, manageable tasks. DeepSeek's DeepThink ...

Psychology Today

5 Mathematical Reasoning Tricks for Everyday Problem-Solving

Mathematicians excel at handling complexity and uncertainty. Mathematical reasoning strategies aren't just useful for dilemmas involving numbers. We can apply math mindsets to improve our approach to ...

VentureBeat

AI’s math problem: FrontierMath benchmark shows how far technology still has to go

Artificial intelligence systems may be good at generating text, recognizing images, and even solving basic math problems—but when it comes to advanced mathematical reasoning, they are hitting a wall.

InfoQ

Microsoft Research Unveils rStar-Math: Advancing Mathematical Reasoning in Small Language Models

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Popular Mechanics

Scientists Found AI’s Fatal Flaw—The Most Advanced Models Are Failing Basic Logic Tests

Here’s what you’ll learn when you read this story: Large language models (LLMs) like ChatGPT show reasoning errors across many domains. Identifying vulnerabilities is good for public safety, industry, ...

VentureBeat

When AI reasoning goes wrong: Microsoft Research shows more tokens can mean more problems

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Large language models (LLMs) are ...

Hosted on MSN

7 reasons why logical reasoning is your ultimate math superpower

Ever stared at a math problem feeling completely lost, even when you've memorised all the formulas? Or maybe you've wondered why certain math rules even exist? The true secret weapon that unlocks ...

Ars Technica

Researchers isolate memorization from problem-solving in AI neural networks

When engineers build AI language models like GPT-5 from training data, at least two major processing features emerge: memorization (reciting exact text they’ve seen before, like famous quotes or ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results