×
Photo via The IndependentNext

Apple’s New Research Shows AI Might Be Dumber Than We Thought

100% reliable
12 mins
35.49K views
Apple
  • AI’s “intelligence” might just be smoke and mirrors, Apple researchers reveal.
  • Apple’s new research just exposed AI’s Achilles’ heel, and it’s a big one.
  • Even simple math problems are tripping up the smartest AI models—here’s why.
  • Think AI can think? Apple’s study says: Not so fast.

Apple, a global leader in tech innovation, has recently highlighted the limitations of AI in a comprehensive study, shining a spotlight on a critical flaw in AI-driven reasoning capabilities. This revelation comes at a time when AI is being marketed as a transformative force in consumer tech, raising questions about the reliability of these advanced models.

The fact that Apple did this has gotten a lot of attention, but nobody should be surprised at the results.— Gary Marcus, AI critic

Craig Federighi announces Apple Intelligence from Apple Park, California in June 2024.Photo via Apple // Craig Federighi announces Apple Intelligence from Apple Park, California in June 2024.

The Problem: A Simple Math Test

To illustrate, consider this elementary arithmetic challenge:

Oliver picks 44 kiwis on Friday. Then he picks 58 on Saturday. On Sunday, he collects double the amount he did on Friday, but five of those are slightly smaller than average. How many kiwis does Oliver have?

If you arrived at 190 (44 + 58 + 88), you matched the performance of most schoolchildren. However, when posed to over 20 state-of-the-art AI models, the results were startlingly different. Many bots failed to identify that the detail about smaller kiwis was irrelevant, leading to erroneous answers such as 185.

This outcome supports a broader pattern observed in the research: AI systems can be tripped up by seemingly inconsequential details.

Deep Dive into Apple’s Study

Released in October, Apple’s technical paper details how these AI systems struggled with arithmetic questions embedded in complex word problems. The inclusion of distracting yet irrelevant information caused “catastrophic performance drops,” a term the researchers used to describe how the models floundered when challenged by nuanced prompts.

This study not only drew widespread interest due to its rigorous documentation but also because it came from Apple, which is actively integrating AI features into its iPhones and other products. The paper's lead author, Mehrdad Farajtabar, emphasized their core inquiry: “Do these models truly understand mathematical concepts?”

The conclusion? A resounding no.Apple Intelligence is compatible with all iPhone 16 models, as well as Pro model iPhone 15's. Apple Intelligence introduces a suite of new AI-powered tools, mostly launching in the next few months and into 2025.Photo via Apple // Apple Intelligence is compatible with all iPhone 16 models, as well as Pro model iPhone 15's. Apple Intelligence introduces a suite of new AI-powered tools, mostly launching in the next few months and into 2025.

A Familiar Pattern

AI experts are not new to these findings. Similar conclusions have emerged from other research efforts, pointing out that large language models (LLMs), like OpenAI’s GPT, are adept at pattern recognition but lack genuine reasoning abilities. As Melanie Mitchell from the Santa Fe Institute noted, “A large gap in basic abstract reasoning still remains between humans and state-of-the-art AI systems.”

Apple’s research further underscores that these models, while impressive in mimicking human language, are fundamentally constrained. They function by matching learned patterns from extensive training datasets, rather than reasoning through problems. Farajtabar elaborated on this flaw, stating, “They memorized what is out there on the web and do pattern matching... [it’s] not real reasoning.”Photo via Ars Technica // "Simply changing specific names and numbers found in GSM8K tests led to significant decreases in performance in many models." Quote from Ars Technica.

Red Herrings and AI Hallucinations

The study’s results are a stark reminder of AI’s limitations, especially when models produce plausible yet incorrect responses with unwarranted confidence—a phenomenon known as “hallucination.” These issues are more than just curiosities; they have real-world implications. In fields like healthcare or legal documentation, even a minuscule error rate can result in significant consequences.

For instance, a recent analysis of Whisper, an AI-powered transcription tool developed by OpenAI, found that about 1.4% of its transcriptions included fabricated content. These hallucinations could misrepresent statements in sensitive environments, such as court proceedings or monitored phone calls, potentially influencing outcomes based on errors.

Implications for AI’s Future

The broader takeaway is clear: while AI excels in pattern recognition, its failure to engage in true abstract reasoning means it will remain reliant on human oversight, particularly in mission-critical applications. Gary Marcus, an outspoken AI critic, highlighted this limitation: “The ways in which they approach reasoning are an approximation and not the real thing... until we have some new technology.”

Apple’s research, though shedding light on the limits of current AI, suggests Apple is aware of these challenges as it navigates the evolving landscape of AI integration with iPhones, iPads, Macs and more. This transparency could signal a more measured approach in marketing their AI products, contrasting with the often unqualified assurances offered by competitors.Apple Intelligence was marketed as the standout feature of Apple's iPhone 16 line, but many customers are still waiting to try Apple's most advanced AI features.Photo via LifeHacker // Apple Intelligence was marketed as the standout feature of Apple's iPhone 16 line, but many customers are still waiting to try Apple's most advanced AI features.

A Call for Balanced Expectations

The findings should temper the hype surrounding AI capabilities. While AI holds immense potential as a tool to augment human tasks—whether in software development, automated processes, or content generation—it remains essential to apply skepticism when AI steps into roles demanding rigorous logic or high-stakes decisions. Mitchell’s perspective sums it up: even “very young children” can often outperform state-of-the-art AI when it comes to abstract reasoning.

In conclusion, the allure of AI as an all-powerful entity capable of replicating human thought is far from reality. For those invested in the future of AI, Apple’s study is a critical reminder of the technology's current boundaries and a call for innovation that goes beyond mere pattern recognition.

Recommended by the editors:

Thank you for visiting Apple Scoop! As a dedicated independent news organization, we strive to deliver the latest updates and in-depth journalism on everything Apple. Have insights or thoughts to share? Drop a comment below—our team actively engages with and responds to our community. Return to the home page.

Published to Apple Scoop on 2nd November, 2024.
Luke Everett

Luke Everett

Lead Technology Journalist

Luke Everett is Apple Scoop’s Lead Technology Journalist with 7 years of experience reporting on Apple hardware, software, and breaking news. Known for his investigative insights and in-depth analysis, Luke covers everything from major Apple keynotes to the latest rumors in the industry, helping readers stay ahead of the curve.

Luke's journalism More about Apple
Stories related to Apple

Apple’s Lighter Vision Headset Might Be Called “Air”

59% reliable 14 mins

Thinner M6 MacBook Pro Coming in 2026 With OLED, Report Claims

61% reliable 9 mins

Inside Apple’s Supply Chain: Who is Foxconn?

100% reliable 12 mins

New CarPlay Features in iOS 18.4: More Icons, Smarter Routes

100% reliable 7 mins

iPhone 17 Series: Rumored Launch Date, Pricing, Design, Display, Camera, More

73% reliable 16 mins

iPhone 16e vs. iPhone 16: Comparing Price, Design, Battery, Performance & More

100% reliable 12 mins

Apple Inc: The 11 Biggest Scandals of All Time

100% reliable 15 mins

Apple’s 2nm A20 Chip: Latest Rumors, Tech Specs, and More

57% reliable 10 mins

iPhone 17 Pro Leak: 12GB RAM, Triple 48MP Cameras, and 3nm A19 Pro Chip

62% reliable 11 mins

Apple’s iPhone 16e Is Selling Faster Than the Cheaper iPhone SE

100% reliable 9 mins

Trump Tariffs: Could Your Next iPhone Suddenly Cost More?

100% reliable 15 mins

Explained: What Is Skeuomorphism?

100% reliable 12 mins

The End of Skeuomorphism: How iOS 7 Changed UI Design

100% reliable 10 mins

Apple Drops Plans for Larger iPhone 17 Air Due to 'Bendgate' Concerns

72% reliable 8 mins

Apple’s Foldable iPhone May Cost Twice as Much as the iPhone 16 Pro Max

60% reliable 11 mins

The Apple Intelligence Scandal: Why Siri’s Future is in Jeopardy

64% reliable 12 mins

Apple Considered Removing the USB-C Port from 'iPhone 17 Air'

73% reliable 11 mins

Apple WWDC 2025 Rumors: Expect a Radical Software Overhaul for iPhone, iPad & Mac

66% reliable 9 mins

Apple R1 Chip Explained: What Is It, Why Does It Matter, and What's Next?

100% reliable 9 mins

'iPhone 17 Air' Rumored to Include These 8 New Features

70% reliable 13 mins
Recommended

Apple’s Lighter Vision Headset Might Be Called “Air”

59% reliable 14 mins

Thinner M6 MacBook Pro Coming in 2026 With OLED, Report Claims

61% reliable 9 mins

Inside Apple’s Supply Chain: Who is Foxconn?

100% reliable 12 mins

New CarPlay Features in iOS 18.4: More Icons, Smarter Routes

100% reliable 7 mins

iPhone 16e vs. iPhone 16: Comparing Price, Design, Battery, Performance & More

100% reliable 12 mins

Apple Inc: The 11 Biggest Scandals of All Time

100% reliable 15 mins

Apple’s 2nm A20 Chip: Latest Rumors, Tech Specs, and More

57% reliable 10 mins

iPhone 17 Pro Leak: 12GB RAM, Triple 48MP Cameras, and 3nm A19 Pro Chip

62% reliable 11 mins

Apple’s iPhone 16e Is Selling Faster Than the Cheaper iPhone SE

100% reliable 9 mins

Trump Tariffs: Could Your Next iPhone Suddenly Cost More?

100% reliable 15 mins

Explained: What Is Skeuomorphism?

100% reliable 12 mins

The End of Skeuomorphism: How iOS 7 Changed UI Design

100% reliable 10 mins
Apple

Apple

Microsoft

Microsoft

Google

Google

Samsung

Samsung

Meta

Meta

Wallpaper categories


iOS 18

iPhone 15 Pro

MacBook Pro

Black

Dark

Anime

Christmas

Car

Blue

Gradient

Pink

Apple

iPhone

BMW

Red

4K

Abstract

Purple

iPhone 16

Cars

Green

Apple Logo

Art

Orange

iOS

iPhone 15

Dynamic Island

Nature

Space

Gold