×
Photo via The IndependentNext

Apple’s New Research Shows AI Might Be Dumber Than We Thought

100% reliable
12 mins
13.95K views
Apple
  • AI’s “intelligence” might just be smoke and mirrors, Apple researchers reveal.
  • Apple’s new research just exposed AI’s Achilles’ heel, and it’s a big one.
  • Even simple math problems are tripping up the smartest AI models—here’s why.
  • Think AI can think? Apple’s study says: Not so fast.

Apple, a global leader in tech innovation, has recently highlighted the limitations of AI in a comprehensive study, shining a spotlight on a critical flaw in AI-driven reasoning capabilities. This revelation comes at a time when AI is being marketed as a transformative force in consumer tech, raising questions about the reliability of these advanced models.

The fact that Apple did this has gotten a lot of attention, but nobody should be surprised at the results.— Gary Marcus, AI critic

Craig Federighi announces Apple Intelligence from Apple Park, California in June 2024.Photo via Apple // Craig Federighi announces Apple Intelligence from Apple Park, California in June 2024.

The Problem: A Simple Math Test

To illustrate, consider this elementary arithmetic challenge:

Oliver picks 44 kiwis on Friday. Then he picks 58 on Saturday. On Sunday, he collects double the amount he did on Friday, but five of those are slightly smaller than average. How many kiwis does Oliver have?

If you arrived at 190 (44 + 58 + 88), you matched the performance of most schoolchildren. However, when posed to over 20 state-of-the-art AI models, the results were startlingly different. Many bots failed to identify that the detail about smaller kiwis was irrelevant, leading to erroneous answers such as 185.

This outcome supports a broader pattern observed in the research: AI systems can be tripped up by seemingly inconsequential details.

Deep Dive into Apple’s Study

Released in October, Apple’s technical paper details how these AI systems struggled with arithmetic questions embedded in complex word problems. The inclusion of distracting yet irrelevant information caused “catastrophic performance drops,” a term the researchers used to describe how the models floundered when challenged by nuanced prompts.

This study not only drew widespread interest due to its rigorous documentation but also because it came from Apple, which is actively integrating AI features into its iPhones and other products. The paper's lead author, Mehrdad Farajtabar, emphasized their core inquiry: “Do these models truly understand mathematical concepts?”

The conclusion? A resounding no.Apple Intelligence is compatible with all iPhone 16 models, as well as Pro model iPhone 15's. Apple Intelligence introduces a suite of new AI-powered tools, mostly launching in the next few months and into 2025.Photo via Apple // Apple Intelligence is compatible with all iPhone 16 models, as well as Pro model iPhone 15's. Apple Intelligence introduces a suite of new AI-powered tools, mostly launching in the next few months and into 2025.

A Familiar Pattern

AI experts are not new to these findings. Similar conclusions have emerged from other research efforts, pointing out that large language models (LLMs), like OpenAI’s GPT, are adept at pattern recognition but lack genuine reasoning abilities. As Melanie Mitchell from the Santa Fe Institute noted, “A large gap in basic abstract reasoning still remains between humans and state-of-the-art AI systems.”

Apple’s research further underscores that these models, while impressive in mimicking human language, are fundamentally constrained. They function by matching learned patterns from extensive training datasets, rather than reasoning through problems. Farajtabar elaborated on this flaw, stating, “They memorized what is out there on the web and do pattern matching... [it’s] not real reasoning.”Photo via Ars Technica // "Simply changing specific names and numbers found in GSM8K tests led to significant decreases in performance in many models." Quote from Ars Technica.

Red Herrings and AI Hallucinations

The study’s results are a stark reminder of AI’s limitations, especially when models produce plausible yet incorrect responses with unwarranted confidence—a phenomenon known as “hallucination.” These issues are more than just curiosities; they have real-world implications. In fields like healthcare or legal documentation, even a minuscule error rate can result in significant consequences.

For instance, a recent analysis of Whisper, an AI-powered transcription tool developed by OpenAI, found that about 1.4% of its transcriptions included fabricated content. These hallucinations could misrepresent statements in sensitive environments, such as court proceedings or monitored phone calls, potentially influencing outcomes based on errors.

Implications for AI’s Future

The broader takeaway is clear: while AI excels in pattern recognition, its failure to engage in true abstract reasoning means it will remain reliant on human oversight, particularly in mission-critical applications. Gary Marcus, an outspoken AI critic, highlighted this limitation: “The ways in which they approach reasoning are an approximation and not the real thing... until we have some new technology.”

Apple’s research, though shedding light on the limits of current AI, suggests Apple is aware of these challenges as it navigates the evolving landscape of AI integration with iPhones, iPads, Macs and more. This transparency could signal a more measured approach in marketing their AI products, contrasting with the often unqualified assurances offered by competitors.Apple Intelligence was marketed as the standout feature of Apple's iPhone 16 line, but many customers are still waiting to try Apple's most advanced AI features.Photo via LifeHacker // Apple Intelligence was marketed as the standout feature of Apple's iPhone 16 line, but many customers are still waiting to try Apple's most advanced AI features.

A Call for Balanced Expectations

The findings should temper the hype surrounding AI capabilities. While AI holds immense potential as a tool to augment human tasks—whether in software development, automated processes, or content generation—it remains essential to apply skepticism when AI steps into roles demanding rigorous logic or high-stakes decisions. Mitchell’s perspective sums it up: even “very young children” can often outperform state-of-the-art AI when it comes to abstract reasoning.

In conclusion, the allure of AI as an all-powerful entity capable of replicating human thought is far from reality. For those invested in the future of AI, Apple’s study is a critical reminder of the technology's current boundaries and a call for innovation that goes beyond mere pattern recognition.

Recommended by the editors:

Thank you for visiting Apple Scoop! As a dedicated independent news organization, we strive to deliver the latest updates and in-depth journalism on everything Apple. Have insights or thoughts to share? Drop a comment below—our team actively engages with and responds to our community. Return to the home page.

Published to Apple Scoop on 2nd November, 2024.
Luke Everett

Luke Everett

Lead Technology Journalist

Luke Everett is Apple Scoop’s Lead Technology Journalist with 7 years of experience reporting on Apple hardware, software, and breaking news. Known for his investigative insights and in-depth analysis, Luke covers everything from major Apple keynotes to the latest rumors in the industry, helping readers stay ahead of the curve.

Luke's journalism More about Apple
Stories related to Apple

Apple Watch SE 3 Rumors: Release Date, Pricing, Features, and More

58% reliable8 mins

Apple Watch Ultra 3: Rumors, Leaks, Features, Release Date, and More

60% reliable8 mins

Is Apple Losing Its Edge? iPhone Sales Stall While Android Sales Rise

88% reliable10 mins

Apple's Smart Home 'Command Center' Rumors: Here's What We Know So Far

59% reliable15 mins

iPhone 17, 17 Air, 17 Pro & 17 Pro Max: Rumors, Leaks, and More

65% reliable15 mins

iPhone 17 Air vs. Samsung Galaxy S25 Slim: Rumors, Leaks, and More

61% reliable13 mins

Apple M5 Chip Production Kicks Off for 2025

74% reliable11 mins

Apple’s Foldable iPhone Could Launch Sooner Than Expected

58% reliable11 mins

iPhone 17 Pro Rumored to Launch With These 9 New Features

64% reliable12 mins

iPhone SE 4: Apple’s First In-House 5G Modem Will Launch in March 2025

66% reliable14 mins

iPhone 17 Slim Rumors: Pricing, Release Date, Display, Camera, and More

62% reliable13 mins

Leaked: iPhone 17 Pro Set to Reintroduce Aluminum

56% reliable9 mins

CarPlay 2 Leak Hints at Upcoming Launch

97% reliable10 mins

iPhone 17 Air Could Eliminate SIM Slots Worldwide

77% reliable9 mins

Are iPhone Sales Actually Declining?

93% reliable15 mins

Apple iPhone 17 & iPhone 18 Rumors, Leaks, and More

68% reliable13 mins

100+ Best Christmas Wallpapers for iPhone, Mac, and iPad

100% reliable6 mins

Reviews Are In: Apple Intelligence Delivers Mixed Reactions

100% reliable12 mins

iPhone 18 Pro Will Be Your Next Must-Have, Here's Why

58% reliable11 mins

Apple’s Home Hub Smart Display: Rumors, Release Date, Features, Apps, and More

66% reliable13 mins
Apple

Apple

Microsoft

Microsoft

Google

Google

Samsung

Samsung

Meta

Meta

More stories

Apple Watch SE 3 Rumors: Release Date, Pricing, Features, and More

58% reliable8 mins

Apple Watch Ultra 3: Rumors, Leaks, Features, Release Date, and More

60% reliable8 mins

Is Apple Losing Its Edge? iPhone Sales Stall While Android Sales Rise

88% reliable10 mins

Apple's Smart Home 'Command Center' Rumors: Here's What We Know So Far

59% reliable15 mins

100+ Best Free Beach Wallpapers for Apple iPhone, Mac, and iPad

100% reliable110 mins

iPhone 17, 17 Air, 17 Pro & 17 Pro Max: Rumors, Leaks, and More

65% reliable15 mins

iPhone 17 Air vs. Samsung Galaxy S25 Slim: Rumors, Leaks, and More

61% reliable13 mins

100+ Best Luxury Wallpapers for Apple iPhone, Mac, and iPad

100% reliable118 mins

Apple M5 Chip Production Kicks Off for 2025

74% reliable11 mins

Apple’s Foldable iPhone Could Launch Sooner Than Expected

58% reliable11 mins

iPhone 17 Pro Rumored to Launch With These 9 New Features

64% reliable12 mins

iPhone SE 4: Apple’s First In-House 5G Modem Will Launch in March 2025

66% reliable14 mins


More stories
Looking for the perfect wallpaper?
Explore thousands of free, high-quality wallpapers from Apple Scoop, specially crafted for your Apple devices.

Gradient

Apple

4K

HD

Landscape

Beach

Marble

Space

City

Pattern

Sunset

Ocean

Moon

Architecture

Quote




More wallpapers