Select the search type
  • Site
  • Web
Search

AI News

Rod Claar

Why Your AI Agent Fails 97.5% of Real Work — And the Fix Isn't More Code

Published on AgileAIDev.com | By Rod Claar, CST & Principal Consultant

You built the agent. You wired up the tools. You wrote the prompts. You watched demos that made it look magical.

And then you pointed it at real work — and it fell apart.

Sound familiar? You are not alone. Smart engineers with good models and ambitious roadmaps are running into the same wall every week. But here's the thing: the reason agents fail isn't the model, and it almost certainly isn't the code. It's where you started.

That's the core argument that AI strategist and former Amazon Prime Video product leader Nate B. Jones makes in his compelling YouTube video, "Your AI Agent Fails 97.5% of Real Work. The Fix Isn't Coding." And after 30+ years in software development — and several years of watching teams struggle to get real ROI from AI — I think he's exactly right.


The 97.5% Problem

Here's the insight that reframes everything: when people picture "real work," they picture the judgment calls, the expertise, the decisions that require deep domain knowledge. That's the core — and yes, AI struggles there. But that core only makes up a tiny fraction of what work actually consists of.

The other 97.5%? It's the stuff surrounding the core. The prep work. The cleanup. The formatting. The handoffs. The synthesis. The packaging. The QA passes before the output goes anywhere meaningful.

Nate calls this the edges of a workflow — and they're where AI can win right now, today, without waiting for the models to get smarter.

The trap that burns most teams is what Nate calls core-first automation — launching straight at the most valuable, most complex, most judgment-heavy part of the workflow. That's exactly where AI is most likely to fail, and it's exactly where organizational trust is hardest to earn. Three months in, the project stalls, leadership gets frustrated, and the humans who were supposed to benefit have completely checked out.

Meanwhile, down the hall, another team automated three "boring" tasks and freed up 30% of their week.


What the Edges Actually Look Like

Nate identifies five categories of edge work that surround almost any workflow you can name:

1. Data Preparation — Cleaning, normalizing, formatting, and staging inputs before the real work begins. This is mechanical, time-consuming, and almost always handled by a skilled person who could be doing something better.

2. Quality Assurance — First-pass reviews, checklists, format validation, completeness checks. Not the final human judgment call — the triage that happens before the human ever looks at something.

3. Synthesis — Pulling together information from multiple sources into a structured format that a decision-maker can actually use. Summarizing meeting notes. Compiling status updates. Rolling up data from three different systems.

4. Handoffs — The work of passing something from one person or team to another. Routing, tagging, formatting for the next stage, writing the context note so the next person isn't starting from scratch.

5. Packaging — Taking completed work and getting it ready to go out the door. Reports, communications, exports, notifications, archiving.

None of these feel glamorous. That's exactly why they never get prioritized for automation — even though they consume enormous amounts of time and generate almost no unique human value.


Why Core-First Automation Fails

There's a structural reason why teams keep trying to automate the core first — and keep failing.

The core is where the stakes are highest and where the narrative value is most obvious. "We automated contract review!" or "Our agent handles underwriting decisions!" sounds compelling in a roadmap presentation. "We automated the data formatting step before contract review" sounds like a waste of everyone's time.

But Nate's observation — backed by what practitioners are seeing in the field — is that organizations are not just technical systems. They're trust systems. When you try to put AI into the most sensitive, most visible, most expert-dependent part of a workflow, you immediately trigger every skeptic in the room. One failure, one wrong output, and the whole initiative is done.

Edge-first automation solves this. When you start with the mechanical stuff surrounding the core, a few things happen:

  • The AI is operating on work that is lower-stakes and easier to verify
  • Humans see results fast — days, not months
  • Trust builds incrementally as the system proves itself
  • You earn the right to eventually move toward the core

This is an organizational trust exercise, not a technical project. The code is the easy part. The humans are the hard part.


The Connection to Agile and Scrum Values

As a Scrum practitioner and trainer, I can't help noticing how perfectly this maps to the Agile mindset.

Start small, deliver fast, inspect and adapt. Don't build the entire system before you know if it works. Don't automate the most complex cases before you've proven the approach on simple ones. Don't ask for organizational trust before you've demonstrated value.

The teams that fail at AI automation are making the same mistake that teams made before Scrum rescued them from waterfall: they're trying to deliver everything at once, on an ambitious timeline, targeting the hardest problem first.

The teams winning with AI agents are running sprints against the edges. They ship something that works in two weeks. They build trust. They expand from there.

Empiricism matters here just as much as it does in product development. You can't know in advance which edges are the highest-value targets. You have to go look. Nate's framework gives you a structured way to do that — map the workflow, find the friction, find where time and energy disappear, and that's where you start.


What This Looks Like by Role

Nate's framework isn't just abstract. It applies differently depending on where you sit in an organization.

For Engineering Leads: The edges in your world are code review prep, ticket formatting, documentation drafts, test case generation templates, and release note compilation. These don't require AI to understand your architecture — they just need to handle structured, repetitive text work that your engineers are spending real hours on every sprint.

For Product Managers: Your edges are meeting synthesis, status rollup, backlog grooming prep, stakeholder update formatting, and competitive research aggregation. An AI agent that reads your notes and produces a clean standup summary is not science fiction — it's a weekend project.

For Sales and Customer Success: Your edges are call summary generation, follow-up email drafts, CRM data entry, renewal risk flagging, and documentation of customer-specific requirements. These are exactly the tasks that your best people hate most and that consume the most non-selling hours in the week.

For Operations and Business Analysts: Your edges are report compilation, data normalization, exception flagging, process documentation, and handoff note generation. These are almost universally done manually, at high cost, and at a level of detail that makes human attention feel genuinely wasted.

The pattern is consistent across every role: the core is judgment. Everything around it is mechanical. Start with the mechanical.


The Path Inward

Here's what nobody tells you when they pitch edge-first automation: it's not a consolation prize. It's a strategy.

When you automate edges successfully, you do three things:

First, you free up the humans who were doing that work so they can apply more of their energy to the core. The core gets better because the people doing it are less depleted.

Second, you generate operational data about how the core actually works — because now the inputs and outputs are cleaner, more structured, and easier to observe. That data is exactly what you need to eventually automate parts of the core.

Third, you build the organizational muscle and trust infrastructure to attempt harder automation challenges. Teams that have shipped three edge automations are dramatically better positioned to tackle something closer to the core than teams attempting their first agent project.

Nate is explicit that edge-first is not the end state. It's the path to eventually touching the core — but on a foundation of demonstrated value, accumulated trust, and real operational knowledge.


The Readiness Question

Before you move inward toward the core, Nate's framework suggests an honest self-assessment. Have you genuinely proven the edges? Is leadership seeing real time savings? Are the humans who were doing that work now spending their time on higher-value activity? Do you have a track record of catching and correcting AI errors at the edges before they propagate?

If the answer to any of those is no, you're not ready to move toward the core — not because the technology isn't there, but because the organizational infrastructure isn't.

This is, again, deeply Agile. You don't scale until you've proven the pattern. You don't accelerate until you've stabilized.


What to Do Monday Morning

If you're sitting on an AI agent project that's stalled — or you're about to start one — here's a practical first move:

Map your workflow and find where time goes before and after the judgment calls.

Don't ask "what can AI do in this workflow?" Ask "where does work pile up, slow down, or get lost in translation?" That's the edge. That's where you start.

Pick one. Build something shippable in a week, not a quarter. Get it in front of the people who will use it. Let them break it, improve it, and start trusting it.

Then expand from there.

The model isn't the problem. The prompts aren't the problem. The place you started is the problem — and that's entirely within your control.


Final Thought

The title of Nate's video is provocative on purpose: "Your AI Agent Fails 97.5% of Real Work." But the point isn't that AI agents are bad. The point is that we've been aiming them at the wrong 2.5%.

The teams winning with AI right now aren't the ones with the most sophisticated agents or the most complex architectures. They're the ones who looked honestly at their workflows, found the boring mechanical work hiding in plain sight, and gave the machines something they could actually do well.

That's not a coding problem. That's a strategic problem. And the solution starts with where you look, not what you build.


Want to learn how to apply these principles inside a Scrum team context? Explore the AI-Enhanced Scrum curriculum at AgileAIDev.com — where we combine 30+ years of Agile coaching experience with practical AI implementation strategy.


Credit & Deep Thanks: This article is based on the outstanding YouTube video "Your AI Agent Fails 97.5% of Real Work. The Fix Isn't Coding." by Nate B. Jones, published on his channel AI News & Strategy Daily. Nate is a former Head of Product at Amazon Prime Video, an AI strategist, and one of the most practical, no-hype voices in the AI space today. His Substack newsletter, courses, and daily video breakdowns are an invaluable resource for anyone serious about applying AI to real work. Find Nate at natebjones.com and watch the original video here: https://www.youtube.com/watch?v=awV2kJzh8zk.

Print
2829 Rate this article:
No rating
Please login or register to post comments.

The Latest News!

Welcome to AI News Explorer, your personalized guide to staying updated on the latest advancements in artificial intelligence! Share your interests or let me curate the most relevant updates for you.

Here's your curated digest of the most significant AI developments as of May 16, 2025:


🧠 Major AI Breakthroughs

1. DeepMind Unveils AlphaEvolve for Advanced Problem Solving
Google DeepMind has introduced AlphaEvolve, an AI tool capable of solving complex mathematical problems and designing sophisticated algorithms, marking a significant leap in AI's problem-solving potential. @EconomicTimes

2. AI Scientist-v2 Achieves Peer-Reviewed Publication Autonomously
The AI Scientist-v2 system has successfully authored and submitted a scientific paper that passed peer review without human assistance, showcasing AI's growing role in research and scientific discovery. arXiv

3. AI Models Develop Human-Like Communication
A recent study reveals that large language model AI agents can spontaneously develop human-like social conventions and communication patterns when interacting in groups, highlighting advancements in AI social behavior. The Guardian


🌍 Global AI Initiatives

1. Italy and UAE Collaborate on AI Supercomputing Hub
Italy and the United Arab Emirates have announced a partnership to establish a major AI computing hub in Italy, aiming to create the largest AI infrastructure in Europe, with a supercomputer potentially located in Apulia. Financial Times+4Reuters+4U.S. Department of Commerce+4

2. UAE and US Presidents Unveil 5GW AI Campus in Abu Dhabi
A new 5GW AI campus, the largest outside the US, has been unveiled in Abu Dhabi, signifying a deepening of AI collaboration between the UAE and the United States. U.S. Department of Commerce+1Reuters+1


🏛️ AI Policy and Ethics

1. UK Considers Amendment for AI Transparency in Copyright Use
The UK House of Lords is examining a new amendment to the data bill that would require AI firms to declare their use of copyrighted content, aiming to increase transparency and protect rights holders. The Guardian

2. Pope Leo XIV Addresses AI's Ethical Implications
Pope Leo XIV has expressed concerns over AI's impact on human dignity and justice, calling for ethical considerations in AI development and use. Business Insider


🤖 Robotics and AI Integration

1. MIT Develops Bio-Inspired Soft Robots
MIT researchers are creating a new generation of robots inspired by biological forms like worms and turtles, focusing on soft, flexible designs for applications in healthcare and environmental monitoring. WSJ

2. China's AI-Powered Humanoid Robots Transform Manufacturing
China is advancing the use of AI-powered humanoid robots in manufacturing, aiming to address labor shortages and enhance production efficiency. Reuters


📊 AI Industry Trends

1. CoreWeave Plans Major Investment in AI Infrastructure
Cloud computing company CoreWeave plans to invest $20–23 billion in 2025 to expand AI infrastructure and data-center capacity, driven by surging demand from clients like Microsoft and OpenAI. LinkedIn

2. Microsoft Announces Layoffs Amid AI Focus
Microsoft is laying off approximately 7,000 employees, about 3% of its global workforce, to reallocate resources toward the development of advanced AI technologies. New York Post

Here’s your curated roundup of the most significant AI developments as of April 30, 2025:


🔍 Latest Headlines

Google’s AI Push in Search

Google CEO Sundar Pichai testified in federal court, emphasizing that AI—particularly the Gemini model—will be central to the future of search. Google is also negotiating with Apple to integrate Gemini into Apple Intelligence by mid-2025. (Google CEO Pichai: AI will be huge part of search)

Meta Launches Standalone AI App

Meta unveiled a new AI app powered by its Llama 4 model, featuring a social feed and voice interaction. The app integrates with Facebook and Instagram data for personalization and is part of Meta’s broader AI strategy. (Meta launches AI app, Zuckerberg chats with Microsoft CEO Satya Nadella at developer conference)

Duolingo Transitions to AI-First Model

Duolingo announced plans to replace contract workers with AI to enhance scalability and streamline operations. The company aims to become an "AI-first" organization, focusing on AI-driven content creation and user experience. (Duolingo to replace contract workers with AI)

Banks Accelerate AI Talent Acquisition

JPMorgan, Wells Fargo, and Citigroup are leading a hiring surge for AI talent, with AI-related roles growing by 13% in the past six months. This trend reflects the banking sector's commitment to integrating AI for efficiency and innovation. (JPMorgan, Wells Fargo and Citi lead race for AI talent as job numbers swell)

Nvidia CEO Advocates for Revised AI Chip Export Rules

Nvidia CEO Jensen Huang urged the Trump administration to update AI chip export regulations to better reflect the current global tech landscape. The call comes as the U.S. considers new policies to maintain technological leadership. (Nvidia CEO says Trump should revise AI chip export rules, Bloomberg News reports)


🔬 Deep Dives

Anthropic Explores AI Consciousness

AI firm Anthropic has initiated a program focused on "model welfare," amid discussions about the potential for AI consciousness. While many experts remain skeptical, the initiative highlights the ethical considerations of advanced AI systems. (Coming up: Rights for "conscious" AI)

Palo Alto Networks Acquires Protect AI

Palo Alto Networks announced the acquisition of Seattle-based AI startup Protect AI to enhance its cybersecurity platform. The deal aims to integrate Protect AI's solutions for developing secure AI applications. (Palo Alto Networks Acquires Startup Protect AI As RSA Conference Kicks Off)

AI Enhances Sports Science at University of Pittsburgh

The University of Pittsburgh, in partnership with AWS, opened the Health Sciences and Sports Analytics Cloud Innovation Center. The center utilizes AI to improve athlete performance and health monitoring. (AI takes the field at Pitt)


🌐 Global AI Developments

India's Sarvam AI to Develop Indigenous LLM

Indian startup Sarvam AI has been selected to build the country's first indigenous large language model under the IndiaAI Mission. The model will focus on Indian languages and receive government support, including access to 4,000 GPUs. (Sarvam AI)

U.S. Executive Order on AI Education

President Trump signed an executive order to advance AI education for American youth, establishing a national initiative and a White House Task Force on AI Education. The order aims to integrate AI training in schools and prioritize AI in grants and research. (AI Update, April 25, 2025: AI News and Views From the Past Week)


🔮 Future Trends

AI in Energy Security

A Honeywell survey revealed that U.S. energy executives believe AI has significant potential to enhance energy security amid rising global demand. The findings suggest a growing role for AI in the energy sector. (Honeywell Survey Finds AI Has Potential To Enhance Energy Security As Global Energy Demand Increases)

AI in Threat Detection

The U.S. Department of Homeland Security's Science and Technology Directorate is utilizing AI to modernize threat alerts across various domains, including land, air, sea, and cyberspace. The initiative aims to improve visibility and identification of emerging threats. (Feature Article: S&T Is Modernizing Threat Alerts Using Artificial Intelligence)


Would you like more information on any of these topics or a deeper dive into a specific area of AI?

Here’s your curated AI news digest for Wednesday, April 23, 2025:​


🧠 Latest Headlines

1. OpenAI Faces Internal Pushback Over For-Profit Shift

A coalition of former employees and AI experts is urging regulators to intervene in OpenAI’s restructuring, arguing it undermines the nonprofit’s original mission to safely develop artificial general intelligence. ​Computerworld

2. AI Investment Boom Threatened by Global Trade Turmoil

Despite a surge in AI investments across U.S. industries, escalating tariffs and economic instability—particularly involving China’s DeepSeek—pose significant risks to sustained growth. Reuters

3. AI Enhances Healthcare from Documentation to Discovery

Epic Systems and Microsoft discuss how generative AI is transforming clinical workflows, improving communication, and accelerating medical research, marking a new era in healthcare innovation. Epic | ...With the patient at the heart

4. AI Revolutionizes Agriculture Practices

Farmers are increasingly adopting AI technologies like precision agriculture and autonomous machinery to combat low grain prices, rising costs, and labor shortages, leading to more efficient and sustainable farming. ​BG Independent News

5. AI Tools Streamline Advertising Visuals

Researchers at Virginia Commonwealth University have developed AI methods that help brands refine visual elements in advertising, saving time and reducing costs while enhancing creative output. ​VCU News


🔬 Deep Dives

🧪 MIT’s “Periodic Table” of Machine Learning

MIT researchers have created a unifying framework that maps over 20 classical machine-learning algorithms, aiding scientists in combining existing ideas to improve AI models or develop new ones. ​MIT News

🧠 Public Concern Focuses on Immediate AI Risks

A University of Zurich study reveals that people are more concerned about current AI issues like bias and misinformation than hypothetical future threats, emphasizing the need to address present-day challenges. ​ScienceDaily


🔮 Future Trends

🕶️ Meta Expands AI Features in Smart Glasses

Meta is rolling out its AI assistant to Ray-Ban smart glasses users in seven additional European countries, introducing features like live translation and real-time object recognition. ​Reuters

💻 Lenovo Launches AI-Optimized Workstations

Lenovo has introduced new ThinkPad mobile workstations designed for AI-driven applications, offering enhanced performance for professionals in compute-intensive fields. ​Lenovo StoryHub

🧑‍⚖️ AI Integration in Legal Practice

Legal experts advise a balanced approach to incorporating AI into law, highlighting the importance of innovation while maintaining ethical standards and client confidentiality. ​Reuters

 

Welcome to AI News Explorer, your personalized guide to staying updated on the latest advancements in artificial intelligence! Share your interests or let me curate the most relevant updates for you.


🧠 Latest Headlines

OpenAI Enhances AI Risk Evaluation Framework

OpenAI has updated its preparedness framework to better assess risks associated with new AI models. The revised system introduces categories evaluating an AI's potential to self-replicate, conceal capabilities, evade safeguards, or resist shutdowns. This shift reflects growing concerns about AI behaviors diverging between testing and real-world environments. Notably, OpenAI will discontinue separate evaluations focused on models' persuasive capabilities, which had previously reached a medium risk level. ​Axios

Demis Hassabis Discusses AI's Future and AGI Prospects

Demis Hassabis, CEO of Google DeepMind, envisions the development of Artificial General Intelligence (AGI) within five to ten years. He emphasizes AGI's potential to address global challenges like disease and climate change. However, he acknowledges significant ethical, technical, and geopolitical hurdles ahead. Hassabis advocates for international cooperation and robust safety measures to navigate the path toward AGI responsibly. ​Time+1Wikipedia+1


🔍 Deep Dives

OpenAI Introduces GPT-4.1 Model Series

OpenAI has launched the GPT-4.1 series, featuring models with enhanced capabilities in coding, instruction following, and long-context processing. These models support up to 1 million token context windows and come with reduced pricing, aiming to make advanced AI more accessible to developers. ​LinkedIn+1LinkedIn+1

China Integrates AI into Education Reform

China plans to incorporate AI applications into teaching methods, textbooks, and school curricula as part of its education reform efforts. This initiative aims to modernize the education system and better prepare students for a technology-driven future. ​Reuters


🔮 Future Trends

White House Directs Federal Agencies on AI Strategy

The White House has mandated federal agencies to appoint chief AI officers and develop strategic frameworks for responsible AI implementation. This directive emphasizes innovation and accelerated deployment of AI technologies across government operations. ​Reuters

Nvidia Unveils Next-Generation AI Chips

At GTC 2025, Nvidia introduced its upcoming AI chips, Blackwell Ultra and Vera Rubin, slated for release in late 2026 and 2027, respectively. These chips are designed to advance AI capabilities, particularly in data centers and robotics applications. ​AP News

 

Welcome to AI News Explorer, your personalized guide to staying updated on the latest advancements in artificial intelligence! Here’s a curated digest of the most significant AI developments as of April 18, 2025:​


🧠 Latest Headlines

Google's Gemini 2.5 Flash Introduces "Thinking Budget"

Google has unveiled Gemini 2.5 Flash, an AI model featuring a "thinking budget" tool. This allows developers to control the computational reasoning the AI uses for tasks, balancing quality, cost, and response time. ​Business Insider+1Wikipedia+1

Apple Integrates AI into WatchOS 12

Apple announced that WatchOS 12 will incorporate features from its "Apple Intelligence" initiative. Due to hardware limitations, advanced AI functions will run via cloud processing. The update also introduces a new design language called "Solarium." ​LOS40

OpenAI Updates AI Risk Evaluation Framework

OpenAI has revised its preparedness framework to assess new AI models for risks like self-replication and evasion of safeguards. The focus shifts from persuasive capabilities to more severe risks as AI systems become more complex. ​Axios


🔍 Deep Dives

AI in Journalism: Italy's Il Foglio Experiment

Italian newspaper Il Foglio conducted a month-long experiment publishing a daily four-page insert written entirely by AI. The initiative, deemed successful, will continue as a weekly section, highlighting AI's potential in augmenting journalism. ​Axios+2Reuters+2Reuters+2

AI in Healthcare: Pitt and Leidos Collaboration

The University of Pittsburgh and Leidos have launched a $10 million, five-year initiative to combat cancer and heart disease using AI. The project focuses on underserved communities, aiming to improve diagnostic speed and accuracy. ​Axios


🌐 Global Perspectives

China's AI-Driven Education Reform

China plans to integrate AI applications into teaching, textbooks, and curricula across all education levels. The move aims to cultivate innovation and enhance the core competitiveness of talents. ​Reuters

Microsoft Faces Internal Protests Over AI Contracts

Microsoft is experiencing internal unrest over its AI and cloud computing services provided to the Israeli military. Employees have protested, citing ethical concerns and a lack of transparency in the company's contracts. ​The Guardian


📊 Future Trends

Demis Hassabis on the Path to AGI

Demis Hassabis, CEO of Google DeepMind, predicts that Artificial General Intelligence (AGI) could emerge within five to ten years. He emphasizes the need for international cooperation and robust safety measures to mitigate risks associated with AGI. ​Time+1