AI Agents for PMs in 69 Minutes — Masterclass with IBM VP

Armand Ruiz, VP of AI Platform at IBM, reveals why most enterprise AI implementations fail and what Fortune 500 companies are actually building that works. He breaks down the difference between chatbots and agents, the 4-step framework powering real AI systems, and why RAG dominates 90% of enterprise use cases. ---- Transcript: https://www.news.aakashg.com/p/armand-ruiz-podcast ---- ⏰ Timestamps: 00:00 Intro 02:39 What Makes AI Agents Special 04:40 The Four Steps of AI Agents 07:14 AI Agent Development Frameworks 12:59 RAG Explained 16:55 Ads 18:46 Common RAG Mistakes 26:48 Managing Multiple AI Agents 31:39 Ads 33:57 How AI Changes Product Management 37:43 Problem Investigation vs Feature Factory 41:22 Roadmap to Build AI Agents 43:30 Can Open Source AI Win? 51:39 IBM's AI Strategy 59:32 Career Journey: Intern to VP 1:02:36 Building 200K LinkedIn Followers 1:08:18 Outro ---- 🏆 Thanks to our sponsors: 1. Kameleoon: Prompt-based experimentation platform - kameleoon.com/prompt 2. AI Evals Course for PMs & Engineers: Get $800 off https://maven.com/parlance-labs/evals?promoCode=ag-product-growth 3. Vanta: Security and compliance for fast-moving teams - https://www.vanta.com/lp/demo-1k 4. Amplitude: Mobile user engagement analytics - https://amplitude.com/digital-maturity-model 5. Product Faculty: Product Strategy Certificate for Leaders (Get $550 off) https://maven.com/product-faculty/ai-product-management-certification?promoCode=AAKASH25 ---- Key Takeaways: 1. AI Agents vs Chatbots: Chatbots respond to queries while agents execute complete workflows. The difference between getting suggestions and getting finished work. 2. Four-Step Agent Framework: Every agent needs Thinking (reasoning), Planning (task breakdown), Action (system execution), and Reflection (learning from outcomes). 3. RAG Dominates Enterprise: 90% of enterprise AI uses RAG to connect LLMs to proprietary data. Success requires 95%+ accuracy through sophisticated evaluation. 4. Vision RAG Unlocks Value: Most business data lives in charts and tables that traditional text-only RAG completely misses. 5. Framework Selection Matters: Use coding frameworks (LangGraph, CrewAI) for complex systems. Use no-code tools (Lindy, n8n) for rapid prototyping. 6. PM Ratios Transform: Traditional 1:6-10 PM-to-developer ratios become 1:2-30 when agents handle research and documentation. 7. Prototypes Beat PRDs: Show working systems instead of 20-page documents teams misinterpret. AI enables functional demos. 8. Open Source Wins: Despite closed-source capabilities, enterprises choose open source for licensing control and infrastructure flexibility. 9. Technical Literacy Essential: Understanding agents, RAG, and frameworks becomes baseline knowledge for everyone, not just developers. 10. Implementation Reality: Enterprise RAG needs heavy data engineering. Teams underestimate accuracy requirements and engineering complexity. ---- 👨‍💻 Where to find Armand: LinkedIn: linkedin.com/in/armandruiz IBM AI Platform: ibm.com/ai ---- 👨‍💻 Where to find Aakash: Twitter: twitter.com/aakashg0 LinkedIn: linkedin.com/in/aagupta/ #AIAgents #EnterpriseAI #RAGSystems #ProductManagement ---- 🧠 About Product Growth: The world's largest podcast focused solely on product + growth, with over 185K listeners. Hosted by Aakash Gupta, who spent 16 years in PM, rising to VP of product, this 2x/week show covers product and growth topics in depth. 🔔 Subscribe and turn on notifications to master AI agent implementation!

Aakash GuptahostArmand Ruizguest

Sep 5, 20251h 9mWatch on YouTube ↗

CHAPTERS

Why AI agents are “the wall of automation” beyond chatbots
Aakash and Armand frame AI agents as the next step after predictive analytics and chatbots—systems that can automate real work end-to-end. Armand shares why enterprises (and CIOs) now prioritize agents, but also why safe, secure production deployment is still the hard part.
The 4-step mental model: Think → Plan → Act → Reflect
Armand walks through his simple four-step diagram that explains what an agent does internally. The model clarifies how agents reason, decompose tasks, take actions in real systems, and improve via reflection loops over time.
Choosing an agent-building approach: code frameworks vs no-code builders
They categorize agent development tooling into two camps: programming frameworks that provide maximum control and low/no-code tools that speed up experimentation. The discussion highlights popular options and when to use each.
RAG demystified: adding fresh enterprise context to LLMs
Armand explains Retrieval-Augmented Generation (RAG) as the dominant method for injecting up-to-date knowledge into LLM outputs. He contrasts RAG with fine-tuning and shares why RAG became the default enterprise pattern post-ChatGPT.
RAG inside agent workflows: enterprise search becomes ‘answer + action’
They position RAG as a core component of agentic systems, especially during planning where agents fetch needed data. Examples show how RAG turns traditional enterprise search into direct, usable intelligence for decisions and downstream work.
RAG architecture building blocks (and why it’s mostly data engineering)
Armand outlines the real components behind RAG pipelines—embeddings, vector databases, filtering/ranking, and orchestration. The key message: most RAG failures and successes are driven by data engineering complexity, not just the LLM choice.
Vision RAG: extracting value from charts, tables, and rich PDFs
The conversation expands RAG from text-only to multimodal information retrieval. Vision RAG enables agents to understand charts/tables and visually dense documents, unlocking industries where critical data lives in non-text formats.
Common RAG mistakes: accuracy expectations, ‘vanilla’ pipelines, and weak evals
Armand focuses on the gap between consumer tolerance for imperfect answers and enterprise requirements for accuracy and trust. Teams often deploy generic templates without rigorous evaluation, leading to frustration and unreliable systems.
Evals everywhere: how to test agent/RAG systems like real software
They argue evaluation must happen at multiple steps in an agentic workflow, not only at the final answer. Armand explains evals as a way to inject human expertise, scale SME input, and continuously improve systems in production.
Managing 10–20 agents: orchestration as a new knowledge-worker skill
Armand describes a near-future where employees supervise fleets of specialized agents. The challenge becomes orchestration—assigning tasks, setting approvals, and judging outputs—especially in traditional companies where adoption takes longer.
How AI reshapes product management: fewer PMs, broader scope, more leverage
They explore how agents can change PM-to-engineer ratios and expand a PM’s coverage area. Armand maps agents across the PM lifecycle—from competitive research to feedback synthesis to PRD drafting and prototyping.
Prototype-first vs write-first: avoiding ‘feature factory’ while moving faster
Armand shares a career story where a prototype beat slides and PRDs in an exec meeting, illustrating why prototypes communicate better. They also address the risk of rushing into solutions without deep problem investigation and customer understanding.
Roadmap for learning and building agents: concepts → one agent → deeper tooling
Armand gives a practical learning sequence: start with fundamentals, build a single useful agent, then progress toward more advanced tools as needed. He emphasizes hands-on exploration as the only way to learn the ‘art of the possible.’
Can open source AI win? Why enterprises default to open ecosystems
Armand argues open source tends to win in enterprise contexts due to deployability, control, and ecosystem momentum. He also acknowledges the reality that closed-source labs may stay ahead temporarily, but open source catches up over time.
IBM’s AI strategy: flexibility, Granite models, scaling inference, and governance
Armand describes IBM’s positioning around deployment flexibility (any cloud/on-prem), a family of models (Granite), and enterprise-grade governance. He emphasizes that compliance and policy management must be designed in from the start.
Career + creator playbook: intern-to-VP journey and building 200k followers
Armand closes by sharing how intentional moves, consistency in AI through ‘winters,’ and customer proximity accelerated his career. He also breaks down his daily LinkedIn system, why he now uses less AI in writing, and how targeting the right audience beats chasing virality.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome

Why AI agents are “the wall of automation” beyond chatbots

The 4-step mental model: Think → Plan → Act → Reflect

Choosing an agent-building approach: code frameworks vs no-code builders

RAG demystified: adding fresh enterprise context to LLMs

RAG inside agent workflows: enterprise search becomes ‘answer + action’

RAG architecture building blocks (and why it’s mostly data engineering)

Vision RAG: extracting value from charts, tables, and rich PDFs

Common RAG mistakes: accuracy expectations, ‘vanilla’ pipelines, and weak evals

Evals everywhere: how to test agent/RAG systems like real software

Managing 10–20 agents: orchestration as a new knowledge-worker skill

How AI reshapes product management: fewer PMs, broader scope, more leverage

Prototype-first vs write-first: avoiding ‘feature factory’ while moving faster

Roadmap for learning and building agents: concepts → one agent → deeper tooling

Can open source AI win? Why enterprises default to open ecosystems

IBM’s AI strategy: flexibility, Granite models, scaling inference, and governance

Career + creator playbook: intern-to-VP journey and building 200k followers

Get more out of YouTube videos.