How I AIHow Microsoft's AI VP automates everything with Warp | Marco Casalaina
CHAPTERS
Meet Marco Casalaina: Microsoft AI VP and “micro-agent” mindset
Claire Vo introduces Marco Casalaina (VP of Core AI Products at Microsoft) and frames the episode around “micro-agents” that reduce friction in everyday work. The focus is less on coding and more on automating administrative and file-based tasks using agentic tools.
- •Marco’s role at Microsoft and interest in practical AI automation
- •Episode theme: micro-agents for small, high-friction tasks
- •Warp positioned as an automation interface beyond coding
- •Goal: speed-run multiple real-world automation use cases
Why Warp became Marco’s go-to tool for admin work
Marco explains he adopted Warp after the Microsoft PowerShell team recommended it. He quickly found it especially valuable for repetitive administrative workflows where a CLI exists but the GUI is slow and tedious.
- •Warp recommendation came internally from the PowerShell team
- •“Hooked” after using it for Azure subscription/resource administration
- •Warp excels when there’s an existing CLI (PowerShell/AZ/etc.)
- •Primary benefit: compressing multi-step admin actions into simple requests
Demo: AI-assisted Azure role assignments via AZ CLI
Marco shows how he grants a colleague multiple Azure roles quickly using Warp to drive the Azure CLI. The tool iterates through commands, recovers from an error, and completes multi-role permission changes far faster than the Azure portal.
- •Assigning granular Azure roles via the portal is time-consuming
- •Warp invokes the AZ CLI repeatedly to apply roles
- •Agent recovers after a command mistake and continues
- •Adds subscription-wide Contributor access as a follow-up
- •Works similarly with other clouds (e.g., GCP via gcloud)
Why CLI-first automation beats complex GUIs for IAM and configuration
Claire highlights that GUIs for permissions/configuration are inherently hard to design and painful to use. Letting AI operate through CLIs/APIs abstracts the complexity and enables a simpler natural-language interface for users and builders alike.
- •GUI design for IAM/config is a hard product problem
- •AI + CLI/API access can replace many brittle GUI flows
- •Agents replicate the old “search → paste command → debug” loop in one place
- •Framing: AI as an interface layer over programmatic capabilities
Improving Warp reliability with MCP servers and “rules”
Marco explains that Warp becomes significantly more dependable when you provide it with the right context. He uses MCP servers (e.g., Microsoft Docs) for accurate lookups and adds rules to prevent common failures (like missing prerequisite access activation).
- •Connect Warp to Microsoft Docs MCP server for role/permission lookups
- •Use rules to remind about prerequisites (e.g., activate Owner access)
- •Rules guide behavior without needing complex prompt formatting
- •Same approach applies to coding and non-coding workflows
Demo: Scan documents from the terminal (NAPS2 + Warp)
Marco demonstrates scanning his daughter’s two-sided practice test by asking Warp to run the scanner directly—no manual scanner UI. Claire reacts to the “hands-off” workflow: load the feeder, run a command, and let automation handle the rest.
- •Use case: scan homework/practice tests for generating variants later
- •Warp can trigger scanning directly (no button-clicking in scanner software)
- •Workflow reduces dependence on clunky native scanner apps
- •Personal archiving: scanning kids’ cards and keepsakes for safekeeping
Bimodal workflow: re-run generated commands and merge PDFs with ephemeral Python
After the first scan, Marco reuses the generated command (without re-invoking the LLM) to scan the even pages. Then Warp writes a short throwaway Python script (PyPDF2) to interleave/merge odd-even scans into one PDF and deletes the script afterward.
- •Warp produces a reusable CLI command for repeat runs (up-arrow replay)
- •Odd/even scanning workflow for duplex documents using a feeder
- •Agent writes and runs a temporary Python script to combine PDFs
- •Ephemeral automation: create what you need, run it, discard it
The non-magic behind the magic: installing CLIs and encoding them as rules
Marco clarifies that effective automation often requires prep: finding/installing the right CLI tool and teaching the agent how to use it. He installed NAPS2 because Windows lacks a built-in scanner CLI workflow and then added Warp rules pointing to its path and switches.
- •Automation depends on having CLI hooks (e.g., NAPS2 for scanning)
- •Rules include tool path and flags (feeder vs flatbed)
- •Once rules exist, the workflow becomes repeatably reliable
- •Reframing: AI is powerful, but good tool plumbing matters
Demo: Compressing huge screen recordings with FFmpeg via Warp
Marco shows how Warp analyzes a massive Xbox Game Bar recording and re-encodes it while preserving 1080p. Warp identifies bitrate/resolution as the cause and runs FFmpeg to shrink the file dramatically for practical sharing and storage.
- •Problem: 10-minute recording becomes a 1.7GB file
- •Warp inspects media properties to diagnose the cause
- •Uses FFmpeg switches to re-encode while keeping 1080p
- •Outcome: compression down to a normal-sized file (e.g., ~13MB)
- •Broader use: targeted audio fixes (boost volume within a time range)
Ad hoc agents and the case for ephemeral solutions over “productionizing”
Marco names his approach “ad hoc agents”—mini, on-the-fly agents created to solve a specific problem immediately. Claire argues most of these shouldn’t become permanent tools; it’s often better to redo quickly later, saving only lightweight rules for recurring pitfalls.
- •Definition: ad hoc agents = unnamed, task-specific agents created on demand
- •Trend: general-purpose agents increasingly support this style
- •Ephemeral scripts are fine; don’t over-invest in ‘making a product’
- •Save durable ‘rules’ when you repeatedly hit the same failure mode
Triggered workflows in Microsoft 365 Copilot: email → calendar check → invite
Marco demonstrates M365 Copilot’s Workflows building an agent that reacts to specific emails. The workflow extracts a requested meeting time, checks calendar availability, and automatically sends a 30-minute invite if the slot is free.
- •Shift: business copilots becoming ‘agent builders’
- •Trigger: email from Claire requesting a meeting
- •Steps: extract time (ISO 8601) → check Outlook calendar → send invite
- •Value: fast responsiveness without being manually in the loop
Scheduled automation in ChatGPT: daily checks and notifications (cron-like agents)
Marco shows a consumer-side analogue: setting ChatGPT to check daily for new episodes of Claire’s podcast and notify him. This illustrates recurring agents that run on a schedule and can push desktop notifications.
- •Recurring trigger: daily run (ChatGPT chooses a default time)
- •Task: monitor for new podcast episodes and alert the user
- •Optional desktop notifications for proactive awareness
- •Pattern: cron-style automation is spreading to mainstream assistants
Impact on time management, teaching the next generation, and prompt-control hacks
In the lightning round, Marco explains these micro-agents save minutes daily by removing him from low-leverage steps (e.g., scanning while helping with math). He discusses how his daughter learns tools differently, and he shares how he enforces consistency using rules plus AutoHotkey prompt snippets (e.g., character limits, formatting constraints, avoiding .env check-ins).
- •Time savings: parallelize tasks (agent works while you do higher-value work)
- •Kids: different adoption styles—tinkerers vs mainstream users
- •Reliability: rules to prevent repeated mistakes (e.g., never commit .env)
- •AutoHotkey shortcuts for reusable prompt templates (e.g., ‘500 chars, no bullets’)
- •General tactic: create a personal library of proven constraints and instructions