Skip to content
YC Root AccessYC Root Access

Artie: Real Time Data Streaming For The AI Age

In this episode of Founder Firesides, YC Managing Partner Jared Friedman talks to the founders of Artie (S23), Jacqueline Cheong and Robin Tang, who have just announced their Series A. Artie is a real-time data streaming platform for cutting edge companies, streaming up-to-date and reliable data between systems in real time.

Jared FriedmanhostJacqueline CheongguestRobin Tangguest
Jan 26, 202626mWatch on YouTube ↗

CHAPTERS

  1. 0:05 – 1:00

    Artie overview and Series A announcement

    Jared opens by introducing Artie founders Robin Tang and Jacqueline Cheong and congratulating them on their newly announced Series A. Jacqueline explains Artie’s core mission: moving production data between systems in real time as changes happen.

  2. 1:00 – 2:00

    Origin story: the persistent pain of “data isn’t fresh enough”

    Robin traces the company’s roots to repeated frustrations across multiple roles where teams wanted fresher, faster data for experimentation and operations. He found existing tools didn’t fit production-database needs and internal builds were costly and slow.

  3. 2:00 – 3:24

    Why building CDC connectors in-house is a trap (and why Artie had to exist)

    The founders argue it’s irrational that companies spend 1–2 years building basic CDC pipelines like Postgres-to-Snowflake. Jared reinforces that this work isn’t a company’s core competency, motivating Robin to build a productized solution.

  4. 3:24 – 4:17

    Early build timeline and YC-era “fake it till self-serve” onboarding

    Robin explains it took ~6 months to build early infrastructure (faster now with better AI tooling), and much longer to make the product truly self-serve. They describe scrappy onboarding tactics, including manual workflows behind a self-serve facade.

  5. 4:17 – 5:23

    What existed at YC application time: infrastructure first, sales skills later

    Jacqueline describes that by the start of YC the core infrastructure already existed, but usability and UI were still developing. Much of YC was spent learning customer conversations, selling, and iterating on what the market needed.

  6. 5:23 – 8:47

    Landing Substack: mission-critical first customer via cold email + POC

    They recount how Substack became their first major customer, despite the high stakes of deploying unproven infrastructure. Confidence came from a rigorous POC pushing huge volumes under strict constraints, and the speed of the customer’s need.

  7. 8:47 – 10:18

    Why growth is lumpy for infra: long gaps between ‘big bets’

    Jared highlights that infrastructure adoption doesn’t produce easy early hockey-stick growth because each deployment is a major bet. The founders confirm it took many months to land another Substack-scale customer and compare to Substack’s own early trajectory.

  8. 10:18 – 13:12

    Reaching $1M ARR with a tiny team: disciplined hiring and founder-led sales

    Jared notes Artie hit $1M ARR with only four people, far smaller than typical pre-AI SaaS benchmarks. Jacqueline attributes this to strict adherence to YC hiring discipline and keeping customer-facing sales tightly founder-driven to speed iteration.

  9. 13:12 – 16:49

    Married co-founders: deciding to jump in and how it changes the work dynamic

    The conversation shifts to their personal partnership: they are married and co-founded Artie together after evaluating whether their conflict style would work in business. Jacqueline explains how working together increased closeness and reduced communication friction.

  10. 16:49 – 17:27

    Work-life boundaries (or lack thereof) when building high-stakes infra

    They discuss whether they maintain boundaries between work and life, concluding that during this phase they largely don’t. The intensity of Artie’s mission-critical operations dominates day-to-day life.

  11. 17:27 – 20:53

    Battle scars of real-time data: backfills, edge cases, and undocumented ‘right ways’

    Robin and Jacqueline describe the technical reality: production data pipelines fail in countless messy ways that don’t show up locally. They cover online backfills while streaming, performance constraints at massive scale, and hard connector work like SQL Server CDC.

  12. 20:53 – 22:19

    Owning the whole stack: Kafka SDK ordering bugs and customer expectations

    They explain how reliability demands extend beyond their code—third-party library bugs become Artie’s responsibility. A Kafka client rebalancing/ordering issue caused out-of-order reads, forcing deep debugging and a vendor switch, plus stronger customer guardrails.

  13. 22:19 – 23:16

    Where Artie is now: 700B+ rows processed and scaling toward trillions

    Jacqueline shares current scale metrics and how demand is rising with real-time AI workloads and agentic use cases. They plan to increase reliability and scalability as volume grows by an order of magnitude.

  14. 23:16 – 24:23

    Team growth plan and founder advice: move fast, don’t overthink

    They outline aggressive hiring plans—tripling the team—and the roles needed to support scale. Jacqueline closes with tactical advice: execute, observe, and iterate rather than getting stuck in analysis.

  15. 24:23 – 26:33

    Roadmap: beyond CDC into events APIs and new real-time destinations

    They describe expanding from database CDC into an events API and more sources/destinations where low-latency matters. Examples include sub-second warehouse queryability and future destinations like Elasticsearch for real-time indexing/search.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.