No Priors Ep. 47 | With Sourcegraph CTO Beyang Liu

Coding in collaboration with AI can reduce human toil in the software development process and lead to more accurate and less tedious work for coding teams. This week on No Priors, Sarah talked with Beyang Liu, the cofounder and CTO of Sourcegraph, which builds tools that help developers innovate faster. Their most recent launch was an AI coding assistant called Cody. Beyang has spent his entire career thinking about how humans can work in conjunction with AI to write better code. Sarah and Beyang talk about how Sourcegraph is thinking about augmenting the coding process in a way that ensures accuracy and efficiency starting with robust and high-quality context. They also think about what the future of software development could look like in a world where AI can generate high-quality code on its own and where that leaves humans in the coding process. 0:00 Beyang Liu’s experience 0:52 Sourcegraph premise 2:20 AI and finding flow 4:18 Developing LLMs in code 6:46 Cody explanation 7:56 Unlocking AI code generation 11:00 search architecture in LLMs 16:02 Quality-assurance in data set 18:03 Future of Cody 22:48 Constraints in AI code generation 30:28 Lessons from Beyang’s research days 33:17 Benefits of small models 35:49 Future of software development 42:14 What skills will be valued down the line

Sarah GuohostBeyang Liuguest

Jan 17, 202446mWatch on YouTube ↗

WHAT IT’S REALLY ABOUT

Sourcegraph CTO Beyang Liu on AI’s Real Impact on Coding Workflows

Sourcegraph CTO and co-founder Beyang Liu explains how their AI assistant Cody builds on a decade of work in code search and code understanding to make developers dramatically more productive. He details how retrieval-augmented generation (RAG), graph-based code context, and search-style pipelines are as critical as the underlying large language models. The conversation explores near-term “inner loop” tools like completions and targeted commands versus the longer-term goal of AI engineers that can go from issue description to production-ready pull requests. Liu also shares a forward-looking view of software development where AI compresses boilerplate work, magnifies the importance of CS fundamentals and domain expertise, and helps teams operate with far greater cohesion and visibility.

IDEAS WORTH REMEMBERING

5 ideas

Context and retrieval quality are at least as important as the base model.

Cody’s performance jumps “night and day” when its context engine pulls the right code and docs into the model’s window, showing that smart search, indexing, and graph traversal can rival or surpass benefits from only upgrading to bigger models.

Treat AI coding systems as search pipelines attached to LLMs.

Sourcegraph uses a two-stage search architecture—multiple retrievers (keyword, regex, embeddings, symbol-aware indexing) plus a re-ranking layer—to maximize recall then precision, mirroring classic search engines and generalizing well to other domains.

AI can be steered toward better code by curating what it sees.

Enterprises want Cody to ignore legacy or “anti-pattern” areas and focus on “golden” code regions, and RAG makes it straightforward to filter or weight context sources so generated code reflects desired standards and architectures.

Near-term value is in accelerating the developer inner loop, not full autonomy.

Current AI tools shine in in-editor completions, chat, and targeted commands like generate-tests or explain-code that reduce toil, while fully automatic issue-to-PR workflows remain limited by compounding failure and model reliability.

There are two main strategies toward autonomous AI engineers, both with trade-offs.

You can decompose tasks into many smaller chained steps (suffering from compounding error) or attempt large, context-heavy one-shot diffs with sampling and validation (suffering from cost and search space explosion), so better context and validation are crucial.

WORDS WORTH SAVING

5 quotes

The bread and butter really is, you have code search that lets you pinpoint the needle in the haystack, and then you walk the reference graph of code.

— Beyang Liu

The dirty secret is that keyword search can probably get you more than 90% of the way there.

— Beyang Liu

When our context fetching engine works, the quality of code generated by Cody is like night and day.

— Beyang Liu

We always want to do the simplest thing first, because oftentimes someone comes along with a much dumber, cheaper baseline that works as well or better than the fancy model.

— Beyang Liu

My maybe contrarian hot take is that CS fundamentals, if anything, are going to grow in importance.

— Beyang Liu

Origin and core thesis of Sourcegraph as a code understanding and search companyDesign and capabilities of Cody, Sourcegraph’s AI coding assistantRetrieval-augmented generation (RAG), graph context, and search pipelines for codeHandling real-world code quality and customizable context in enterprise codebasesInner loop vs. outer loop in software development and where AI fits todayPaths toward autonomous “AI engineers” and the limits of current agent architecturesFuture of software engineering skills, CS fundamentals, and team coordination with AI

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.