Seminar • Data Systems • Building More Reliable and Scalable AI Systems with Language Model Programming | Cheriton School of Computer Science

Wednesday, April 24, 2024 10:30 am - 11:30 am EDT (GMT -04:00)

Please note: This seminar will take place in DC 1304.

Omar Khattab, PhD candidate
Stanford University

It is now easy to build impressive demos with language models (LMs) but turning these into reliable information access systems currently requires brittle combinations of prompting, chaining, and finetuning LMs. I will present LM programming, a systematic way to address this by defining and improving three layers of the stack around retrieval and language models. I start with how to adapt LMs to search for information most effectively (ColBERT) and how to scale such search to billions of tokens (PLAID). I then discuss the right architectures and supervision strategies (e.g., ColBERT-QA, Baleen, Hindsight) for allowing LMs to retrieve and cite verifiable sources in their responses. This leads to DSPy, a programming model that introduces composable modules for building and automatically supervising controllable programs built with LMs and retrieval models. Even simple systems expressed in DSPy routinely outperform large standalone LMs and standard hand-crafted prompting pipelines for knowledge-intensive tasks, in some cases while using only small models. I highlight how ColBERT and DSPy have sparked applications at dozens of leading tech companies and academic labs. I then conclude by discussing how DSPy enables a new degree of research modularity around LMs, one that stands to allow open research to again lead the development of AI systems.

Bio: Omar Khattab is a fifth-year CS Ph.D. candidate at Stanford, whose work spans Information Retrieval (IR), Machine Learning (ML) Systems, and Natural Language Processing (NLP). His research creates models, algorithms, supervision strategies, and programming abstractions for building reliable, transparent, and scalable NLP systems. Omar is the author of the ColBERT retrieval model, which has helped shape the modern landscape of neural information retrieval. His lines of work on ColBERT and DSPy form the basis of influential open-source projects, exceeding 600,000 downloads per month, and have sparked applications at Google, Amazon, IBM, VMware, Databricks, Baidu, AliExpress, and numerous startups. Omar’s Ph.D. has been supported by the Eltoukhy Family Graduate Fellowship and the Apple Scholars in AI/ML PhD Fellowship.

Location Information

Location Address: DC - William G. Davis Computer Research Centre
200 University Avenue West
DC 1304
Waterloo, ON, CA N2L 3G1

Location coordinates: