autonomous-language-model-systems.html

---
layout: default
title: "Towards Autonomous Language Model Systems"
body-class: announcement
background-class: announcement-background
permalink: /autonomous-language-model-systems
---

<div class="container">
  <div class="row hero-content">
    <div class="col-md-10">
      <h1>PyTorch Webinars</h1>
    </div>
  </div>
</div>

<div class="container-fluid light-background-section">
  <div class="container">
    <div class="row content">
      <div class="col-md-10 body-side-text">
        <img style="width:100%; max-width:600px; margin-bottom: 40px; display: block; margin-left: auto; margin-right: auto;" src="/assets/images/autonomous-language-model-systems.png" alt="Towards Autonomous Language Model Systems">            
        <h2>Towards Autonomous Language Model Systems</h2>
        <p class="lead">
          <strong>Date</strong>: May 21, 2025, 11AM PT / 2PM ET
          <br/>
          <strong>Speaker</strong>: Ofir Press
          <br/>
          <br/>
          Language models (LMs) are increasingly used to assist users in day-to-day tasks such as programming (Github Copilot) or search (Google's AI Overviews). But can we build language model systems that are able to autonomously complete entire tasks end-to-end? 
          <br/><br/>

In this talk, Ofir Press will discuss efforts to build autonomous LM systems, focusing on the software engineering domain. Ofir will present SWE-bench, a novel method for measuring AI systems on their abilities to fix real issues in popular software libraries. Ofir will then discuss SWE-agent, a system for solving SWE-bench tasks. 
<br/><br/>

SWE-bench and SWE-agent are used by many leading AI organizations in academia and industry, including OpenAI, Anthropic, Meta, and Google, and SWE-bench has been downloaded over 2 million times. These projects show that academics on tight budgets can have a substantial impact in steering the research community toward building autonomous systems that can complete challenging tasks. 
<br/><br/>

Ofir is a postdoc at Princeton University, where they mainly work with Karthik Narasimhan's lab. Ofir previously completed their PhD at the University of Washington in Seattle, where Ofir was advised by Noah Smith. During their PhD, Ofir spent two years at Facebook AI Research Labs on Luke Zettlemoyer's team.
          <br/><br/>
          <h3 class="mb-4">Register now to attend this event</h3>
          <div style="width:100%;position:relative;padding-bottom:56.25%;min-height:550px;"><iframe src="https://streamyard.com/watch/hzZ8FxVqTazw?embed=true"  width="100%" height="100%" frameborder="0" allow="autoplay; fullscreen" style="width:100%;height:100%;position:absolute;left:0px;top:0px;overflow:hidden;"></iframe></div>
        </p>
      </div>
    </div>
  </div>
</div>