Benjamin Eysenbach

Room 416

35 Olden St

Princeton NJ 08544

eysenbach@princeton.edu

I design reinforcement learning (RL) algorithms: AI methods that learn how to make intelligent decisions from trial and error. I am especially interested in self-supervised methods, which enable agents to learn intelligent behaviors without labels or human supervision. Our group has developed some of the foremost algorithms and analysis for such self-supervised RL methods. Here are a few examples. I run the Princeton Reinforcement Learning Lab.

Bio: Before joining Princeton, I did by PhD in machine learning at CMU under Ruslan Salakhutdinov and Sergey Levine and supported by the NSF GFRP and the Hertz Fellowship. I spent a number of years at Google Brain/Research before and during my PhD. My undergraduate studies were in math at MIT.

Join us! Please read this page before emailing me about joining the lab.

news

Jun 11, 2025	I’ll be giving a tutorial on intrinsic motivation and self-supervised RL at RLDM! Materials will be available on the tutorial website. Also, send me an email if you’d like to chat at the conference!
Apr 24, 2025	Princeton RL @ ICLR 2025! Some say hi in Singapore! A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals. Led by Grace Liu and Michael Tang. Accelerating Goal-Conditioned Reinforcement Learning Algorithms and Research. Led by Michał Bortkiewicz and lots of amazing co-authors. Invariance to Planning in Goal-Conditioned RL. Led by Cathy Ji and Vivek Myers. The “Law’’ of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities. Led by Yongwei Che. Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning. Led by Chongyi Zheng and Jens Tuyls. OGBench: Benchmarking Offline Goal-Conditioned RL. Led by Seohong Park and Kevin Frans.
Mar 21, 2025	Check out our new preprint, 1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities, led by Kevin Wang and Ishaan Javali and Michał Bortkiewicz!
Feb 1, 2025	Check out our new preprint, Horizon Generalization in Reinforcement Learning, led by Cathy Ji and Vivek Myers!
Jan 7, 2025	Awarded a grant from the Princeton AI Lab to study ``Do brains perceive, act, and plan using temporal contrast?’’ together with Nathaniel Daw.
Jan 2, 2025	We’re launching a undergraduate research program (REU) together with state and community colleges in NJ. This is a paid program, and no research experience is required. Apply by Feb. 1.
Jan 2, 2025	I’m teaching Introduction to Reinforcement Learning this Spring, together with a fantastic team of TAs. I create this course to give students a strong foundation in RL and highlight that unifying themes (RL isn’t just a bag of tricks). All course notes and assignments will be posted publicly, so you can follow along!
Nov 22, 2024	Princeton RL @ NeurIPS 2024! Some say ``hi’’ and check out the work below! Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference. With Vivek Myers. Poster at 7:30pm on Wed Dec 11. Learning to Assist Humans without Inferring Rewards. With Evan Ellis and Vivek Myers. Poster at 2:00pm on Thur Dec 12. A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals. Led by Grace Liu and Michael Tang. Intrinsically-Motivated and Open-Ended Learning Workshop. Oral prentation on 2:45pm – 3:00pm on Sunday Dec 15. Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning. Led by Chongyi Zheng and Jens Tuyls. Intrinsically-Motivated and Open-Ended Learning Workshop. Poster at 10:30am – 11:30am and 3:pm – 4pm on Sunday Dec 15.
Sep 13, 2024	JaxGCRL: A new benchmark for goal-conditioned RL is blazing fast, allowing you to train at 1 million steps per minute on 1 GPU. Experiments run so fast that the algorithm design process becomes interactive. Tools like this not only make research much more accessible (e.g., you can now run a bunch of interesting experiments in a free Colab notebook before the 90 min timeout), but also will change how RL is taught (less fighting with dependencies, more experiments on complex tasks, less waiting for experiments to queue and finish); stay tuned for COS 435 this Spring!
Aug 13, 2024	Skills and directed exploration seem to emerge from contrastive RL! Check out the website for videos, code, and the full paper! Let by Grace Liu with Michael Tang.

selected publications

The aim is to highlight a small subset of the work done in the group, and to give a sense for the sorts of problems that we're working on. Please see Google Scholar for a complete and up-to-date list of publications.

2025

Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning

Chongyi Zheng, Jens Tuyls, Joanne Peng, and Benjamin Eysenbach

In The Thirteenth International Conference on Learning Representations, 2025

PDF Code
The "Law" of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities

Yongwei Che, and Benjamin Eysenbach

In The Thirteenth International Conference on Learning Representations, 2025

PDF Code
Invariance to Planning in Goal-Conditioned RL

Catherine Ji, Vivek Myers, and Benjamin Eysenbach

In The Thirteenth International Conference on Learning Representations, 2025

PDF Code
Accelerating Goal-Conditioned Reinforcement Learning Algorithms and Research. Led by Michał Bortkiewicz

Michał Bortkiewicz, Władek Pałucki, Vivek Myers, Tadeusz Dziarmaga, Tomasz Arczewski, Łukasz Kuciński, and Benjamin Eysenbach

In The Thirteenth International Conference on Learning Representations, 2025

PDF Code
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals

Grace Liu, Michael Tang, and Benjamin Eysenbach

In The Thirteenth International Conference on Learning Representations, 2025

PDF Code

2024

Learning to Assist Humans without Inferring Rewards

Vivek Myers, Evan Ellis, Sergey Levine, Benjamin Eysenbach, and Anca Dragan

In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024

PDF Code
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Benjamin Eysenbach, Vivek Myers, Russ Salakhutdinov, and Sergey Levine

In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024

PDF Code
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making

Vivek Myers, Chongyi Zheng, Anca Dragan, Sergey Levine, and Benjamin Eysenbach

In Forty-first International Conference on Machine Learning, 2024

PDF Code
Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View

Raj Ghugare, Geist Matthieu, Glen Berseth, and Benjamin Eysenbach

In The Twelfth International Conference on Learning Representations, 2024

PDF Code
Contrastive Difference Predictive Coding

Chongyi Zheng, Ruslan Salakhutdinov, and Benjamin Eysenbach

In The Twelfth International Conference on Learning Representations, 2024

PDF Code Website
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data

Chongyi Zheng, Benjamin Eysenbach, Homer Walke, Patrick Yin, Kuan Fang, Ruslan Salakhutdinov, and Sergey Levine

In The Twelfth International Conference on Learning Representations, 2024

PDF Code Website

2023

Contrastive value learning: Implicit Models for Simple Offline RL

Bogdan Mazoure, Benjamin Eysenbach, Ofir Nachum, and Jonathan Tompson

In Conference on Robot Learning, 2023

PDF Code Video
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective

Raj Ghugare, Homanga Bharadhwaj, Benjamin Eysenbach, Sergey Levine, and Russ Salakhutdinov

In International Conference on Learning Representations , 2023

PDF Code Website
A Connection between One-Step RL and Critic Regularization in Reinforcement Learning

Benjamin Eysenbach, Matthieu Geist, Sergey Levine, and Ruslan Salakhutdinov

In International Conference on Machine Learning, 2023

PDF
Contrastive Example-Based Control

Kyle Beltran Hatch, Benjamin Eysenbach, Rafael Rafailov, Tianhe Yu, Ruslan Salakhutdinov, Sergey Levine, and Chelsea Finn

In Learning for Dynamics and Control Conference, 2023

PDF
Probabilistic Reinforcement Learning: Using Data to Define Desired Outcomes, and Inferring How to Get There

Benjamin Eysenbach

PhD Thesis, Carnegie Mellon University, 2023

PDF

2022

Mismatched No More: Joint Model-Policy Optimization for Model-Based RL

Benjamin Eysenbach, Alexander Khazatsky, Sergey Levine, and Ruslan Salakhutdinov

In Advances in Neural Information Processing Systems, 2022

PDF Code Video
Contrastive Learning as Goal-Conditioned Reinforcement Learning

Benjamin Eysenbach, Tianjun Zhang, Ruslan Salakhutdinov, and Sergey Levine

In Advances in Neural Information Processing Systems, 2022

PDF Code Video Website
Imitating Past Successes can be Very Suboptimal

Benjamin Eysenbach, Soumith Udatha, Russ R Salakhutdinov, and Sergey Levine

In Advances in Neural Information Processing Systems, 2022

PDF Code Video
The Information Geometry of Unsupervised Reinforcement Learning

Benjamin Eysenbach, Ruslan Salakhutdinov, and Sergey Levine

In International Conference on Learning Representations, 2022

PDF Code Video
Maximum Entropy RL (Provably) Solves Some Robust RL Problems

Benjamin Eysenbach, and Sergey Levine

In International Conference on Learning Representations, 2022

PDF Blog

2021

Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification

Benjamin Eysenbach, Sergey Levine, and Ruslan Salakhutdinov

Advances in Neural Information Processing Systems, 2021

PDF Blog Code Website
C-Learning: Learning to Achieve Goals via Recursive Classification

Benjamin Eysenbach, Ruslan Salakhutdinov, and Sergey Levine

In International Conference on Learning Representations, 2021

PDF Code Website
Robust Predictable Control

Benjamin Eysenbach, Ruslan Salakhutdinov, and Sergey Levine

In Advances in Neural Information Processing Systems, 2021

PDF Code Video Website

2019

Search on the replay buffer: Bridging planning and reinforcement learning

Benjamin Eysenbach, Ruslan Salakhutdinov, and Sergey Levine

In Advances in Neural Information Processing Systems, 2019

PDF Blog Code
Diversity is All You Need: Learning Skills without a Reward Function

Benjamin Eysenbach, Abhishek Gupta, Julian Ibarz, and Sergey Levine

In International Conference on Learning Representations, 2019

PDF Code