OpenEnv: Agentic Execution Environments

Team

community

https://github.com/meta-pytorch/OpenEnv

meta-pytorch

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

burtenshaw updated a Space 3 days ago

openenv/sudoku

burtenshaw updated a Space 3 days ago

openenv/browsergym_env

burtenshaw updated a Space 3 days ago

openenv/repl

View all activity

burtenshaw

updated 6 Spaces 3 days ago

TextArena Environment Server

🎮

Control and interact with AI environments through a web interface

BrowserGym Environment Server

🌐

Control and monitor AI agents in simulated environments

REPL Environment Server

🎮

Control and monitor AI agent interactions in real-time

Echo Environment Server

🔊

Control and monitor environment interactions through web interface

TB2 Environment Server

🧪

Control and monitor AI agent environments through web interface

OpenSpiel Environment Server

🎮

Interact with AI gaming environments and control game actions

burtenshaw

updated a Space 9 days ago

README

🚀

sergiopaniego

posted an update 10 days ago

Post

397

Meet the Post-Training Toolkit (PTT), which easily integrates with TRL via a single callback, by Aditya Challapally ( @microsoft ):

🔍 Detects training issues early
🛠 Lets you intervene safely
📊 Keeps long training runs stable, auditable & efficient

Microsoft blog: https://devblogs.microsoft.com/engineering-at-microsoft/diagnosing-instability-in-production-scale-agent-rl/

Integration guide: https://huggingface.co/docs/trl/main/en/ptt_integration

Code: https://github.com/microsoft/post-training-toolkit

sergiopaniego

posted an update 11 days ago

Post

2507

New TRL + OpenEnv example! 💥

Fine tune an LLM for playing Sudoku using an RL env via OpenEnv

Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook.

Enjoy!

Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb

Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py

1 reply

sergiopaniego

posted an update 13 days ago

Post

2123

Date idea: read the entire Transformers v5.0.0 release notes

Officially stable now: https://github.com/huggingface/transformers/releases/tag/v5.0.0

1 reply

sergiopaniego

updated a collection 18 days ago

Environment Hub

Collection

A collection of OpenEnv-spec Environments • 11 items • Updated 18 days ago • 23

sergiopaniego

posted an update 20 days ago

Post

1596

FunctionGemma Tuning Lab is a new no-code tool by @google that lets you fine-tune a model directly from the browser, with no coding knowledge required, using TRL behind the scenes.

blog: https://developers.googleblog.com/a-guide-to-fine-tuning-functiongemma/

try it out: google/functiongemma-tuning-lab

This example builds on a more advanced one for learning fine-tuning with SFT using TRL: https://ai.google.dev/gemma/docs/functiongemma/finetuning-with-functiongemma

1 reply

sergiopaniego

posted an update 23 days ago

Post

802

TRL v0.27.0 is out!! 🥳

It includes GDPO, the latest variant of GRPO for multi-reward RL ✨
GDPO decouples reward normalization to avoid reward collapse and improve per-reward convergence — developed by
@sliuau @SimonX et al.

Explore the paper: GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization (2601.05242)

Explore the full set of changes here:
https://github.com/huggingface/trl/releases/tag/v0.27.0

sergiopaniego

updated a collection 26 days ago

Environment Hub

Collection

A collection of OpenEnv-spec Environments • 11 items • Updated 18 days ago • 23

sergiopaniego

posted an update 26 days ago

Post

2999

New REPL environment in OpenEnv available! ✨
Used in the Recursive Language Models (RLM) paper by Alex Zhang.

Ready for inference & post-training using trajectories. Handles long contexts:

> Run Python code in a sandbox
> Make recursive calls to LMs
> Explore data programmatically
> Return final result

Docs: https://meta-pytorch.org/OpenEnv/environments/repl/
Inference script: https://github.com/meta-pytorch/OpenEnv/blob/main/examples/repl_oolong_simple.py

sergiopaniego

posted an update 27 days ago

Post

486

Recursive Language Models (RLM) is a new interface for LLMs with cool ideas by Alex Zhang!

⚠️ LLMs struggle with long prompts → attention overload & lost info
🔄 RLMs inspect, split & call themselves on chunks, then aggregate results
✅ Handles millions of tokens, reduces noise, improves reasoning
💡 System prompt guides recursion
🎯 RLM trajectories can be used for RL training or distillation (OpenEnv+TRL!!)

We're adding it to OpenEnv (with Kashif Rasul): https://github.com/meta-pytorch/OpenEnv/pull/282

More resources:

> Paper: Recursive Language Models (2512.24601)
> Paper blog: https://alexzhang13.github.io/blog/2025/rlm/
> RLM repo: https://github.com/alexzhang13/rlm

2 replies

sergiopaniego

posted an update about 1 month ago

Post

2266

New GRPO + TRL free Colab notebook out! 🔥

Fine-tune 7B+ models on T4 GPUs thanks to a ton of memory optimizations for GRPO

7B model uses only 9.2 GB VRAM (~7× reduction) 🤯

Try the notebook here 👉 https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_trl_lora_qlora.ipynb

AI & ML interests

Recent Activity

Team members 11

openenv's activity

TextArena Environment Server

BrowserGym Environment Server

REPL Environment Server

Echo Environment Server

TB2 Environment Server

OpenSpiel Environment Server

README