NiceWebRL: a Python library for human subject experiments with reinforcement learning environments

📅 2025-08-21

📈 Citations: 0

✨ Influential: 0

career value

231K/year

🤖 AI Summary

This study addresses the bottleneck of adapting JAX-based reinforcement learning (RL) environments for direct use in online human behavioral experiments. To bridge this gap, we introduce NiceWebRL—a novel toolkit enabling seamless, low-latency, high-fidelity deployment of JAX RL environments (e.g., Craftax, Overcooked, XLand-Minigrid) as web-based platforms for human participants. It supports both single- and multi-agent experimental paradigms, integrates WebSocket-based real-time interaction, and provides Flax-compatible interfaces. Our core contributions are threefold: (1) establishing a standardized, cognitively grounded experimental infrastructure to rigorously evaluate and compare computational cognitive RL models against human behavioral data; (2) facilitating generalization assessment of human–AI collaborative algorithms under ecologically valid conditions; and (3) enabling empirical studies of large language model (LLM)-augmented human decision-making. Three cutting-edge applications demonstrate NiceWebRL’s pivotal role as an interdisciplinary bridge between AI and cognitive science.

Technology Category

Application Category

📝 Abstract

We present NiceWebRL, a research tool that enables researchers to use machine reinforcement learning (RL) environments for online human subject experiments. NiceWebRL is a Python library that allows any Jax-based environment to be transformed into an online interface, supporting both single-agent and multi-agent environments. As such, NiceWebRL enables AI researchers to compare their algorithms to human performance, cognitive scientists to test ML algorithms as theories for human cognition, and multi-agent researchers to develop algorithms for human-AI collaboration. We showcase NiceWebRL with 3 case studies that demonstrate its potential to help develop Human-like AI, Human-compatible AI, and Human-assistive AI. In the first case study (Human-like AI), NiceWebRL enables the development of a novel RL model of cognition. Here, NiceWebRL facilitates testing this model against human participants in both a grid world and Craftax, a 2D Minecraft domain. In our second case study (Human-compatible AI), NiceWebRL enables the development of a novel multi-agent RL algorithm that can generalize to human partners in the Overcooked domain. Finally, in our third case study (Human-assistive AI), we show how NiceWebRL can allow researchers to study how an LLM can assist humans on complex tasks in XLand-Minigrid, an environment with millions of hierarchical tasks. The library is available at https://github.com/KempnerInstitute/nicewebrl.

Problem

Research questions and friction points this paper is trying to address.

Develops a Python library for online human experiments with RL environments

Enables comparison of AI algorithms to human performance in cognition

Facilitates multi-agent research for human-AI collaboration across domains

Innovation

Methods, ideas, or system contributions that make the work stand out.

Python library for human-RL experiments

Converts Jax environments to web interface

Supports single and multi-agent environments

🔎 Similar Papers

CREW: Facilitating Human-AI Teaming Research

2024-07-31arXiv.orgCitations: 2

Anthropic

$500,000—$850,000 USD

San Francisco, CA, USA

AI Research Scientist, Reinforcement Learning