Jumanji¶

JAX-based hardware-accelerated environments from Google DeepMind. Organised into logic puzzles, packing problems, and routing tasks.

Install:: pip install -e ".[jumanji]"
Paradigm:: Single-agent
Stepping:: SINGLE_AGENT
Note:: Requires JAX with compatible hardware backend (CPU/GPU/TPU)

Category	Environments
Logic Puzzles	Game2048, Minesweeper, RubiksCube, SlidingTilePuzzle, Sudoku, GraphColoring
Packing	BinPack, FlatPack, JobShop, Knapsack, Tetris
Routing	Cleaner, Connector, CVRP, Maze, MMST, MultiCVRP, PacMan, RobotWarehouse, Snake, Sokoban, TSP

Citation¶

@inproceedings{bonnet2024jumanji,
  author       = {Cl{\'e}ment Bonnet and Daniel Luo and Donal Byrne and Shikha Surana and Sa{\"i}d Chadly and Sasha Abramowitz and Victor Le and Paul Breuil and Thomas Barrett and Arnu Pretorius and Alexandre Laterre},
  title        = {Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX},
  booktitle    = {International Conference on Learning Representations (ICLR)},
  year         = {2024},
}