Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX

May 22, 2026 · 4:00 AM UTC ·3 min read · 0 reactions · 0 comments · 17 views

#artificial intelligence #machine learning #mahjong

⚡ TL;DR · AI summary

Mahjax is a new GPU-accelerated Mahjong simulator designed for reinforcement learning using JAX. It allows for large-scale parallelization and offers a high-quality visualization tool for debugging. Experimental results indicate that Mahjax can achieve impressive training throughputs, demonstrating its effectiveness for training agents in the game.

Key facts

▪Mahjax is implemented in JAX to facilitate reinforcement learning research.
▪The simulator achieves throughputs of up to 2 million steps per second on NVIDIA A100 GPUs.
▪Agents trained in Mahjax can effectively improve their rank against baseline policies.

Original article

arXiv cs.AI

Read full at arXiv cs.AI →

Opening excerpt (first ~120 words) tap to expand

Computer Science > Artificial Intelligence arXiv:2605.20577 (cs) [Submitted on 20 May 2026] Title:Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX Authors:Soichiro Nishimori, Shinri Okano, Keigo Habara, Sotetsu Koyamada, Eason Yu, Masashi Sugiyama View a PDF of the paper titled Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX, by Soichiro Nishimori and 5 other authors View PDF HTML (experimental) Abstract:Riichi Mahjong is a multi-player, imperfect-information game characterized by stochasticity and high-dimensional state spaces. These attributes present a unique combination of challenges that mirror complex real-world decision-making problems in reinforcement learning.

…

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed

Discussion

0 comments

Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX

Discussion

More from arXiv cs.AI