Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

May 25, 2026 · 4:00 AM UTC ·3 min read · 0 reactions · 0 comments · 12 views

#machine learning #reinforcement learning #artificial intelligence

⚡ TL;DR · AI summary

The paper introduces Reflex, a new approach to reinforcement learning that utilizes reflection symmetry in state-based continuous control tasks. This method aims to improve sample efficiency by integrating reflection symmetry into policy learning. The authors demonstrate that Reflex outperforms standard baselines when evaluated on various benchmarks.

Key facts

▪Reflex leverages group-invariant Markov Decision Processes to enhance sample efficiency in reinforcement learning.
▪The paper focuses on state-based continuous control tasks and introduces two types of reflection: axial and bilateral.
▪Reflex integrates with both on-policy and off-policy reinforcement learning algorithms, showing superior performance in tests.

Original article

arXiv cs.AI

Read full at arXiv cs.AI →

Opening excerpt (first ~120 words) tap to expand

Computer Science > Machine Learning arXiv:2605.23415 (cs) [Submitted on 22 May 2026] Title:Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control Authors:Shuai Zhen, Yifan Zhang, Yuling Wang, Yanhua Yu View a PDF of the paper titled Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control, by Shuai Zhen and 3 other authors View PDF HTML (experimental) Abstract:Reinforcement learning has long struggled with poor sample efficiency. One promising approach to mitigate this problem is leveraging group-invariant Markov Decision Processes ($G$-invariant MDPs).

…

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed

Discussion

0 comments

Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

Discussion

More from arXiv cs.AI