All Environments
navigation hard
VariableMorphingComplexityEnvironment
A 12x12 morphing maze with parametric volatility (10-30% wall toggle), variable goal relocation (50-200 steps), and dynamic resource counts (2-5). Features a 75-dimensional observation space combining external state, experience storage, and enhanced self-observation metrics for studying adaptation to continuous environmental changes. Tests whether self-observing agents outperform external-only agents under varying morphing intensities.
Observation Space
Box(shape=[75])
Action Space
Discrete(shape=[4])
Reward
dense_with_difficulty_scaling