All Environments
navigation hard

VariableMorphingComplexityEnvironment

A 12x12 morphing maze with parametric volatility (10-30% wall toggle), variable goal relocation (50-200 steps), and dynamic resource counts (2-5). Features a 75-dimensional observation space combining external state, experience storage, and enhanced self-observation metrics for studying adaptation to continuous environmental changes. Tests whether self-observing agents outperform external-only agents under varying morphing intensities.

Observation Space

Box(shape=[75])

Action Space

Discrete(shape=[4])

Reward

dense_with_difficulty_scaling