Algorithms typically try to minimize errors and failures. However, this research seeks to challenge this paradigm by investigating the potential advantages of failure. We explore failure-preserving evolutionary algorithms in the context of the Lunar Lander game environment, where the purple lander is the agent. Variants of FI-2Pop in the Lunar Lander environment are tested whether preserving "failed" individuals improves evolutionary search, revealing reward-hacking behaviors.
multiprocessing, batched rollouts (≈6.7× output speedup).

We ran these algorithms with 100,000 agents and selected the highest fitness agents:
(0.591, −0.215).multiprocessing, achieving approximately 6.7× faster trial throughput.Presented a research poster at the Lida Orzeck '68 Poster Session at Barnard College on July 31, 2024 and at the Barnard Computer Science Senior Research Symposium on April 25, 2025.