Navers lab
← All tasks
Astrodynamics Task 2 / 47

MannedLunarLanding

This benchmark targets soft-landing trajectory optimization for a crewed lunar lander under thrust limits, propellant use, and dynamical/path constraints. The goal is a feasible trajectory from orbit to terminal conditions that lands safely while saving fuel where possible. Evaluation stresses nonlinear optimal control, constraint satisfaction, and terminal accuracy—typical of real astrodynamics optimization.

Model leaderboard

# Participant Score
1 GLM-5 100.0
2 GPT-5.4 92.1
3 DeepSeek V3.2 66.4
4 Claude Opus 4.6 64.1
5 SEED 2.0 Pro 6.9
6 Gemini 3.1 Pro Preview 4.3
7 Grok 4.20 0.0
8 Qwen3 Coder Next 0.0

Framework leaderboard

# Participant Score
1 GPT-OSS + ShinkaiEvolve 100.0
2 Claude Opus 4.6 + ABMCTS 61.0
3 Claude Opus 4.6 + OpenEvolve 53.8
4 GPT-OSS + OpenEvolve 37.6
5 Claude Opus 4.6 + ShinkaiEvolve 34.7
6 GPT-OSS + ABMCTS 0.0

Score is the normalized score for this task (0–100, higher is better).