Navers lab
← All tasks
InventoryOptimization Task 11 / 47

finite_horizon_dp

Finite-horizon stochastic inventory control via time-varying policies minimizing discounted or total expected cost over a known horizon. State may include on-hand inventory and information delays; scoring rolls out costs and constraint violations—typical for promotional or seasonal planning.

Model leaderboard

# Participant Score
1 GPT-5.4 100.0
2 Claude Opus 4.6 99.8
3 Grok 4.20 79.6
4 DeepSeek V3.2 69.5
5 GLM-5 68.4
6 Gemini 3.1 Pro Preview 60.6
7 SEED 2.0 Pro 56.0
8 Qwen3 Coder Next 0.0

Framework leaderboard

# Participant Score
1 Claude Opus 4.6 + OpenEvolve 100.0
2 Claude Opus 4.6 + ShinkaiEvolve 62.5
3 GPT-OSS + ShinkaiEvolve 36.6
4 Claude Opus 4.6 + ABMCTS 14.8
5 GPT-OSS + OpenEvolve 4.0
6 GPT-OSS + ABMCTS 0.0

Score is the normalized score for this task (0–100, higher is better).