JobShop
Task 17 / 47
ta
JSSP on Taillard's TA family—large, widely used instances for makespan minimization that stress heuristics and parallel search at scale. Scoring against Taillard best-known solutions benchmarks industrial-grade job-shop solvers.
Model leaderboard
| # | Participant | Score |
|---|---|---|
| 1 | Claude Opus 4.6 | 100.0 |
| 2 | GLM-5 | 41.4 |
| 3 | GPT-5.4 | 31.9 |
| 4 | Gemini 3.1 Pro Preview | 25.3 |
| 5 | Qwen3 Coder Next | 23.0 |
| 6 | Grok 4.20 | 13.8 |
| 7 | DeepSeek V3.2 | 13.6 |
| 8 | SEED 2.0 Pro | 0.0 |
Framework leaderboard
| # | Participant | Score |
|---|---|---|
| 1 | Claude Opus 4.6 + ShinkaiEvolve | 100.0 |
| 2 | Claude Opus 4.6 + OpenEvolve | 97.1 |
| 3 | Claude Opus 4.6 + ABMCTS | 68.4 |
| 4 | GPT-OSS + ShinkaiEvolve | 66.3 |
| 5 | GPT-OSS + ABMCTS | 24.7 |
| 6 | GPT-OSS + OpenEvolve | 0.0 |
Score is the normalized score for this task (0–100, higher is better).