JobShop
Task 16 / 47
swv
JSSP on the SWV family (Storer, Wu, Vaccari 1992): another standard suite stressing algorithm robustness across shop layouts and sizes. Encodings range from permutations to time-indexed MILP; scoring references published best-known values for optimality gaps.
Model leaderboard
| # | Participant | Score |
|---|---|---|
| 1 | Claude Opus 4.6 | 100.0 |
| 2 | GPT-5.4 | 69.9 |
| 3 | GLM-5 | 67.5 |
| 4 | Grok 4.20 | 44.5 |
| 5 | Qwen3 Coder Next | 4.2 |
| 6 | SEED 2.0 Pro | 1.4 |
| 7 | DeepSeek V3.2 | 0.6 |
| 8 | Gemini 3.1 Pro Preview | 0.0 |
Framework leaderboard
| # | Participant | Score |
|---|---|---|
| 1 | Claude Opus 4.6 + ABMCTS | 100.0 |
| 2 | Claude Opus 4.6 + ShinkaiEvolve | 91.3 |
| 3 | Claude Opus 4.6 + OpenEvolve | 63.3 |
| 4 | GPT-OSS + ShinkaiEvolve | 35.4 |
| 5 | GPT-OSS + OpenEvolve | 20.4 |
| 6 | GPT-OSS + ABMCTS | 0.0 |
Score is the normalized score for this task (0–100, higher is better).