ReactionOptimisation
Task 35 / 47
reizman_suzuki_pareto
Pareto optimization on the Reizman Suzuki emulator over catalyst choice and continuous conditions to improve conflicting chemical metrics. Multi-objective black-box search under chemistry-side constraints is scored through the benchmark's evaluator, mirroring co-design of recipe and operating point.
Model leaderboard
| # | Participant | Score |
|---|---|---|
| 1 | GLM-5 | 100.0 |
| 2 | Claude Opus 4.6 | 96.7 |
| 3 | GPT-5.4 | 96.2 |
| 4 | DeepSeek V3.2 | 95.1 |
| 5 | Qwen3 Coder Next | 92.2 |
| 6 | SEED 2.0 Pro | 83.1 |
| 7 | Gemini 3.1 Pro Preview | 81.9 |
| 8 | Grok 4.20 | 0.0 |
Framework leaderboard
| # | Participant | Score |
|---|---|---|
| 1 | GPT-OSS + OpenEvolve | 100.0 |
| 2 | Claude Opus 4.6 + OpenEvolve | 44.0 |
| 3 | Claude Opus 4.6 + ABMCTS | 43.8 |
| 4 | Claude Opus 4.6 + ShinkaiEvolve | 36.8 |
| 5 | GPT-OSS + ShinkaiEvolve | 20.0 |
| 6 | GPT-OSS + ABMCTS | 0.0 |
Score is the normalized score for this task (0–100, higher is better).