ReactionOptimisation
Task 36 / 47
snar_multiobjective
Multi-objective optimization of a continuous-flow SnAr reaction, trading productivity against waste or byproduct metrics along a Pareto front over continuous operating variables. Grounded in chemical engineering emulators (SUMMIT family), it reflects real plant trade-offs among yield, waste, and operability.
Model leaderboard
| # | Participant | Score |
|---|---|---|
| 1 | GPT-5.4 | 100.0 |
| 2 | Claude Opus 4.6 | 54.2 |
| 3 | DeepSeek V3.2 | 37.7 |
| 4 | GLM-5 | 33.9 |
| 5 | Gemini 3.1 Pro Preview | 28.1 |
| 6 | SEED 2.0 Pro | 25.5 |
| 7 | Qwen3 Coder Next | 1.7 |
| 8 | Grok 4.20 | 0.0 |
Framework leaderboard
| # | Participant | Score |
|---|---|---|
| 1 | Claude Opus 4.6 + OpenEvolve | 100.0 |
| 2 | Claude Opus 4.6 + ShinkaiEvolve | 83.0 |
| 3 | GPT-OSS + ShinkaiEvolve | 80.3 |
| 4 | GPT-OSS + OpenEvolve | 58.4 |
| 5 | GPT-OSS + ABMCTS | 54.4 |
| 6 | Claude Opus 4.6 + ABMCTS | 0.0 |
Score is the normalized score for this task (0–100, higher is better).