Navers lab
← All tasks
ReactionOptimisation Task 36 / 47

snar_multiobjective

Multi-objective optimization of a continuous-flow SnAr reaction, trading productivity against waste or byproduct metrics along a Pareto front over continuous operating variables. Grounded in chemical engineering emulators (SUMMIT family), it reflects real plant trade-offs among yield, waste, and operability.

Model leaderboard

# Participant Score
1 GPT-5.4 100.0
2 Claude Opus 4.6 54.2
3 DeepSeek V3.2 37.7
4 GLM-5 33.9
5 Gemini 3.1 Pro Preview 28.1
6 SEED 2.0 Pro 25.5
7 Qwen3 Coder Next 1.7
8 Grok 4.20 0.0

Framework leaderboard

# Participant Score
1 Claude Opus 4.6 + OpenEvolve 100.0
2 Claude Opus 4.6 + ShinkaiEvolve 83.0
3 GPT-OSS + ShinkaiEvolve 80.3
4 GPT-OSS + OpenEvolve 58.4
5 GPT-OSS + ABMCTS 54.4
6 Claude Opus 4.6 + ABMCTS 0.0

Score is the normalized score for this task (0–100, higher is better).