ReactionOptimisation Task 36 / 47

snar_multiobjective

Multi-objective optimization of a continuous-flow SnAr reaction, trading productivity against waste or byproduct metrics along a Pareto front over continuous operating variables. Grounded in chemical engineering emulators (SUMMIT family), it reflects real plant trade-offs among yield, waste, and operability.

Model leaderboard

#	Participant	Score
1	GPT-5.4	100.0
2	Claude Opus 4.6	54.2
3	DeepSeek V3.2	37.7
4	GLM-5	33.9
5	Gemini 3.1 Pro Preview	28.1
6	SEED 2.0 Pro	25.5
7	Qwen3 Coder Next	1.7
8	Grok 4.20	0.0

Framework leaderboard

#	Participant	Score
1	Claude Opus 4.6 + OpenEvolve	100.0
2	Claude Opus 4.6 + ShinkaiEvolve	83.0
3	GPT-OSS + ShinkaiEvolve	80.3
4	GPT-OSS + OpenEvolve	58.4
5	GPT-OSS + ABMCTS	54.4
6	Claude Opus 4.6 + ABMCTS	0.0

Score is the normalized score for this task (0–100, higher is better).