ReactionOptimisation Task 35 / 47

reizman_suzuki_pareto

Pareto optimization on the Reizman Suzuki emulator over catalyst choice and continuous conditions to improve conflicting chemical metrics. Multi-objective black-box search under chemistry-side constraints is scored through the benchmark's evaluator, mirroring co-design of recipe and operating point.

Model leaderboard

#	Participant	Score
1	GLM-5	100.0
2	Claude Opus 4.6	96.7
3	GPT-5.4	96.2
4	DeepSeek V3.2	95.1
5	Qwen3 Coder Next	92.2
6	SEED 2.0 Pro	83.1
7	Gemini 3.1 Pro Preview	81.9
8	Grok 4.20	0.0

Framework leaderboard

#	Participant	Score
1	GPT-OSS + OpenEvolve	100.0
2	Claude Opus 4.6 + OpenEvolve	44.0
3	Claude Opus 4.6 + ABMCTS	43.8
4	Claude Opus 4.6 + ShinkaiEvolve	36.8
5	GPT-OSS + ShinkaiEvolve	20.0
6	GPT-OSS + ABMCTS	0.0

Score is the normalized score for this task (0–100, higher is better).