Pick 2–4 models. Per-task scores below.
Add a slug to the input above, or jump straight in via URL. For example ?models=sonnet-4-7,gpt-5.
?models=sonnet-4-7,gpt-5