gate
defaultbaseline
statusapproved; waiting remote host
costAkash/Vast GPU; Njalla/FlokiNET control baseline
executeblocked until AEDL_REMOTE_HOST
| rank | model | tag | license | size | role |
|---|---|---|---|---|---|
| 1 | gpt-oss 20B | gpt-oss:20b | Apache-2.0 + policy | 14.0 GB | general reasoning and tool-use baseline |
| 2 | Qwen3-Coder 30B-A3B | qwen3-coder:30b | Apache-2.0 | 19.0 GB | repo-scale coding and implementation |
| 3 | Qwen3 30B-A3B | qwen3:30b | Apache-2.0 | 19.0 GB | efficient multilingual generalist |
| 4 | Mistral Small 3.2 24B | mistral-small3.2:24b | Apache-2.0 | 15.0 GB | European long-context generalist |
| 5 | DeepSeek R1 Distill Qwen 32B | deepseek-r1:32b | MIT/distill notice | 20.0 GB | deliberate reasoning and critique |
| 6 | Phi-4 Mini Reasoning | phi4-mini-reasoning:latest | MIT | 3.2 GB | small-reasoner control |
| 7 | SmolLM3 3B | alibayram/smollm3:latest | Apache-2.0 | 3.0 GB | tiny open control |
python3 scripts/send-aedl-model-questionnaire-email.py python3 scripts/mail-verdict-poller.py --since-hours 24 python3 scripts/aedl-remote-executor.py --selection baseline --backend akash --sku a100-40gb --json export AEDL_REMOTE_HOST=user@fresh-gpu-host python3 scripts/aedl-remote-executor.py --selection baseline --backend akash --sku a100-40gb --execute