ON1 // AEDL benchmark sign-off

gate before pulls

defaultbaseline

statusapproved; waiting remote host

costAkash/Vast GPU; Njalla/FlokiNET control baseline

executeblocked until AEDL_REMOTE_HOST

AEDL rule offspring

targetstability, capability, profitability

split30% parent default, optimized per aegent

spawnonly if 30-day lower bound is profitable

migratetrigger at 25% backend cost improvement or SLA failure

baseline pull list commercial-use screened

rank	model	tag	license	size	role
1	gpt-oss 20B	`gpt-oss:20b`	Apache-2.0 + policy	14.0 GB	general reasoning and tool-use baseline
2	Qwen3-Coder 30B-A3B	`qwen3-coder:30b`	Apache-2.0	19.0 GB	repo-scale coding and implementation
3	Qwen3 30B-A3B	`qwen3:30b`	Apache-2.0	19.0 GB	efficient multilingual generalist
4	Mistral Small 3.2 24B	`mistral-small3.2:24b`	Apache-2.0	15.0 GB	European long-context generalist
5	DeepSeek R1 Distill Qwen 32B	`deepseek-r1:32b`	MIT/distill notice	20.0 GB	deliberate reasoning and critique
6	Phi-4 Mini Reasoning	`phi4-mini-reasoning:latest`	MIT	3.2 GB	small-reasoner control
7	SmolLM3 3B	`alibayram/smollm3:latest`	Apache-2.0	3.0 GB	tiny open control

commands after approval

python3 scripts/send-aedl-model-questionnaire-email.py
python3 scripts/mail-verdict-poller.py --since-hours 24
python3 scripts/aedl-remote-executor.py --selection baseline --backend akash --sku a100-40gb --json
export AEDL_REMOTE_HOST=user@fresh-gpu-host
python3 scripts/aedl-remote-executor.py --selection baseline --backend akash --sku a100-40gb --execute

AEDL sign-off

gate before pulls

AEDL rule offspring

baseline pull list commercial-use screened

commands after approval