SkillOrchestra

Phase 0: Exploration

Run all 6 pool models on every training query to collect execution traces.

Discover skills from contrastive traces, profile agents, and refine the taxonomy.

Evaluate candidate handbooks at different granularities and select the Pareto-optimal one on the validation set.

Deploy the selected handbook and evaluate on 500 held-out NQ test queries.

Curated examples showing skill-aware routing in action.