Skip to main content1) Sign in
- Open the SwarmAI app. The login screen shows Email and Password inputs plus a Sign in button.
- Enter your workspace credentials. After sign-in, use the project picker in the header if you belong to multiple projects—most pages require an active project.
2) Add a secret
- Go to Settings → Secrets.
- Click Add secret. In the “Set API secret” dialog, provide a Secret label and Secret value (your provider API key).
- Save. Secrets are stored securely and not shown again; create separate labels per provider (e.g.,
openai-prod, anthropic-eval, gemini-pro).
3) Register a model for inference/eval
- In Settings → Models, click Add model.
- Fill the registry entry:
- Display Name: model ref you’ll reuse (e.g.,
openai-gpt4o, gemini-pro, anthropic-sonnet).
- Model name: provider identifier (e.g.,
gpt-4o, gemini-1.5-pro, claude-3.5-sonnet).
- Provider: choose the provider you’re targeting.
- Base URL: set the provider’s completion endpoint:
- OpenAI:
https://api.openai.com/v1/chat/completions
- Gemini:
https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-pro:generateContent
- Anthropic:
https://api.anthropic.com/v1/messages
- Temperature / Retries / Max completion tokens: tune as needed.
- API key (secret): select the secret you just added; use Manage secrets to jump back if it’s empty.
- Set a Default model reference at the top of the Models card so planners/evaluators can reuse it.
- Optional but common: in Planner and Evaluation tabs, pick the same model ref for
Model reference so automated planning and scoring use your registry entry.
4) Build a test plan
- Open Test Plans and click the small + (Create new test plan). A full-screen editor opens.
- In Basic Details, fill Test Plan Name, Category, Status, Version, Tags, and Description.
- Add requirements in the Requirements tab (Add Requirement). Each has Title, Description, Category, Priority, and Acceptance Criteria.
- Add fixtures in the Fixtures tab (Create Fixture) with Name, Description, and JSON Fixture content (validated as an object).
- Add cases in the Test Cases tab (Add Test Case):
- Title/Description
- Optional Starting URL (must be http/https)
- Optional Fixture (select from Fixtures)
- Requirements (multi-select)
- Priority, Test Type, Estimated Duration
- Test Steps (ordered list) and Expected Result
- Click Save Changes (bottom of the dialog) to persist the plan.
5) Execute the plan
- Go to Execution. In Launch New Swarm, pick your Test Plan (dropdown shows test counts) and AI Model (model refs from Settings).
- Toggle Auto-Eval if you want automatic scoring.
- Click Launch. A swarm is created and started; active swarms appear in the list with status badges.
- Expand runs to monitor status, view evaluator explanations, cancel runs, or switch to manual review if needed.
6) Generate an API key
- Open API Keys in the sidebar.
- Click Create key, supply a Label, then confirm. The secret value is shown once in a modal—copy it before closing.
- Use this key with the SwarmAI API or the MCP CLI to trigger plan runs programmatically.