---
name: skill-tester
description: Use when validating a skill before deployment. Tests skill behavior with subagent simulations to find edge cases and rationalization loopholes.
category: meta
token_budget: 1500
---

# Skill: Test Skills with Subagents

## Overview
Every skill must be tested before deployment. The Iron Law: **No skill ships without a failing test first.**

## Testing Protocol

### Phase 1: Baseline (No Skill)
1. Run a subagent WITHOUT the skill loaded
2. Give it a scenario where the skill SHOULD be used
3. Document what the subagent does wrong
4. This establishes the "before" behavior

### Phase 2: Apply Skill
1. Run subagent WITH the skill loaded
2. Same scenario as Phase 1
3. Verify the skill changes behavior correctly
4. Document improvements

### Phase 3: Edge Cases
Test these rationalization patterns:
- "I already know how to do this"
- "This situation is different"
- "It would be faster to just..."
- "The user probably meant..."

### Phase 4: Adversarial
Attempt to make the skill fail:
- Ambiguous inputs
- Conflicting requirements
- Time pressure scenarios
- Partial information

## Output Format
```json
{
  "skill_name": "...",
  "test_date": "YYYY-MM-DD",
  "baseline_failures": ["..."],
  "improvements": ["..."],
  "edge_cases_passed": N,
  "edge_cases_failed": N,
  "recommendations": ["..."],
  "verdict": "ship|iterate|reject"
}
```

## Token Budget: 1500 tokens max