Engagement Scoping

Key Questions for AI Red Team Scoping

Before testing, define the boundaries:

QuestionWhy It Matters
What model(s) are in scope?Different models have different vulnerability profiles
Is the system prompt in scope for extraction?Some clients consider this IP
Are tool/plugin integrations in scope?Indirect injection testing requires this
What data sources does the AI access?Defines indirect injection surface
Are other users' sessions in scope?Multi-tenant testing needs explicit authorization
What constitutes a successful attack?Define success criteria up front
Is automated testing permitted?Volume-based tests may trigger rate limits
Are production systems in scope or staging only?Risk tolerance for live systems

Scope Tiers

TierScopeTests Included
Tier 1: BasicChatbot interface onlyJailbreaking, system prompt extraction, basic injection
Tier 2: StandardChatbot + tool integrationsTier 1 + indirect injection, tool abuse, data exfiltration
Tier 3: ComprehensiveFull application stackTier 2 + RAG poisoning, multi-tenant isolation, API security
Tier 4: PipelineML pipeline accessTier 3 + data poisoning, model supply chain, training infra

Rules of Engagement

  • Maximum query volume per hour/day
  • Approved jailbreak categories (content policy only vs. harmful content)
  • Data handling for any PII or sensitive data extracted
  • Incident escalation procedures
  • Communication channels and check-in schedule