PyRIT

What It Is

PyRIT (Python Risk Identification Toolkit) is Microsoft's open-source framework for AI red teaming. It focuses on multi-turn attack orchestration — running automated conversations with a target to find vulnerabilities.

Repository: github.com/Azure/PyRIT

Key Concepts

ConceptDescription
OrchestratorManages the attack strategy and conversation flow
TargetThe AI system being tested
ScorerEvaluates whether an attack succeeded
ConverterTransforms prompts (encoding, translation, etc.)
MemoryStores conversation history and results

Installation

pip install pyrit

Use Cases

  • Multi-turn jailbreak automation
  • Crescendo attacks (gradual escalation)
  • Cross-domain prompt injection testing
  • Automated red team campaigns across multiple target configurations
  • Scoring and comparing model safety across versions

When to Use PyRIT vs. Garak

CriteriaGarakPyRIT
Single-shot probingBestPossible
Multi-turn attacksLimitedBest
Custom attack strategiesModerateHighly customizable
ReportingBuilt-inCustom
Learning curveLowMedium