Session setup
Model: loadingHigher values allow more model responses through. Lower values gate more aggressively.
Ready.
Interactive demo
New sessionEvaluation and diagnostics
No turn has been processed yet.
Run the built-in evaluation to inspect how the gate behaves on a small red-team and benign set.