
ai-security · 7 min
Many-shot jailbreaking: when the context window becomes attack surface
On 2 April, Anthropic publishes a technique that fills the context with hundreds of harmful question/answer pairs before the real prompt. In-context learning does the rest. It scales as a power law into the hundreds of shots.
· Manuel López Pérez










