OpenAI's GPT-4 and Anthropic's Claude 3 Sonnet can be tricked into generating unsafe content - quite easily.
Navigating the frontier of machine intelligence
OpenAI's GPT-4 and Anthropic's Claude 3 Sonnet can be tricked into generating unsafe content - quite easily.