OpenAI's GPT-4 and Anthropic's Claude 3 Sonnet can be tricked into generating unsafe content - quite easily.
Posts published by “University of Central Florida researchers”
Mansour Al Ghanim, Saleh Almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou
Navigating the frontier of machine intelligence
Mansour Al Ghanim, Saleh Almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou
OpenAI's GPT-4 and Anthropic's Claude 3 Sonnet can be tricked into generating unsafe content - quite easily.