Query-Relevant Images Jailbreak Large Multi-Modal Models

TLDR

A novel visual prompt attack that exploits query-relevant images to jailbreak the open-source LMMs and shows LLMs can be easily at-tacked by this approach, even if the employed Large Language Models are safely aligned.