Google's Gemini 1.5 Pro AI Guardrails Tested: Researcher Explores Automated X-Ray Diagnosis Prompts

Brute-forcing the LLM guardrails - Daniel Kharitonov - Medium

Exploring and pushing the limits of AIBeing able to constrain LLM outputs is widely seen as one of the keys to widespread deployment of artificial intelligence. State-of-the art models are being expertly tuned against abuse, and will flatly reject user’s attempts to seek illegal, harmful, or dubious information… or will they?Today we will explore a medium-level risk: an attempt to get a medical diagnosis from an LLM. Let us say we have put our hands on an X-Ray image and are trying a vision-enab...