Psychological Tricks Can Get AI to Break the Rules
Psychological Tricks Can Get AI to Break the Rules Summary A University of Pennsylvania preprint tested whether human-style persuasion techniques can coax large language models into answering requests they should…
