Misaligned reward functions

  1. Hidden Dangers of Rewarding Prompts (AI Secrets)